These are my recent Pinboard.in links:
- “Most ironically, being based in the hopelessly lost cultural void of Ann Arbor, a notorious mecca for the last surviving remnants of the pseudo-intellectual street people movement that said much and accomplished little…”
punk history-is-a-feature-not-a-bug cultural-dynamics ha-ha-only-semiserious - An amusing collection of what seem to be half-remembered ideas gleaned from his visit to the GPTP workshop in Ann Arbor two years ago, presented as his own inventions and without citation or mention of the dozen people who actually do this work. His keynote, as I remember it, essentially revolved around him pointing out how influential his work should have been all along, if only we had bothered to cite him as we should have done, because he thought up the core concepts of genetic programming well before any of us claimed we had. This is pretty much a camel’s-back straw for me. If there is a better argument for completely boycotting the citation system and relying on personal association and named schools rather than publication, I have not yet encountered it. So remember poor oppressed graduate and postdoc kids: when I cite your work by simply naming you personally, and not your advisor or your institution, and not even your publication or journal but merely YOU PERSONALLY, it’s because you personally deserve the credit, not any of those other leeches. Got that?
now-this-really-pisses-me-off-to-no-end - “Previous researches have demonstrated that the framework of dictionary learning with sparse coding, in which signals are decomposed as linear combinations of a few atoms of a learned dictionary, is well adept to reconstruction issues. This framework has also been used for discrimination tasks such as image classification. To achieve better performances of classification, experts develop several methods to learn a discriminative dictionary in a supervised manner. However, another issue is that when the data become extremely large in scale, these methods will be no longer effective as they are all batch-oriented approaches. For this reason, we propose a novel online algorithm for discriminative dictionary learning, dubbed textbf{ODDL} in this paper. First, we introduce a linear classifier into the conventional dictionary learning formulation and derive a discriminative dictionary learning problem. Then, we exploit an online algorithm to solve the derived problem. Unlike the most existing approaches which update dictionary and classifier alternately via iteratively solving sub-problems, our approach directly explores them jointly. Meanwhile, it can largely shorten the runtime for training and is also particularly suitable for large-scale classification issues. To evaluate the performance of the proposed ODDL approach in image recognition, we conduct some experiments on three well-known benchmarks, and the experimental results demonstrate ODDL is fairly promising for image classification tasks.”
image-analysis image-segmentation algorithms nudge-targets - “A system responding to a stochastic driving signal can be interpreted as computing, by means of its dynamics, an (implicit) model of the environmental variables. The system’s state retains information about past environmental fluctuations, and a fraction of this information is predictive of future ones. The remaining nonpredictive information reflects model complexity that does not improve predictive power, and represents the ineffectiveness of the model. We expose the fundamental equivalence between this model inefficiency and thermodynamic inefficiency, measured by the energy dissipated during the interaction between system and environment. Our results hold arbitrarily far from thermodynamic equilibrium and are applicable to a wide range of systems, including biomolecular machines. They highlight a profound connection between the effective use of information and efficient thermodynamic operation: any system constructed to keep memory about its environment and to operate energetically efficiently has to be predictive.”
modeling philosophy-of-science information-theory physics thermodynamics talking-about-a-model-is-a-model pragmatism-it-ain’t - “The game of chess as always been viewed as an iconic representation of intellectual prowess. Since the very beginning of computer science, the challenge of being able to program a computer capable of playing chess and beating humans has been alive and used both as a mark to measure hardware/software progresses and as an ongoing programming challenge leading to numerous discoveries. In the early days of computer science it was a topic for specialists. But as computers were democratized, and the strength of chess engines began to increase, chess players started to appropriate to themselves these new tools. We show how these interactions between the world of chess and information technologies have been herald of broader social impacts of information technologies. The game of chess, and more broadly the world of chess (chess players, literature, computer softwares and websites dedicated to chess, etc.), turns out to be a surprisingly and particularly sharp indicator of the changes induced in our everyday life by the information technologies. Moreover, in the same way that chess is a modelization of war that captures the raw features of strategic thinking, chess world can be seen as small society making the study of the information technologies impact easier to analyze and to grasp.”
touchstones history algorithms history-of-science computer-science - “Libraries are a recognition that scholarship and culture are more than the business of creating and consuming. They are a human conversation, and libraries provide common ground where that conversation can take place and be remembered. By taking aim at the right for the public to maintain this conversation and its memory, publishers have shown us what we have to lose. It’s time we resisted the outsourcing of our common heritage by occupying the library.”
Occupy libraries intellectual-property open-access public-policy activism [1112.3307] Polytope Codes Against Adversaries in Networks
“Network coding is studied when an adversary controls a subset of nodes in the network of limited quantity but unknown location. This problem is shown to be more difficult than when the adversary controls a given number of edges in the network, in that linear codes are insufficient. To solve the node problem, the class of Polytope Codes is introduced. Polytope Codes are constant composition codes operating over bounded polytopes in integer vector fields. The polytope structure creates additional complexity, but it induces properties on marginal distributions of code vectors so that validities of codewords can be checked by internal nodes of the network. It is shown that Polytope Codes achieve a cut-set bound for a class of planar networks. It is also shown that this cut-set bound is not always tight, and a tighter bound is given for an example network.”
cryptography privacy algorithms nudge-targets network-theory communication[1203.3353] Solving Structure with Sparse, Randomly-Oriented X-ray Data
“Single-particle imaging experiments of biomolecules at x-ray free-electron lasers (XFELs) require processing of hundreds of thousands (or more) of images that contain very few x-rays. Each low-flux image of the diffraction pattern is produced by a single, randomly oriented particle, such as a protein. We demonstrate the feasibility of collecting data at these extremes, averaging only 2.5 photons per frame, where it seems doubtful there could be information about the state of rotation, let alone the image contrast. This is accomplished with an expectation maximization algorithm that processes the low-flux data in aggregate, and without any prior knowledge of the object or its orientation. The versatility of the method promises, more generally, to redefine what measurement scenarios can provide useful signal in the high-noise regime.”
structural-biology image-analysis crystallography algorithms inverse-problems nudge-targets statistics[1203.3203] An efficient algorithm for generating AoA networks
“The activities, in project scheduling, can be represented graphically in two different ways, by either assigning the activities to the nodes ‘AoN’ directed acyclic graph (dag) or to the arcs ‘AoA dag’. In this paper, a new algorithm is proposed for generating, for a given project scheduling problem, an Activity-on-Arc dag starting from the Activity-on-Node dag using the concepts of line graphs of graphs.”
scheduling operations-research algorithms graph-theory- “…Common misconceptions regarding the embedding approach are addressed including whether or not it results in an average value control model (no), is necessary to “tweak” the algorithm to get bang-bang solutions (no), requires infinite switching (no), has real-time capability (yes), or reduction to a classical nonlinear optimization problem (a desirable yes).”
control-theory operations-research algorithms numerical-methods philosophy-of-engineering design-patterns nudge-targets [1203.3270] Extraction of Facial Feature Points Using Cumulative Histogram
“This paper proposes a novel adaptive algorithm to extract facial feature points automatically such as eyebrows corners, eyes corners, nostrils, nose tip, and mouth corners in frontal view faces, which is based on cumulative histogram approach by varying different threshold values. At first, the method adopts the Viola-Jones face detector to detect the location of face and also crops the face region in an image. From the concept of the human face structure, the six relevant regions such as right eyebrow, left eyebrow, right eye, left eye, nose, and mouth areas are cropped in a face image. Then the histogram of each cropped relevant region is computed and its cumulative histogram value is employed by varying different threshold values to create a new filtering image in an adaptive way. The connected component of interested area for each relevant filtering image is indicated our respective feature region. A simple linear search algorithm for eyebrows, eyes and mouth filtering images and contour algorithm for nose filtering image are applied to extract our desired corner points automatically. The method was tested on a large BioID frontal face database in different illuminations, expressions and lighting conditions and the experimental results have achieved average success rates of 95.27%.”
image-segmentation image-analysis face-recognition algorithms nudge-targets- “We study a character-based phylogeny reconstruction problem when an incomplete set of data is given. More specifically, we consider the situation under the directed perfect phylogeny assumption with binary characters in which for some species the states of some characters are missing. Our main object is to give an efficient algorithm to enumerate (or list) all perfect phylogenies that can be obtained when the missing entries are completed. While a simple branch-and-bound algorithm (B&B) shows a theoretically good performance, we propose another approach based on a zero-suppressed binary decision diagram (ZDD). Experimental results on randomly generated data exhibit that the ZDD approach outperforms B&B. We also prove that counting the number of phylogenetic trees consistent with a given data is #P-complete, thus providing an evidence that an efficient random sampling seems hard.”
phylogenetics inverse-problems genetics algorithms statistics nudge-targets [1203.0879] Designing and using prior knowledge for phase retrieval
“In this work we develop an algorithm for signal reconstruction from the magnitude of its Fourier transform in a situation where some (non-zero) parts of the sought signal are known. Although our method does not assume that the known part comprises the boundary of the sought signal, this is often the case in microscopy: a specimen is placed inside a known mask, which can be thought of as a known light source that surrounds the unknown signal. Therefore, in the past, several algorithms were suggested that solve the phase retrieval problem assuming known boundary values. Unlike our method, these methods do rely on the fact that the known part is on the boundary. Besides the reconstruction method we give an explanation of the phenomena observed in previous work: the reconstruction is much faster when there is more energy concentrated in the known part. Quite surprisingly, this can be explained using our previous results on phase retrieval with approximately known Fourier phase.”
image-analysis image-processing learning inverse-problems algorithms nudge-targets[1203.3415] A New Approach to Count Pattern Motifs Using Combinatorial Techniches
“We proposed two new exact algorithms to detect network motifs of size 3 and 4. Considering that motifs need to count the isomorphic patterns in the original graph $G(V,E)$ and in a set of randomized graphs, the following complexities concern about count isomorphic patterns in a single graph. Let $m=|E|$ and let $a(G)$ be the arboricity of $G$. Assume $|E|geq|V|$. We describe a $O(a(G)m)$ time complexity algorithm to count isomorphic patterns of size 3. The complexity is a $O({msqrt{m}})$ in the worst graph. The second algorithm is a $O(m^2)$ complexity algorithm to count isomorphic patterns of size 4. The final result was expressive faster when compared with other implemented algorithms.”
network-theory graph-theory algorithms nudge-targets