Items of some interest:

These are my recent Pin​board​.in links:

Items of some interest:

These are my recent Pin​board​.in links:

  • Nicholas Rombes: Punk | berfrois

    “Most iron­i­cally, being based in the hope­lessly lost cul­tural void of Ann Arbor, a noto­ri­ous mecca for the last sur­viv­ing rem­nants of the pseudo-​​intellectual street peo­ple move­ment that said much and accom­plished little…”

    punk history-​​is-​​a-​​feature-​​not-​​a-​​bug cultural-​​dynamics ha-​​ha-​​only-​​semiserious
  • [1112.5309] POWERPLAY: Train­ing an Increas­ingly Gen­eral Prob­lem Solver by Con­tin­u­ally Search­ing for the Sim­plest Still Unsolv­able Problem

    An amus­ing col­lec­tion of what seem to be half-​​remembered ideas gleaned from his visit to the GPTP work­shop in Ann Arbor two years ago, pre­sented as his own inven­tions and with­out cita­tion or men­tion of the dozen peo­ple who actu­ally do this work. His keynote, as I remem­ber it, essen­tially revolved around him point­ing out how influ­en­tial his work should have been all along, if only we had both­ered to cite him as we should have done, because he thought up the core con­cepts of genetic pro­gram­ming well before any of us claimed we had. This is pretty much a camel’s-back straw for me. If there is a bet­ter argu­ment for com­pletely boy­cotting the cita­tion sys­tem and rely­ing on per­sonal asso­ci­a­tion and named schools rather than pub­li­ca­tion, I have not yet encoun­tered it. So remem­ber poor oppressed grad­u­ate and post­doc kids: when I cite your work by sim­ply nam­ing you per­son­ally, and not your advi­sor or your insti­tu­tion, and not even your pub­li­ca­tion or jour­nal but merely YOU PERSONALLY, it’s because you per­son­ally deserve the credit, not any of those other leeches. Got that?

    now-​​this-​​really-​​pisses-​​me-​​off-​​to-​​no-​​end
  • [1203.0856] Online Dis­crim­i­na­tive Dic­tio­nary Learn­ing for Image Clas­si­fi­ca­tion Based on Block-​​Coordinate Descent Method

    “Pre­vi­ous researches have demon­strated that the frame­work of dic­tio­nary learn­ing with sparse cod­ing, in which sig­nals are decom­posed as lin­ear com­bi­na­tions of a few atoms of a learned dic­tio­nary, is well adept to recon­struc­tion issues. This frame­work has also been used for dis­crim­i­na­tion tasks such as image clas­si­fi­ca­tion. To achieve bet­ter per­for­mances of clas­si­fi­ca­tion, experts develop sev­eral meth­ods to learn a dis­crim­i­na­tive dic­tio­nary in a super­vised man­ner. How­ever, another issue is that when the data become extremely large in scale, these meth­ods will be no longer effec­tive as they are all batch-​​oriented approaches. For this rea­son, we pro­pose a novel online algo­rithm for dis­crim­i­na­tive dic­tio­nary learn­ing, dubbed textbf{ODDL} in this paper. First, we intro­duce a lin­ear clas­si­fier into the con­ven­tional dic­tio­nary learn­ing for­mu­la­tion and derive a dis­crim­i­na­tive dic­tio­nary learn­ing prob­lem. Then, we exploit an online algo­rithm to solve the derived prob­lem. Unlike the most exist­ing approaches which update dic­tio­nary and clas­si­fier alter­nately via iter­a­tively solv­ing sub-​​problems, our approach directly explores them jointly. Mean­while, it can largely shorten the run­time for train­ing and is also par­tic­u­larly suit­able for large-​​scale clas­si­fi­ca­tion issues. To eval­u­ate the per­for­mance of the pro­posed ODDL approach in image recog­ni­tion, we con­duct some exper­i­ments on three well-​​known bench­marks, and the exper­i­men­tal results demon­strate ODDL is fairly promis­ing for image clas­si­fi­ca­tion tasks.”

    image-​​analysis image-​​segmentation algo­rithms nudge-​​targets
  • [1203.3271] The ther­mo­dy­nam­ics of prediction

    “A sys­tem respond­ing to a sto­chas­tic dri­ving sig­nal can be inter­preted as com­put­ing, by means of its dynam­ics, an (implicit) model of the envi­ron­men­tal vari­ables. The system’s state retains infor­ma­tion about past envi­ron­men­tal fluc­tu­a­tions, and a frac­tion of this infor­ma­tion is pre­dic­tive of future ones. The remain­ing non­pre­dic­tive infor­ma­tion reflects model com­plex­ity that does not improve pre­dic­tive power, and rep­re­sents the inef­fec­tive­ness of the model. We expose the fun­da­men­tal equiv­a­lence between this model inef­fi­ciency and ther­mo­dy­namic inef­fi­ciency, mea­sured by the energy dis­si­pated dur­ing the inter­ac­tion between sys­tem and envi­ron­ment. Our results hold arbi­trar­ily far from ther­mo­dy­namic equi­lib­rium and are applic­a­ble to a wide range of sys­tems, includ­ing bio­mol­e­c­u­lar machines. They high­light a pro­found con­nec­tion between the effec­tive use of infor­ma­tion and effi­cient ther­mo­dy­namic oper­a­tion: any sys­tem con­structed to keep mem­ory about its envi­ron­ment and to oper­ate ener­get­i­cally effi­ciently has to be predictive.”

    mod­el­ing philosophy-​​of-​​science information-​​theory physics ther­mo­dy­nam­ics talking-​​about-​​a-​​model-​​is-​​a-​​model pragmatism-it-ain’t
  • [1203.3434] On the Impact of Infor­ma­tion Tech­nolo­gies on Soci­ety: an His­tor­i­cal Per­spec­tive through the Game of Chess

    “The game of chess as always been viewed as an iconic rep­re­sen­ta­tion of intel­lec­tual prowess. Since the very begin­ning of com­puter sci­ence, the chal­lenge of being able to pro­gram a com­puter capa­ble of play­ing chess and beat­ing humans has been alive and used both as a mark to mea­sure hardware/​software pro­gresses and as an ongo­ing pro­gram­ming chal­lenge lead­ing to numer­ous dis­cov­er­ies. In the early days of com­puter sci­ence it was a topic for spe­cial­ists. But as com­put­ers were democ­ra­tized, and the strength of chess engines began to increase, chess play­ers started to appro­pri­ate to them­selves these new tools. We show how these inter­ac­tions between the world of chess and infor­ma­tion tech­nolo­gies have been her­ald of broader social impacts of infor­ma­tion tech­nolo­gies. The game of chess, and more broadly the world of chess (chess play­ers, lit­er­a­ture, com­puter soft­wares and web­sites ded­i­cated to chess, etc.), turns out to be a sur­pris­ingly and par­tic­u­larly sharp indi­ca­tor of the changes induced in our every­day life by the infor­ma­tion tech­nolo­gies. More­over, in the same way that chess is a mod­eliza­tion of war that cap­tures the raw fea­tures of strate­gic think­ing, chess world can be seen as small soci­ety mak­ing the study of the infor­ma­tion tech­nolo­gies impact eas­ier to ana­lyze and to grasp.”

    touch­stones his­tory algo­rithms history-​​of-​​science computer-​​science
  • Share Books | berfrois

    “Libraries are a recog­ni­tion that schol­ar­ship and cul­ture are more than the busi­ness of cre­at­ing and con­sum­ing. They are a human con­ver­sa­tion, and libraries pro­vide com­mon ground where that con­ver­sa­tion can take place and be remem­bered. By tak­ing aim at the right for the pub­lic to main­tain this con­ver­sa­tion and its mem­ory, pub­lish­ers have shown us what we have to lose. It’s time we resisted the out­sourc­ing of our com­mon her­itage by occu­py­ing the library.”

    Occupy libraries intellectual-​​property open-​​access public-​​policy activism
  • [1112.3307] Poly­tope Codes Against Adver­saries in Networks

    “Net­work cod­ing is stud­ied when an adver­sary con­trols a sub­set of nodes in the net­work of lim­ited quan­tity but unknown loca­tion. This prob­lem is shown to be more dif­fi­cult than when the adver­sary con­trols a given num­ber of edges in the net­work, in that lin­ear codes are insuf­fi­cient. To solve the node prob­lem, the class of Poly­tope Codes is intro­duced. Poly­tope Codes are con­stant com­po­si­tion codes oper­at­ing over bounded poly­topes in inte­ger vec­tor fields. The poly­tope struc­ture cre­ates addi­tional com­plex­ity, but it induces prop­er­ties on mar­ginal dis­tri­b­u­tions of code vec­tors so that validi­ties of code­words can be checked by inter­nal nodes of the net­work. It is shown that Poly­tope Codes achieve a cut-​​set bound for a class of pla­nar net­works. It is also shown that this cut-​​set bound is not always tight, and a tighter bound is given for an exam­ple network.”

    cryp­tog­ra­phy pri­vacy algo­rithms nudge-​​targets network-​​theory com­mu­ni­ca­tion
  • [1203.3353] Solv­ing Struc­ture with Sparse, Randomly-​​Oriented X-​​ray Data

    “Single-​​particle imag­ing exper­i­ments of bio­mol­e­cules at x-​​ray free-​​electron lasers (XFELs) require pro­cess­ing of hun­dreds of thou­sands (or more) of images that con­tain very few x-​​rays. Each low-​​flux image of the dif­frac­tion pat­tern is pro­duced by a sin­gle, ran­domly ori­ented par­ti­cle, such as a pro­tein. We demon­strate the fea­si­bil­ity of col­lect­ing data at these extremes, aver­ag­ing only 2.5 pho­tons per frame, where it seems doubt­ful there could be infor­ma­tion about the state of rota­tion, let alone the image con­trast. This is accom­plished with an expec­ta­tion max­i­miza­tion algo­rithm that processes the low-​​flux data in aggre­gate, and with­out any prior knowl­edge of the object or its ori­en­ta­tion. The ver­sa­til­ity of the method promises, more gen­er­ally, to rede­fine what mea­sure­ment sce­nar­ios can pro­vide use­ful sig­nal in the high-​​noise regime.”

    structural-​​biology image-​​analysis crys­tal­log­ra­phy algo­rithms inverse-​​problems nudge-​​targets sta­tis­tics
  • [1203.3203] An effi­cient algo­rithm for gen­er­at­ing AoA networks

    “The activ­i­ties, in project sched­ul­ing, can be rep­re­sented graph­i­cally in two dif­fer­ent ways, by either assign­ing the activ­i­ties to the nodes ‘AoN’ directed acyclic graph (dag) or to the arcs ‘AoA dag’. In this paper, a new algo­rithm is pro­posed for gen­er­at­ing, for a given project sched­ul­ing prob­lem, an Activity-​​on-​​Arc dag start­ing from the Activity-​​on-​​Node dag using the con­cepts of line graphs of graphs.”

    sched­ul­ing operations-​​research algo­rithms graph-​​theory
  • [1203.3341] A Com­par­i­son of Multi-​​Parametric Pro­gram­ming, Mixed-​​Integer Pro­gram­ming, Gra­di­ent Descent Based, and the Embed­ding Approach on Four Pub­lished Hybrid Opti­mal Con­trol Examples

    “…Com­mon mis­con­cep­tions regard­ing the embed­ding approach are addressed includ­ing whether or not it results in an aver­age value con­trol model (no), is nec­es­sary to “tweak” the algo­rithm to get bang-​​bang solu­tions (no), requires infi­nite switch­ing (no), has real-​​time capa­bil­ity (yes), or reduc­tion to a clas­si­cal non­lin­ear opti­miza­tion prob­lem (a desir­able yes).”

    control-​​theory operations-​​research algo­rithms numerical-​​methods philosophy-​​of-​​engineering design-​​patterns nudge-​​targets
  • [1203.3270] Extrac­tion of Facial Fea­ture Points Using Cumu­la­tive Histogram

    “This paper pro­poses a novel adap­tive algo­rithm to extract facial fea­ture points auto­mat­i­cally such as eye­brows cor­ners, eyes cor­ners, nos­trils, nose tip, and mouth cor­ners in frontal view faces, which is based on cumu­la­tive his­togram approach by vary­ing dif­fer­ent thresh­old val­ues. At first, the method adopts the Viola-​​Jones face detec­tor to detect the loca­tion of face and also crops the face region in an image. From the con­cept of the human face struc­ture, the six rel­e­vant regions such as right eye­brow, left eye­brow, right eye, left eye, nose, and mouth areas are cropped in a face image. Then the his­togram of each cropped rel­e­vant region is com­puted and its cumu­la­tive his­togram value is employed by vary­ing dif­fer­ent thresh­old val­ues to cre­ate a new fil­ter­ing image in an adap­tive way. The con­nected com­po­nent of inter­ested area for each rel­e­vant fil­ter­ing image is indi­cated our respec­tive fea­ture region. A sim­ple lin­ear search algo­rithm for eye­brows, eyes and mouth fil­ter­ing images and con­tour algo­rithm for nose fil­ter­ing image are applied to extract our desired cor­ner points auto­mat­i­cally. The method was tested on a large BioID frontal face data­base in dif­fer­ent illu­mi­na­tions, expres­sions and light­ing con­di­tions and the exper­i­men­tal results have achieved aver­age suc­cess rates of 95.27%.”

    image-​​segmentation image-​​analysis face-​​recognition algo­rithms nudge-​​targets
  • [1203.3284] Effi­cient Enu­mer­a­tion of the Directed Binary Per­fect Phy­lo­ge­nies from Incom­plete Data

    “We study a character-​​based phy­logeny recon­struc­tion prob­lem when an incom­plete set of data is given. More specif­i­cally, we con­sider the sit­u­a­tion under the directed per­fect phy­logeny assump­tion with binary char­ac­ters in which for some species the states of some char­ac­ters are miss­ing. Our main object is to give an effi­cient algo­rithm to enu­mer­ate (or list) all per­fect phy­lo­ge­nies that can be obtained when the miss­ing entries are com­pleted. While a sim­ple branch-​​and-​​bound algo­rithm (B&B) shows a the­o­ret­i­cally good per­for­mance, we pro­pose another approach based on a zero-​​suppressed binary deci­sion dia­gram (ZDD). Exper­i­men­tal results on ran­domly gen­er­ated data exhibit that the ZDD approach out­per­forms B&B. We also prove that count­ing the num­ber of phy­lo­ge­netic trees con­sis­tent with a given data is #P-​​complete, thus pro­vid­ing an evi­dence that an effi­cient ran­dom sam­pling seems hard.”

    phy­lo­ge­net­ics inverse-​​problems genet­ics algo­rithms sta­tis­tics nudge-​​targets
  • [1203.0879] Design­ing and using prior knowl­edge for phase retrieval

    “In this work we develop an algo­rithm for sig­nal recon­struc­tion from the mag­ni­tude of its Fourier trans­form in a sit­u­a­tion where some (non-​​zero) parts of the sought sig­nal are known. Although our method does not assume that the known part com­prises the bound­ary of the sought sig­nal, this is often the case in microscopy: a spec­i­men is placed inside a known mask, which can be thought of as a known light source that sur­rounds the unknown sig­nal. There­fore, in the past, sev­eral algo­rithms were sug­gested that solve the phase retrieval prob­lem assum­ing known bound­ary val­ues. Unlike our method, these meth­ods do rely on the fact that the known part is on the bound­ary. Besides the recon­struc­tion method we give an expla­na­tion of the phe­nom­ena observed in pre­vi­ous work: the recon­struc­tion is much faster when there is more energy con­cen­trated in the known part. Quite sur­pris­ingly, this can be explained using our pre­vi­ous results on phase retrieval with approx­i­mately known Fourier phase.”

    image-​​analysis image-​​processing learn­ing inverse-​​problems algo­rithms nudge-​​targets
  • [1203.3415] A New Approach to Count Pat­tern Motifs Using Com­bi­na­to­r­ial Techniches

    “We pro­posed two new exact algo­rithms to detect net­work motifs of size 3 and 4. Con­sid­er­ing that motifs need to count the iso­mor­phic pat­terns in the orig­i­nal graph $G(V,E)$ and in a set of ran­dom­ized graphs, the fol­low­ing com­plex­i­ties con­cern about count iso­mor­phic pat­terns in a sin­gle graph. Let $m=|E|$ and let $a(G)$ be the arboric­ity of $G$. Assume $|E|geq|V|$. We describe a $O(a(G)m)$ time com­plex­ity algo­rithm to count iso­mor­phic pat­terns of size 3. The com­plex­ity is a $O({msqrt{m}})$ in the worst graph. The sec­ond algo­rithm is a $O(m^2)$ com­plex­ity algo­rithm to count iso­mor­phic pat­terns of size 4. The final result was expres­sive faster when com­pared with other imple­mented algorithms.”

    network-​​theory graph-​​theory algo­rithms nudge-​​targets

Items of some interest:

These are my recent Pin​board​.in links:

  • [1201.5440] Self-​​assembly of anisotropic soft par­ti­cles in two dimensions

    “The self assem­bly of core-​​corona discs inter­act­ing via anisotropic poten­tials is inves­ti­gated using Monte Carlo com­puter sim­u­la­tions. A min­i­mal inter­ac­tion poten­tial that incor­po­rates anisotropy in a sim­ple way is intro­duced. It con­sists in a core-​​corona archi­tec­ture in which the cen­ter of the core is shifted with respect to the cen­ter of the corona. Anisotropy can thus be tuned by pro­gres­sively shift­ing the posi­tion of the core. Despite its sim­plic­ity, the sys­tem self orga­nize in a rich vari­ety of struc­tures includ­ing stripes, tri­an­gu­lar and rec­tan­gu­lar lat­tices, and unusual plas­tic crys­tals. Our results indi­cate that the amount of anisotropy does not alter the lat­tice spac­ing and only influ­ences the type of clus­ter­ing (stripes, micells, etc.) of the indi­vid­ual particles.”

    self-​​assembly biologically-​​inspired sim­u­la­tion pattern-​​formation condensed-​​matter
  • [1201.5477] Entropy-​​growth-​​based model of emo­tion­ally charged online dialogues

    “We ana­lyze emo­tion­ally anno­tated mas­sive data from IRC (Inter­net Relay Chat) and model the dia­logues between its par­tic­i­pants by assum­ing that the dri­ving force for the dis­cus­sion is the entropy growth of emo­tional prob­a­bil­ity dis­tri­b­u­tion. This process is claimed to be cor­re­lated to the emer­gence of the power-​​law dis­tri­b­u­tion of the dis­cus­sion lengths observed in the dia­logues. We per­form numer­i­cal sim­u­la­tions based on the noticed phe­nom­e­non obtain­ing a good agree­ment with the real data. Finally, we pro­pose a method to arti­fi­cially pro­long the dura­tion of the dis­cus­sion that relies on the entropy of emo­tional prob­a­bil­ity distribution.”

    oh-​​look-​​power-​​laws flame-​​wars social-​​dynamics com­plex­ol­ogy cultural-​​dynamics
  • [1201.4955] Coor­di­na­tion, Dif­fer­en­ti­a­tion and Fair­ness in a pop­u­la­tion of coop­er­at­ing agents

    “In a recent paper, we ana­lyzed the self-​​assembly of a com­plex coop­er­a­tion net­work. The net­work was shown to approach a state, where every agent invests the same amount of resources. Nev­er­the­less, highly-​​connected agents arise that extract extra-​​ordinarily high pay­offs while con­tribut­ing com­pa­ra­bly lit­tle to any of their coop­er­a­tions. Here, we inves­ti­gate a vari­ant of the model, in which highly-​​connected agents have access to addi­tional resources. We study ana­lyt­i­cally and numer­i­cally whether these resources are invested in exist­ing col­lab­o­ra­tions, lead­ing to a fairer load dis­tri­b­u­tion, or in estab­lish­ing new col­lab­o­ra­tions, lead­ing to an even less fair dis­tri­b­u­tion of loads and payoffs.”

    col­lab­o­ra­tion social-​​capital agent-​​based network-​​theory com­plex­ol­ogy nudge-​​targets
  • [1201.5426] Con­straint Prop­a­ga­tion as Infor­ma­tion Maximization

    “Dana Scott used the par­tial order among par­tial func­tions for his math­e­mat­i­cal model of recur­sively defined func­tions. He inter­preted the par­tial order as one of infor­ma­tion con­tent. In this paper we elab­o­rate on Scott’s sug­ges­tion of regard­ing com­pu­ta­tion as a process of infor­ma­tion max­i­miza­tion by apply­ing it to the solu­tion of con­straint sat­is­fac­tion prob­lems. Here the method of con­straint prop­a­ga­tion can be inter­preted as decreas­ing uncer­tainty about the solu­tion — that is, as gain in infor­ma­tion about the solu­tion. As illus­tra­tive exam­ple we choose numer­i­cal con­straint sat­is­fac­tion prob­lems to be solved by inter­val con­straints. To facil­i­tate this approach to con­straint solv­ing we for­mu­late con­straint sat­is­fac­tion prob­lems as for­mu­las in pred­i­cate logic. This neces­si­tates extend­ing the usual seman­tics for pred­i­cate logic so that mean­ing is assigned not only to sen­tences but also to for­mu­las with free variables.”

    computer-​​science quite-​​interesting constraint-​​processing computational-​​methods
  • [1201.4459] An effi­cient par­al­lel algo­rithm for the longest path prob­lem in meshes

    “In this paper, first we give a sequen­tial linear-​​time algo­rithm for the longest path prob­lem in meshes. This algo­rithm can be con­sid­ered as an improve­ment of [13]. Then based on this sequen­tial algo­rithm, we present a constant-​​time par­al­lel algo­rithm for the prob­lem which can be run on every par­al­lel machine.”

    algo­rithms graph-​​theory computational-​​complexity nudge-​​targets
  • [1201.4417] Insta­bil­i­ties and Pat­terns in Cou­pled Reaction-​​Diffusion Layers

    “We study insta­bil­i­ties and pat­tern for­ma­tion in reaction-​​diffusion lay­ers that are dif­fu­sively cou­pled. For two-​​layer sys­tems of iden­ti­cal two-​​component reac­tions, we ana­lyze the sta­bil­ity of homo­ge­neous steady states by exploit­ing the block sym­met­ric struc­ture of the lin­ear prob­lem. There are eight pos­si­ble pri­mary bifur­ca­tion sce­nar­ios, includ­ing a Turing-​​Turing bifur­ca­tion that involves two dis­parate length scales whose ratio may be tuned via the inter-​​layer cou­pling. For sys­tems of $n$-component lay­ers and non-​​identical lay­ers, the lin­ear problem’s block form allows approx­i­mate decom­po­si­tion into lower-​​dimensional lin­ear prob­lems if the cou­pling is suf­fi­ciently weak. As an exam­ple, we apply these results to a two-​​layer Brus­se­la­tor sys­tem. The com­pet­ing length scales engi­neered within the lin­ear prob­lem are read­ily appar­ent in numer­i­cal sim­u­la­tions of the full sys­tem. Select­ing a $sqrt{2}$:1 length scale ratio pro­duces an unusual steady square pattern.”

    cute emergent-​​design pattern-​​formation com­plex­ol­ogy nudge-​​targets nonlinear-​​dynamics
  • [1201.4737] Pro­duc­tion Sys­tem Rules as Pro­tein Com­plexes from Genetic Reg­u­la­tory Networks

    “This short paper intro­duces a new way by which to design pro­duc­tion sys­tem rules. An indi­rect encod­ing scheme is pre­sented which views such rules as pro­tein com­plexes pro­duced by the tem­po­ral behav­iour of an arti­fi­cial genetic reg­u­la­tory net­work. This ini­tial study begins by using a sim­ple Boolean reg­u­la­tory net­work to pro­duce tra­di­tional ternary-​​encoded rules before mov­ing to a fuzzy vari­ant to pro­duce real-​​valued rules. Com­pet­i­tive per­for­mance is shown with related genetic reg­u­la­tory net­works and rule-​​based sys­tems on bench­mark problems.”

    evolutionary-​​algorithms production-​​systems computer-​​science emergent-​​design

Items of some interest…

These are my recent Pin​board​.in links:

  • [1107.0674] “Mem­ory foam” approach to unsu­per­vised learning

    “We pro­pose an alter­na­tive approach to con­struct an arti­fi­cial learn­ing sys­tem, which nat­u­rally learns in an unsu­per­vised man­ner. Its math­e­mat­i­cal pro­to­type is a dynam­i­cal sys­tem, which auto­mat­i­cally shapes its vec­tor field in response to the input sig­nal. The vec­tor field con­verges to a gra­di­ent of a multi-​​dimensional prob­a­bil­ity den­sity dis­tri­b­u­tion of the input process, taken with neg­a­tive sign. The most prob­a­ble pat­terns are rep­re­sented by the sta­ble fixed points, whose basins of attrac­tion are formed auto­mat­i­cally. The per­for­mance of this sys­tem is illus­trated with musi­cal signals.”

    machine-​​learning clas­si­fi­ca­tion learning-​​from-​​data algo­rithms nudge-​​targets
  • [1107.0550] 3D Ter­res­trial LiDAR data clas­si­fi­ca­tion of com­plex nat­ural scenes using a multi-​​scale dimen­sion­al­ity cri­te­rion: appli­ca­tions in geomorphology

    3D point clouds of nat­ural envi­ron­ments rel­e­vant to geo­mor­phol­ogy prob­lems (rivers, cliffs…) often require to clas­sify the data into ele­men­tary rel­e­vant classes. A typ­i­cal exam­ple is the sep­a­ra­tion of ripar­ian veg­e­ta­tion from soil in flu­vial envi­ron­ments, the dis­tinc­tion between fresh sur­faces and rock­fall in cliff envi­ron­ments, or more gen­er­ally the clas­si­fi­ca­tion of sur­faces accord­ing to their mor­phol­ogy (rip­ples, grain size…). Nat­ural sur­faces are very het­ero­ge­neous and their dis­tinc­tive prop­er­ties are sel­dom defined at a unique scale. We have thus defined a multi-​​scale mea­sure of the point cloud dimen­sion­al­ity around each point. The dimen­sion­al­ity char­ac­ter­izes the local 3D orga­ni­za­tion of the point cloud and varies from being 1D (points set along a line) to really tak­ing all 3D vol­ume, at each scale. We present the tech­nique and illus­trate its effi­ciency in sep­a­rat­ing ripar­ian veg­e­ta­tion from ground and clas­si­fy­ing a moun­tain stream in veg­e­ta­tion, rock, gravel and water sur­face. The supe­ri­or­ity of the multi-​​scale analy­sis in enhanc­ing class sep­a­ra­bil­ity and spa­tial res­o­lu­tion of the clas­si­fi­ca­tion is also demon­strated. Large scenes can be clas­si­fied on a com­mod­ity lap­top in a rea­son­able time. The tech­nique is robust to miss­ing data and espe­cially shadow zones. The clas­si­fi­ca­tion is fast and accu­rate and can account for some degree of intra-​​class mor­pho­log­i­cal vari­abil­ity such as dif­fer­ent veg­e­ta­tion types. A prob­a­bilis­tic con­fi­dence in the clas­si­fi­ca­tion result is given at each point allow­ing the user to remove the points for which the clas­si­fi­ca­tion is uncer­tain. The process can be both fully auto­mated but also fully cus­tomized by the user includ­ing a graph­i­cal def­i­n­i­tion of the clas­si­fiers if so desired. Although devel­oped for fully 3D data, the method can be read­ily applied to 2.5D air­borne LiDAR data.”

    image-​​analysis image-​​segmentation learning-​​from-​​data clas­si­fi­ca­tion nudge-​​targets
  • [1105.6001] A Call to Arms: Revis­it­ing Data­base Design

    “Good data­base design is cru­cial to obtain a sound, con­sis­tent data­base, and — in turn — good data­base design method­olo­gies are the best way to achieve the right design. These method­olo­gies are taught to most Com­puter Sci­ence under­grad­u­ates, as part of any Intro­duc­tion to Data­base class. They can be con­sid­ered part of the “canon”, and indeed, the over­all approach to data­base design has been unchanged for years. More­over, none of the major data­base research assess­ments iden­tify data­base design as a strate­gic research direc­tion. Should we con­clude that data­base design is a solved prob­lem? Our the­sis is that data­base design remains a crit­i­cal unsolved prob­lem. Hence, it should be the sub­ject of more research. Our start­ing point is the obser­va­tion that tra­di­tional data­base design is not used in prac­tice — and if it were used it would result in designs that are not well adapted to cur­rent envi­ron­ments. In short, data­base design has failed to keep up with the times. In this paper, we put forth argu­ments to sup­port our view­point, ana­lyze the root causes of this sit­u­a­tion and sug­gest some avenues of research.”

    data­base ontol­ogy software-​​development computer-​​science design-​​patterns