links for 2010-​​05-​​26

links for 2010-​​05-​​25

links for 2010-​​05-​​24

  • “The Pre­dic­tion API allows you to get more from your data and makes its pat­terns more acces­si­ble. Specif­i­cally, the Pre­dic­tion API lever­ages Google’s machine learn­ing infra­struc­ture to give you the tools to bet­ter ana­lyze your data and reveal pat­terns that are often dif­fi­cult to man­u­ally dis­cover. The API also enables you to use those pat­terns to pre­dict new out­comes, which facil­i­tates the devel­op­ment of all types of soft­ware, from tex­tual analy­sis sys­tems to rec­om­men­da­tion sys­tems. Because the Pre­dic­tion API is a REST­ful HTTP ser­vice, you can eas­ily access it from Google App Engine, Apps Script, and other Internet-​​connected desk­top applications.”
  • “Het­ero­gene­ity in the degree dis­tri­b­u­tion is known to sup­press global syn­chro­niza­tion in com­plex net­works of sym­met­ri­cally cou­pled oscil­la­tors. Scale-​​free net­works present strong het­ero­gene­ity, there are a few highly con­nected nodes, termed hubs, while the major­ity of nodes has only a few con­nec­tions. We show that a sta­ble par­tially syn­chro­nized state may take place in scale-​​free net­works: hubs undergo a tran­si­tion to syn­chro­niza­tion while the remain­ing nodes are unsyn­chro­nized. This phe­nom­e­non may occur in any large het­ero­ge­neous net­work, regard­less of the net­work global syn­chro­niza­tion prop­er­ties. We pro­vide the­ory and numer­i­cal evi­dence to estab­lish this phenomenon.”
  • “… In this paper we present BRACE (Big Red Agent-​​based Com­pu­ta­tion Engine), which extends the MapRe­duce frame­work to process these sim­u­la­tions effi­ciently across a clus­ter. We can lever­age spa­tial local­ity to treat behav­ioral sim­u­la­tions as iter­ated spa­tial joins and greatly reduce the com­mu­ni­ca­tion between nodes. In our exper­i­ments we achieve nearly lin­ear scale-​​up on sev­eral real­is­tic simulations.…”
  • “A wide fam­ily of non­lin­ear sequence gen­er­a­tors, the so-​​called clock-​​controlled shrink­ing gen­er­a­tors, has been ana­lyzed and iden­ti­fied with a sub­set of lin­ear cel­lu­lar automata. The algo­rithm that con­verts the given gen­er­a­tor into a lin­ear model based on automata is very sim­ple and can be applied in a range of prac­ti­cal inter­est. Due to the lin­ear­ity of these automata as well as the char­ac­ter­is­tics of this class of gen­er­a­tors, a crypt­an­a­lytic approach can be pro­posed. Lin­ear cel­lu­lar struc­tures eas­ily model keystream gen­er­a­tors with appli­ca­tion in stream cipher cryptography.”
  • “Con­nec­tions in com­plex net­works are inher­ently fluc­tu­at­ing over time and exhibit more dimen­sion­al­ity than analy­sis based on stan­dard sta­tic graph mea­sures can cap­ture. Here, we intro­duce the con­cepts of tem­po­ral paths and dis­tance in time-​​varying graphs. We define as tem­po­ral small world a time-​​varying graph in which the links are highly clus­tered in time, yet the nodes are at small aver­age tem­po­ral dis­tances. We explore the small-​​world behav­ior in syn­thetic time-​​varying net­works of mobile agents, and in real social and bio­log­i­cal time-​​varying systems.”
  • “The inverse Ising prob­lem is a dif­fi­cult com­bi­na­to­r­ial opti­miza­tion prob­lem in the class known as “NP-​​hard”. In the­ory, only approx­i­mate schemes, or meth­ods that take more than poly­no­mial time to find the answer are pos­si­ble. Boltz­mann Learn­ing [1] is an iter­a­tive method where in one step the cor­re­la­tion func­tions are com­puted given an Ising model, and in another step the Ising model cou­plings are mod­i­fied to adjust to data. In prin­ci­ple, Boltz­mann learn­ing can be employed to find the cou­plings with arbi– trary accu­racy given accu­rate data and suf­fi­cient time, but the slow con­ver­gence of the Boltz­mann learn­ing makes it a very inef­fi­cient algo­rithm for most prac­ti­cal purposes.”
  • “Approx­i­mate dynamic pro­gram­ming has been used suc­cess­fully in a large vari­ety of domains, but it relies on a small set of pro­vided approx­i­ma­tion fea­tures to cal­cu­late solu­tions reli­ably. Large and rich sets of fea­tures can cause exist­ing algo­rithms to over­fit because of a lim­ited num­ber of sam­ples. We address this short­com­ing using $L_​1$ reg­u­lar­iza­tion in approx­i­mate lin­ear pro­gram­ming. Because the pro­posed method can auto­mat­i­cally select the appro­pri­ate rich­ness of fea­tures, its per­for­mance does not degrade with an increas­ing num­ber of fea­tures. These results rely on new and stronger sam­pling bounds for reg­u­lar­ized approx­i­mate lin­ear pro­grams. We also pro­pose a com­pu­ta­tion­ally effi­cient homo­topy method. The empir­i­cal eval­u­a­tion of the approach shows that the pro­posed method per­forms well on sim­ple MDPs and stan­dard bench­mark problems.”
  • “There seems to be a brief recog­ni­tion period for Bdellovib­rio to iden­tify its prey after a col­li­sion with another cell (Shilo, 1969). Ini­tially, the attach­ment to a cell sur­face is reversible. Bdellovib­rio is still able to swim away a few sec­onds after rec­og­niz­ing that the cell is not a right tar­get (gram– pos­i­tive bac­te­ria). When a gram-​​negative bac­terium is encoun­tered, Bdellovib­rio cell becomes com­mit­ted to inva­sion. The whole process usu­ally takes around 5 – 10 min­utes. Bdellovib­rio drops its fla­gel­lum. It has been hypoth­e­sized that Bdellovib­rio may adhere to the cell sur­face using pilus-​​like fibre struc­ture expressed on its pen­e­tra­tion pole.”
  • “Small world net­works inter­po­late between fully reg­u­lar and fully ran­dom topolo­gies and simul­ta­ne­ously exhibit large local clus­ter­ing as well as short aver­age path length. Small world topol­ogy has there­fore been sug­gested to sup­port net­work syn­chro­niza­tion. Here we study the asymp­totic speed of syn­chro­niza­tion of cou­pled oscil­la­tors in depen­dence on the degree of ran­dom­ness of their inter­ac­tion topol­ogy in gen­er­al­ized Watts-​​Strogatz ensem­bles. We find that net­works with fixed in-​​degree syn­chro­nize faster the more ran­dom they are, with small worlds just appear­ing as an inter­me­di­ate case. For any generic net­work ensem­ble, if syn­chro­niza­tion speed is at all extremal at inter­me­di­ate ran­dom­ness, it is slow­est in the small world regime. This phe­nom­e­non occurs for var­i­ous types of oscil­la­tors, intrin­sic dynam­ics and cou­pling schemes.”
  • “Empir­i­cal stud­ies show that real world net­works often exhibit mul­ti­ple scales of topo­log­i­cal descrip­tions. How­ever, it is still an open prob­lem how to iden­tify the intrin­sic mul­ti­ple scales of net­works. In this arti­cle, we con­sider detect­ing the multi-​​scale com­mu­nity struc­ture of net­works from the per­spec­tive of dimen­sion reduc­tion. …The­o­ret­i­cal analy­sis indi­cates that all these three trans­for­ma­tions are cru­cial to iden­tify the multi-​​scale com­mu­nity struc­ture of net­works. Exten­sive tests on real world and arti­fi­cial net­works demon­strate that the cor­re­la­tion matrix sig­nif­i­cantly out­per­forms the mod­u­lar­ity matrix as regards iden­ti­fy­ing the multi-​​scale com­mu­nity struc­ture of networks.”
  • “Now, to return to the news arti­cle. If the eigen­value dis­tri­b­u­tion is an attrac­tor, this means that a lot of phys­i­cal and social phe­nom­ena which can be mod­eled by eigen­val­ues (includ­ing, appar­ently, quan­tum energy lev­els and some prop­er­ties of sta­tis­ti­cal tests) might have a com­mon struc­ture. Just as, at a sim­i­lar level, we see the nor­mal dis­tri­b­u­tion and related func­tions in all sorts of unusual places.”
  • “…Through sev­eral nu– mer­i­cal exper­i­ments, we demon­strate that both matrix– and tensor-​​based tech­niques are effec­tive for tem­po­ral link pre­dic­tion despite the inher­ent dif­fi­culty of the prob­lem. Addi­tion­ally, we show that tensor-​​based tech­niques are par­tic­u­larly effec­tive for tem­po­ral data with vary­ing peri­odic patterns.”
  • “I envi­sion a future in which pro­gram­mers are the con­scious repos­i­to­ries of a body of knowl­edge. A future in which they regain their craft, instead of tweak­ing frame­works they don’t under­stand. A future, even­tu­ally, in which pro­gram­mers say “no” to demands at odds with their ethics.

    It is cru­cial to cre­ate ways, spaces and for­mats for pro­gram­mers to share their knowl­edge with other pro­gram­mers. It is vital we keep this knowl­edge (espe­cially ver­bal­ized knowl­edge) among pro­gram­mers and out of salespeople’s hands. And it is urgent the IT crowd rec­og­nize soft­ware mak­ing as a craft, instead of a commodity.”

  • “We con­sider sched­ul­ing weighted pack­ets in a capacity-​​bounded buffer. In this model, there is a buffer with a lim­ited capac­ity B such that at any time, the buffer can­not accom­mo­date more than B pack­ets. Pack­ets arrive over time. Each packet has a non-​​negative real value. Pack­ets do not expire and they leave the buffer only because either we send them or we drop them. The pack­ets that have left the buffer will not be recon­sid­ered for deliv­ery any more. In each time step, at most one packet in the buffer can be sent. The order in which the pack­ets are sent should com­ply with the order of their arriv­ing time. The objec­tive is to max­i­mize the total value of the pack­ets sent in an online manner.…”
  • “Each com­bi­na­tion of val­ues was then fed into a sim­u­la­tor to give an idea of the grid’s per­for­mance and its expected life­time. If the per­for­mance was promis­ing, the “genetic mate­r­ial” was sub­jected to fur­ther ran­dom changes, or muta­tion, and this process was repeated until no more improve­ments were forthcoming.

    After 100 gen­er­a­tions, the GA spawned a geometry/​voltage set that boosted the ion engine grid’s life­time to 5.1 years — at least in the sim­u­la­tor (Jour­nal of Propul­sion and Power, DOI: 10.2514÷1.44358). Fac­tors opti­mised included grid hole diam­e­ter, hole spac­ing and the thick­ness of the grids. The engine could be improved fur­ther, says Far­nell, by evolv­ing the other parts too.…”

  • “Results on a com­modi­ties basket”
  • “In this paper we con­sider spa­tial net­works that real­ize a bal­ance between an infra­struc­ture cost (the cost of wire needed to con­nect the net­work in space) and com­mu­ni­ca­tion effi­ciency, mea­sured by aver­age short­est path­length. A global opti­miza­tion pro­ce­dure yields net­work topolo­gies in which this bal­ance is opti­mized. These are com­pared with net­work topolo­gies gen­er­ated by a com­pet­i­tive process in which each node strives to opti­mize its own cost-​​communication bal­ance. Three phases are observed in glob­ally opti­mal con­fig­u­ra­tions for dif­fer­ent cost-​​communication trade-​​offs: (i) reg­u­lar small worlds, (ii) star-​​like net­works and (iii) trees with a cen­tre of inter­con­nected hubs. In the lat­ter regime, i.e. for very expen­sive wire, power laws in the link length dis­tri­b­u­tions $P(w)\propto w^{-\alpha}$ are found, which can be explained by a hier­ar­chi­cal orga­ni­za­tion of the networks…”
  • “We present a near-​​linear time algo­rithm that approx­i­mates the edit dis­tance between two strings within a poly­log­a­rith­mic fac­tor; specif­i­cally, for strings of length n and every fixed epsilon>0, it can com­pute a (log n)^O(1/epsilon) approx­i­ma­tion in n^(1+epsilon) time. This is an expo­nen­tial improve­ment over the pre­vi­ously known fac­tor, 2^(O (sqrt(log n))), with a com­pa­ra­ble run­ning time (Ostro­vsky and Rabani J.ACM 2007; Andoni and Onak STOC 2009). Pre­vi­ously, no effi­cient poly­log­a­rith­mic approx­i­ma­tion algo­rithm was known for any com­pu­ta­tional task involv­ing edit dis­tance (e.g., near­est neigh­bor search or sketching). ”
  • “In many dynam­i­cal sys­tems there is a large sep­a­ra­tion of time scales between typ­i­cal events and “rare” events which can be the cases of inter­est. Rare-​​event rates are quite dif­fi­cult to com­pute numer­i­cally, but they are of con­sid­er­able prac­ti­cal impor­tance in many fields: for exam­ple tran­si­tion times in chem­i­cal physics and extinc­tion times in epi­demi­ol­ogy can be very long, but are quite impor­tant. We present a very fast numer­i­cal tech­nique that can be used to find long tran­si­tion times (very small rates) in low-​​dimensional sys­tems, even if they lack detailed bal­ance. We illus­trate the method for a bistable non-​​equilibrium sys­tem intro­duced by Maier and Stein and a two-​​dimensional (in para­me­ter space) epi­demi­ol­ogy model.”
  • “Image Effects With Cel­lu­lar Automata (PDF) Abstract:This paper presents some tech­niques for cre­at­ing var­i­ous artis­tic effects on dig­i­tal pho­tog­ra­phy using the con­cept of cel­lu­lar automata. All exam­ples in this paper are cre­ated by “Image Infec­tor” pro­gram, which is a plu­gin for Pixo­pe­dia 24 image edi­tor and painter (www​.sigmapi​-design​.com).”
  • “I’ll start by high­light­ing that while all the soft­ware in this post is indeed free (true to FOSS), an account with Inter­ac­tive Bro­kers is needed to make use of it. For those not famil­iar with IB, they offer a trad­ing plat­form that excels on numer­ous fronts but is most appeal­ing to those of us who trade algo­rith­mi­cally. IB makes avail­able a rather com­pre­hen­sive API that makes data access and trade exe­cu­tion entirely pos­si­ble pro­gram­mat­i­cally via a hand­ful of “sup­ported” lan­guages. These include Java (the lan­guage of the plat­form), C#, VBA and even Excel. The also have a POSIX com­pli­ant C++ ver­sion for those who enjoy C++ but dis­like Windows.

    For those who dis­like Win­dows and C++, the com­mu­nity of IB users have a few “non-​​official” options. They include some nice imple­men­ta­tions in C, Python (2), Mat­lab, and some­thing even more abstracted in the trading-​​shim. While all well and good, there was one miss­ing: R.…”

  • “This paper aims at gen­er­at­ing a new face based on the human like descrip­tion using a new con­cept. The FASY (FAce SYn­the­sis) Sys­tem is a Face Data­base Retrieval and new Face gen­er­a­tion Sys­tem that is under devel­op­ment. One of its main fea­tures is the gen­er­a­tion of the requested face when it is not found in the exist­ing data­base, which allows a con­tin­u­ous grow­ing of the data­base also.”
  • “The per­for­mance assess­ment of algo­rithms for mul­ti­ob­jec­tive opti­miza­tion prob­lems is far from being a triv­ial issue. Recent results indi­cate that unary per­for­mance mea­sures, i.e. per­for­mance mea­sures which assign a sin­gle value to each non-​​dominated point set, are inher­ently lim­ited in their infer­en­tial power. Despite these lim­i­ta­tions, the hyper­vol­ume indi­ca­tor (also known as Lebesgue mea­sure or S met­ric) is still con­sid­ered to pos­sess some rea­son­able prop­er­ties, hav­ing also been pro­posed as a guid­ance cri­te­rion for accept­ing solu­tions in Mul­ti­ob­jec­tive Evo­lu­tion­ary Algo­rithms. There­fore, the com­pu­ta­tional time taken for com­put­ing the hyper­vol­ume indi­ca­tor is a cru­cial fac­tor for the per­for­mance of such algorithms.…”

links for 2010-​​05-​​19

links for 2010-​​05-​​18