-
"The goals of the journal are to promote sharing of high-quality scientific software (e.g. there must be a test suite with 100% code coverage), promote discussion of best practice in research software development, and to enable researchers to be rewarded through publication for the time spent on developing software tools for others to use."
-
"In a landmark decision issued today in the criminal appeal of U.S. v. Warshak, the Sixth Circuit Court of Appeals has ruled that the government must have a search warrant before it can secretly seize and search emails stored by email service providers. Closely tracking arguments made by EFF in its amicus brief, the court found that email users have the same reasonable expectation of privacy in their stored email as they do in their phone calls and postal mail."
-
"But the privacy divide is deeper than that. Are the poor begging for extra helpings of privacy? No, not much, not even where they could use it. Pollsters in England have found that when they ask people in hard-up communities what things would improve their lives, the top three items almost always include more closed-circuit TV cameras on the streets. The people who get worked up about universal ID cards, DNA databases and CCTV monitoring are almost always the wealthy elite.
It’s the poor who are forced to live with crime, violence, harassment from unstable and marginalized people — exactly the sort of stuff that these supposedly privacy-invading conveniences are designed to prevent. When your life is hard, privacy equals isolation equals death. If you consider it a right, it’s a pretty good sign that you’ve got too much money and too little to worry about."
Monthly Archives: December 2010
links for 2010-12-13
-
"The rhetoric of social mobility helps to legitimize class hierarchies, by maintaining the pretence that management is a technical skills. In fact, bosses' power derives from other sources.And what's worst of all is that such hierarchies might not be needed anyway. In many firms, "management" is either a redundant function – because good companies run themselves – or it's worse than useless."
-
"Overall, these empirical patterns suggests that we need to be less cavalier in addressing questions of human nature on the basis of data drawn from this particularly thin, and rather unusual, slice of humanity."
-
"Mr. President, I don't believe that word means what you think it means."
-
"[A]narchists resemble the American Founders, who saw the spirit of liberty as a necessary bulwark against concentrations of power, and were themselves partly embedded in international networks preaching revolution and social upheaval. In building a truly global economy, the great states have given anarchists the opportunity to rebuild their networks of sympathy and common political purpose across borders. Today’s anarchists want to change the world through distributed action rather than a pistol shot. It seems to be working out better."
-
""They told me that EPA scientists had reviewed the originally lifecycle study and determined it wasn't scientifically sound, and I asked if it had been documented, if there was a hard copy," he says, "The [employee] said yes, and I asked if I could get a copy." And just like that, he had the proof he needed that the EPA had overlooked something that could be killing America's bees."
-
"You already have geniuses working on your infrastructure. Listen to them. This article is for people who don’t have any option between “code runs on developer’s laptop” and “code runs in production.”"
-
"Netflix has been a business partner to the movie and television studios through licensing deals, but increasingly it is seen as a partner with its hands far deeper in the pockets of the media companies than anyone thought. Through its success, the company has positioned itself at the center of the media universe — at the nexus of technology and content — and is now finding it a place increasingly under attack."
for the prettiest ones
A lively breakfast conversation with old friend Cosma Shalizi this morning leaves me hanging on two or three technical confusions. Basically I buttonholed him and made him buy me breakfast today because I’m trying to understand his people better.
Statisticians, I mean.
In particular, I wanted to talk at him a while about what I see and hear in his student’s recent research project.
Some headway was made. But the time and effort spent translating from the [whatever the fuck it is I speak] to a series of relevant and semantically reasonable-sounding noises meant I didn’t quite get far enough to close all the open questions I had.
So I’m just going to dump them here. In more or less unedited form.
File this all under either “notes to self” or “cramped rants scribbled on ceiling of patient’s cell”, as you prefer.
For the prettiest ones
I didn’t get to a basic question about statistics that’s been developing in my practice lately. Viz: when are they necessary, and when are they even a good idea?
By “statistics” I don’t mean Statistics—the field—but rather the mathematical functions we use to summarize sets of observations: mean, variance, maximum, mode, range, and so forth.
Suppose you’re modeling some dataset containing example training points, using a model
, where
Applying some particular model to the dataset produces a vector of predictions
, which we want to compare to our expected responses
. And given some other model
, we’ll get a different vector of predicted values (one for each training input),
.
Now normally we drop from this big honking e-dimensional space into one (1) dimension by doing some statistics at this point. Adding the absolute deviations up, or residual sum of squares: all those standard measures of model fitting. Those functions of random variables are “statistics”.
As I understand it from the sidelines, in the Statistics community (note the big ‘S’) there’s a lot of time spent working out modes and manners for model selection. These amount to comparison metrics which project these vectors I’ve described into various lower-dimensional spaces, with accompanying caveats about special conditions under which the assumptions underlying that given projection may fail, or be questionable. Particular kinds of models, particular biases in the datasets, and unusual interactions between those two.
Now suppose for the sake of building up slowly in this argument, we have one (1) input:output pair in our training set; one vector of values, in a tuple with a single expected
value. Given two models
and
, if I plug those
values in I’ll get predictions
and
.
Clearly, the best of these two models the one whose distance from the expected is smallest. In one dimension, it doesn’t really matter to us which of the metric norms you prefer; you can use absolute difference and be intuitive about “distance in one dimension”, or square the distances if you want to [prematurely] plan ahead for higher-dimensional cases.
In this trivial case with one training point, let’s just say we pick norm—absolute value—and we’ll stick with it as we move up the dimensions. OK?
Now suppose we have two (2) input:output pairs in our training set; two tuples of values and accompanying expected
values. Given our two models
and
, when I plug those two sets of
values in I’ll get predictions
and
.
Let’s just dismiss Doktor Gauss for a while. Send him out of the room.
When I say that dominates
under some metric
against
, I mean that for every training case in our dataset,
and for at least one training case,
.
Notice that for any norm, the order of observations doesn’t actually change. We’re not aggregating them over all training cases, we’re producing vectors that can be stretched or squared—as long as you do it independently across training data—without changing the domination relationship. If some model dominates some
given the training data, it doesn’t matter whether we square the residuals, or take their absolute values, or whatever. As long as we don’t aggregate them.
In other words, I’m wondering whether it’s possible to keep “fit with regards to a given training tuple” separate, and use standard multi-objective ranking methods to discriminate models that are dominated from those that are non-dominated, given a particular training set.
Multicriterion sorting is a partial ordering relationship, and that means there are often—especially in higher dimensions—mutually nondominated points. But that doesn’t mean it’s different from traditional scalar ordering, which is just a degenerate case: there are ties in races, after all.
So what does holding back on aggregation do?
Well, for one thing, it brings into focus the notion of broad and wide-ranging families of models, not the scant handful many statisticians are used to working with.
It leads to some interesting possibilities in understanding the relationship between model performance over test data (generalization) and training data.
It leads to the useful notion of sheafs of multiple nondominated models, some “specialized” for modeling one training point, others specialized in modeling other data points, and a few generalists that do quite well on all of them.
It opens up some interesting questions (to me, at least) about leave-one-out validation methods. Especially about how robust rankings might be.
Finally, it seems that it opens a door for the sort of “data balancing” work Katya Vladislavleva has made such progress with… and maybe a direction through which it can be communicated to the folks on the machine learning side.
We’ll see.
Component #1 of “What to Do”
“Pragmatic Genetic Programming: What to Do”
Sometimes there’s too much of a gap between what you know and what’s available for other people to learn. That can be frustrating.