Signal-to-noise ratio

From The Jolly Contrarian
Revision as of 22:20, 24 November 2020 by Amwelladmin (talk | contribs)
Jump to navigation Jump to search
In which the curmudgeonly old sod puts the world to rights.
Index — Click ᐅ to expand:
Tell me more
Sign up for our newsletter — or just get in touch: for ½ a weekly 🍺 you get to consult JC. Ask about it here.


Caught in a mesh of living veins,
In cell of padded bone,
He loneliest is when he pretends
That he is not alone.
We’d free the incarcerate race of man
That such a doom endures
Could only you unlock my skull,
Or I creep into yours.
Ogden Nash, Listen...
In God we trust, all others must bring data.
W. Edwards Deming

If we accept that the information content of the universe, through all time and space is, if not exactly, then as good as infinite,[1] and the data homo sapiens has collected or generated to the point of reading is necessarily finite, even if we’ve lost quite a lot of it along the way, then it follows that the total value of data in which W.Edwards Deming would have us trust is, mathematically, as good as nil. There is no data from the future.

And that is before considering its quality. If 90% of all gathered data originates from the internet age,[2] so a good portion is cat videos and hot takes on Twitter — so fairly shite data, even on its own terms. Have you read Twitter?[3]

But leave the banality of our age to one side — we don’t need it to make out the argument. It would hold even if every hot take on Twitter were an incandescent pearl of unique genius.

In any case, it follows mathematically that, should we transcend our meagre hermeneutic bubbles — free the incarcerate race of man, so to speak — the signal of our data to the noise of all possible data out there is infinitesimal.[4]

If this is the data we’re meant to trust, you might wonder what is so wrong with God. But we humans are pattern-seeking machines. We don’t take the data as we see it cold, and fashion objective axioms from it, carving nature at its joints: we bring our idiosyncratic prisms and pre-existing cognitive structures to it —our hot takes, if you like — and willfully create patterns from it to support our convictions.

This is not a criticism about the human modus operandi, but an observation. This is the doom our incarcerate race endures.

It is not just the Twitterati. Science, too, has its confirmation bias, that subsists at a meta-level, uncontrollable even by double-blind testing methodologies. Experiments which confirm a hypothesis are a lot more likely to be published than those which don’t.[5] Of those failed experiments that are published, far fewer are cited in other literature. Falsifications die.

This is neither a cause for alarm nor is it new. It is just a reminder how important, in all human discourse, is contingency, provisionality, and above all humility. Your data is likely bunk.

All of these are another way of attacking a familiar problem: the universe, the world, the nation, your market, your workplace and even your interpersonal relationships are complex, not merely complicated. Complication is a function of a paradigm. It is part of the game. It is within the rules. It is soluble, by sufficiently skilled application of the rules. Complication can be beaten by an algorithm. You can brute force it.

Complexity, you cannot.

Complexity, describes the limits of the paradigm. Complexity is the wilderness beyond the rules of the game. Complexity inhabits the noise, not the signal. Where there is complexity, the rules do not work. Here data is relegated to noise.[6]

This is why physical sciences apparently have a greater success than social sciences — cue Richard Dawkins’ obligatory scoff. Physical sciences generally address behaviour of independent events — rolling balls, flipping coins, waves and/or particles of light. But rolling balls are not autonomous agents. They act independently. The behaviour of one will not influence that of another. Each coin flip is, as a condition of probability theory — independent.[7] Independent events obey Gaussian principles. They may be modelled. That is to say, they may be complicated but they remain predictable, at least in theory. When physical systems inexplicably go bang — Chernobyl, the Space Shuttle Challenger, the Torrey Canyon — the root cause will not be a failure of the physical science underlying the engineering, but some supervening cause invalidating the underlying assumptions on which the physical science was based.

Social sciences don’t have that get-out-of-jail-free card: they address precisely that kind of supervening cause: behaviour that is, intrinsically, unpredictable. Psychology, sociology, anthropology, economics — these concern themselves with human agents, who are influenced by each other — which is why we don’t use physical science to predict their behaviour. Social sciences have to deal with the inherently complex, non-Gaussian interactions between human beings.[8]


See also

References

  1. This assumes there is not a finite end-point to the Universe; by no means settled cosmology, but hardly a rash assumption. And given how little we have of it, the universe’s total information content might as well be infinite, when compared to our finite collection of mortal data. Even the total, ungathered-by-mortal-hand, information content generated by the whole universe to date, not even counting the unknowable future, is as good as infinite.
  2. Eric Schmidt said something like this in 2011, and it sounds totally made up, but let’s run with it, hey?
  3. Get off Twitter, okay? For all of our sakes.
  4. That means, really small.
  5. The Hidden Half: How the World Conceals its Secrets, by Michael Blastland.
  6. Provisional theory: “information” is data framed with a hypothesis.
  7. The technical term: “platykurtic”.
  8. physical sciences set up closed logical systems within which their rules will work, and often these systems are dramatically simplified as compared with anything you see in the real world: Newton, for example, assumes a frictionless, stationery, stable, neutral frame of reference: circumstances which, in any observed environment, do not and cannot not exist. Nancy Cartwright calls these structures “nomological machines”. Because of this explicit caveat, we can put any variances between Newton’s prediction and the observed outcome down not to falsification, but to the messy real world “contaminating” the idealised experimental conditions. Hence, the proverbial crisp packet blowing across St Mark’s Square.