Big data: Difference between revisions

466 bytes added ,  21 October 2020
no edit summary
No edit summary
No edit summary
Line 4: Line 4:
The obsession with [[big data]] has a few implications:  
The obsession with [[big data]] has a few implications:  
===It is [[premium mediocre]]===
===It is [[premium mediocre]]===
Firstly it expresses a preference for the ''aggregate'' over the specific, and the ''average'' over the outlier, the individual, the unique or extraordinary. It is to prefer the mediocre, for its weight of numbers, over the isolated vision of a genius and the depravity of the ugliest man.   
Firstly it expresses a preference for the ''aggregate'' over the specific, and the ''average'' over the outlier, the individual, the unique or extraordinary. It is to prefer the mediocre, for its weight of numbers, over the isolated vision of a genius or the depravity of the ugliest man.   


As surely as [[ugliest man|the ugliest man]] killed God, so did data kill the [[superman]]. The ''will to power'' is defeated by the million-strong dull blades of the ''will to entropy''. It is the ''will to [[premium mediocre]]''.
As surely as [[ugliest man|the ugliest man]] killed God, so did data kill the [[superman]]. The ''will to power'' is defeated by the million-strong dull blades of the ''will to entropy''. It is the ''will to [[premium mediocre]]''.
Line 11: Line 11:
===It is noisy===
===It is noisy===
Thirdly, to embrace all the data you can find is to degrade the [[signal-to-noise ratio]]. Even if you buy into the incoherent [[reductionist]] idea that the “signal” is some kind of transcendent truth, by industrialising your data, you risk burying it and if you don’t — if like we pluralists you see ''any'' signal as not just a suitable narrative for your present purposes, the more data you gather, the more possible narratives — conflicting narratives; [[incommensurable]] narratives — you will have. Now this is, for a pluralist, is a good thing: every narrative is a tool in your workshop, the more you have the better you are equipped to deal with the [[unknown unknown]]s our [[complex]] world will surely throw at us — but that tends ''not'' to be what big data disciples are after.  
Thirdly, to embrace all the data you can find is to degrade the [[signal-to-noise ratio]]. Even if you buy into the incoherent [[reductionist]] idea that the “signal” is some kind of transcendent truth, by industrialising your data, you risk burying it and if you don’t — if like we pluralists you see ''any'' signal as not just a suitable narrative for your present purposes, the more data you gather, the more possible narratives — conflicting narratives; [[incommensurable]] narratives — you will have. Now this is, for a pluralist, is a good thing: every narrative is a tool in your workshop, the more you have the better you are equipped to deal with the [[unknown unknown]]s our [[complex]] world will surely throw at us — but that tends ''not'' to be what big data disciples are after.  
===It is not a [[universal affirmative]]===
Even if, from pure data, you ''could'' establish the causal relationship between data you have observed and an event that drives it (it is axiomatic that you ''can’t'': you can only derive a [[correlation]], and we know how spurious ''those'' can be) you still can’t conclude that the cause propelling the ''general'' compelled any ''particular''.
 
The JC has invoked the [[parable of the squirrels]] to sort this out.
 
{{sa}}
{{sa}}
*[[Signal-to-noise ratio]]
*[[Signal-to-noise ratio]]
*[[In God we trust, all others must bring data]]
*[[In God we trust, all others must bring data]]