Big data: Difference between revisions

46 bytes added ,  8 November 2022
no edit summary
No edit summary
No edit summary
Line 12: Line 12:
As surely as [[ugliest man|the ugliest man]] killed God, so did data kill the [[superman]]. The ''will to power'' is defeated by the million-strong dull blades of the ''[[will to entropy]]''. It is the ''will to [[premium mediocre]]''.
As surely as [[ugliest man|the ugliest man]] killed God, so did data kill the [[superman]]. The ''will to power'' is defeated by the million-strong dull blades of the ''[[will to entropy]]''. It is the ''will to [[premium mediocre]]''.
===It is historical===
===It is historical===
All data is from the past, as Roger Martin has it.
All data is from the past, as Roger Martin has it. That means all data is skewed in time
===It is bad===
===It is bad===
Not only (per our careful argument at [[signal-to-noise ratio]]) is the overall quantity of data we have skewed in time (all from the past, none from the future), place (only what we’ve been looking at, none of what we haven’t) and practically ''nil'' in quantity, the ''quality'' of data in our tawdry collection is ''poor''. And not just in its profusion of cat videos and [[hot takes on Twitter]], either. For, as an evolutionary record, it contains ''all'' the errors and the one successful trial; all the abandoned drafts, all the false starts, all the typos, [[split infinitive]]s, tendentious arguments, feeble caveats and needless [[for the avoidance of doubt|avoidances of doubt]]. The data we have, that is, even on our own rationalised terms, mainly noise. ''Bad'' noise.
Not only (per our careful argument at [[signal-to-noise ratio]]) is the overall quantity of data we have skewed in ''time'' (all from the past, none from the future), ''place'' (only what we’ve been looking at, none of what we haven’t) and practically ''nil'' in quantity, the ''quality'' of data in our tawdry collection is ''poor''. And not just in its profusion of cat videos and [[hot takes on Twitter]], either. For, as an evolutionary record, it contains ''all'' the errors and the one successful trial; all the abandoned drafts, all the false starts, all the typos, [[split infinitive]]s, tendentious arguments, feeble caveats and needless [[for the avoidance of doubt|avoidances of doubt]]. The data we have, that is, even on our own rationalised terms, mainly noise. ''Bad'' noise.


===Noise===
===Noise===