#status/processed
# Metadata
Author:: [[Health Insurance]]
Title:: How Data Scientists Turned Against Statistics
Full Title:: How Data Scientists Turned Against Statistics
Import Date:: 2023-05-13
Source:: #source/readwise/instapaper
Source URL:: [Source URL](https://www.forbes.com/sites/kalevleetaru/2019/03/07/how-data-scientists-turned-against-statistics/amp/?__twitter_impression=true)
Review URL:: [Review URL](https://readwise.io/bookreview/26339317)
# Document
Tags:: [[Data Science]]
# Highlights
- ==Our great leap into the world of data has come with a giant leap of faith that the core tenets of statistics no longer apply when one works with sufficiently large datasets.==
- Date:: [[2019-03-10]]
- Find: [View Highlight](https://instapaper.com/read/1170545087/10378778)
- There is no “methodology appendix” attached to a keyword search in most commercial platforms that specifies precisely how much data was searched, whether and what kind of sampling was used or how much missing data there is in its index.
- Date:: [[2019-03-10]]
- Find: [View Highlight](https://instapaper.com/read/1170545087/10378781)
- Partially this reflects the influx of non-traditional disciplines into the data sciences.
- Date:: [[2019-03-11]]
- Find: [View Highlight](https://instapaper.com/read/1170545087/10384132)
- Eager to project a proprietary edge, companies wrap known algorithms in unknown preprocessing steps to obfuscate their use but in doing so introduce unknown accuracy implications.
- Date:: [[2019-03-11]]
- Find: [View Highlight](https://instapaper.com/read/1170545087/10384133)
- Coinciding with this shift is the ==loss of the denominator and the trend away from normalization in data analysis.==
- Date:: [[2019-03-11]]
- Find: [View Highlight](https://instapaper.com/read/1170545087/10384134)
- The lack of a solid statistical foundation means many data scientists don’t understand why reporting raw counts from a rapidly changing dataset can lead to incorrect findings.
- Date:: [[2019-03-11]]
- Find: [View Highlight](https://instapaper.com/read/1170545087/10384135)
- Note: really?
- In the end, it seems we no longer actually care what our data says or whether our results are actually right.
- Date:: [[2019-03-11]]
- Find: [View Highlight](https://instapaper.com/read/1170545087/10384136)