KDnuggets Home » News » 2015 » May » Opinions, Interviews, Reports » The Inconvenient Truth About Data Science ( 15:n14 )

The Inconvenient Truth About Data Science


Data is never clean, you will spend most of your time cleaning and preparing data, 95% of tasks do not require deep learning, and more inconvenient wisdom.



By Kamil Bartocha (lastminute.com)

  1. data-cleaningData is never clean.

  2. You will spend most of your time cleaning and preparing data.

  3. 95% of tasks do not require deep learning.

  4. In 90% of cases generalized linear regression will do the trick.

  5. Big Data is just a tool.

  6. You should embrace the Bayesian approach.

  7. No one cares how you did it.

  8. Academia and business are two different worlds.

  9. Presentation is key - be a master of Power Point.

  10. All models are false, but some are useful.

  11. There is no fully automated Data Science. You need to get your hands dirty.



Bio: Kamil Bartocha is Head of Data Science at lastminute.com, and an expert in the field of data processing, data systems architecture and artificial intelligence.

Original.

(Editor: What are your inconvenient truths? Please comment)

Related: