The Inconvenient Truth About Data Science

Data is never clean, you will spend most of your time cleaning and preparing data, 95% of tasks do not require deep learning, and more inconvenient wisdom.



By Kamil Bartocha (lastminute.com)

  1. data-cleaningData is never clean.

  2. You will spend most of your time cleaning and preparing data.

  3. 95% of tasks do not require deep learning.

  4. In 90% of cases generalized linear regression will do the trick.

  5. Big Data is just a tool.

  6. You should embrace the Bayesian approach.

  7. No one cares how you did it.

  8. Academia and business are two different worlds.

  9. Presentation is key - be a master of Power Point.

  10. All models are false, but some are useful.

  11. There is no fully automated Data Science. You need to get your hands dirty.



Bio: Kamil Bartocha is Head of Data Science at lastminute.com, and an expert in the field of data processing, data systems architecture and artificial intelligence.

Original.

(Editor: What are your inconvenient truths? Please comment)

Related:


Get the FREE ebook 'KDnuggets Artificial Intelligence Pocket Dictionary' along with the leading newsletter on Data Science, Machine Learning, AI & Analytics straight to your inbox.

By subscribing you accept KDnuggets Privacy Policy


Get the FREE ebook 'KDnuggets Artificial Intelligence Pocket Dictionary' along with the leading newsletter on Data Science, Machine Learning, AI & Analytics straight to your inbox.

By subscribing you accept KDnuggets Privacy Policy

Get the FREE ebook 'KDnuggets Artificial Intelligence Pocket Dictionary' along with the leading newsletter on Data Science, Machine Learning, AI & Analytics straight to your inbox.

By subscribing you accept KDnuggets Privacy Policy

No, thanks!