Interview: Michael Li, Data Incubator on Data-driven Hiring for Data Scientists
We discuss the launch of the Data Incubator, its business model, why we need data-driven hiring, selection process for the incubator program and alumni feedback.

At Foursquare, Michael discovered that his favorite part of the job was teaching and mentoring smart people about data science. He decided to build a startup that lets him focus on what he really loves.
Here is my interview with him:
Anmol Rajpurohit: Q1. How and when did you get the idea to launch the Data Incubator? What is the business model?
Michael Li: The Data Incubator is a 7 week fellowship to help masters students, PhDs, and postdocs transition from academia into industry data science roles. The program is free and the tuition is paid for by partner hiring companies (including EBay, Palantir, Pfizer, and the New York Times). For more information, visit: http://www.thedataincubator.com/ or read about fellow experience on our blog http://blog.thedataincubator.com/.

The idea is born out of my frustration having been on both sides of the hiring table. While interviewing, I realized that companies (across a wide-range of data science industries) would often ask the same technical interview questions. As a hiring manager, I was surprised by how many people with strong resumes were unable to answer these basic questions. I figured it would be more efficient for someone to ask those questions to candidates just once. That way, we help aspiring data scientists identify and build their skills and help companies find top talent. We accept fellows who have the raw brainpower and give them a framework to analyze terabytes of data and save hiring managers time and resources in hiring.

AR: Q2. What do you mean by "Data-driven hiring for Data Scientists"?
ML: Hiring, even for data scientists, is often not data-driven. Like with

The other major flaw of most hiring is that a lot of policies are based on small sample sizes. Because we are working with thousands of applications each cohort, we’re able to get a lot more visibility into these stats than many hiring companies.
AR: Q3. Can you describe the selection process for this program? What are the important characteristics you are looking for in the pool of applicants?
ML: We look for people who have a solid foundation in mathematics (statistics, linear algebra, etc…) and computation. PhDs often get this from their research and coursework. The latter is mostly the ability to hack around, munge data, and get things done on a computer and is often a self-taught skill. Finally, we also look for people who can communicate complex, technical ideas to a general audience.
AR: Q4. A few batches have already graduated from the Data Incubator program. What is the common feedback you are getting from alumni?

Second and last part of the interview
Related: