Open drives Boston Open Data Science Conference, May 30-31

Data science is built on transparency, effort, and the exchange of ideas. Join Open Data Science Conference, Boston, May 30-31, 2015.

By Sheamus McGovern, Chair – Open Data Science Conference

Readers of this blog are obviously no strangers to open source in data science. Take R for example. R is the epitome of a successful open source project, having become one of the most popular languages for data analysis and used by millions. It is a thriving open-source project supported by a community of thousands of contributors. Yet R is but one of hundreds of open source languages and tools used by data scientists and developers: think also Python, Spark Julia, Hadoop, and Hive, to name but a few. Data science like other scientific disciplines is built on transparency, effort, and the exchange of ideas. We believe strongly enough in these ideals that we’ve organized a two-day conference, on May 30-31, devoted entirely to them.

Open Data Science Conference The Open Data Science Conference is a celebration of the openness of data science and the thriving community that surrounds it.

Contributors to open source are amazingly talented. They are the authors of many of the libraries and packages you use everyday. OpenDataSci conference gives you the chance to hear from them directly and meet them in person. Our speaker lineup includes Max Kuhn, the prolific author of no less than six R packages, including the caret package. Jared Lander is an R contributor and also the author of R for Everyone. Wes McKinney is the creator of the madly popular Pandas library. Andy Terrel contributes to numerous open source projects including Ignition, the FEniCS Project and Sympy. Anna Herlihy is a contributor to PyMongo and Monary, while Peter Prettenhofer is a contributor to the ubiquitous scikit-learn project.

This conference will, of course, present ample talks and workshops on open source languages and tools. Data science is a huge field and we will cover such topics as predictive analytics, deep learning, machine learning and so on. We are honored to have some of the best and brightest in the field share their insights with us. Owen Zhang is the number one ranked Kaggle data scientist and his talks are always insightful. Josh Wills, Director of Data Science at Cloudera, is a gifted speaker and like the best has a knack for distilling simplicity from complexity. Allen Downey, author of Think Python, and Think Stats, conducts illuminating workshops guided by his easy manner and command of the subject matter. We have many such talented presenters in our 70+ presenter lineup.

I’ve always felt that for something to be truly useful there needs to be a strong community around it to share experiences and lend support. There is a tremendous community around open source data science fueled lately by Meetup phenomena and groups such as NumFOCUS. Many of our speakers are directly involved, like Jared Langer who organizes the NYC Open Statistical Programming Meetup and runs his own R conference; James Powell, who runs many NYC Python events; and Lynn Root, who is involved with PyLadies SF. Many others from the open source community will be present, too.

Ok, this may not be the last conference you’ll every need to attend, but we are confident you will come away with enough ideas and contacts to fill the remainder of your year. Be a part of the open source data community and join us in Boston on May 30-31.

UPDATE: The Open Data Science Conference has added Usama Fayyad, Chief Data Officer at Barclays Bank to its list of speakers. Click here for the full schedule. Use discount code NL-ODSC for an additional 5% off for newsletter readers.