KDnuggets™ News 14:n23, Sep 3
Features | Software | News | Opinions | Interviews | Reports | Webcasts | Courses | Meetings | Jobs | Academic | Publications | Tweets | Quote
Features
- New Poll: Machine Data Science for Humans or for Machines? - Sep 1, 2014.
Did you do mainly Data Science for Humans (explainable models for a human decision maker), Data Science for Machines (accuracy-oriented black-box models for automated algorithms), or both? Please vote.
- KDD-2014 - The Biggest, Best, and Booming Data Science Meeting - Aug 28, 2014.
KDD-2014 was the largest (with over 2300 people) and the best Data Science meeting, highlighting the huge progress of Data Science made with Big Data, and its even more amazing potential.
- KDD-2014 report, part 2: The Magic Module network and Privacy vs Big Data - Sep 2, 2014.
Here is part 2 of my report on KDD-2014, the biggest and the best Data Science meeting: The Magic Module genes, Privacy vs Big Data, and should we ask for consent of data subjects?
- Cartoon: Robot Labor Day 2050 - Sep 1, 2014.
Amidst all the discussion about robots and automation taking over human jobs, new KDnuggets cartoon looks at how Labor Day can evolve by 2050.
- Sibyl: Google's system for Large Scale Machine Learning - Aug 20, 2014.
A review of 2014 keynote talk about Sibyl, Google system for large scale machine learning. Parallel Boosting algorithm and several design principles are introduced.
- PAW: Predictive Analytics World London, October - Sep 2, 2014.
See Dr. John Elder in London at the leading Predictive Analytics conference, and get KDnuggets Discount - 15% off full conference passes to Predictive Analytics World London.
- PAW: Don't Miss Predictive Analytics World for Healthcare - Aug 21, 2014.
PAW Healthcare gathers industry leaders who share case studies and knowledge about predictive analytics making significant impacts in the healthcare industry - KDnuggets readers get a special discount.
Software
- Ontotext text mining, semantic search, and graph database - Sep 2, 2014.
Ontotext blends text mining, semantic annotation and semantic search with a graph database that infers new meaning at scale, helping organizations find meaning in large volumes of structured and unstructured data.
- Scorto Plug and Score Modeler - for easy scorecard development - Aug 30, 2014.
New version of Plug&Score Modeler is an easy-to-use credit scorecard development software designed for credit risk professionals in banks and other financial institutions.
- Dataiku Data Science Studio - Aug 26, 2014.
Data Science Studio (DSS) from Dataiku is a complete Data Science software tool for developers and analysts, which significantly shortens the time-consuming load-clean-train-test-deploy cycles of building predictive applications. A community edition and a free trial available.
- Deep Learning - important resources for learning and understanding - Aug 21, 2014.
New and fundamental resources for learning about Deep Learning - the hottest machine learning method, which is approaching human performance level.
News
- Analytical Skills Survey for 2014 - Sep 2, 2014.
Have your voice be heard - participate in the Analytical Skills, Tools and Attitudes Survey 2014, and see the challenges faced by others.
- Additions to KDnuggets Directory in August - Sep 1, 2014.
New analytics, data science, data mining top stories page, meetings, KDD-2015, Certificates and Certification option, Data Analytics BS, software for text analytics and visualization and more.
- KDD-2014 Ice Bucket Challenge - Aug 30, 2014.
KDD-2014 team of Jure Leskovec, Claudia Perlich, Sofus Macskassy, Gregory Piatetsky, and Rayid Ghani responds to the Ice Bucket challenge.
- KDD-2014 Awards Winners - Aug 22, 2014.
KDD-2014, the leading and the largest conference in data mining, data science, and knowledge discovery, recognizes the key researchers and contributors through several awards - read about the winners.
- Kaggle Epilepsy Seizure Prediction Challenge - Aug 28, 2014.
Create a forecasting system for predicting epileptic seizures in this Kaggle challenge to help improve the lives of epilepsy patients and win prizes. Competition ends on November 17.
- Top stories for Aug 24-30: 4 main languages for Analytics, Data Mining; Dataiku Data Science Studio - Aug 31, 2014.
Four main languages for Analytics, Data Mining, Data Science; Dataiku Data Science Studio; KDD-2014 - The Biggest, Best, and Booming Data Science Meeting; On the Secret Sauce of Impressive Content Curation.
- Top stories for Aug 17-23: Four main languages for Analytics, Data Mining, Data Science - Aug 24, 2014.
Four main languages for Analytics, Data Mining, Data Science; Sibyl: Google system for Large Scale Machine Learning; Top Research Leaders in Data Mining; Interview: Pedro Domingos: the Master Algorithm, new type of Deep Learning.
Opinions
- Age homophily for predicting age of mobile phone customers - Sep 2, 2014.
Homophily (a tendency of people to associate with others like them) is ubiquitous in real world and in social networks. We show the existence of age homophily in a mobile phone communication network and exploit it to predict the age group for all users in the network.
Interviews
- Interview: Debora Donato, StumbleUpon on the Secret Sauce of Impressive Content Curation - Aug 28, 2014.
We discuss the role of data science at StumbleUpon, the shift from search to discovery, metrics for user engagement, the art of collaborative filtering, how native ads improve user experience, major trends, advice and more.
- Interview: Arpit Gupta, CEO, Actionable Analytics on Enterprise Challenges in Big Data and Cloud - Aug 24, 2014.
We discuss Actionable Analytics start-up, enterprise challenges in Big Data, relationship with cloud computing, metrics vs. insights, Big Data expectations and more.
- Interview: Saikat Mukherjee, ShareThis on Why Marketers can no longer Ignore Social TV? - Aug 20, 2014.
We discuss the role of Analytics at ShareThis, the emergence of Social TV, better user behavior insights through Social TV, major challenges with Social TV analytics, interesting insights, future trends, recommendation and more.
Reports
- INFORMS The Business of Big Data 2014: Day 1 Highlights - Aug 21, 2014.
Highlights from the presentations by Big Data technology practitioners from Teradata, Booz Allen Hamilton, Databricks and ProbabilityManagement.org during INFORMS The Business of Big Data in San Jose.
Webcasts and Webinars
- Webcast - Analytically Speaking Featuring Michael J. A. Berry and Gordon S. Linoff - Sep 2, 2014.
Berry and Linoff talk about the current data mining landscape, including new methods, new types of data and the importance of using the right analysis for your problem.
- Upcoming Webcasts on Analytics, Big Data, Data Science - Sep 2 and beyond - Sep 1, 2014.
Streaming Analytics, Analytical Lifecycle, Modern Regression Analysis, Hadoop for Machine Learning, NASA Earth Science Data, Strata + Hadoop NYC preview, Ontotext, and more.
Courses
- Online Master of Science in Data Science - Sep 2, 2014.
Lewis University online MS in Data Science program offers two concentrations: Data Science for Computer Scientists and Data Science for Life Scientists.
Meetings
- Sep 2014 - Mar 2015 Meetings in Analytics, Big Data, Data Mining, and Data Science - Sep 3, 2014.
Coming soon: PAW Gov, ECML/PKDD, BayesiaLab UC, Big Data Innovation Summit, PAW Boston, PAW Healthcare, RecSys, Strata + Hadoop NYC, Analytics 2014, BigData TechCon, Data Analytics Week, Text Analytics Summit West,and many more.
- BayesiaLab User Conference, Sep 16-24, UCLA - Sep 2, 2014.
Research practitioners from leading organizations will gather at UCLA for the only event dedicated to applied research and analytics with Bayesian networks and BayesiaLab. Pre-conference program includes courses on BayesiaLab and Causal Inference with Graphical Models.
- PAW: Predictive Analytics World London, October - Sep 2, 2014.
See Dr. John Elder in London at the leading Predictive Analytics conference, and get KDnuggets Discount - 15% off full conference passes to Predictive Analytics World London.
- ACM Data Science Camp, San Jose, Oct 25 - Aug 31, 2014.
ACM Data Mining Camps, held since 2009, are renamed this year to Data Science Camp to be more inclusive of Big Data & Data Science. Propose or organize sessions and attend this great meeting.
- Leeds Business Analytics Conference, Sep 18-19, CU Boulder - Aug 24, 2014.
Top names in business analytics come to the Leeds Analytics conference at CU Boulder, creating a unique opportunity to learn practical applications of analytics today and to network with key analytics professionals.
- PAW: Don't Miss Predictive Analytics World for Healthcare - Aug 21, 2014.
PAW Healthcare gathers industry leaders who share case studies and knowledge about predictive analytics making significant impacts in the healthcare industry - KDnuggets readers get a special discount.
Jobs
- Symantec: IoT Analytics Engineer - Principal (Data Scientist) - Aug 29, 2014.
A key, hands-on contributor to the development and integration of analytics and machine learning technologies under the Internet of Things (IoT) umbrella.
- Resonate: Data Scientist - Aug 29, 2014.
Performing complex data analysis on large datasets to further develop our campaign optimization and targeting algorithms.
- Microsoft: Principal Data Scientist, Skype - Aug 29, 2014.
Responsible for analyzing call quality for Skype and Lync and driving improvements into the product, to make Skype and Lync have the best audio and video communication experience possible.
- Apple: Server Software Engineer - Maps Community - Aug 24, 2014.
Help develop the next generation of community services, build the scalable infrastructure that drives crowdsourced improvements to the Apple Maps experience.
- Flipkart: Data Scientist - Aug 22, 2014.
If you are passionate about Data, looking to impact billion+ people, and have an advanced degree (MS or PhD) in Computer Science, Machine Learning, Statistics, Optimization, NLP or related areas, we would love to talk!
- Pew Research Center: Research Analyst, Internet, Science and Technology - Aug 22, 2014.
Expert in Big Data and non-survey data collection and analysis to support Pew Research on the impact of the internet and other digital technologies on users and society.
- company: Sr. Data Analytics Engineer/Scientist - Machine Learning - Aug 21, 2014.
Develop an analytic strategy by creating methodologies, algorithms and prototypes, and showing their value; build statistical models with large datasets to derive meaningful insights that effect actionable decisions.
- Nike: Senior Data Scientist, Integrated Analytics, Global Consumer Knowledge - Aug 21, 2014.
Work across Nike consumer facing businesses to define and implement measurement strategies, instrument and analyze consumer behavior, and inform Nike global strategy.
Academic and Research positions
- PNNL: Post Doctorate RA - Data Sciences and Analytics - Aug 28, 2014.
Do research related to social network analysis, human language technologies, machine learning, infectious disease epidemiology, and data collection and analysis.
- Becker College: Founding Director of the Data Science Program - Aug 26, 2014.
Responsibilities include: thought leadership, program development, network development, industry engagement, research, effective classroom teaching, student advising, college service, and professional scholarship.
Publications
- White paper: Making the business case for text analytics - Aug 29, 2014.
Download a free white paper which focuses on the business benefits (and challenges!) of text analytics from the perspectives of 4 experts from slated to speak at the 13th Annual Text Analytics Summit West, Nov 4-5 in San Francisco.
- Best Text Analytics Summit Presentations - Aug 26, 2014.
Before this year's Text Analytics Summit West, read the previous year's summit's best received presentations and learn about leveraging text analytics for business gain.
- DataReview interview with me on KDnuggets, Data Mining, and Data Science - Aug 20, 2014.
I have recently given an interview to DataReview, an Ukrainian site, where we talked about KDnuggets origins, history of Data Mining, interesting problems I worked on, and typical problems faced by young data scientists.
Top Tweets
- Top KDnuggets tweets, Aug 29-31 - Sep 1, 2014.
Data Analytics vs Predictive Modeling vs Data Mining vs Big Data;
100 most popular machine learning talks at videolectures;
Intro to parallel iterative Deep Learning on Hadoop YARN;
Jeremy Howard answers why he left Kaggle and what are his plans now. - Top KDnuggets tweets, Aug 27-28: Worst Venn Diagram ever? KDD-2014 - Aug 29, 2014.
Worst Venn Diagram ever?
New Microsoft Azure SQL Database service tiers with reduced pricing;
KDD-2014 - The Biggest, Best, and Booming Data Science Meeting;
NYTimes on KDD-2014: Looking to the Future of Data Science. - Top KDnuggets tweets, Aug 25-26 - Aug 27, 2014.
KDD-2014 organizers, including Gregory, take Ice Bucket challenge;
Most important APIs every Data Scientist should know;
Best Text Analytics Summit Presentations;
Artificial Dataset Generation for Machine Learning. - Top KDnuggets tweets, Aug 22-24 - Aug 25, 2014.
A Day in the Life of a Functional Data Scientist;
How to keep yourself up to data on Data Science, Data Mining;
Automatic Storytelling: How to Build Your Very Own Data Scientist;
Four main languages for Analytics, Data Mining, Data. - Top KDnuggets tweets, Aug 20-21: Deep Learning book and more; Sibyl - Aug 22, 2014.
Deep Learning book draft - read and help improve it;
Deep Learning - important resources for learning and understanding;
Sibyl: Google's system for Large Scale Machine Learning;
Excellent Intro to Deep Learning. - Top KDnuggets tweets, Aug 18-19 - Aug 20, 2014.
Big Data moves to "Trough of Disillusionment" in Gartner Hype Cycle;
Four main languages for Analytics, Data Mining;
Statistics.com Online Courses and Certificate Programs in Data Science;
Exploring Hotel Review Data from Trip Advisor with R.