KDnuggets™ News 15:n01, Jan 7: Clever methods of overfitting; 5 Analytics Rules to cut thru the Hype
11 Clever Methods of Overfitting and how to avoid them, Data Mining and Text Analytics of World Cup 2014, iMath Cloud Data Science Platform beta, Platfora CEO on Insightful Analytics for Big Data, and more analytics, big data, data science, and data mining stories.
Features | Software | Opinions | Interviews | Reports | News | Webcasts | Courses | Meetings | Jobs | Academic | Tweets | CFP | Quote
Features
- 11 Clever Methods of Overfitting and how to avoid them - Jan 2, 2015.
Overfitting is the bane of Data Science in the age of Big Data. John Langford reviews "clever" methods of overfitting, including traditional, parameter tweak, brittle measures, bad statistics, human-loop overfitting, and gives suggestions and directions for avoiding overfitting.
- Data Mining and Text Analytics of World Cup 2014 - Jan 3, 2015.
Explore how text analysis techniques to dig into some of the data in a series of blog posts, focusing on matches and their events, tweets languages, tweets volumes for different teams and sentiment analysis.
- PAW: 5 Co-Located Analytics Events in San Francisco, March 29 - Apr 2, 2015 - Jan 7, 2015.
March 29 - Apr 2, 2015 San Francisco hosts PAW for Business, eMetrics Summit, Predictive Analytics World for Workforce, Text Analytics World and the PA Times Executive Breakfast.
Software (see also All Software )
- iMath Cloud Data Science Platform beta - Jan 6, 2015.
iMathResearch presents a Data Science platform, offering development in Python, R or Octave, cloud-based collaboration, private computational instances and visualization from the browser.
Opinions (see also All Opinions, Interviews for this month )
- Causation vs Correlation: Visualization, Statistics, and Intuition - Jan 4, 2015.
Visualizations of correlation vs. causation and some common pitfalls and insights involving the statistics are explored in this case study involving stock price time series.
- Analytics: Five Rules to Cut Through the Hype - Jan 1, 2015.
Cut through the analytics hype by asking the right questions, discerning between value-add analytics, considering in and out of house solutions, forming an iterative analytics process, and making sure your organization uses it.
Interviews (see also All Opinions, Interviews for this month )
- Interview: Paul Robbins, STATS on the Potential and Challenges for Sports Analytics - Jan 5, 2015.
We discuss Analytics at STATS, typical daily tasks, ICE Analytics platform, key challenges, response from coaches/players, career advice and more.
- Interview: Ben Werther, CEO, Platfora on Insightful Analytics for Big Data - Dec 30, 2014.
We discuss the challenges in implementing end-to-end solutions for Big Data, Platfora use cases, Big Data trends, advice and more.
- Interview: Ben Werther, CEO, Platfora on Why Big Data Needs Self-Service Tools - Dec 29, 2014.
We discuss the importance of self-service model for Big Data tools, Small Data vs. Big Data, unique advantages of Platfora, key enhancements in Platfora 4.0 and more.
Reports
- Predictive Analytics Innovation Summit, Chicago - Day 1 Highlights - Jan 6, 2015.
Highlights from the presentations by Predictive Analytics leaders from Netflix, LinkedIn and Mashable on day 1 of Predictive Analytics Innovation Summit 2014 in Chicago.
News (see also All News )
- Agnik Connected Insurance Program powered by Vehicle Analytics - Jan 6, 2015.
Car owners can save up to 20% with Connected Insurance Program with participating insurance carriers by opting in after driving for least 50 miles with an Agnik connected car product.
- Top stories for Dec 28 - Jan 3: What will happen to big data and data science? Analytics: Five Rules to Cut Through the Hype - Jan 4, 2015.
2015 Predictions: What will happen to big data and data science?; Data Mining is LinkedIn Hottest Skill in 2014; Analytics: Five Rules to Cut Through the Hype; 11 Clever Methods of Overfitting and how to avoid them.
- Top stories in December: If programming languages were vehicles; Cartoon: Unexpected Data Science Recommendations - Jan 2, 2015.
If programming languages were vehicles, what would be R, Python, SAS, and SQL? Cartoon: Unexpected Data Science Recommendations; Geoffrey Hinton talks about Deep Learning, Google and Everything; IBM Watson Analytics vs. Microsoft Azure Machine Learning.
- Additions to KDnuggets Directory in December 2014 - Jan 1, 2015.
TDWI Las Vegas (Feb 22), UIC Big Data Symposium (Mar 17), INFORMS (Apr 12-14), BVA Data Science, Silk visualization and more.
Webcasts and Webinars (see also All Webcasts and Webinars )
- Upcoming Webcasts on Analytics, Big Data, Data Science - Jan 6, 2015 and beyond - Jan 6, 2015.
Hadoop, Top BI Trends, Enter a KDD Cup or Kaggle, YARN, In-Database Analytics Deep Dive, Data Mining: Failure to Launch, and more.
- Webinar: Data Mining: Failure to Launch [Jan 15] - Jan 5, 2015.
Learn how to get started with predictive modeling and overcome strategic and tactical limitations that cause data mining projects to fall short of their potential. Next webinar is Jan 15.
- Salford Webinar: Enter a KDD Cup or Kaggle Competition - Jan 4, 2015.
The webinar will show on the example of KDD Cup 2009 how Salford TreeNet can quickly achieve a top 5 result, and how to quickly build great models even if you are not an expert.
Courses (see also All Courses )
- Big Data Bootcamp, Santa Clara, Jan 16-18 - Jan 6, 2015.
This is a fast paced, vendor agnostic, technical overview of the Big Data landscape targeted towards both technical and non-technical people who want to understand the emerging world of Big Data. Special KDnuggets discount till Jan 14.
- NYC Data Science Academy Bootcamp, Feb 2 - Apr 24 - Jan 1, 2015.
Learn from one of NYC top Data Science Instructors and receive mentorship from Chief Data Scientists, ending with interview prep and job placement at top firms in New York and the Tri-State area.
Meetings (see also All Meetings )
- PAW: 5 Co-Located Analytics Events in San Francisco, March 29 - Apr 2, 2015 - Jan 7, 2015.
March 29 - Apr 2, 2015 San Francisco hosts PAW for Business, eMetrics Summit, Predictive Analytics World for Workforce, Text Analytics World and the PA Times Executive Breakfast.
- Upcoming Jan - Jun 2015 Meetings in Analytics, Big Data, Data Mining, Data Science, Machine Learning - Jan 5, 2015.
Coming soon: Big Data Innovation Las Vegas, EGC, Deep Learning Summit SF, TDWI Las Vegas, IKDD India, PAW Business and PAW Workforce San Francisco, Text Analytics World, MLconf NYC, SBP15.
- NYC Open Data Meetups in January - Jan 3, 2015.
Upcoming events including Python Machine learning class Demo Day, Data Science Bootcamp and more.
Jobs (see also All Jobs )
- Bell Labs: Member of Technical Staff in Data Science - Jan 5, 2015.
Solve challenging problems and build analytic products with real-world impact; design and implement efficient and accurate algorithms suitable for large scale data.
Academic and Research positions (see also All Academic positions )
- WPI: Teaching Professor or Instructor, Data Science - Jan 5, 2015.
Full-time non-tenure track position for Fall 2015 to strengthen WPI fast-growing interdisciplinary Data Science program.
- CUNY: Distinguished Lecturer - Data Analytics and Information Systems - Jan 5, 2015.
Teach courses in Data analytics and serve as the academic director for CUNY SPS online BS in Information Systems and online MS in Data Analytics.
Top Tweets (see also All top tweets for this month )
- Top KDnuggets tweets, Dec 29 - Jan 04 - Jan 5, 2015.
SAS is n1 among major BI vendors whose users plan to discontinue use; How #MachineLearning, #BigData, and image recognition could revolutionize search; A brilliant way to tell causation from correlation; Machine Learning Experts You Need to Know: Geoff Hinton, Michael Jordan, Andrew Ng.
- 2014 in Review: Top KDnuggets tweets in August - Jan 1, 2015.
Worst Venn Diagram ever? ThomsonReuters needs a new graphic designer; The World Bank sums up the entire global economy in one chart; #BigData moves to "Trough of Disillusionment" in Gartner 2014 Hype Cycle; xkcd: Boyfriend as a statistically "significant" other.
- Year in Review: Top KDnuggets tweets in September - Dec 30, 2014.
One pattern is random, other is machine-generated. Can you guess which?; 14 Awesome (and Free) #DataScience Books; Dilbert 20 funniest cartoons on #BigData, data mining, privacy; Watch: Statistical, Machine learning with R, great 15 hour online course.
- Year in Review: Top KDnuggets tweets in October - Dec 29, 2014.
Data mining classics: Classifying Shakespearean Drama; Air traffic data is being analyzed to predict Ebola Spread; A Great Collection of #MachineLearning Algorithms; Best Programming Language for Machine Learning.
- Top KDnuggets tweets, Dec 22-28 - Dec 29, 2014.
Top 10 Data Science Skills, and How to Learn Them; Mathematicians claim to figure out how to tell correlation from causation; Review of #MOOC Learning from Data - the class that changed everything; Free Big Data sources every Data Science enthusiast should know.
CFP - Calls for Papers (see also All Calls for Papers )
- Due Jan 15, 15th Industrial Conf. on Data Mining ICDM 2015 , Hamburg, Germany. Jul 15-19, 2015
- Due Jan 15, 11th Int. Conf. on Machine Learning and Data Mining (MLDM 2015) , Hamburg, Germany. Jul 20-23, 2015
- Due Jan 20, Call for workshop proposals: The 19th Pacific-Asia Conf. on Knowledge Discovery and Data Mining (PAKDD-2015) , May 19-22, 2015, Ho Chi Minh City, Viet Nam
- Due Feb 8, IJCAI-15 Machine Learning Track , Buenos Aires, Argentina. Jul 25-31, 2015
- Due Mar 9, EPIA 2015: Knowledge Discovery and Business Intelligence , Coimbra, Portugal. 8-11 Sep 2015
- Due Mar 30, JTAER Special Issue on Big Data Analytics , Guest Editors: Jouni Markkula, Marikka Heikkila, Christopher Westland, Zhangxi Lin, Jukka Heikkila. Email jtaer.big.data@utalca.cl