2014 April
All (91) | Courses, Education (8) | Meetings & Reports (13) | News, Features (18) | Opinions, Interviews (11) | Publications (17) | Software (5) | Top Tweets (13) | Webcasts (6)
- Massachusetts releases Big Data Report 2014
- Apr 30, 2014.
Massachusetts Big Data Report 2014 (free download) highlights state successes, including almost 500 Big Data companies, $2.5B invested, 5600 students graduating from 14 data science-related programs, and identifies key priorities and growth opportunities.
- Top KDnuggets tweets, Apr 28-29
- Apr 30, 2014.
9 Free Books for Learning Data Mining; Cartoon: Data Scientist Salary Negotiation; statsTeachR - great free resource; What every Data Scientist needs to know about SQL.
- KDnuggets Interview: Juan Miguel Lavista, Microsoft Data Science Team
- Apr 30, 2014.
We discuss Randomized Controlled Experiments, common errors during A/B testing, Correlation vs. Causality, Big Data Myths and setting up realistic expectations from Big Data and more...
- KDnuggets 14:n10, Data Scientist Salary examined; Big Data Highlights; Michael Brodie Interview
- Apr 30, 2014.
Latest analytics/data mining news, including Features, Opinions, Software, News, Webcasts, Courses, Meetings, Jobs, Academic positions, Publications, Top Tweets, and CFP.
- Cartoon: Data Scientist Salary Negotiation
- Apr 29, 2014.
New KDnuggets Cartoon looks at Data Scientist Salary Negotiation situation.
- PAW: Predictive Analytics World Toronto – New Data Paradigm
- Apr 29, 2014.
2014 means big changes for big data. Get ready for a fierce debate on the new data paradigm at Predictive Analytics World Toronto on May 15. Special KDnuggets discount.
- Big Data Leads Top Paying Skills
- Apr 29, 2014.
Big Data related skills led the list of top paying technical skills (six-figure salaries) in 2013. Several other useful insights are available in the Dice Tech Survey Report, available for free download.
- Big Data Innovation Summit 2014 Santa Clara: Highlights of Selected Talks on Day 2
- Apr 29, 2014.
Highlights from the presentations by big data technology practitioners from NYSE, Glassdoor, Slice and Paychex on day 2 of Big Data Innovation Summit 2014 in Santa Clara.
- 9 Free Books for Learning Data Mining and Data Analysis
- Apr 29, 2014.
Whether you are learning data science for the first time or refreshing your memory or catching up on latest trends, these free books will help you excel through self-study.
- New Poll: What data types/sources you analyzed in the past 12 months?
- Apr 29, 2014.
New KDnuggets Poll is asking: What data types/sources you analyzed in the past 12 months? Please vote on www.kdnuggets.com .
- U. Cincinnati Analytics Summit 2014, May 23
- Apr 28, 2014.
Keynotes by Eric Siegel (PAW Founder) and Jack Levis (UPS Director of Process Management), Tracks on Predictive Analytics, Descriptive Analytics, Prescriptive Analytics, Social and Mobile Media Analytics.
- Top KDnuggets tweets, Apr 25-27: Recommended Tutorials for Data Scientists; How One Woman Hid Her Pregnancy from Big Data
- Apr 28, 2014.
Recommended Tutorials for Data Scientists from PyCon 2014; How One Woman Hid Her Pregnancy from #BigData; MLTK: Machine Learning Toolkit in Java - free download; Deep Learning for Natural Language Processing.
- Upcoming Webcasts on Analytics, Big Data, Data Science – April 28 and beyond
- Apr 28, 2014.
Stuck in Traffic, Beyond Excel, Evolving Your BI Strategy, BLU Acceleration with Cognos, SAS and Cloudera, Analytically Speaking Featuring David Meintrup, Data Mining: Failure to Launch, and more.
- KDnuggets Interview: Michael Brodie on Data Curation, Cloud Computing, Startup Quality, Verizon (part 2)
- Apr 28, 2014.
The second part of our exclusive interview focuses on Data Curation, Cloud Computing, Data Tamer and Jisto startups, and his experience as a chief Scientist of Verizon - and how that relates to teenager never tidying a room for 60 years.
- Where are your users? Geo-localization with KNIME
- Apr 28, 2014.
Learn how KNIME can help you improve user understanding through Geo-localization of IP addresses and dynamic visualization. Access free white paper for more details.
- MLTK: Machine Learning Toolkit in Java – free download
- Apr 27, 2014.
MLTK is a collection of machine learning algorithms in Java, supporting Generalized Linear Models: Ridge, Lasso, Elastic Net, Regression Trees, Random Forests, and more. Free download under BSD license.
- KDD 2014 Workshops – the leading edge of Data Science Research
- Apr 27, 2014.
KDD 2014 workshops provide the forum for the leading-edge research on topics like Data Science for Social Good, Crowd Sensing, Mobile Health, Stream Mining, Data Ethics, Sports Analytics, Social Networks, and much more. Papers due in June.
- Top stories for Apr 20-26
- Apr 27, 2014.
Elusive Data Scientists Driving High Salaries; Data Workflows for Machine Learning; New Book: Social Media Mining - free PDF download; Microsoft Expands Big Data Platform.
- Top KDnuggets tweets, Apr 23-24: It does look similar, but …; Why people are bad at technology predictions
- Apr 25, 2014.
#BigData Cartoon: "It does look similar - but this one is powered by Hadoop"; Great list: 9 Python Machine Learning Books; Why people are bad at technology predictions; Too busy recommending things to experience them.
- Exclusive Interview: David Stringfellow, Chief Economist, State Utah Auditor
- Apr 25, 2014.
We discuss Analytics for Public Policy decisions, responsibilities of Utah Chief Data Officer, crowdsourcing analytics for resolving Government problems and most important skills for data science practitioners.
- Big Data Innovation Summit 2014 Santa Clara: Highlights of Selected Talks on Day 1
- Apr 25, 2014.
Highlights from the presentations by big data technology practitioners from eBay, YarcData, LinkedIn, Trulia, and other leading companies on day 1 of Big Data Innovation Summit 2014 in Santa Clara.
- Data Mining Medicare Data – What Can We Find?
- Apr 24, 2014.
Medicare released detailed reimbursement data for 2012: $77 billion paid to more than 880,000 health care providers, by doctor and procedure.We take an initial look and find large variances and potential indicators of fraud.
- USC Marshall MS in Business Analytics
- Apr 24, 2014.
USC Marshall new MS in Business Analytics will give you the tools to leverage big and unstructured data for effective decision-making - study full or part-time, and customize your degree to your career goals.
- Top KDnuggets tweets, Apr 21-22
- Apr 23, 2014.
Sweet! Chocolate Consumption strongly correlated to Nobel Prizes; Cheat Sheets for Data Scientists; New Book: Social Media Mining - free PDF download; Elusive Data Scientists Driving High Salaries.
- Are Big Data and Privacy at odds? FICO Interview
- Apr 23, 2014.
We discuss privacy, FICO scores, balancing predictive power and non-discrimination, whether technology bringing big data and privacy closer, and most important privacy issues for FICO.
- Big Data Innovation Summit 2014: Highlights of Keynote Speeches on Day 2
- Apr 23, 2014.
Highlights from keynote speeches by big data experts from Facebook, RedPoint Global, Quintiles, Samsung, GMU, PayPal, and others on Day 2 of Big Data Innovation Summit 2014 in Santa Clara.
- Big Data Innovation Summit 2014: Highlights of Keynote Speeches on Day 1
- Apr 23, 2014.
Highlights from keynote speeches by big data technology leaders from industry and academia on first day of Big Data Innovation Summit 2014 in Santa Clara.
- SIGKDD Data Science/Data Mining PhD Dissertation Award – Nominations Due Apr 30
- Apr 23, 2014.
This annual award by ACM SIGKDD seeks to recognize outstanding research by doctoral candidates in the field of data mining, data science, and knowledge discovery. Nominations due Apr 30.
- MMDS 2014: Workshop on Algorithms for Modern Massive Data Sets, Berkeley
- Apr 22, 2014.
The MMDS 2014 workshop (Berkeley, June 17-20) will bring top researchers to address algorithmic, mathematical, and statistical challenges in modern statistical data analysis. Early registration deadline May 1.
- New Book: Social Media Mining – free PDF download
- Apr 22, 2014.
Social Media Mining integrates social media, social network analysis, and data mining to enable students, practitioners, researchers, and managers to understand the basics and potentials of this field.
- TESC Online, Affordable MBA in Data Analytics – Recharge Your Career
- Apr 22, 2014.
This online program enables graduates to use advanced data analytics to drive continuous improvement in business and organizations, and lets you earn credit for professional certifications/expertise.
- Top KDnuggets tweets, Apr 18-20
- Apr 22, 2014.
Cross-validation pitfalls for regression/classification and how to avoid them; Data Workflows for Machine Learning ; Apache Spark, the hot new trend in Big Data ; Visual Analysis Best Practices - download a free guidebook from Tableau.
- Upcoming Webcasts on Analytics, Big Data, Data Science – April 21 and beyond
- Apr 21, 2014.
Traditional RDBMS Wisdom is All Wrong, What Is Hadoop and Where Is It Going, Trupanion and SiSense, Measuring Skill Level and Optimizing Player-Matching Algorithms in Online Games, and Analytically Speaking Featuring David Meintrup.
- Microsoft Expands Big Data Platform
- Apr 21, 2014.
Microsoft expands its data platform with 3 major features: SQL Server 2014 with in-memory technology, Azure Intelligent Systems Service, and Analytics Platform System - SQL Server + Hadoop. New CEO Satya gives low-key but impressive presentation.
- Exclusive Interview: Michael Brodie, Leading Database Researcher, Industry Leader, Thinker
- Apr 21, 2014.
We discuss the most important database research advances, industry developments, role of relational, NoSQL, Graph databases, Computing Reality, and more.
- Elusive Data Scientists Driving High Salaries
- Apr 21, 2014.
Recent study tracks experience, salary, industry and location of Data scientists, finds they are earning base salaries over $200K. Download free report.
- Data Workflows for Machine Learning
- Apr 20, 2014.
Paco Nathan compares several open source frameworks for Machine Learning workflows, including KNIME, IPython Notebook and related libraries, Cascading, Cascalog, and Spark/MLbase, and proposes 9 criteria to evaluate the best alternatives.
- Top stories for Apr 13-19
- Apr 20, 2014.
Top LinkedIn Groups in 2014 for Analytics, Big Data, Data Science; Data Analytics Handbook, free download; Apache Spark, the hot new trend in Big Data; GoodData Open Analytics Platform.
- Top KDnuggets tweets, Apr 16-17
- Apr 19, 2014.
Scikit-Learn: a great python library for machine learning; A map of where nobody lives in the US; Apache Spark, the hot new trend in Big Data ; NYU @aghose on Est. Demand for Mobile Apps - Learn more: NYU Stern MS in Biz Analytics.
- Apache Spark, the hot new trend in Big Data
- Apr 18, 2014.
Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. Leveraging Hadoop Yarn, Alpine has made it very simple to get started with Spark.
- Big Data TechCon – Great How-To Conference
- Apr 17, 2014.
The recent BigData TechCon conference in Boston featured practical, how-to classes and tutorials for IT and Big Data professionals. It is the how-to training conference for professionals implementing and analyzing Big Data.
- Exclusive Interview: Peter Bruce, President Statistics.com
- Apr 17, 2014.
We discuss the mission of Statistics.com, selection of analytics courses and certificates, the future of analytics education, MOOCs, are Statistics disconnected from Big Data, the role of a data scientist, and more.
- UC Berkeley Master of Information and Data Science, Online
- Apr 17, 2014.
This online degree is for professionals who want to become leaders in the field of data science. Students benefit from UC Berkeley strong ties to Silicon Valley and multidisciplinary approach that teaches the entire data life cycle.
- Examining GoodData Open Analytics Platform
- Apr 16, 2014.
KDnuggets examines the main features of GoodData Open Analytics Platform, its users, how it compares to competition, and future plans.
- Top KDnuggets tweets, Apr 14-15
- Apr 16, 2014.
9 Free Books for Learning Data Mining and Data Science; Coursera #DataScience Specialization: 10 courses from JHU; Top LinkedIn Groups in 2014 for Analytics, Big Data, Data Mining, and Data Science; EMC Data Science and Big Data Analytics Offer.
- KDnuggets 14:n09: Top LinkedIn Groups in 2014; Big Data Vendor Analysis; Teaching Machine Learning
- Apr 16, 2014.
Latest analytics, data mining, and data science stories, including Features, Opinions, Software, News, Webcasts, Courses, Meetings, Jobs, Academic positions, Publications, Tweets, CFP.
- Vendor-Neutral Hands-On Training in Data Mining [ Wash-DC, May | Denver-CO, July ]
- Apr 15, 2014.
Successful analytics in the big data era does not start with data and software, but with immersive hands-on training and goal-driven strategy. Get this training from The Modeling Agency.
- Webinar: Data Mining: Failure to Launch [Apr 17]
- Apr 15, 2014.
Learn how to get started with predictive modeling and overcome strategic and tactical limitations that cause data mining projects to fall short of their potential. Next webinar is Apr 17.
- PAW: Predictive Analytics World, Toronto, May 12-15
- Apr 15, 2014.
Predictive Analytics World in Toronto brings you the leading experts in predictive analytics, with keynotes from top speakers. Join PAW and access the best keynotes, sessions, workshops, and more. Bonus: tips on getting approval to attend.
- Oracle Academy – Teaching Students Around The World
- Apr 15, 2014.
Oracle academy teaches millions on students around the world, supports Oracle and open-source applications, with courses ranging from computer science for kids to Big Data education.
- EMC Data Science and Big Data Analytics Offer
- Apr 15, 2014.
Gain the skills to become an immediate contributor on a data science team - get EMC Starter Kit or get instructor-led training in Big Data Analytics, now 50% off until May 15 with special code.
- Top KDnuggets tweets, Apr 11-13: Influential Data Scientists on Twitter; Data Analytics Handbook – free download
- Apr 14, 2014.
Influential Data Scientists on Twitter and what they do now; Data Analytics Handbook - Interviews with Data Scientists and CEO, free download; An Introduction to Deep Learning in Java; #BigData Salaries for Data Analysts, Data Scientists, DBAs
- Top LinkedIn Groups in 2014 for Analytics, Big Data, Data Mining, and Data Science
- Apr 14, 2014.
We analyze Top 30 LinkedIn Groups for Analytics, Big Data, Data Mining, and Data Science. Overall activity drops about 25%, but membership growth accelerates in Q4 2013. We identify 4 group quadrants and find which groups are fastest growing and most active.
- Top stories for Apr 6-12
- Apr 13, 2014.
Learning and Teaching Machine Learning: A Personal Journey; Interactive Big Data Timeline; Big Data Vendor Analysis; Beyond the Science of Data Science.
- Data Analytics Handbook – Interviews with Data Scientists and Tech Leaders, free download
- Apr 12, 2014.
A young team of UC Berkeley students produced a Data Analytics Handbook - free to download, featuring interviews with data scientists and tech leaders from leading companies including LinkedIn, Cloudera, Facebook, Yelp, and Flurry.
- Vincent Granville Data Science Book
- Apr 12, 2014.
The Data Science book from Analytic Bridge founder Vincent Granville shows you what employers want and the skill set that separates the quality data scientist from other IT professionals.
- Top KDnuggets tweets, Apr 9-10: MLlib: Scalable Machine Learning on Spark; Ensemble methods overview
- Apr 11, 2014.
MLlib: Scalable Machine Learning on Spark (free ebook); Ensemble methods usually give best results in Machine Learning - an overview; Prediction.io open source machine learning server ; Maslow Hierarchy of Analytical Needs - too clever?
- SiSense Crowd Accelerated BI
- Apr 11, 2014.
We examine SiSense Crowd Accelerated BI - an innovative approach which paradoxically enables faster response as the number of users grows.
- Prediction.io open source machine learning server
- Apr 10, 2014.
Prediction.io is an open source machine learning server for predictive solutions, such as personalization or recommendations, built on top of scalable frameworks such as Hadoop and Cascading - ready to handle Big Data.
- Upcoming Webcasts on Analytics, Big Data, Data Science – April 10 and beyond
- Apr 10, 2014.
In-Database Scalable R & Python, Why Analytics Belongs in the Cloud, Data Mining: Failure To Launch, Pivotal Big Data Suite, What Is Hadoop and Where Is It Going, Optimizing Player-Matching Algorithms in Online Games, and more.
- Open Analytics NYC Summit May 8
- Apr 10, 2014.
Open Analytics Summits are a great place for CTOs, Engineers, Developers, Data Scientists, and others to connect, network, and learn about open source technologies and big data analytics. Early reg by Apr 18 + KDnuggets discount.
- Big Data Vendor Analysis
- Apr 10, 2014.
The Big Data market in hardware, software, and services reached $18.6 billion in 2013. We analyze the vendor data and identify 4 main clusters of Big Data companies.
- Salford Hands On Data Mining Mini Training, San Diego, April 25
- Apr 9, 2014.
Attend this mini-training to get step-by-step instruction for the most popular data mining techniques, and walk away with the models you create, ready to start your own data projects.
- Top KDnuggets tweets, Apr 7-8: Beware of P values – they are not reliable; The Data Scientist Toolbox online course
- Apr 9, 2014.
Data scientists beware: P values are not reliable; The Data Scientist Toolbox course - online at Coursera; Beyond the Science of Data Science; EU recommends changing copyright law to enable scientific text data mining.
- Interactive Big Data Timeline
- Apr 8, 2014.
A very interesting interactive Big Data timeline takes you from the beginning of information overload in 1880s to Business Intelligence, World Wide Web, Hadoop, Cloud, and more.
- March Analytics, Big Data, Data Mining Acquisitions and Startups Activity
- Apr 8, 2014.
March 2014 acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: Vizify, Echo Nest, Coinalytics, VoltDB, Platfora, Dell buys Statsoft, Vicarious, RelateIQ, Cloudera.
- Beyond the Science of Data Science
- Apr 8, 2014.
The difference between what is required being a successful analytics consultant and what is taught in school.
- Useful Business Analytics Summit in Boston, June 10-11
- Apr 8, 2014.
Join the Useful Business Analytics Summit (Boston, June 10-11), the unique conference where corporate peers can meet and determine how USEFUL analytics can improve business decision making. Special KDnuggets discount and early bird by Apr 18.
- Top KDnuggets tweets, Apr 4-6: Apache Spark – a Fast #BigData Analytics Engine; Facebook #DataScience tools
- Apr 7, 2014.
Apache Spark - a Fast #BigData Analytics Engine - very good, detailed overview! Facebook #DataScience team releases open-source tools; Top #BigData start-ups by employee satisfaction; My answer to "Which will be better for career prospects in Machine Learning".
- CEOWorld Top Big Data Executives and Experts to Follow on Twitter
- Apr 7, 2014.
Another list of 64 Top Big Data Execs and Experts on Twitter from CEO World is lead by Hilary Mason (@hmason), @todd_park, @SethGrimes, Cindi Howson (@biscorecard), and Gregory Piatetsky-Shapiro (@kdnuggets).
- Book Review: Data Just Right
- Apr 7, 2014.
An introduction to technology and software at play in the current quest to define the Big Data Analytics computing paradigm, the book Data Just Right is reviewed in detail here.
- Additions to KDnuggets Directory in March
- Apr 7, 2014.
MLconf (NYC), Customer Analytics (Philadelphia), AnalyticsWorld (Chicago), Useful Business Analytics Summit (Boston), CBIG Consulting, QueBIT, and other meetings, companies, and education in Analytics, Big Data, Data Mining, and Data Science.
- Top stories for Mar 30 – Apr 5: Exclusive Interview – LinkedIn Economic Graph
- Apr 6, 2014.
Is Data Scientist the right career path - Candid advice; Exclusive: Interview with Sriram Sankar - LinkedIn Economic Graph; Information Management 10 More Big Data Companies; Top stories in March: Machine Learning in 7 Pictures.
- VAST Visual Analytics Challenge 2014: Visualization and Crime-Solving
- Apr 5, 2014.
VAST Challenge 2014 involves three mini-challenges that focus on text, location, and streaming data analysis, and an overall Grand Challenge to test your skills. Submit entries before July 8.
- Learning and Teaching Machine Learning: A Personal Journey
- Apr 5, 2014.
Joseph Barr examines history and origins of Machine Learning and Artificial Intelligence and recounts his personal journey from statistics to industry to teaching machine learning and running R on Unix clusters.
- Three ways to extract business value from analytics
- Apr 4, 2014.
Data Driven Business recently interviewed 9 corporate analytics experts from companies like Johnson & Johnson, L’Oreal Paris, and Google about current trends in analytics and what companies should focus on in 2014 and found 3 main area of focus.
- Top KDnuggets tweets, Apr 2-3: Data scientists need their GitHub; How to make Data Scientist job less tedious
- Apr 4, 2014.
Also Top stories in March: Machine Learning in 7 Pictures; import.io adds authenticated APIs, command line crawlers.
- Employee Churn 201: Calculating Employee Value
- Apr 4, 2014.
Much has been written about customer churn. This post examines employee churn - an equally important problem and its unique dynamics.
- Madrid Summer School 2014 on Advanced Statistics and Data Mining
- Apr 4, 2014.
The annual summer school on "Advanced Statistics and Data Mining" will be held in English, in Madrid, Spain, June 23 - July 4, and will include 12 courses on latest and most important topics. Registration is OPEN and you can register for each course independently.
- HP Perspective on Big Data and Analytics: Interview with Mazhar Hussain
- Apr 3, 2014.
KDnuggets talks with Mazhar Hussain, HP Big Data & Analytics Services Leader, on key topics for the industry and 4 next big areas in Big Data.
- DataScience Central competition: Automate jackknife regression
- Apr 3, 2014.
Data Science Central holds a competition to get statisticians more involved in Data Science - create a black-box, automated, easy-to-interpret, sample-based, robust technique called jackknife regression.
- Upcoming Webcasts on Analytics, Big Data, Data Science – April 3 and beyond
- Apr 3, 2014.
HP Vertica, Hadoop Data Warehouse with Impala, In-Database Scalable R & Python, Data Mining Failure To Launch, What Is Hadoop and Where Is It Going, and more.
- KDnuggets 14:n08, Is Data Scientist the right career path for you? FiveThirtyEight stumbles
- Apr 3, 2014.
Latest analytics/data mining news, including candid advice on Data Scientist career, FiveThirtyEight stumbles on climate change, Boston Panel on Next Big Thing in Big Data, and White House/MIT Big Data Privacy Workshop report.
- Top stories in March: Machine Learning in 7 Pictures; How Many Data Scientists?
- Apr 2, 2014.
Also - The Dos and Donts of Data Mining; Is Data Scientist the right career path for you - Candid advice.
- Webcast – Analytically Speaking featuring Rob Reul
- Apr 2, 2014.
Focusing on customer intelligence research, including why customer satisfaction is not enough; survey analysis and the role of interactive data visualization; and the importance of choice experiments.
- Top KDnuggets tweets, Mar 31 – Apr 1: Experfy marketplace for Data Science projects; Event Recommendation in Python
- Apr 2, 2014.
Experfy launches a marketplace for #DataScience projects; Machine Learning Project: Event Recommendation in Python ; Anyone can see your email address on LinkedIn with this Chrome extension; 5 reasons to use R: free, popularity, power, flexibility, support.
- Exclusive: Interview with Sriram Sankar – LinkedIn Economic Graph
- Apr 2, 2014.
KDnuggets talks with Sriram Sankar, Principal Staff Engineer at LinkedIn about LinkedIn’s “Economic Graph”, Entity-Oriented Search, and the biggest challenges towards delivering relevant, personalized search results.
- SAS Competition for Top Data Scientist in UK, Ireland
- Apr 2, 2014.
The contest, open to the academic and business communities, aims to find UK and Ireland best talent in analytics and data science. The goal is to produce an innovative forecast of energy demand.
- Book: Visual Analytics of Movement
- Apr 2, 2014.
This new book is about the exciting possibilities created by visual analytics for anyone interested in understanding movement, analyzing movement, or simply make decisions influenced by the way people, animals, and objects move.
- April-July 2014 Meetings in Analytics, Big Data, Data Mining, and Data Science
- Apr 1, 2014.
Coming soon - Big Data Innovation Summit (Santa Clara), MLconf NYC, SDM 13, PAW Toronto, PAKDD, AnalyticsWorld Chicago, Future of Consumer Intelligence, Useful Business Analytics Boston, and many more.
- Forrester Research: Build Trusted Data with Data Quality
- Apr 1, 2014.
Key takeaways of the report include: How managing data quality brings IT and the business closer together, Different data quality definitions, and advantages of transparency in data quality.