Big Data Innovation Summit 2014 London: Highlights
Highlights from the presentations by Big Data technology practitioners from Sears Holdings, Microsoft, Ticketmaster during Big Data Innovation Summit 2014 in London.
on May 31, 2014 in Big Data, Data Visualization, IE Group, London-UK, Microsoft, Privacy, Sears Holdings, Social Analytics
US Open Data Action Plan and Datasets
We summarize the key findings in the recently released US Open Data Action Plan, highlighting the principles, commitments, datasets released and future outlook.
on May 31, 2014 in Datasets, Government, Open Data, Social Participation, White House
Interview: Kirk Borne, Data Scientist, GMU on Decision Science as a Service and Data Science curriculum
We discuss Kirk's role at Syntasa, the concept of "Decision Science as a Service", key components of a well-designed Data Science education curriculum, advice for young aspirants and more.
on May 31, 2014 in Advice, Decision Making, Education, Interview, Kirk D. Borne, Skills, Syntasa
Gaming Analytics Innovation Summit: Day 2 Highlights
Highlights from the presentations by Gaming Analytics experts from Ubisoft, Electronic Arts, Sega on Day 2 of Gaming Analytics Summit 2014.
on May 30, 2014 in Acquisitions, Analytics, Data Science, Games, Gaming, IE Group, San Francisco-CA
Preparing Industry for the Upcoming Data Deluge: PAW-Manufacturing 2014
Predictive analytics will become more powerful in industry as the data that computers collect and analyze in consumer and manufacturing contexts becomes more numerous.
on May 30, 2014 in Bala Deshpande, Manufacturing, PAW, Predictive Analytics, Quality Control
Interview: Kirk Borne, Data Scientist, GMU on Big Data in Astrophysics and Correlation vs. Causality
We discuss how to build the best data models, significance of correlation and causality in Predictive Analytics, and impact of Big Data on Astrophysics.
on May 30, 2014 in Correlation, Interview, Kirk D. Borne, Predictive Analytics, Recommendations
Top KDnuggets tweets, May 28-29: SAS University Edition free software; Google Quantum Computing Playground
SAS University Edition offers free #SAS software for higher education, teaching; Ultra-cool! Google "Quantum Computing Playground" - fiddle with quantum algorithms; Thomson Reuters: Data Scientist ; Realtime Personalization and Recommendation with Stream Mining.
on May 30, 2014 in Google, Quantum Computing, SAS, Stream Mining, Thomson Reuters
Data Mining Modern Languages
We examine the trends and implications in modern language enrollment in the United States, and also show an excellent example of using rCharts and ggplot2 for interactive visualization.
on May 30, 2014 in Data Visualization, Education, India, R, Spanish, Vivek Patil
Data Discovery to Real Business Value – INFORMS Conference, June 22-24, San Jose
Learn about how to achieve return on investment and real business value from your Big Data investments through the real-life case studies, insightful presentations, and lot more at the INFORMS Conference in San Jose.
on May 30, 2014 in Big Data, Business Value, Conference, INFORMS, San Jose-CA
Big Data for Executives 2014: Day 2 Highlights
Highlights from the presentations by Big Data experts from McKinsey Solutions, SAP, Techfetch, Weather Analytics on Day 2 of Big Data for Executives 2014.
on May 29, 2014 in Analytics, Big Data, Machine Learning, Visualization, Washington-DC
KDnuggets Social Network in NodeXL, May 2014
We examine KDnuggets Twitter Social Network, as generated by NodeXL, looking at clusters, top Twitter accounts, URLs, hashtags, words, and what does it all mean?
on May 29, 2014 in Clustering, Excel, Gregory Piatetsky, Marc Smith, NodeXL, Social Network Analysis
Gaming Analytics Summit 2014: Day 1 Highlights
Highlights from the presentations by Gaming Analytics experts from Activision, Valve, Microsoft and Broken Bulb Studios on Day 1 of Gaming Analytics Summit 2014.
on May 29, 2014 in Analytics, Boosting, Customer Engagement, Games, Gaming, IE Group, San Francisco-CA
Interview: Walter Maguire, Chief Field Technologist on HP Big Data Strategy and HAVEn
We discuss how HP views Big Data, capabilities of HP HAVEn, leveraging Big Data for improving customer experience, Analytics challenges, outsourcing criteria and current trends.
on May 28, 2014 in Business Intelligence, Customer Experience, Haven OnDemand, Infrastructure, Interview, Vertica, Walter Maguire
Top KDnuggets tweets, May 26-27
Machine Learning Algorithms Tour: Regression, kNN, Regularization, Decision Tree; Where to Learn Deep Learning - Courses, Tutorials, Software; 9 Courses on Data Science, R, Machine Learning start on Coursera.
on May 28, 2014 in Algorithms, Brazil, Coursera, Data Science Education, Deep Learning, Machine Learning, World Cup
KDnuggets 14:n13, Where to learn Deep Learning; Vowpal Wabbit; 10 Big Data Pros
Latest analytics/data mining news, including Features, Software,Opinions, New, Webcasts, Courses, Meetings and Reports, Jobs, Academic Positions, Publications, Tweets, and CFP.
on May 28, 2014 in
Book: Data Mining and Analysis: Fundamental Concepts and Algorithms
This textbook for senior undergraduate and graduate data mining courses provides a broad yet in-depth overview of data mining, integrating related concepts from machine learning and statistics. Companion website has data, slides and other teaching material.
on May 27, 2014 in Algorithms, Book, Data Mining, Mohammed Zaki, Textbook
Big Data Use Case: Zookeeper at Rubicon Project
What is the big idea with ZooKeeper - a summary of an excellent Big Data use case using Apache ZooKeeper for Hadoop implementation.
on May 27, 2014 in Daniel D. Gutierrez, Hadoop, Jan Gelin, ZooKeeper
Free Data Science Workshops at Bournemouth University, UK, June 9-10
2 Data Science Workshops: "Data scientist: The sexiest job of the 21st century?" and "Data as a utility and analytics as a service". Attendance is free, but registration required.
on May 27, 2014 in Bournemouth, Data Scientist, Workshop
Interview: Martin Hack, CEO, Skytree on Industrializing Machine Learning for Big Data
We discuss the mission of Skytree, product strategy, complimentary consulting programs, recent trends, and current expectations from Machine Learning.
on May 26, 2014 in Advanced Analytics, Machine Learning, Martin Hack, Predictive Analytics, Skytree
Upcoming Webcasts on Analytics, Big Data, Data Science – May 26 and beyond
Purchase history to customer projects, Hadoop, YARN, BigML, Amazon Redshift, ClearStory Data, and Analytically Speaking Featuring Dan Ariely - author of Predictably Irrational.
on May 26, 2014 in Analytically Speaking, BigML, Dan Ariely, Hadoop
Vowpal Wabbit: Fast Learning on Big Data
Vowpal Wabbit is a fast out-of-core machine learning system, which can learn from huge, terascale datasets faster than any other current algorithm. We also explain the cute name.
on May 26, 2014 in Fast Learning, John Langford, Machine Learning, Microsoft, Vowpal Wabbit
Top KDnuggets tweets, May 23-25: Data Science vs. Statistics: one big difference; A SQL query walks into a bar
Data Science vs. Statistics: one big difference in Data Science focus; TGIF: A SQL query walks into a bar, approaches two girls at two tables ...; Amazing demo - IBM #Watson analyzes topic, presents a speech, can debate opponents; Microsoft #Kinect as Inexpensive #BigData Tool.
on May 26, 2014 in Data Science, Deep Learning, Humor, Kinect, SQL, Statistics, Watson
Where to Learn Deep Learning – Courses, Tutorials, Software
Deep Learning is a very hot Machine Learning techniques which has been achieving remarkable results recently. We give a list of free resources for learning and using Deep Learning.
on May 26, 2014 in Andrew Ng, Deep Learning, Geoff Hinton, Machine Learning, Yann LeCun
Top stories for May 18-24
New Poll: Analytics, Data Mining, Data Science Software Used? Stacking the Deck: The Next Wave of Opportunity in Big Data; Michael O'Connell on how to lead in Big Data; Tamr at the New Frontier of Big Data Curation.
on May 25, 2014 in Data Curation, Full Stack Analytics, Michael O'Connell, Next Big Thing, Poll, Tamr, Top stories
Interview: Richard Wendell, VP, Data Science, TE Connectivity on the Role of Analytics in Organizations
We discuss organizational structure of data science team, making Analytics an integral component of all projects, future of Big Data and crucial soft-skills for aspiring practitioners.
on May 24, 2014 in Advanced Analytics, Advice, Data Science, Richard Wendell, TE Connectivity
OnlyBoth Startup is like IBM Watson in Reverse
OnlyBoth startup from Vivisimo co-founder Raul Valdes-Perez is like IBM Watson in reverse - it discovers new insights in data and writes them in English.
on May 24, 2014 in CMU, Education, IBM, OnlyBoth, Raul Valdes-Perez, Vivisimo
Saxon Global, fast growing BI, Big Data, Cloud Service Provider
Why India is emerging as a powerhouse of Analytics, Big Data Applications, Privacy, What will replace Big Data, and more.
on May 24, 2014 in Applications, Big Data, India, Interview, Privacy, Saxon Global
Big Data for Executives 2014: Day 1 Highlights
Highlights from the presentations by Big Data experts from Sears Holdings, PWC, Oracle, Altamira, Tesora on Day 1 of Big Data for Executives 2014.
on May 23, 2014 in Big Data, Oracle, Predictive Analytics, PWC, Sears Holdings, Washington-DC, Wikibon
Top KDnuggets tweets, May 21-22: Outlier Detection for Temporal Data; Become a Big Data mgr with #ieMBD
Outlier Detection for Temporal Data ; 1.5M #BigData managers will be needed - Become one with #ieMBD; Goldman Sachs Surveillance Analytics; InformationWeek 10 Big Data Pros To Follow On Twitter.
on May 23, 2014 in Anomaly Detection, Data Science Education, Goldman Sachs, Temporal Data, Twitter
Interview: Richard Wendell, VP, Data Science, TE Connectivity on Strategy for Analytics Projects
We discuss the last mile of the execution path of Analytics projects, five critical pillars of success and data-driven decision making through advanced analytics.
on May 23, 2014 in Advanced Analytics, Big Data Strategy, Project Fail, Richard Wendell, TE Connectivity
Registration open for KDD 2014 – Best Data Mining and Data Science Research
KDD 2014, held Aug 24-27 in New York City, brings you the best research and the state of the art in Knowledge Discovery, Data Mining, and Data Science. Registration is now open - early rate till July 15.
on May 23, 2014 in KDD-2014, New York-NY, Oren Etzioni, Registration
InformationWeek 10 Big Data Pros To Follow On Twitter
Information Week list of 10 Big Data Pros includes leading industry experts @merv, @sogrady, @Sve_Sic, @KirkDBorne, @KDnuggets, @BigDataGal, @Data_Nerd, @JaimeFitzgerald, @TonyBaer, and @marcusborba.
on May 22, 2014 in Big Data, Gregory Piatetsky, InformationWeek, KDnuggets Honors, Kirk D. Borne, Top 10, Twitter
Data Literacy: Education for the Information Economy
Brian Liou discusses how data literacy adopts an even more important role for professionals in the data-driven world of today as seen in the interviews conducted for the Analytics Handbook.
on May 22, 2014 in Brian Liou, Hal Varian, Handbook, UC Berkeley
Outlier Detection for Temporal Data
Outlier Detection for Temporal Data covers topics in temporal outlier detection, which have applications in numerous fields. It starts with the basic topics then moves on to state of the art techniques in the field.
on May 22, 2014 in Anomaly Detection, Book, Charu Aggarwal, Jiawei Han, Morgan & Claypool, Outliers, Temporal Data
Zementis – Cool Vendor in Data Science, 2014
Zementis, named by Gartner a Cool Vendor in Data Science, provides a scalable, standards-based platform to rapidly deploy and execute predictive models.
on May 22, 2014 in Data Science, Gartner, PMML, Vendors, Zementis
MADALGO Summer School on LEARNING AT SCALE, August 11-14, Denmark
MADALGO Summer School will teach the latest developments in learning at scale as applied to Big Data. Registration is free on a first-come-first serve basis. Denmark, Aug 11 - 14, 2014.
on May 22, 2014 in Big Data, Denmark, Machine Learning, Optimization, Summer School
Interview: Dale Russell, CTO, Talksum on Building Talksum Router and Real-time Anaytics
We discuss challenges in building Talksum data stream solution, current trends in real-time analytics, advice for Data Science aspirants and more.
on May 21, 2014 in Advice, Analytics, Dale Russell, Data Management, Interview, Real-time, Talksum
Signi-Trend App: Detecting Significant Trends in Text
Signi-Trend is a visual explorer tool for a new, heavy-hitters style, trend detection algorithm. Details will be published at KDD 2014.
on May 21, 2014 in Erich Schubert, KDD-2014, Text Analytics, Trend Detection, Twitter
Top KDnuggets tweets, May 19-20: 12 Free Data Mining, Data Science books; Exclusive: How to Lead in Big Data
12 Free Data Mining, Data Science, Applied Stats, and Machine Learning Books ; Exclusive Interview: Michael O'Connell on How to Lead in Big Data; Free ebook: The Complete Guide to Facebook Analytics; AI, Deep Learning race between Google, Facebook, and Baidu.
on May 21, 2014 in Andrew Ng, Baidu, Deep Learning, Facebook, Free ebook, Google, Michael O'Connell
KDnuggets 14:n12, Annual Poll: Software Used? Tamr & Data Curation; Data Science Cheat Sheets
Latest analytics/data mining news, including Features, Software, Opinions and Interviews, News, Webcasts, Courses, Meetings and Reports, Jobs, Publications, Top Tweets, and CFP.
on May 21, 2014 in Cheat Sheet, Data Curation, Poll, Tamr
Interview: Dale Russell, CTO, Talksum on Winning the IE Big Data Startup Award
We discuss Talksum data stream router and cross-domain networking with real-time data management using data streams.
on May 20, 2014 in Awards, Dale Russell, Data Management, IE Group, Interview, Startup, Talksum
PAW: Predictive Analytics World Chicago, Expert-led Workshops
Discover best practices and sharpen your skills by attending one of our expert-led workshops: The Best and Worst of Predictive Analytics, R for Predictive Modeling - Hands-on, and Advanced Methods Hands-on.
on May 20, 2014 in Chicago-IL, Hands-On, PAW, Predictive Analytics World, R
New Poll: Analytics, Data Mining, Data Science Software Used?
Please vote in our well-known annual KDnuggets Software Poll: What Analytics, Big Data, Data mining, Data Science software you used in the past 12 months for a real project?
on May 20, 2014 in Data Mining Software, Poll
Stacking the Deck: The Next Wave of Opportunity in Big Data
A leading venture capitalist explains why Big Data infrastructure market is mostly mature and where lies the next big area of opportunities related to Big Data.
on May 20, 2014 in Chip Hazard, Full Stack Analytics, Machine Learning, Network Effects, Startups, VC
Upcoming Webcasts on Analytics, Big Data, Data Science – May 19 and beyond
Data Mining: FTL; Deep Learning with H2O; Purchase history to Customer Projects; Apache Hadoop, Hive, Kafka, Solr; Python for Big Data Analytics, and more.
on May 19, 2014 in Deep Learning, Hadoop, Hortonworks, Solr, WCAI, Wharton, YARN
BabelNet 2.5: Very Large Multilingual Encyclopedic Dictionary and Semantic Network
BabelNet 2.5 covers 50 languages, and offers seamless integration of WordNet, Open Multilingual WordNet, Wikipedia, OmegaWiki, Wikidata (NEW), and Wiktionary (NEW). Check upcoming BabelNet workshops.
on May 19, 2014 in BabelNet, Semantic Network, Wikipedia, WordNet
Code for India 2014 Global Hack-a-thon – Building a Better India through Innovative Solutions
Non-stop 24 hours of coding at the Code for India 2014 hackathon leads to creative solutions for major social problems of India through interesting software applications.
on May 19, 2014 in Data Mining, Google, Hackathon, India, Mountain View-CA, Predictive Modeling
Exclusive: Tamr at the New Frontier of Big Data Curation
Our exclusive profile of Tamr (former Data Tamer), the latest startup from legendary Michael Stonebraker, which emerged from stealth mode to address the new field of Big Data Curation.
on May 19, 2014 in Andy Palmer, Data Curation, Machine Learning, Michael Brodie, Michael Stonebraker, Startups, Tamr
Top KDnuggets tweets, May 16-18: Great find – Intro to Data Science, free download; Why code written by scientists gets ugly
Great find! Intro. to Data Science, v2 (170 pages), free download; Why code written by scientists gets ugly; A Statistician's View on #BigData and Data Science - updated; CIOReview Top 100 Most Promising Big Data Companies.
on May 19, 2014 in CIOReview, Companies, Free ebook, Kaggle, Startups, Statisticians
Exclusive Interview: Michael O’Connell, Chief Data Scientist, TIBCO on How to Lead in Big Data
We discuss Big Data vs. Fast Data, Data Visualization trends, Jaspersoft acquisition, factors differentiating future leaders of Big Data and more.
on May 19, 2014 in Analytics Leader, Data Visualization, Interview, Michael O'Connell, Statistics, TIBCO
Top stories for May 11-17
Guide to Data Science Cheat Sheets; Watch: Basics of Machine Learning; Cartoon: Data Visualization meets 3-D Printer; Social Media and Web Analytics Innovation Summit 2014 Highlights.
on May 19, 2014 in 3-D Printing, Cartoon, Cheat Sheet, Machine Learning, Summit, Top stories
Top 100 Startup Experts to Follow on Twitter
A list of Top 100 Startup Experts to Follow on Twitter is headed by @kdnuggets. Check our tweets on Analytics, Big Data, Data Mining, and Data Science startups and acquisitions under hashtag #BigDataCo.
on May 17, 2014 in Experts, Gregory Piatetsky, KDnuggets Honors, Startups, Twitter
CIOReview Top 100 Most Promising Big Data Companies
Top 100 Most Promising Big Data Companies according to CIO Review, from 7Segments to OpenBI to Zeta Interactive.
on May 17, 2014 in Big Data, CIOReview, Companies, Startups
Poll Results: Data Types/Sources Analyzed
Trends in data sources for data mining include: table data dominates, followed by time series and text; audio, JSON grows in popularity, while itemsets decline; 70% access DB engines, but only 20% access NoSQL stores; Hadoop, MongoDB used more for text; Europe is lagging in NoSQL usage.
on May 17, 2014 in Data types, Hadoop, NoSQL, Poll, Relational Databases
Top KDnuggets Tweets, May 14-15: Easier Facebook Network Analysis; Cloudera Live, a New Way to Start with Hadoop
Facebook Network analysis, visualization is easier with httr from R wizard; Cloudera Live offers a new way start with #Hadoop - No downloads; Watch: Basics of Machine Learning ; BigML Machine Learning platform Spring Release.
on May 16, 2014 in BigML, Cloudera, Facebook, Hadoop, Machine Learning, R
Predict Soccer World Cup 2014 Winner, Get Prizes from RapidMiner
Use a free edition of RapidMiner to have fun and bring sports predictions to another level by making a prediction of Soccer (Futbol) World Cup 2014, which starts on June 12 in Brazil.
on May 16, 2014 in Boston-MA, Brazil, Competition, RapidMiner, Soccer, World Cup
KDD Cup 2014 – Predicting Excitement at DonorsChoose.org
Predict which Donor Choose projects will be exciting. 2014 edition of KDD Cup, the first data mining competition, is on Kaggle. Submissions due June 15.
on May 16, 2014 in Charity, Competition, DonorsChoose, Kaggle, KDD-2014, New York-NY
Resource-aware Machine Learning – Summer School 2014, Germany
Summer school in Dortmund, Germany covers Machine Learning with Constrained Resources including topics like detecting astro particles using smartphones. Applications are due by June 30.
on May 16, 2014 in Dortmund, Germany, Machine Learning, Resource-aware, Summer School
Big Data Landscape, v 3.0, analyzed
We analyze the Big Data Landscape and identify the most popular market segments in Analytics, Infrastructure, Applications, Open Source, and Data Sources categories. It is still early - only 4.5% of companies had exits.
on May 15, 2014 in Big Data, Big Data Analytics, Data Platform, Infrastructure, Landscape, Open Source, Startups
Sentiment Analysis Innovation Summit 2014: Day 2 Highlights
Highlights from the presentations by opinion mining experts from Fujitsu, FindiLike and Stanford University on Day 2 of Sentiment Analysis Innovation Summit 2014 in San Francisco.
on May 15, 2014 in Deep Learning, Innovation, San Francisco-CA, Semantic Analysis, Sentiment Analysis
Has Predictive Analytics Crossed The Chasm?
Recent study highlights the increasing market perception that Predictive Analytics leads to competitive advantage. The report also outlines current trends and challenges for Predictive Analytics.
on May 15, 2014 in Big Data, Competition, Crossing the Chasm, IBM, Predictive Analytics, Ventana Research
Interview: Gary Shorter, Director of Data Science, Quintiles on Big Data for Healthcare
We discuss the rising medical costs, how can Big Data help, key features of Quintiles Inforsario and Topological Data Analysis.
on May 15, 2014 in Data, Data Science, Gary Shorter, Healthcare, Interview, Medical, Quintiles
Social Media & Web Analytics Innovation Summit 2014: Day 2 Highlights
Highlights from the presentations by analytics experts from Youtube, Evernote and Wikia on day 2 of Social Media & Web Analytics Innovation Summit 2014 in San Francisco.
on May 15, 2014 in Market Research, San Francisco-CA, Social Media Analytics, Web Analytics, Youtube
Uppd8: An Engine for the Wisdom of Crowds
What people think matters. Uppd8 focuses on crowd sentiment analysis and provides tag-scored data based on different user types. Basic services will be provided for free.
on May 15, 2014 in NoSQL, Quality Score, Sentiment Analysis, SQL, Startup, Uppd8
Northwestern Online MS in Predictive Analytics
Prepare for leadership-level career, learn from top faculty and industry experts, and earn your analytics degree online. Fall Quarter application deadline July 15.
on May 15, 2014 in MS in Data Science, Northwestern, Online Education
Sentiment Analysis Innovation Summit 2014: Day 1 Highlights
Highlights from the presentations by opinion mining experts from Twitter, eBay and Samsung on Day 1 of Sentiment Analysis Innovation Summit 2014 in San Francisco.
on May 14, 2014 in Innovation, Machine Learning, San Francisco-CA, Semantic Analysis, Sentiment Analysis
Social Media & Web Analytics Innovation Summit 2014: Day 1 Highlights
Highlights from the presentations by experts from Google, CapitalOne, StubHub and Social Media Research Foundation on day 1 of Social Media & Web Analytics Innovation Summit 2014 in San Francisco.
on May 14, 2014 in Google Analytics, NodeXL, San Francisco-CA, Social Media Analytics, Web Analytics
Interview: Prateek Jain, Director of Engineering, eHarmony on Fast Search and Sharding
We discuss Big Data architecture, fast multi-attribute searches, database sharding and scaling challenges at eHarmony.
on May 14, 2014 in eHarmony, Interview, MongoDB, Prateek Jain, Search Infrastructure, Sharding
Top KDnuggets tweets, May 12-13: Guide to Data Science Cheat Sheets; How to analyze Facebook Networks using R
Guide to Data Science Cheat Sheets; Clever hack: How to analyze Facebook Networks using R; Very useful - Introduction to #SQL for Data Scientists; Planning a late career shift to Analytics /Data Science? Be prepared.
on May 14, 2014 in Career, Cheat Sheet, Facebook, R, SQL
Vendor-Neutral Hands-On Training in Data Mining [Denver-CO, July | Wash-DC, Sep]
Successful analytics in the big data era does not start with data and software, but with immersive hands-on training and goal-driven strategy. Get this training from The Modeling Agency.
on May 14, 2014 in Data Science Education, Denver-CO, Hands-On, TMA, Washington-DC
BPDM 2014: Broadening Participation in Data Mining Program
The BPDM Program, held at KDD-2014, aims to foster mentorship, guidance, and connections of minority and underrepresented groups in Data Mining/Data Science by providing scholarships to interact with and learn from senior researchers. Apply by June 12.
on May 14, 2014 in BPDM, KDD-2014, Mentorship, New York-NY
Watch: Basics of Machine Learning
Watch series on machine learning, going from basics like Naive Bayes, Decision Tree, Generalization and Overfitting, to more complex topics like Hierarchical Agglomerative Clustering.
on May 14, 2014 in Machine Learning, Online Education, Victor Lavrenko, Youtube
Do Analytics as well as Google, Johnson&Johnson, and AT&T
Attend Useful Business Analytics Summit in Boston and learn how the leading companies do analytics. Early reg by May 16.
on May 13, 2014 in Boston-MA, Business Analytics, Google, Nokia
New Book: Analytics in a Big Data World – The Essential Guide to Data Science
For organizations looking to enhance their capabilities via data analytics, this book is the go-to reference for applying Data Science to make the right business decisions.
on May 13, 2014 in Analytics, Applications, Bart Baesens, Big Data, Book, Data Science, Wiley
Media Industry Embracing Analytics for Innovation and Competitive Edge
Survey results highlight the importance of Analytics capability in media industry and the consumer beliefs on privacy vs. personalization benefits.
on May 13, 2014 in Analytics, Bain, Media, Personalization, Privacy
Data Analytics Handbook p. 3, Interviews with Research Leaders and Academics, Free Download
Part 3 features interviews with research leaders and academics, including Hal Varian (Chief Economist, Google), Gregory Piatetsky (Editor, KDnuggets), and Analytics Thought Leader Tom Davenport (Professor, Babson College). Free download.
on May 13, 2014 in Data Analytics, Hal Varian, Handbook, Tom Davenport, UC Berkeley
Forrester Research: Transform Your Organization with Strong Data Management
New Forrester Research report shows how to build a more elastic and flexible data management practice to meet the new data demands. Free download compliments of Lavastorm Analytics.
on May 13, 2014 in Data Management, Forrester, Lavastorm, Report
Upcoming Webcasts on Analytics, Big Data, Data Science – May 12 and beyond
Who Owns the Data, The New Database Frontier, Analytically Speaking, Data Mining: Failure To Launch, Deep Learning with H2O, Purchase history to Customer Projects, Hadoop and YARN, and more.
on May 12, 2014 in Analytically Speaking, Apache Storm, Deep Learning, Hadoop, SAS, YARN
Big Data BootCamp: Highlights of talks on Day 3
Highlights from the presentations by big data technology practitioners from Hortonworks, Intel, Rackspace, SciSpike, and Yahoo at Big Data Bootcamp 2014 in Santa Clara.
on May 12, 2014 in Apache Spark, Big Data, Bootcamp, Hands-On, Santa Clara-CA, Startup, Workshops
Interview: George Corugedo, CTO, RedPoint on YARN and Customer Analytics
We discuss significance of YARN for Hadoop 2.0 platform, unique benefits of RedPoint Convergent Marketing Platform and Master Key Management for Customer Analytics.
on May 12, 2014 in Customer Analytics, George Corugedo, Hadoop 2.0, Interview, RedPoint, YARN
Healthcare Analytics: Identifying Leaders and Key Trends
We review recently released report on Healthcare perceptions towards BI/Analytics and share key insights into who is leading healthcare analytics in different categories and what are the key dominant trends.
on May 12, 2014 in Analytics Leader, Business Intelligence, Healthcare, KLAS Research, Trends
Top KDnuggets tweets, May 9-11: Data Mining for Statisticians; For teachers (and students) of Machine Learning
Data Mining for Statisticians ; For teachers (and students) of #MachineLearning - Slides for LIONbook; Build a word cloud using R text mining tools - step-by-step; Graph Theory: Key to Understanding #BigData - graphs are not just for Google or eBay.
on May 12, 2014 in Graph Theory, LIONbook, Machine Learning, R, Statisticians, Text Mining
KDD 2014 Workshops – the latest in Data Mining and Data Science Research
KDD 2014 workshops are the forum for the latest data mining and data science research. Workshop topics include Data Science for Social Good, Big Data Discovery and Curation, Big Data Analytics for Bio/Health Informatics, Stream Mining, Data Ethics, Sports Analytics, and more. Submission dates from late May to late June.
on May 12, 2014 in KDD-2014, New York-NY, Workshops
Guide to Data Science Cheat Sheets
Selection of the most useful Data Science cheat sheets, covering SQL, Python (including NumPy, SciPy and Pandas), R (including Regression, Time Series, Data Mining), MATLAB, and more.
on May 12, 2014 in Cheat Sheet, Data Science, Python, R, SQL
Cartoon: Data Visualization meets 3-D Printer
New KDnuggets Cartoon looks at what happens when Data Visualization meets 3-D Printer.
on May 11, 2014 in 3-D Printing, Cartoon, Data Visualization
Top stories for May 4-10
Data Scientists Not Required with Alteryx Analytics 9.0; 9 Free Books for Learning Data Mining and Data Analysis; Exclusive Interview: Todd Holloway, Data Science Lead, Trulia; Did Target really predict teen pregnancy?
on May 11, 2014 in Alteryx, Free ebook, Pregnancy, Target, Todd Holloway, Top stories, Trulia
Data Mining for Statisticians
New video series from Salford Systems presents an approach to data mining from a statistical point of view.
on May 10, 2014 in
NineSigma Big Data Analytics RFP
NineSigma is seeking proposals for mining user browsing/operations history, social networking services, and sensing devices to improve personalization and recommendation of products. Submit by May 23, 2014.
on May 9, 2014 in NineSigma, Personalization, Proposals, Recommendations, RFP
White House Report on Big Data: Opportunities and Values
We summarize the key findings in the recently released White House report on Big Data, highlight the key opportunities and concerns, and list the recommendations made to the President.
on May 9, 2014 in Big Data Privacy, Government, Recommendations, Report, White House
Big Data BootCamp Santa Clara: Highlights of talks on Days 1-2
Highlights from the presentations by big data technology practitioners from Caspida, Datastax, ElephantScale, Hortonworks, MapR and Qubole at Big Data Bootcamp 2014 in Santa Clara.
on May 9, 2014 in Big Data, Bootcamp, Hadoop 2.0, Hands-On, Santa Clara-CA, Tools, Workshops
Interview: Arijit Sengupta, CEO, BeyondCore on Advanced Analytics and Big Data
We discuss traditional analytics vs. modern analytics, avoiding over-simplification, human-technology interaction for Big Data, challenges in democratizing analytics and more.
on May 9, 2014 in Advanced Analytics, Arijit Sengupta, BeyondCore, High-dimensional, Interview
Top KDnuggets tweets, May 7-8: 30 Simple Tools for Data Visualization; Did Target Really Predict Pregnancy?
30 Simple Tools for Data and Geo-Visualization: iCharts, Fusion, Modest Maps, Raw ...; Did Target Really Predict a Teen's Pregnancy? The Inside Story; Sense, new Data Science startup, builds a Data Science Platform of the Future; Analytics Experts on #BigData Misconceptions.
on May 9, 2014 in Data Science Platform, Data Visualization, Misconceptions, Pregnancy, Target
MassTLC Big Data Meeting Delivers Insights, Perspective
Summit highlights: Digitization and Datification - a love story, Strategies for creating a competitive advantage in #BigData world, Boston open data, Balancing privacy and governance, and the most widely used #BigData tool in the future.
on May 8, 2014 in Big Data Privacy, Big Data Strategy, Boston-MA, Ingo Mierswa, Massachusetts, MassTLC, Open Data, Paul Sonderegger
April 2014 Analytics, Big Data, Data Mining Acquisitions and Startups Activity
April 2014 acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: Experfy, Dunnhumby, NexGraph, Fundbox, FICO, Gnip, Fliptop, InBloom, Jaspersoft, and more.
on May 8, 2014 in Acquisitions, dunnhumby, Experfy, FICO, Gnip, India, Jaspersoft, Startups, Twitter
ClearStory – The Fastest, Simplest Way to Analyze Data
ClearStory is modernizing how diverse data is accessed, merged, and analyzed, and how insights are consumed by analysts and business users. Try ClearStory today.
on May 8, 2014 in ClearStory, Free trial
Spotlight: RapidMiner New Predictive Analytics Platform-as-a-Service
We examine the newly announced RapidMiner Platform-as-a-Service, installed on AWS and managed by RapidMiner experts.
on May 7, 2014 in AWS, Cloud Analytics, PaaS, RapidMiner
Did Target Really Predict a Teen’s Pregnancy? The Inside Story
We examine the origin and the facts behind this explosive story, the importance of headlines, and how unsubstantiated assumptions gain traction and mainstream attention and help create myths around Predictive Analytics.
on May 7, 2014 in Book, Charles Duhigg, Eric Siegel, Predictive Analytics, Pregnancy, Target
JMP White Paper: Advantages of Bootstrap Forest for Yield Analysis
This white paper highlights practical examples on how to use partitioning techniques for semiconductor manufacturing data. These methods also have wider applicability.
on May 7, 2014 in Bootstrap Forests, JMP, White Paper
Interview: Xinghua Lou (Microsoft) on Mining Clinical Notes and Big Data in Healthcare
We discuss data mining of cancer clinical data, LDA topic model, challenges in mining clinical notes, big data in healthcare and more.
on May 7, 2014 in Healthcare, Interview, Machine Learning, Microsoft, Researchers, Xinghua Lou
Top KDnuggets tweets, May 5-6: xkcd looks at Love and Statistics; Analytics job applicants – avoid these mistakes
xkcd looks at Love and Statistics: Why it is important to label your axes; Analytics job applicants - avoid these common mistakes; Stanford Online Courses: Education + Advanced Skills = Awesome Career; Landmark for AI: computer system solves Algebra word problems.
on May 7, 2014 in Artificial Intelligence, Cartoon, MIT, Online Education, Stanford, xkcd
KDnuggets 14:n11, Poincare, Perelman, and Social networks; Industry Lessons; Experfy
Latest analytics/data mining news, including Michael Brodie on Industry Lessons, Knowledge Discovery, Future Trends, and QCRI; Features, Opinions, Software, News, Webcasts, Meetings, Jobs, Publications, Tweets and CFP.
on May 7, 2014 in
Webinar: Data Mining: Failure to Launch [May 20]
Learn how to get started with predictive modeling and overcome strategic and tactical limitations that cause data mining projects to fall short of their potential. Next webinar is May 20.
on May 6, 2014 in Data Mining, Failure to Launch, TMA
PAW: Predictive Analytics World, Manufacturing, Chicago, June
PAW Chicago and PAW Manufacturing are the business events for predictive analytics professionals, managers and commercial practitioners, covering commercial deployment of predictive analytics, across industries, and dedicated to manufacturing. Special KDnuggets discount.
on May 6, 2014 in Chicago-IL, Manufacturing, PAW, Predictive Analytics World
Human Dynamics – Data Mining Mobile Phone Usage
Mobile phone usage contains a gold mine of insights. We examine what was learned about human social connections from the first-ever extensive study of social interactions in Mexico.
on May 6, 2014 in Algorithms, Carlos Sarraute, Communication, Data Mining, Phone Usage
Interview: Michael Brodie on Industry Lessons, Knowledge Discovery, and Future Trends
The last part of our exclusive interview focuses on Industry Lessons, Knowledge Discovery, Privacy Issues, Expected Technical Developments in next 5 years and more.
on May 6, 2014 in Big Data Privacy, Computing Reality, Data Science, Interview, Michael Brodie, QCRI
Stanford Online Courses: Education + Advanced Skills = Awesome Career
With Stanford world-class online certificates, show advanced knowledge in Data Mining and Applications, Mining Massive Data Sets, and more. Enrollment for summer quarter open now till June 9.
on May 6, 2014 in Data Science Certificate, Online Education, Stanford
Upcoming Webcasts on Analytics, Big Data, Data Science – May 5 and beyond
SAS and Cloudera, Analytically Speaking, Data Mining: Failure To Launch, Deep Learning with H2O, Hadoop, WCAI Oppy: Purchase history to id customer projects, and more.
on May 5, 2014 in Analytically Speaking, Cloudera, Deep Learning, SAS
Top KDnuggets tweets, May 2-4: Big List of Machine Learning, #DataScience, and Statistics Resources
Big List of Machine Learning, #DataScience, and Statistics Resources; 7 Free or low-cost ways to Learn Data Mining & Data Science ; Datasight.io - machine learning for the masses - now in beta; Every MIT undergrad will get $100 of Bitcoin.
on May 5, 2014 in Bitcoin, Data Science Education, Datasight.io, Machine Learning, MIT
May-Sep 2014 Meetings in Analytics, Big Data, Data Mining, and Data Science
Coming soon: Big Data Week in 30 cities, PASS Business Analytics (San Jose), PAW Toronto and Chicago, Business Analytics Innovation Summits, MMDS Berkeley, Useful Business Analytics Boston, KDD-2014 NYC, and much more.
on May 5, 2014 in Boston-MA, Chicago-IL, London-UK, New York-NY, San Francisco-CA, Toronto-Canada
Data Scientists Not Required: Promises the recently launched Alteryx Analytics 9.0
Alteryx Analytics 9.0 blends new sources of customer Insight such as Social Media, Google Analytics, and Marketo with data from legacy environments such as SAS Analytics.
on May 4, 2014 in Alteryx, Amazon, Data blending, Data Scientist, Google Analytics, Pivotal
Employee Churn 202: Good and Bad Churn
This post extends the “quantitative scissors” approach to employee churn and examines the factors that underlie attrition cost.
on May 4, 2014 in Churn, Employee, GitHub, Pasha Roberts, Talent Analytics
Top stories for Apr 27 – May 3
Cartoon: Data Scientist Salary Negotiation; 9 Free Books for Learning Data Mining and Data Analysis; MLTK: Machine Learning Toolkit in Java - free download; Mass Big Data Report 2014.
on May 4, 2014 in Cartoon, Free ebook, Machine Learning, Massachusetts, Top stories
Additions to KDnuggets Directory in April
24 new Big Data and Data Science meetings, analytic consulting companies, Virginia data, Data Science certificates, Dean Abbott book on Applied Predictive Analytics, and more.
on May 4, 2014 in Added to KDnuggets, Consulting, Data Science Certificate, Meetings, Virginia
3 Key Trends in the DBMS Market
The top 3 trends in DBMS include market consolidation, moving beyond OLTP, and distributed computing - we examine them in detail.
on May 3, 2014 in DBMS, Distributed, Gartner, Michael Waclawiczek, NuoDB, OLTP, SQL, Trends
Poincare Conjecture, Perelman way, and Topology of social networks
We examine the connections between the $1 million proof of Poincare conjecture by a reclusive math genius and the topological behavior and information diffusion over social networks.
on May 3, 2014 in Mathematics, Social Networks, Topology
WCAI Research Opportunity: Using Purchase History to Identify Customer Projects
Customers frequently purchase a collection of products to complete a specific project. A rich data set from a Fortune 500 Specialty Retailer will allow researchers to study this issue - register for May 28 webinar and submit your proposal afterwards.
on May 3, 2014 in Customer Behavior, Research proposal, WCAI, Wharton
Top KDnuggets tweets, Apr 30 – May 1: Chart Cheat Sheet: When to use Bar, Stacked, Donut
Useful! Chart Cheat Sheet: When to use Bar, Stacked, Line, Donut, Choropleth; Microsoft Data Scientist; The Deep Dive: SAS vs. R: younger Data Scientists prefer R ; Massachusetts releases Big Data Report 2014.
on May 2, 2014 in Cheat Sheet, Massachusetts, Microsoft, R, SAS
Top stories in April
Apache Spark, the hot new trend in Big Data; Data Analytics Handbook - interviews with tech leaders, free download; Learning and Teaching Machine Learning; 9 Free Books for Learning Data Mining and Data Analysis.
on May 2, 2014 in Apache Spark, Data Science Education, Free ebook, Machine Learning, Top stories
Experfy, Big Data Consulting Marketplace from Harvard
Good news for data experts - you have many new options with Experfy, a startup from Harvard Innovation Lab, which is a consulting marketplace where companies hire talent for Big Data, Analytics, and BI projects.
on May 2, 2014 in Consulting, Harvard, Marketplace, Startup
Interview: Vasanth Kumar, Principal Data Scientist, Live Nation
We discuss challenges in analyzing bursty data, real-time classification, relevance of statistics and advice for newcomers to Data Science.
on May 2, 2014 in Advice, Classification, Live Nation, Statistics, Vasanth Kumar
Deep Learning with H2O, May 21 Webcast
H2O is Google-scale open source machine learning engine for R and Big Data. Learn how Deep Learning in H2O is unlocking never before seen performance for prediction - May 21.
on May 1, 2014 in Deep Learning, H2O, Prediction, R