5 Lessons from a Data Science Chat
Data science applications, key challenges, appropriate skills and more – key takeaways from a data science Tweet chat.
on Mar 19, 2015 in Booz Allen, Chat, Data Science, Data Science Skills, Twitter
IEEE ICDM 2015 Call for Data Mining Contest/Competition Proposals, due Mar 29
We invite proposals for the ICDM 2015 Data Mining Contest, which is an integral part of the IEEE ICDM conference and provides an opportunity for teams of scientists and domain experts to compete in order to develop data mining techniques for real-world applications.
on Feb 28, 2015 in Atlantic City, Competition, Data Mining, ICDM, IEEE, NJ
Brandeis Analytics Symposium, Apr 8: Industry Insights into Data and Intelligence
The Apr 8 Brandeis Analytics symposium will focus on promoting a discussion of the growing field of analytics and how organizations can leverage big data to make more strategic decisions.
on Feb 27, 2015 in Analytics, Brandeis, MA, Symposium, Waltham
Strata + Hadoop World 2015 San Jose – report and highlights
Highlights of Strata + Hadoop World San Jose, including Apache Spark vs Storm vs Samza for streaming data, Kafka as a universal message bus, what Netflix puts in front of HDFS, Parquet as a basis for ETL and analytics, DJ Patil, Internet of Things, and more.
on Feb 27, 2015 in Apache Spark, CA, DJ Patil, Hadoop, Kafka, San Jose, Strata
Interview: Nicholas Marko, Geisinger on the Skills Needed for Healthcare Analytics
We discuss challenges of dealing with healthcare data, trends in healthcare analytics, important skills for data scientists and more.
on Feb 27, 2015 in Analytics, Data Science Skills, Geisinger Health System, Healthcare, Nicholas Marko
The Internet of People: 4 key principles for analyzing personal data
The “Internet of People” raises a host of legal and ethical questions about the ownership and use of our personal data. We all have a stake in joining and navigating this increasingly stormy debate. Here are 4 key principles.
on Feb 27, 2015 in Andrew Jennings, Big Data, Ethics, FICO, Personal Data, Predictive Analytics, Privacy
Interview: Nicholas Marko, Geisinger on Building the Analytics Culture for Healthcare
We discuss how to establish credibility of data analytics, recommendations for a data-driven culture, analytics challenges in healthcare and more.
on Feb 26, 2015 in Analytics, Data-Driven Business, Decision Making, Geisinger Health System, Healthcare, Nicholas Marko
RE.WORK Deep Learning Summit, San Francisco, January – videos, presentations
Interesting videos and presentations from January Deep Learning Summit include a Fireside Chat with Andrew Ng and Derrick Harris, presentations by Richard Socher (Founder, MetaMind), Quoc Le (Research Scientist, Google), Greg Corrado (Sr Research Scientist at Google), and more.
on Feb 26, 2015 in Andrew Ng, CA, Deep Learning, Google, MetaMind, Quoc Le, RE.WORK, Richard Socher, San Francisco
Top KDnuggets tweets, Feb 23-25: Microsoft is building fast, low-power Deep Learning networks; Lucrative tech careers: Data Scientist, Data Engineer
5 lucrative tech careers in 2015: Data Scientist ($150K), Data Engineer ($148K); Which SQL on Hadoop? Gartner Poll Still Says "Whatever" But DBMS Providers Gain; 10 Most-Funded #BigData #Startups; DataRPM 8 runs in #Hadoop, uses #MachineLearning to find insights.
on Feb 26, 2015 in Big Data, Data Engineer, Data Scientist, DataRPM, Hadoop, Salary, SQL, Startups, Trevor Hastie
Simplilearn Big Data, Analytics Online Certification Courses – 50% off for a limited time
Get 50% off Simplilearn Big Data & Analytics Online Certification Courses, including a new course - Big Data and Hadoop Administrator. Get a new iPhone5 with Classroom Enrollment - see details.
on Feb 26, 2015 in Big Data Analytics, Certification, Hadoop, R, SAS, Simplilearn
IBM Big Data & Analytics Heroes
IBM's Big Data & Analytics Heroes include leaders in the field that propel the industry in order to promote thought leadership and progress in Big Data Analytics.
on Feb 26, 2015 in About Gregory Piatetsky, Analytics Leader, Big Data, Big Data Analytics, IBM, Jake Porway, Kirk D. Borne
Beagli: Finding value in your personal data
Personal analytics products can help users extract value from their data. This post describes our development of Beagli, a platform for mining and auctioning personal data.
on Feb 25, 2015 in Marketplace, Privacy, Startups
Top KDnuggets tweets, January: Good list of Machine Learning Resources, Deep Learning, Graphical Models; Sample solutions with R on Azure ML Marketplace
Good list of #MachineLearning Resources, #DeepLearning, Graphical Models; Sample #MachineLearning solutions with R on #Azure ML Marketplace #rstats; New book: Data Driven: Creating a Data Culture, by @dpatil, @hmason; Intro to #Python and #IPython for #DataMining.
on Feb 24, 2015 in Azure ML, Deep Learning, DJ Patil, Hilary Mason, IPython, Machine Learning, Marketplace, Python, R
PAW San Francisco: Learn Uplift Modeling
The analytical method to optimize for influence is uplift modeling (aka persuasion modeling) and its adoption is rapidly growing. Learn it in two sessions and a full-day training workshop at PAW Business, San Francisco, Mar 29 - Apr 2, 2015. KDnuggets discount.
on Feb 24, 2015 in CA, PAW, Predictive Analytics World, San Francisco, Uplift Modeling, USA
Big Data TechCon, the HOW-TO conference, Boston, April 26-28
Plan now to attend Big Data TechCon, April 26-28 in Boston, to learn HOW-TO master and analyze Big Data. Learn Hadoop, Spark, Yarn, HBase, R, and Hive from the smartest, hardest-working faculty. Special discount.
on Feb 24, 2015 in Apache Hive, Apache Spark, Big Data, Boston, Hadoop, HBase, MA, R, Techcon, USA, YARN
Top /r/MachineLearning Posts, Feb 15-21: The Elephant in the Room of ML Research
Problems with deep learning papers, Coursera linear algebra courses, Reddit comment visualizations, deep learning lectures, and genetic algorithm introductions make up the top posts this week on /r/MachineLearning.
on Feb 24, 2015 in Coursera, Deep Learning, Genetic, Graph Visualization, Network Graph, Overfitting, Python, Reddit
Upcoming Webcasts on Analytics, Big Data, Data Science – Feb 24 and beyond
Winning with Big Data Analytics, a Roadmap for Data-Driven Culture, Data Science for Workforce Optimization, Text Mining and Knowledge Graphs in the Cloud, Performance and Scale Options for R with Hadoop, and more.
on Feb 23, 2015 in Big Data Analytics, Datameer, Hadoop, Ontotext, Text Mining, Workforce Analytics
Top KDnuggets tweets, Feb 16-22: History of Data Science across 5 strands; Most Popular Coding Languages of 2015
History of #DataScience across 5 strands; Most Popular Coding Languages of 2015: #Python 31% ...; #BigData reveals how information travels: 8 clusters in Europe; New Face Detection Algorithm to revolutionize search: finding faces no longer unique to humans.
on Feb 23, 2015 in Data Science, Europe, Face Detection, History, IBM Watson, Programming Languages, Python
Gartner 2015 Magic Quadrant for Advanced Analytics Platforms: who gained and who lost
SAS, IBM, KNIME, and RapidMiner lead in Gartner 2015 Magic Quadrant for Advanced Analytics Platforms. We analyze who gained and who lost versus last year.
on Feb 23, 2015 in Advanced Analytics, Alteryx, Dell, Gartner, IBM, Knime, Magic Quadrant, Microsoft, RapidMiner, Salford Systems, SAS
Interview: David Kasik, Boeing on Data Analysis vs Data Analytics
We discuss the impact of increasing amount of data on visualization, difference between Data Analysis and Data Analytics, motivation, trends, desired skills and more.
on Feb 23, 2015 in 3D, Boeing, Career, Data Analysis, Data Analytics, David Kasik, Trends, Visualization
Statistical Learning and Data Mining III: 10 Hot Ideas for Learning from Data, Mar 19-20, Palo Alto
Taught by top Stanford professors and leading statisticians Trevor Hastie and Robert Tibshirani, this course presents 10 hot ideas for learning from data, and gives a detailed overview of statistical models for data mining, inference and prediction.
on Feb 23, 2015 in CA, Lasso, Palo Alto, Regression, Robert Tibshirani, Statistical Learning, Trevor Hastie, USA
Top stories for Feb 15-21: 10 things statistics taught us about big data analysis; History of Data Science in 5 strands
My Brief Guide to Big Data and Predictive Analytics for non-experts; 10 things statistics taught us about big data analysis; History of Data Science Infographic in 5 strands; Automatic Statistician and the Profoundly Desired Automation for Data Science.
on Feb 22, 2015 in Automation, Data Science, History, Jargon, Statistics, Strata, Top stories
Prismatic Interest Graph [API]: Organize and Recommend Content
Prismatic Interest Graph API provides a set of tools for automatically analyzing unstructured text and annotating it with a variety of tags that are useful for organizing and recommending content.
on Feb 20, 2015 in Machine Learning, Prismatic, Recommendations, Text Analytics, Text Mining
Top KDnuggets tweets, Feb 18-19: New Face Detection Algorithm to revolutionize search; How to transition from Excel to R
Practical #DataScience in #Python #MachineLearning - nice intro; New Face Detection Algorithm to revolutionize search; Well written: How to Transition from Excel to R; Microsoft launches #Azure #MachineLearning Platform for #BigData, adds Python.
on Feb 20, 2015 in Azure ML, Data Science, DJ Patil, Excel, Face Detection, Image Recognition, Microsoft, Python, R, White House
Google BigQuery Public Datasets
Google BigQuery is not only a fantastic tool to analyze data, but it also has a repository of public data, including GDELT world events database, NYC Taxi rides, GitHub archive, Reddit top posts, and more.
on Feb 20, 2015 in BigQuery, GDELT, Google, New York City, Reddit
Fun and Top! US States in 2 Words using twitteR
Combining twitteR package with text mining techniques and visualization tools can produce interesting outputs. Find out which US state is fun and top, and which is good and crazy, according to Twitter.
on Feb 19, 2015 in R, Text Mining, Twitter, USA
Simplilearn Big Data and Analytics Online Courses
Be Big Data Ready - get 30% Off Simplilearn Big Data and Analytics Online Courses with code FEB30A, valid till 28 Feb 2015.
on Feb 19, 2015 in Big Data Analytics, Certification, Hadoop, Online Education, R, SAS, Simplilearn
Top KDnuggets tweets, Feb 16-17: Most Popular Coding Languages of 2015; History of Data Science across 5 strands
Most Popular Coding Languages of 2015: #Python 31%, Java 20%, C++ 9.8%; History of #DataScience across 5 strands: CS, #Data, #Visualization, Math, Stats; IBM Verse new messaging software will use #Watson to declutter your inbox; Doctors store 1,600 digital #hearts for #BigData study.
on Feb 18, 2015 in Blogs, Data Science, History, IBM Watson, Java, Programming Languages, Python
Watch Keynotes Live: Strata + Hadoop World San Jose, Feb 19-20
Watch live on Feb 19 & 20 ! The Strata + Hadoop World, Feb 17-20, San Jose conference is sold-out again, but if you are not there, here is how you can watch the keynotes live on Feb 19 and Feb 20, 2015.
on Feb 18, 2015 in DJ Patil, Hadoop, Keynote Speech, Strata
Big Data, Privacy, and Security – which side are you on?
After all the positive promise, the hype, and predictions about Big Data, 2015 started with a debate about privacy and specifically whether or not companies like Google and Facebook should be allowed to encrypt their users data.
on Feb 18, 2015 in Big Data, Chris Pearson, Encryption, Facebook, Google, NSA, Paris, Privacy, Security
January 2015 Analytics, Big Data, Data Mining Acquisitions and Startups Activity
January acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: Microsoft buys Revolution Analytics, Why dunnhumby is worth billions, GraphLab renames to Dato, raises $18M, MongoDB raised $80M, and more.
on Feb 17, 2015 in Dato, dunnhumby, GraphLab, Microsoft, MongoDB, R, Revolution Analytics
History of Data Science Infographic in 5 strands
History of Data Science infographic presents key events in Data Science across 5 strands: Computer Science, Data Technology, Visualization, Mathematics/OR, and Statistics.
on Feb 17, 2015 in About Gregory Piatetsky, Data Science, History
Automatic Statistician and the Profoundly Desired Automation for Data Science
The Automatic Statistician project by Univ. of Cambridge and MIT is pushing ahead the frontiers of automation for the selection and evaluation of machine learning models. In general, what does automation mean to Data Science?
on Feb 17, 2015 in Automation, Cambridge, Data Cleaning, Data Science, Machine Learning, MIT, Modeling, Statistician
Statistics.com courses on RESTful APIs
Applying analytics to big data requires a mechanism to rapidly get and share data and RESTful APIs is the standard way doing it. Learn how to write Python code to ingest data, communicate with, and create RESTful APIs with online courses from Statistics.com.
on Feb 17, 2015 in RESTful API, Statistics.com
Tamr Enterprise Platform for Scalable, End-to-End Data Unification
The new Tamr Platform radically simplifies and speeds the availability of unified data for analytics and downstream application, with key new features: catalog, connect, and consume. Tamr also announced solutions for Pharma and Procurement.
on Feb 17, 2015 in Cambridge, Data Preparation, MA, Pharma, Procurement, Tamr
Data Mining finds JASBUG, a Critical Security Vulnerability
We explain how the critical Microsoft security vulnerability JASBUG that existed for 15 years was detected with similarity search and regular expression inference.
on Feb 17, 2015 in Biology, gTLD, JASBUG, Microsoft, Security, Similarity, simMachines
TEDx RheinMain Datanauts Competition
Send your project idea to TEDx RheinMain “Datanauts” Competition. It should fit to one of the following categories: mobility, environment, culture, common Good.
on Feb 17, 2015 in Competition, Germany, TED
Top /r/MachineLearning Posts, Feb 8-14: Automating Tinder, Statistics and Machine Learning
Automating Tinder with Eigenfaces, statistics lessons in big data analysis, an upcoming AMA, the basics of PCA, and neural network programming in Python are all topics covered in the last week on Reddit.
on Feb 17, 2015 in Big Data Analytics, Eigenface, Neural Networks, Python, Reddit
Big Data Innovation Summit, San Jose, Apr 28-29, 2015
The Summit will bring 800+ data practitioners for 7 business and technical focused stages, 70+ sessions, keynotes, workshops, panels and countless networking opportunities. Early Bird until Feb 27.
on Feb 17, 2015 in Big Data, CA, IE Group, San Jose, Summit, USA
Upcoming Webcasts on Analytics, Big Data, Data Science – Feb 17 and beyond
Pivotal Update, Moving Targets, Secure Because Math, Data Driven: Creating a Data Culture, Maximize the effectiveness of your text analytics initiatives, Text Mining and Knowledge Graphs in the Cloud.
on Feb 16, 2015 in DJ Patil, Hilary Mason, Ontotext, Pivotal, Security, Text Analytics
Interview: David Kasik, Boeing on How Visual Analytics is Improving Aviation Safety
We discuss data visualization at Boeing, the importance of Visual Analytics, Aviation Safety improvement through Analytics and augmented reality.
on Feb 16, 2015 in Augmented Reality, Aviation, Boeing, Data Visualization, David Kasik, Graphics, Interview, Visual Analytics
Top KDnuggets tweets, Feb 9-15: Why limit yourself to “50 Shades of Grey?” R has 102 shades; Why Electric Cars Dont Have Better Batteries
Why limit yourself to "50 Shades of Grey?" R has 102; Why Electric Cars Don't Have Better Batteries - a sad story of Envia; More evidence that #sports is a goldmine for #MachineLearning; Wedding with 200+ guests is 92% less likely to lead to divorce.
on Feb 16, 2015 in 50 Shades of Grey, Divorce, Electric Car, Envia, Image Recognition, R, Sports, Wedding
Webinar: Drive effective text analytics initiatives, Feb 19
On February 19, join Meta Brown (author of Data Mining for Dummies), Howard Lyeth (Senior Analyst at L.L. Bean), Steven Scarr (CEO of eContext), and Ramkumar Ravichandran (Director of Analytics at Visa) to learn how to maximize the effectiveness of your text analytics.
on Feb 16, 2015 in Data-Driven Business, Meta Brown, Text Analytics
Active Data Mining, Data Science blogs
Here are 85 or so active (recently updated) data mining, data science, and machine learning blogs.
on Feb 16, 2015 in Blogs, Data Mining, Data Science
Tinderbox: Automating Romance with Tinder and Eigenfaces
Tinderbox is a software uses machine learning and image recognition to automate Tinder, a popular app for single meetings. The author describes his experience and feedback until it started to work too well.
on Feb 15, 2015 in Bots, Eigenface, Image Recognition, Romance, Tinder
Top stories for Feb 8-14: 10 things statistics taught us about Big Data; Data Science Most Confused Jargon
10 things statistics taught us about big data analysis; Data Science's Most Used, Confused, and Abused Jargon; Top 30 people in Big Data and Analytics; Cartoon: Data Scientist 3 wishes for Valentine Day.
on Feb 15, 2015 in Big Data Influencers, Cartoon, Data Science, Jargon, Statistics, Top stories
Top KDnuggets tweets, Feb 11-12: Automating romance with Eigenfaces; My Brief Guide to Big Data, Predictive Analytics for non-experts
Romantic #DataScientist @crockpotveggies automates #Tinder with Eigenfaces; My Brief Guide to Big Data and Predictive Analytics for non-experts; #DataMining finds corruption is correlated with low income, low development MIT; Hitachi buys Pentaho to extend Its #BigData footprint.
on Feb 13, 2015 in Big Data, Causation, Correlation, Corruption, Hitachi, Pentaho, Tinder
NYC DSA Data Science Bootcamp,
June 1 – August 21
NYC Data Science Academy offers the highest quality in data science training, designed specifically around the skills employers are seeking, including R, Python, Hadoop, github, D3.js, raspberry pi and much more.
on Feb 13, 2015 in Bootcamp, Data Science Education, New York City, NY, Python, R, USA
Top /r/MachineLearning posts, January
Talking Machines, SVM lectures, a new Stanford statistical learning online course, and a listing of open-source datasets top the most popular Reddit posts on /r/MachineLearning for the month of January.
on Feb 13, 2015 in Geoff Hinton, Online Education, Open Data, Podcast, Reddit, Statistical Learning, SVM, Yann LeCun, Yoshua Bengio
Cartoon: Data Scientist gets 3 wishes for Valentine’s Day
New KDnuggets cartoon imagines what could happen if a Big Data genie would grant a romantic Data Scientist 3 wishes for a Valentine's Day.
on Feb 13, 2015 in Cartoon, Data Scientist, Hadoop, Python, Scarlett Johansson, Valentine's Day
My Brief Guide to Big Data and Predictive Analytics for non-experts
My brief guide to Big Data and Predictive Analytics for non-experts suggests key books, films, and websites to learn more.
on Feb 12, 2015 in Big Data, Charles Stross, Deep Learning, Eric Siegel, Kenneth Cukier, Nate Silver, The Guardian
Interview: M.C. Srivas, CTO, MapR on Data Agility – The Next Frontier of Big Data
We discuss the competitive differentiation of MapR, challenges in consumerizing Big Data, trends, strategy recommendations, desired skills and more.
on Feb 12, 2015 in Big Data, Business Strategy, Challenges, Competition, Education, Interview, M. C. Srivas, MapR, Trends
Statistics.com Online Data Science Courses and Certificates
Accelerate your career and upgrade your skills With Statistics.com training provided by top experts who will answer your questions on a daily basis. Work on practical exercises with real problems, real data and multiple software tools.
on Feb 12, 2015 in Data Science Certificate, Data Science Education, Online Education, R, Statistics.com, Text Analytics
Lipari Summer School: Algorithms, Data, and Models for Social and Urban Systems
Lipari Summer School will address the role of GIS, social media, big social data, agent-based models, network models, and their integration in the study, design, and implementation of social and urban systems.
on Feb 12, 2015 in Italy, Lipari, Social Networks, Social Science, Summer School
Ontotext: Integrated Text Mining and Triplestores, a form of graph database
Learn about 2 hot trends: RDF triplestores, a form of graph database, and the use of text mining to extract meaning from Big Data, and how Ontotext enables both. Free eval, Feb 26 webinar, and more.
on Feb 12, 2015 in Graph Databases, Ontotext, RDF, Text Mining, Triplestore
Top KDnuggets tweets, Feb 9-10: Teach #Python or R for #DataScience? #BigData suprising finding on Divorce
#BigData on Divorce: Wedding with 200+ guests is 92% less likely to end in divorce; Should you teach #Python or R #rstats for #DataScience?; Top 30 people in Big Data and Analytics; Paris history, captured in its streets, visualized with R.
on Feb 11, 2015 in Big Data Influencers, Data Science Education, Differential Pricing, Divorce, Paris, Python, R, White House
Interview: M.C. Srivas, MapR on Demystifying the Art of Processing Massive Data
We discuss the launch and evolution of MapR, achievements, key characteristics of MapR-DB, significance of Apache Drill, MapR use cases and more.
on Feb 11, 2015 in Apache Drill, Apache Spark, Google, M. C. Srivas, MapR, Mining Massive Datasets, Shark
Big Data Innovators Under 35
Young innovators in deep learning, interface design, and data science automation are all included in MIT Technology Review's Innovators Under 35 list.
on Feb 11, 2015 in Big Data, Deep Learning, Innovation, MIT, Samsung
Top Analytics and Big Data Trends ahead of Strata Hadoop World San Jose
Top 2015 Analytics and Big Data trends from our readers were Apache Spark, Deep Learning, Real-Time technology, Internet of Things (IoT).
on Feb 11, 2015 in Apache Spark, Deep Learning, Hadoop, IoT, Real-time, Strata
KDnuggets Annual Poll: Analytics/Data Science/Data Mining income/salary, role, and employment
Please vote in KDnuggets annual poll on analytics / data science / data mining field income/salary vs analytic role and employment. We will analyze and publish the results.
on Feb 11, 2015 in Data Analyst, Data Scientist, Poll, Salary
Expert Training within Reach
SAS has partnered with some of the world’s most respected analytical thought leaders to offer you the best in analytics training.
on Feb 10, 2015 in Boston, CA, MA, New York City, NY, Online Education, San Francisco, SAS
PAW: How Predictive Analytics Reinvents These Six Industries
Predictive analytics is a game-changer - it's like "Moneyball" for... money. This article summarizes and links resources with late-breaking coverage of how predictive analytics reinvents six industries.
on Feb 10, 2015 in Eric Siegel, Finance, Government, Healthcare, Manufacturing, Marketing, PAW, Predictive Analytics World, Workforce Analytics
Info Kit: Statistics, Predictive Modeling, Data Mining with JMP
Find out how to challenge assumptions, spot patterns and reveal potential solutions to problems that otherwise would not be visible. Register for this complimentary info kit.
on Feb 10, 2015 in Data Mining, Data Preparation, JMP, Online Education, Statistics
TMA Predictive Analytics Data Mining Training [Orlando, Mar | Las Vegas, Apr]
Successful analytics in the big data era does not start with data and software, but with hands-on, immersive training and goal-driven strategy - get it from The Modeling Agency in Orlando (Mar), Las Vegas (Apr), or Washington DC (May).
on Feb 10, 2015 in Data Mining Training, FL, Las Vegas, NV, Orlando, The Modeling Agency, TMA, USA
Interview: Phani Nagarjuna, Nuevora on Right Data and Business Analytics Roadmap
We discuss the journey of Business Analytics, definition of Right Data, competitive differentiation of Nuevora, challenges in the large-scale consumerization of analytics, and more.
on Feb 10, 2015 in Business Analytics, Competition, Data Science Skills, Marketing, Nuevora, Phani Nagarjuna, Predictive Analytics
Data Science’s Most Used, Confused, and Abused Jargon
As data science has spread through the mainstream, so too has a dense vocabulary of ill-defined jargon. In a split-personality post, we offer several perspectives on many of data science's most confused terms.
on Feb 10, 2015 in Big Data Privacy, Data Science, Deep Learning, Zachary Lipton
Analyzing Analysts to Build Better Analysis Software
Our study how analysts used Mode led to major updates designed to fit how data analysts and business analysts actually use data - there's no one-size-fits-all tool and analysis doesn't end with the analyst.
on Feb 10, 2015 in Business Analyst, Data Analyst, Mode Analytics, SQL
10 things statistics taught us about big data analysis
There are 10 ideas in applied statistics are relevant for big data analysis, focusing on prediction accuracy, interactive analysis and more.
on Feb 10, 2015 in Best Practices, Big Data, Overfitting, Statistics
Stanford Online Courses: Use data to solve biological, medical problems
Stay at the cutting-edge and take courses online from Stanford that will teach you techniques used in new applications in biomedicine. Enrollment for spring open till Mar 20.
on Feb 10, 2015 in Bioinformatics, Biomedical, Data Mining, Online Education, Stanford
Interview: Phani Nagarjuna, Nuevora on CMO Expectations from Analytics
We discuss the value proposition of Nuevora, founding story, CMO expectations from Analytics and the Nuevora nBAAP platform.
on Feb 9, 2015 in Analytics, Applications, Interview, Nuevora, Phani Nagarjuna, Platform, Prescriptive Analytics, Real-time
Top KDnuggets tweets, Feb 02-08: 10 things statistics teaches about Big Data analysis; Where is Waldo ruined by Machine Learning
Very useful: 10 things statistics teaches about #BigData analysis; Where's Waldo ruined by a PhD student and #MachineLearning; Great Tutorial: Getting Started with Apache Spark and Python; Clarifai Machine Learning software can understand what is in your videos.
on Feb 9, 2015 in Apache Spark, Big Data, Clarifai, Machine Learning, Python, Statistics, Waldo
Upcoming Webcasts on Analytics, Big Data, Data Science – Feb 10 and beyond
Data Mining: Failure to Launch, 3 Ways to Improve your Regression, The Pragmatic Text Miner, Make It Big As a Data Scientist in 2015, Managing Big Data in Production and more.
on Feb 9, 2015 in Data Scientist, Failure to Launch, Regression, Text Mining
Top /r/MachineLearning posts, Feb 1-7: Music recognition, Text Understanding from scratch
Shazam music recognition techniques, deep learning for text understanding, neuroscience history, Neural Turing Machines using Torch, and genetic algorithms are the top topics on Reddit last week.
on Feb 9, 2015 in AI, Deep Learning, Genetic, Reddit, Text Analytics, Torch, Yann LeCun
Top 30 people in Big Data and Analytics
Innovation Enterprise has compiled a top 30 list for individuals in big data that have had a large impact on the development or popularity of the industry.
on Feb 9, 2015 in About Gregory Piatetsky, Andy Palmer, Big Data, Big Data Influencers, DJ Patil, Hilary Mason, IE Group, Kirk D. Borne, Paco Nathan, Tom Davenport, Wolfram
Facebook Open Sources deep-learning modules for Torch
We review Facebook recently released Torch module for Deep Learning, which helps researchers train large scale convolutional neural networks for image recognition, natural language processing and other AI applications.
on Feb 9, 2015 in Artificial Intelligence, Deep Learning, Facebook, GPU, Neural Networks, NYU, Ran Bi, Torch, Yann LeCun
Top stories for Feb 1-7: Avoiding a Common Mistake with Time Series; Top Big Data Influencers and Brands
Avoiding a Common Mistake with Time Series; (Deep Learning Deep Flaws) Deep Flaws; Top Big Data Influencers and Brands; Two Most Important Trends in Analytics and Big Data.
on Feb 8, 2015 in Big Data Influencers, Deep Learning, Hadoop, Predictions for 2015, Time Series, Top stories
Women in Data: Top Practitioners on Critical Skills, Background, and Education
Cornelia Levy-Bencheton interviewed 15 women in data to learn how they achieved their current level of success, what motivated them to get there, and their views about opportunities for women.
on Feb 7, 2015 in Data Scientist, O'Reilly, STEM, Women
Get practical training at PASS Business Analytics Conference, Santa Clara, April 20-22
Get practical training on Predictive Analytics, R, Data Visualization, Big Data, Excel, Power BI, and more at the PASS Business Analytics Conference, Santa Clara, April 20-22. Get $150 off with KDnuggets discount.
on Feb 7, 2015 in Big Data, Business Analytics, CA, Data Visualization, Dean Abbott, PASS, Santa Clara, USA
Top stories in January: (Deep Learning Deep Flaws) Deep Flaws; Research Leaders on key trends, papers
Research Leaders on Data Science and Big Data key trends, papers; (Deep Learning Deep Flaws) Deep Flaws; Analytics: Five Rules to Cut Through the Hype; 11 Clever Methods of Overfitting and how to avoid them.
on Feb 6, 2015 in Arno Candel, Data Science Education, Deep Learning, Overfitting, Research, Top stories, Trends
Top KDnuggets tweets, Feb 4-5: Clarifai Machine Learning software can understand what is in your videos
Clarifai #MachineLearning software can understand what is in your videos; #BigData Lessons From @Netflix: comparing House of Cards and Macbeth insights; 2014 was the biggest year for #AI startups; Top Data Scientist @DPatil joined the #WhiteHouse as a data scientist-in-residence.
on Feb 6, 2015 in AI, Clarifai, DJ Patil, Image Recognition, Netflix, Startups, Video recognition, White House
Interview: Jason Bloomberg, Intellyx on the Tricky Balance of Optimization and Innovation
We discuss Agile Digital Transformation, Optimization vs Innovation trade-off, best innovations of 2014, trends, advice and more.
on Feb 6, 2015 in Data-Driven Business, Disruptive, Innovation, Intellyx, Jason Bloomberg, Optimization, Recommendations
Top /r/MachineLearning posts, Jan 25-31
Downsides to jobs in machine learning fields, AI learning materials, novel topic modelling techniques and weekly simple question threads are all topics of discussion this week on Reddit /r/MachineLearning.
on Feb 6, 2015 in AI, Neural Networks, Reddit, Topic Modeling
Two Most Important Trends in Analytics and Big Data in 2015
In 2015, two most important trends in Analytics and Big Data are in developing countries and big data security.
on Feb 6, 2015 in Big Data, Predictions for 2015, Security, Trends
How Many Quants are Changing Jobs?
Being a quantitative recruiter, I have had a unique perspective on the current climate and how often are Quants changing jobs? What does this mean if you’re a Quant? What does this mean if you’re trying to hire a Quant?
on Feb 6, 2015 in Burtch Works, Hiring, Quants
How Big Data Pieces, Technology, and Animals fit together
How Big Data Pieces and animals fit together: MapReduce, HDFS, Apache Spark,, Pregel, Zookeeper, Flume, Hive, Pig, and more, explained by a Quora (and past Facebook) Data Scientist.
on Feb 5, 2015 in Apache Hive, Apache Spark, Google, Hadoop, MLlib
The Post-Hadoop World: New Kid On The Block Technologies
Big Data technology has evolved rapidly, and although Hadoop and Hive are still its core components, a new breed of technologies has emerged and is changing how we work with data, enabling more fluid ways to process, store, and manage it.
on Feb 5, 2015 in Apache Spark, Big Data, Chronos, Docker, Hadoop, YARN
Analytics Outsourcing to India: Should or Shouldn’t?
Outsourcing analytics talent to India will continue to grow as a trend as evidenced by the increasing number of Fortune 500 companies participating in the practice.
on Feb 5, 2015 in Analytics, Data Science Team, India, Outsourcing
Top KDnuggets tweets, Feb 2-3: Avoiding a Common Mistake with Time Series; A New Year in Data Science, great overview
Avoiding a Common Mistake with Time Series: use de-trending; A New Year in #DataScience, great overview of the #MachineLearning and #BigData; Data scientist memes - the 'hottest profession'; Top Big Data Influencers and Brands.
on Feb 4, 2015 in Big Data Influencers, Machine Learning, Paco Nathan, Time Series, xkcd
Interview: Rachel Hawley, SAS on Why Data Science Needs Communication Skills
We discuss SAS Analytics Center of Excellence, trends, advice, desired skills in data science and more.
on Feb 4, 2015 in Advice, Communication, Data Science, Hiring, Rachel Hawley, SAS, Skills, Trends
Webinar: Data Mining: Failure to Launch [Feb 10]
Learn how to get started with predictive modeling and overcome strategic and tactical limitations that cause data mining projects to fall short of their potential. Next webinar is Feb 10.
on Feb 3, 2015 in Data Mining, Failure to Launch, TMA
Interview: Rachel Hawley, SAS on the Quest for Agile Analytics
We discuss Agile Analytics, moving from traditional Analytics to Agile, challenges in operationalizing Analytics, SAS Enterprise Decision Management and SAS In-Memory Statistics.
on Feb 3, 2015 in Agile, Analytics, Big Data ROI, Challenges, Decision Management, In-Memory Computing, Rachel Hawley, SAS, Statistics
Gartner Business Intelligence & Analytics Summit, Las Vegas, Mar 30 – Apr 1
Learn how to remaster your skills to deliver the analytic advantage your organization needs in the digital age in order to succeed, get new best practices and leading-edge strategies. KDnuggets discount.
on Feb 3, 2015 in Analytics, Business Intelligence, Gartner, Las Vegas, NV, Summit, USA
PASS Business Analytics Conference, Santa Clara, Apr 20-22
Set yourself apart with world-class analytics training in 60+ sessions at the PASS Business Analytics Conference. Special KDnuggets discount.
on Feb 3, 2015 in CA, Dean Abbott, PASS, Santa Clara, USA
Upcoming Feb – July 2015 Meetings in Analytics, Big Data, Mining, Data Science
Coming soon: Strata + Hadoop World (San Jose), TDWI (Las Vegas), MLDAS (Qatar), TDWI Solution Summit (Savannah), GigaOM Structure (NYC), Chief Data Strategy (Boston), Gartner BI&Analytics (LV), PAW/TAW (SF), PASS (Santa Clara) and more.
on Feb 2, 2015 in Boston, CA, Chicago, London, MA, San Diego, San Francisco, UK, USA
Upcoming Webcasts on Analytics, Big Data, Data Science – Feb 3 and beyond
SAP HANA Launch event, Analytics/Data Science Hiring Market, Platfora, Data Mining: Failure To Launch, 3 Ways to Improve your Regression, BigML 2015 Winter Release, and more.
on Feb 2, 2015 in BigML, Platfora, Salford Systems, SAP, TMA
Interview: Eli Collins, Cloudera on Evolution and Future of Big Data Ecosystem
We discuss the change in Big Data priorities, risks, Big Data ecosystem, rise of data culture in organizations, challenges, advice and more.
on Feb 2, 2015 in Advice, Big Data, Cloudera, ecosystem, Eli Collins, Strategy, Trends, Vendors
Top KDnuggets tweets, Jan 26 – Feb 1: Good list of Machine Learning Resources; Sample Machine Learning solutions with R
Good list of #MachineLearning Resources, #DeepLearning, Graphical Models; Sample #MachineLearning solutions with R on #Azure ML Marketplace #rstats; Decision Tree Algorithms: comparing Gini Index, Chi-Square, Information Gain; Cartoon: Lets solve 2+4=? first, worry about #DataMining Later.
on Feb 2, 2015 in Azure ML, Cartoon, Decision Trees, Machine Learning, Marketplace, R, xkcd
BigML machine learning platform Winter 2015 Release, Feb 11
See the latest in BigML's continuously evolved machine learning platform with its emphasis on consumability, programmability, and scalability. Feb 11 webinar at 9 am PT and 5 pm PT.
on Feb 2, 2015 in BigML, Clustering, Data Science Platform, Google, Machine Learning
Top Big Data Influencers and Brands
Top Big Data influencers and brands on Twitter, selected by Onalytica based on the Pagerank analysis of Twitter graph.
on Feb 2, 2015 in About Gregory Piatetsky, About KDnuggets, Bernard Marr, Big Data, Big Data Influencers, Brands, Cloudera, IBM, Kirk D. Borne, Twitter
Avoiding a Common Mistake with Time Series
We explore a common mistake in analyzing relationships between time series, and show how de-trending helps to avoid this error.
on Feb 2, 2015 in De-trending, Regression, Time Series, Tom Fawcett
Comics Recommendations: “Tinder for Comics” built with Tapastic and PredictionIO
Here is how we built a cool demo of recommending comics, using PredictionIO new Similar Product Template and dataset provided by Tapastic.com.
on Feb 2, 2015 in Cartoon, PredictionIO, Recommendations, Tapastic, Tinder
Top stories for Jan 25-31: (Deep Learning Deep Flaws) Deep Flaws; Text Analysis 101: Document Classification
(Deep Learning Deep Flaws) Deep Flaws; Text Analysis 101: Document Classification; Interview: Anthony Bak, Ayasdi on Managing Data Complexity through Topology.
on Feb 1, 2015 in Anthony Bak, Ayasdi, Deep Learning, Text Analytics, Top stories
Additions to KDnuggets Directory in January
53 new meetings, Silicon Valley Data Science, BayeSniffer, Data Science education in Paris and Nice, bootcamps in NYC, fraud detection solutions and more.
on Feb 1, 2015 in Boston, CA, Chicago, IL, London, MA, San Diego, UK, USA