MoDAT: Designing the Market of DATa – Workshop Report
An overview of MoDAT workshop on "Designing the Market of DATa" - key research ideas such as recommending expertise, chance discovery, "data jackets", privacy risks, and more.
on May 28, 2014 in Dallas-TX, ICDM, IEEE, Marketplace, Workshop, Yukio Ohsawa
Exclusive Interview: Richard Socher, founder of etcML, Easy Text Classification
An exclusive interview with Richard Socher, co-founder of etcML, a new and free tool for helping users with creating classifiers for text using machine learning.
on Mar 31, 2014 in etcML, Machine Learning, Richard Socher, Startups, Text Classification
Top KDnuggets tweets, Mar 28-30: SAS vs R vs Python, ecosystem comparison; Practical Data Science with R
SAS vs. R vs. Python - Which should you learn?; New Book: Practical Data Science with R ; Is Data Scientist the right career path for you? Candid advice; Must read books for people interested in Analytics.
on Mar 31, 2014 in Advice, Book, Career, Data Scientist, Python, R, SAS
Information Management 10 More Big Data Companies
Information Management selects 10 additional less known but promising companies offering Big Data platforms, solutions, and services.
on Mar 30, 2014 in Appuri, Fractal Analytics, Scientel, Skytree, Think Big Analytics
Top stories for Mar 23-29
Data Scientists Salary Poll: US, Canada, Australia lead; Is Data Scientist the right career path for you? New Book: Practical Data Science with R; Test your numbersense.
on Mar 30, 2014 in Book, Career, Data Scientist, Dell, Numbersense, R, Salary, StatSoft
New Book: Practical Data Science with R
This new book will help you learn and apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business intelligence, and decision support.
on Mar 29, 2014 in Book, Business Intelligence, Data Science, Decision Support, R, Statistics
Top KDnuggets tweets, Mar 26-27: Watch “Statistics with R for newbies”; Coursera free #DataScience courses
Also free ebooks on Practical Machine Learning: Innovations in Recommendations, and Apache Hive - How to access big data on Hadoop with SQL/HiveQL.
on Mar 28, 2014 in Apache Hive, Coursera, Hadoop, Machine Learning, newbies, R, Recommendations, Statistics
Nate Silver FiveThirtyEight tackles Climate Change, fails on Data Science
Nate Silver FiveThirtyEight forays into climate science and drowns in criticism for bad data analysis. We examine a basic data science question: Can we tell if natural disasters are related to climate?
on Mar 28, 2014 in Climate Change, FiveThirtyEight, Munich Re, Nate Silver, Natural Disasters
Is Data Scientist the right career path for you? Candid advice
Candid advice from an industry veteran reveals the true picture behind the much-talked-about Data Scientist "glamour" and helps people have the right expectations for a Data Science career.
on Mar 28, 2014 in Advice, Career, Data Science, Data Scientist, Hadoop, Paco Nathan, Recommendation, Visualization
Swiss Analytics Magazine launched
The very first issue of the Swiss Analytics Magazine is now available online. The primary objective of this publication is to provide original analytics content to Swiss practitioners.
on Mar 28, 2014 in Analytics, Magazine, Recommendations, SAA, Sandro Saitta, Switzerland
Fractal Analytics Interview Highlights
Fractal Analytics CEO on starting the company, competing with the best, managing attrition, attributes he looks for when hiring, 4 different analytics career tracks, strategic bets, and advice for starting data scientists.
on Mar 27, 2014 in Advice, Career, Fractal Analytics, Hiring, Interview
Boston AnalyticsWeek Panel Highlights: Next Big Thing in Big Data
Boston AnalyticsWeek opens with a vigorous panel discussion, which debates the next "Big Thing" in #BigData, Replacing data scientists by an algorithm, and is Privacy a big obstacle to Big Data?
on Mar 27, 2014 in AnalyticsWeek, Automating, Big Data, Boston-MA, Gregory Piatetsky, Next Big Thing, Panel, Paul Sonderegger, Privacy
Upcoming Webcasts on Analytics, Big Data, Data Science – Mar 27 and beyond
Best Practices in Predictive Analytics, Best Decision Trees with Angoss 9, Thick Data, In-Database Scalable R and Python, Data Mining - Failure to Launch, Algorithms in Online Games, and more.
on Mar 27, 2014 in Hadoop, Online Games, RapidInsight, SciDB, Wharton
White House-MIT Big Data Privacy Workshop – Top Researcher Reports
Leading database researcher Michael Brodie gives a summary of an important White House-MIT Big Data Privacy workshop and discusses privacy, government, technical solutions, Edward Snowden, SXSW, and technical challenges associated with big data and privacy.
on Mar 27, 2014 in Big Data Privacy, CSAIL, Michael Brodie, Michael Stonebraker, MIT, Snowden, SXSW, White House
SciDB: Big Analytics without Big Hassles: In-Database Scalable R and Python, Apr 10
Next advance in analytical databases from renowned database researcher Mike Stonebraker - watch April webinar 10 about SciDB - open source, array database with native scalable complex analytics, programmable from R and Python.
on Mar 27, 2014 in Array database, Michael Stonebraker, Python, R, SciDB, Webcast
WCAI: Measuring Skill Level and Optimizing Player-Matching Algorithms in Online Games
New research opportunity from Wharton Customer Analytics Initiative (WCAI) involves a unique data set from a major gaming company, with historic behavioral data for 9.5 million users who played 882K games. Learn more on Apr 25.
on Mar 26, 2014 in Customer Analytics, Online Games, Optimization, Skills, Video Games, WCAI, Wharton
Top KDnuggets tweets, Mar 24-25: Is a Data Science Certificate sufficient? Kaggle branches beyond competitions
My answer to Is a Data Science Certificate sufficient to become a data scientist? Kaggle branches beyond data mining competitions, will build oil and gas vertical solution; Vicarious, developing brain-inspired machine learning, gets $40M from Mark and Elon; OkCupid "Love" Analytics finds best three questions.
on Mar 26, 2014 in Data Science Certificate, Kaggle, Mark Zuckerberg, Oil & Gas, OkCupid, Vicarious
Identity Fraud and Analytics – An Overview
With the consumers being increasingly concerned about identity theft, leading financial institutions are leveraging analytics to detect Identity Fraud as it happens.
on Mar 26, 2014 in Boosting, Decision Trees, FCRA, Identity Fraud, Identity Theft, Logistic Regression, Machine Learning
KDnuggets 14:n07, Data Scientist Salary; Test your Numbersense; 4 LinkedIn steps
Latest Analytics, Big Data, Data science and Data mining news, including Features, Opinions, Software, News Briefs, Webcasts, Courses, Meetings, Jobs, Academic positions, Top Tweets, and CFP .
on Mar 26, 2014 in Daniel Tunkelang, LinkedIn, Numbersense, Salary
Additions to KDnuggets Directory in February
CSMR Data Miner software suite, Ascribe text mining software, and more added to KDnuggets in February 2014.
on Mar 25, 2014 in Added to KDnuggets, Asia, CSMR Data Miner, Edvancer, Fotetah, Text Mining
MMDS 2014: Workshop on Algorithms for Modern Massive Data Sets, Berkeley, June 2014
The MMDS 2014 will address algorithmic, mathematical, and statistical challenges in modern statistical data analysis. Registration is open and you can apply to present a poster.
on Mar 25, 2014 in Algorithms, Berkeley-CA, High-dimensional, Massive Datasets, Poster, Workshop
What is numbersense – test yours
Kaiser Fung, Marketing and Analytics expert, and author of "Numbersense" book, explains what is numbersense in the age of Big Data. Test yours.
on Mar 25, 2014 in Anomaly Detection, Distribution, Kaiser Fung, Missing Values, Numbersense, Outliers, US Census
SAS Analytics U to offer free software, MOOCs
The new SAS University Edition and already established SAS OnDemand for Academics offer free use of SAS foundational technologies, ideal for data and statistical analysis in teaching, research and self-paced learning.
on Mar 25, 2014 in Academics, Free, MOOC, SAS, SAS Certification, SAS Programming
KDnuggets Exclusive: Interview with Anjul Bhambhri, VP of Big Data Products at IBM
KDnuggets talks with Anjul Bhambhri, IBM’s Vice President of Big Data Products about Big Data Trends, developing the Big Data capabilities in-house vs. outsourcing, five crucial steps to adopting a success big data strategy and advice for beginners.
on Mar 25, 2014 in Advice, Big Data Strategy, Challenges, Data Science, IBM, In-house, Interview, Outsourcing, Watson
Gartner 2014 Magic Quadrant for Advanced Analytics Platforms – view report
Pioneering predictive analytics vendor RapidMiner was positioned in the Leaders quadrant of the first "Gartner Magic Quadrant for Advanced Analytics Platforms" - view the full report.
on Mar 25, 2014 in Advanced Analytics, Gartner, Magic Quadrant, RapidMiner
Dell Buys Statsoft to Fill a Data-Mining Void
Dell buys StatSoft, a major provider of statistical, data mining, and text analytics software. Is it filling an important void to help Dell become more of a software company, or an uninspired acquisition?
on Mar 24, 2014 in Acquisitions, Data Mining, Dell, StatSoft
Top KDnuggets tweets, Mar 21-23: Machine Learning in Parallel with SVM; Good Data Sets for Data Science Practice
Machine Learning in Parallel with SVM, GLM; Good Data Sets for Data Science Practice: Big enough, requires data engineering, rich; Cartoon: Why Madame Zaza, Fortune Teller, changes to Predictive Analytics; Top 45 #BigData Tools and Platforms for Developers
on Mar 24, 2014 in Cartoon, Data Science Platform, Datasets, Machine Learning, Platform, Support Vector Machines, Tools
Webinar, Apr 3: Best Decision Trees just got better with Angoss KnowledgeSEEKER 9.0
This Apr 3 webinar will show how KnowledgeSEEKER 9.0 will make your modeling faster with automated workflow for building, refreshing, and reusing workflows - all with the click of a button.
on Mar 23, 2014 in Angoss, Decision Trees, KnowledgeSEEKER 9.0, Workflow
Top stories for Mar 16-22: Machine Learning in 7 Pictures; How Many Data Scientists are out there?
Machine Learning in 7 Pictures; How Many Data Scientists are out there? Predictive Analytics Marketplaces; Data Scientist Salary Survey; How Deep Learning Analytics Mimic the Mind.
on Mar 23, 2014 in Deep Learning, Machine Learning, Marketplace, Predictive Analytics, Salary
PAW: Predictive Analytics World, Toronto, May 12-15
Predictive Analytics World in Toronto brings you who is who in predictive analytics, with keynotes from top experts. Join PAW and access the best keynotes, sessions, workshops, and more. See tips on getting approval to attend.
on Mar 22, 2014 in John Elder, Obama for America, PAW, Predictive Analytics World, Talent Analytics, Toronto-Canada
INFORMS: The Business of Big Data Conference, San Jose, June 22-24
The new INFORMS conference would help Analytics Professionals and Operations Researchers to get from data discovery to real business value. Sign up at the early rates until May 23.
on Mar 21, 2014 in Business, INFORMS, San Jose-CA
Top KDnuggets tweets, Mar 19-20: Bitcoin 101 – everything you need to know; Top IPython Notebooks at #strataconf
Bitcoin 101 - covers everything you need to know, how it is traded, history, and future; Top 7 IPython Notebooks used for presentations at #strataconf; Online (streaming) Learning with Microsoft AdPredictor algorithm; IBM creates fraud & financial crimes prevention unit, leveraging #BigData Analytics.
on Mar 21, 2014 in Bitcoin, Fraud Prevention, IBM, IPython, Microsoft, Online advertising, Strata
Data Scientists Salary Survey: US, Canada, Australia lead
Data Scientists Salary Survey shows that industry data scientists are in a sweet spot, especially in US, Canada, and Australia, with average salary $135K. European and Asian data scientists salaries are significantly lower.
on Mar 21, 2014 in Asia, Australia, Canada, Data Scientist, Europe, Industry, Poll, Salary, USA
Zipfian Academy: Become a Data Scientist in 12 Intense Weeks
Learn the practical skills you need through our immersive program in San Francisco. Zipfian Academy alumni have joined some of the top data science teams in Silicon Valley.
on Mar 20, 2014 in Data Science Education, Machine Learning, San Francisco-CA, Statistical Analysis, Zipfian Academy
FICO Livecast: Analytically Powered Applications Driving Success, Mar 27
Learn how the world leading companies are building and using analytically powered applications to be more responsive to their customers, presented by Forrester research and FICO.
on Mar 20, 2014 in Applications, Customer Analytics, Decision Management, FICO, Forrester
Top KDnuggets tweets, Mar 17-18: NSA metadata can find medical/financial conditions; Machine Learning in 7 Pictures
Stanford students show NSA metadata can find medical, financial conditions; Machine Learning in 7 Pictures ; Social Networks are investing big in Artificial Intelligence; 7 Key Skills of Effective Data Scientists.
on Mar 19, 2014 in Artificial Intelligence, Machine Learning, Metadata, NSA, Privacy, Skills, Social Networks, Stanford
Exclusive: Interview with Daniel Tunkelang, Head of Query Understanding at LinkedIn
Daniel Tunkelang, Head of Query Understanding at LinkedIn talks about search quality, IR, query understanding, and advice for data science enthusiasts. Don't miss: 4 steps to get your LinkedIn profile show up on top of search results.
on Mar 19, 2014 in Daniel Tunkelang, Information Retrieval, LinkedIn, Ranking, Search Quality
How Deep Learning Analytics Mimic the Mind
There has been a lot of buzz surrounding the potential impact deep learning will have in the field of analytics. This post looks at the origins of deep learning.
on Mar 19, 2014 in Deep Learning, DeepMind, FICO, Fraud, Neural Networks, Shallow Learning
KDnuggets 14:n06, How Many Data Scientists? Crossing the Chasm and Big Data; Trifacta vs Paxata
Latest analytics, data mining, and data science news, including How Many Data Scientists are out there, exclusive interviews with Geoffrey Moore (Crossing the Chasm), Paco Nathan (Apache Mesos and Big Data Math), and Quentin Clark (Power of BI), and LIONbook completed.
on Mar 19, 2014 in Crossing the Chasm, Gregory Piatetsky, LIONbook, Machine Learning, Paco Nathan, Paxata, Quentin Clark, Trifacta
Open Analytics Summit – Chicago, March 27 – KDnuggets discount
The Open Analytics Summit, Chicago, March 27 is a great place for CTOs, Engineers, Developers, Data Scientists, and others to network and learn about open source technologies and big data analytics. Exclusive KDnuggets discount - register today!
on Mar 18, 2014 in Chicago-IL, Open Analytics, Open Source, Summit
Need more insight into Web analytics and intelligence?
This course will show you how to improve your web site effectiveness and marketing measurements, with state-of-the-art web analytics and data mining techniques and real-life case studies.
on Mar 18, 2014 in Chicago-IL, Marketing, SAS, SAS Enterprise Miner, Web Analytics
Data Mining Best Practices – coming to a city near you!
The "Data Mining: Principles and Best Practices" course, presented by SAS and Elder Research, introduces you to the power and potential of data mining and shows you how to discover useful patterns and trends from data.
on Mar 18, 2014 in Best Practices, Boston-MA, Chicago-IL, Data Mining, Elder Research, New York-NY, San Francisco-CA, SAS
Alpine Data expects faster, easier Data Science with Spark
Alpine Data Labs becomes one of the first companies to be certified on Apache Spark, reported up to 100x faster than Hadoop. Alpine answers 3 questions from KDnuggets.
on Mar 18, 2014 in Alpine, Apache Spark, Collaborative, Databricks, Hadoop, Workflow
Big Data Library on Demand: The Leading Edge of Data Science
IE group Big Data Library offers many interesting presentations on demand. This week it is "The Leading Edge of Data Science".
on Mar 18, 2014 in Cyber Psychology, Data Science, On demand, Presentation, Samsung
Machine Learning in 7 Pictures
Basic machine learning concepts of Bias vs Variance Tradeoff, Avoiding overfitting, Bayesian inference and Occam razor, Feature combination, Non-linear basis functions, and more - explained via pictures.
on Mar 18, 2014 in Basis functions, Bayesian, Concepts, Machine Learning, Pictures, Variance
Trifacta – Tackling Data Wrangling with Automation and Machine Learning
Trifacta wants to solve the important problem of data cleaning and transformation by building better interfaces which use machine learning. It then aims to help enterprises make sense of their disparate data sources and cut down on time needed to prepare data for data science.
on Mar 17, 2014 in Data Preparation, Joseph Hellerstein, Paxata, Trifacta
Top KDnuggets tweets, Mar 14-16: Is Apache Spark the Next Big Thing? R Meta-Book – best CRAN posts assembled
Apache Spark promises to be the Next Big Thing in #Big Data - 100x faster than #Hadoop; An R Meta-Book - best CRAN posts assembled; The Beauty of pi - the fastest (and most incomprehensible) formula; Tips for Hiring Data Scientists: look for quants with business hustle.
on Mar 17, 2014 in Apache Spark, Book, CRAN, Hiring, pi, R, Ramanujan
Innocentive: INSTINCT – The IARPA Trustworthiness Challenge
This challenge investigates novel statistical techniques to identify neurophysiological correlates of trustworthiness. Deadline: May 5.
on Mar 16, 2014 in Challenge, Competition, IARPA, Innocentive, Neurophysiology, Trust
Top stories for Mar 9-15: How Many Data Scientists?
How Many Data Scientists are out there? LIONbook: Machine Learning + Intelligent Optimization - completed, free personal download; Boston AnalyticsWeek: Big Data and Analytics Unconference, March 24-28; Upcoming Webcasts on Analytics, Big Data, Data Science.
on Mar 16, 2014 in Boston-MA, Data Scientist, free download, LIONbook, Machine Learning, Optimization, Unconference
SciCast Crowdsourcing search for Malaysian Air Flight MH370
Where will the Malaysia Airlines Flight MH370 be found and what happened to it? Scientific crowdsourcing site SciCast has some predictions.
on Mar 15, 2014 in Crowdsourcing, Malaysia Airlines, MH370, Scicast
New book: Big Data, Mining, and Analytics: Components of Strategic Decision Making
This book ties together big data, data mining, and analytics to explain how readers can leverage them to extract valuable insights from their data.
on Mar 15, 2014 in Book, Business Intelligence, Decision Making, Text Mining, Tom Davenport
Webinar: Best Decision Trees just got better with Angoss KnowledgeSEEKER 9.0, Apr 3
New KnowledgeSEEKER 9.0 makes your modeling faster with automated workflow for building, refreshing, and reusing workflows - all with the click of a button. Learn more on Apr 3.
on Mar 14, 2014 in Angoss, Decision Trees, KnowledgeSEEKER 9.0, Webinar, Workflow
KDnuggets Twitter Social Network
We examine KDnuggets Twitter social network, created by NodeXL, a free, open-source template for Microsoft Excel for social network analysis.
on Mar 14, 2014 in Excel, Kirk D. Borne, NodeXL, Social Networks, Twitter
Top KDnuggets tweets, Mar 12-13: Machine learning explained in 10 pictures; Tutorial: Using Google BigQuery
Machine learning explained in 10 pictures. The most important: Bias vs Variance; A Tutorial example: Using Google BigQuery with R; Visualizing Google Analytics Data With R; Exploratory Data Analysis on Udacity: Investigate, Visualize, and Summarize Data Using R.
on Mar 14, 2014 in Bias, BigQuery, Google, Google Analytics, Machine Learning, R, Udacity, Variance
KDnuggets Exclusive: Interview with Geoffrey Moore: Crossing the Chasm and Big Data
KDnuggets talks with a noted author Geoffrey Moore about his "Crossing the Chasm" book, his vision for Big Data analytics, when Big Data will cross the chasm, and advice for entrepreneurs.
on Mar 14, 2014 in Adoption, Business Strategy, Crossing the Chasm, Geoffrey Moore, Interview, Life Cycle, Strata
Evolution of Fraud Analytics – An Inside Story
The amazing analytic innovations in payment fraud prevention can be grouped into three major categories: large data-set modeling, sparse data-set modeling, and false-positive reductions - a view from the inside.
on Mar 14, 2014 in False positive, FICO, Fraud analytics, Fraud Prevention, Neural Networks, Sparse data
Wharton Conference: Successful Applications Of Customer Analytics, May 1
As a research center at the intersection between academics and industry, WCAI will be showcasing presentations that illustrate a high level of rigor but are also broadly accessible to practitioners, with case studies from top companies including MGM Resorts, Cleveland Indians, Pfizer/Kaggle, GE Capital.
on Mar 14, 2014 in Applications, Customer Analytics, Philadelphia-PA, Predictive Models, Tom Davenport, WCAI, Wharton
Complimentary Chapter – Numbersense: How to Use Big Data to Your Advantage
Download a chapter of best-selling "Numbersense" book and learn how Groupon deals with predictive modeling challenges.
on Mar 13, 2014 in Email marketing, Groupon, JMP, Kaiser Fung, Numbersense
How Many Data Scientists are out there?
We examine indeed, LinkedIn, Kaggle, and other sources to investigate how many data scientists - in name and in function - are out there, and how strong is the demand.
on Mar 13, 2014 in Data Scientist, indeed, Kaggle, LinkedIn, McKinsey
Scholarships for first-ever Women in Statistics conference, May 15-17, Cary, NC
The JMP team and SAS Women’s Initiatives Network want to empower three statistics students by helping them attend the Women in Statistics conference. Apply by April 11.
on Mar 13, 2014 in Cary-NC, Conference, JMP, SAS, Scholarships, Statistics, Women
Analyze Data 10X Faster with the Lavastorm Analytics Engine
Download your free copy of Lavastorm Analytics Engine Public today and analyze your data faster, handle more volume, eliminate Excel headaches and improve data visibility.
on Mar 13, 2014 in Analytics Engine, Bloor Group, Business Rules, free download, Lavastorm
FICO: 20+ Years of Analytics Innovations to fight Fraud
FICO infographic shows 20+ years of analytics innovations protecting consumers from payments fraud. It highlights the most significant innovations in anti-fraud analytics for card payments, and offers interesting facts about payment fraud.
on Mar 12, 2014 in Analytics, FICO, Fraud, Fraud Detection, Fraud Prevention, Infographic, Innovation, Real-time
Top KDnuggets tweets, Mar 10-11: Deep Learning overview, free book; Best machine learning interview questions
Deep Learning: Methods and Application, free book from Microsoft; Best interview questions to evaluate a machine learning researcher; Good list of Machine Learning Libraries in Python: scikit-learn, pandas, Theano, NLTK.
on Mar 12, 2014 in Dancing, Deep Learning, Healthcare, Interview Questions, Machine Learning, Python, scikit-learn
IBM Big Data & Analytics Heroes: Gregory Piatetsky
Meet IBM Big Data and Analytics Hero for this week: Gregory Piatetsky.
on Mar 11, 2014 in Gregory Piatetsky, Hero, IBM, KDnuggets Honors
Upcoming Webcasts on Analytics, Big Data, Data Science
State of BI, Risk Assessment, Data Science Summit, Large-Scale Real-Time Learning, Data Mining: failure to launch, Online fraud detection, and more.
on Mar 11, 2014 in BI, Cloudera, Data Science, RapidInsight, Real-time, Risk Assessment
LIONbook: Machine Learning + Intelligent Optimization – completed, free personal download
This book combines two usually separated topics: machine learning and intelligent optimization, and does it with enough technical details to satisfy professionals, but also with concrete examples, vivid images, and fun. Buy a low-cost paperback or ebook (Kindle), or download a free PDF.
on Mar 11, 2014 in Book, ebook, free download, Kindle, Machine Learning, Optimization
Sentiment Analysis Symposium Highlights
Highlights from Sentiment Analysis Symposium held recently in New York: Affective Computing: sentiment from facial expressions, need to market to a “tribe” of people, social media speed of appearance, IBM notion of an Engaged Employee, and more.
on Mar 11, 2014 in Affective Computing, Dell, Highlights, IBM, MIT Media Lab, Oxford, Sentiment Analysis, Steve Gallant, Symposium
Predictive Analytics World Tour
Predictive Analytics World is the industry's leading conference series for business professionals, managers and practitioners, held in 2014 in San Francisco, Toronto, Chicago, Boston, London, and Berlin.
on Mar 11, 2014 in Berlin-Germany, Boston-MA, Chicago-IL, London-UK, PAW, Predictive Analytics World, San Francisco-CA, Toronto-Canada
Cloudera Data Science Challenge
Your task is to analyze a large amount of data from Medicare and try to detect abnormal data -- providers, areas, or patients with unusual procedures and/or claims. Challenge starts March 31, 2014.
on Mar 10, 2014 in Anomalies, Challenge, Cloudera, Data Science, Medicare
Lipari Summer School on Computational Social Science
The Lipari 2014 Summer School will examine the intersection between parameters that define a city as smart and the enabling technology that guarantees proper planning and implementation. Apply by April 30.
on Mar 10, 2014 in Italy, Lipari, Smart City, Social Science, Spatio-Temporal, Summer School
Boston AnalyticsWeek: Big Data and Analytics Unconference, March 24-28
Boston first week-long Big Data And Analytics Unconference wants to bring the limelight back to Analytics, bringing 5 evenings of leading industry speakers and panels, and with registration at nominal $10 to cover expenses.
on Mar 10, 2014 in Analytics, AnalyticsWeek, Big Data, Boston-MA, Unconference
KDnuggets Exclusive: Part 2 of the interview with Paco Nathan
We discuss about Paco's upcoming book "Just Enough Math", problems with current university curriculum around Math for Data Science and Big Data trends.
on Mar 10, 2014 in Apache, Big Data Player, BioCoder, Hadoop, Interview, Mesos, Mesosphere, Paco Nathan, Trends
KDnuggets Exclusive: Interview with Paco Nathan, Chief Scientist at Mesosphere
KDnuggets talks with Paco Nathan, computer scientist, OSS developer, author, and advisor about Apache Mesos, Cascading, his books and Big Data trends.
on Mar 10, 2014 in Apache Mesos, Big Data Player, Cascading, Hadoop, Interview, Mesosphere, Monoids, Paco Nathan
Dancing Statistics – who says statistics cannot be fun?
Four little dance routines explain statistical concepts of frequency distributions, sampling, standard error, variance, correlation, and correlation != causation. Enjoy!
on Mar 10, 2014 in Correlation, Dancing, Statistics
Top KDnuggets tweets, Mar 7-9: Experiments with Twitter and IPython; Cloudera Data Scientist Solution Kit
Learn very useful skills! #DataScience Experiments with Twitter and IPython; Cloudera Data Scientist Solution Kit; For data science hackers: combining Emacs, ESS and R for Zombies; Mashape - Free Natural Language Processing Service.
on Mar 10, 2014 in Cloudera, Dancing, Emacs, ESS, IPython, Mashape, NLP, R, Twitter
Top stories for Mar 2-8: Do’s and Don’t of Data Mining; Wolfram Breakthtough language
The Dos and Donts of Data Mining; Wolfram Breakthrough Knowledge-based Programming Language - what it means for Data Science? Introduction to Random Forests for Beginners - free ebook; Exclusive Interview with Quentin Clark, Microsoft Data Platform Group.
on Mar 9, 2014 in Data Mining, ebook, Interview, Microsoft, Quentin Clark, random forests algorithm, Wolfram
February Analytics, Big Data, Data Mining companies and startups activity
February 2014 acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: Cloudant, Klout, Statwing, 300, Palantir, Talent Neuron, BlueKai.
on Mar 9, 2014 in Acquisitions, Cloudant, Klout, Palantir, Startups
Future of Consumer Intelligence, May 19-21, KDnuggets Discount
FOCI is unique aggregation of diversity across insights, data science, marketing science, social science with technology as a common thread. 20% off with KDnuggets discount.
on Mar 8, 2014 in Consumer Intelligence, Consumer Loyalty, FOCI, Jer Thorp, Kurzweil, Los Angeles-CA
Top KDnuggets tweets, Mar 5-6: Data Science Backlash begins; Intro to Random Forests® for Beginners – free ebook
Backlash begins: Data Science is not a science, and not a good job prospect; Intro to Random Forests for Beginners - free ebook; Must read for data scientists: Q - new Data Language; Book: R for Business Analytics.
on Mar 7, 2014 in Backlash, Data Definition, Data Science, ebook, Q, R, random forests algorithm
Paxata automates Data Preparation for Big Data Analytics
Paxata wants to shorten and automate the data cleaning process, by augmenting data from a huge number of sources and by using machine learning to see statistical similarities between the data imported.
on Mar 7, 2014 in Accel Partners, Data Preparation, MDM, Paxata
Top stories in February: 3 Ways to test the accuracy; Exclusive Interview with Yann LeCun; One Page R
3 Ways to Test the Accuracy of Your Predictive Models; KDnuggets Exclusive: Interview with Yann LeCun, One Page R: A Survival Guide to Data Science with R; Cartoon: Data Scientist Valentine Day Prediction.
on Mar 6, 2014 in Accuracy, Cartoon, R, Top stories, Valentine's Day, Yann LeCun
MS in Data Mining, Analytics, and Knowledge Discovery at University Paris 13
This MSc focuses on data mining, business analytics, and knowledge discovery and is well-suited for students with BA in CS/Math/Stats.
on Mar 6, 2014 in Analytics, Data Mining, France, Knowledge Discovery, Master of Science, Paris-France
Introduction to Random Forests® for Beginners – free ebook
Random Forests is of the most powerful and successful machine learning techniques. This free ebook will help beginners to leverage the power of Random Forests.
on Mar 6, 2014 in Beginners, Decision Trees, ebook, Free, Kaggle, random forests algorithm, Salford Systems
KDnuggets Exclusive: Part 2 of the interview with Quentin Clark, CVP, Microsoft Data Platform Group
We discuss Microsoft decision to embrace Hadoop as the standard, collaboration with Hortonworks, and advice to newbies in Data Science.
on Mar 6, 2014 in Advice, Azure HDInsight, Data Platform, Hadoop, Hortonworks, Microsoft, Strata 2014
KDnuggets Exclusive: Interview with Quentin Clark, CVP, Microsoft Data Platform Group
KDnuggets talks with Quentin Clark, Corporate Vice President, Microsoft Data Platform Group. In the interview, we discuss Power BI for Office 365, Big Data trends and Microsoft’s strategic decisions.
on Mar 6, 2014 in Accessibility, Data Platform, Interview, Microsoft, Office 365, Power BI, Quentin Clark, Strata 2014
Big Data Innovation Summit, Apr 9-10, Santa Clara
Listen to 80+ speakers from top companies including JP Morgan Chase, Pfizer, Netflix, Bing, American Express, Samsung, IBM, and learn how to tackle Big Data challenges.
on Mar 6, 2014 in Innovation, Santa Clara-CA, Summit
Book: R for Business Analytics
This book helps you kick-start with analytics including chapters on data visualization, code examples on web analytics and social media analytics, clustering, regression models, text mining, data mining models and forecasting.
on Mar 5, 2014 in Business Analytics, R
SIGKDD Data Science/Data Mining Doctoral Dissertation Award Nominations
This annual award by ACM SIGKDD seeks to recognize outstanding research by doctoral candidates in the field of data mining, data science, and knowledge discovery. Submit nominations by Apr 30.
on Mar 5, 2014 in ACM, Awards, Dissertation, Nominations, SIGKDD
Top KDnuggets tweets, Mar 3-4: Accenture/MIT data science challenge; Spark graduates, 100x faster than Hadoop
Accenture and MIT data science challenge - analyze City of Chicago; Spark graduates from Apache Incubator, 100x faster than Hadoop over in-memory data; Stanford Data Mining, Finance, Statistics Courses Online; Data Mining Cup 2014 - Student Competition starts.
on Mar 5, 2014 in Accenture, Apache Spark, Chicago-IL, Hadoop, MIT, Online Education, Stanford, Student Competition
etcML Promises to Make Text Classification Easy
etcML is a new and free tool that allows even novice user use the power of machine learning and text classification.
on Mar 5, 2014 in etcML, Machine Learning, Stanford, Text Classification
KDnuggets 14:n05, Exclusive: Yann LeCun interview; New Salary Poll; Gartner Analytics MQ
Latest analytics/data mining news, including Features (10) | News (6) | Software (5) | Webcasts (1) | Courses (6) | Meetings (5) | Jobs (7) | Academic (1) | Publications (5) | Top Tweets (5) | CFP (18) .
on Mar 5, 2014 in Advanced Analytics, Awards, Boston-MA, Gartner, SIGKDD, Techcon, Yann LeCun
March-June Analytics, Big Data and Data Science Meetings
Coming soon: PAW San Francisco, GigaOM Structure, Predictive Analytics/Big Data Innovation Summits, Big Data Techcon, INFORMS Boston, SPB 14, SDM 14, PASS Business Analytics, PAKDD 2014, and many more.
on Mar 4, 2014 in Boston-MA, Chicago-IL, INFORMS, London-UK, PAW, Philadelphia-PA, San Francisco-CA, Toronto-Canada
KDnuggets Annual Analytics/Data Science Salary/Income Poll
KDnuggets 2013 Analytics/Data Science Salary/Income analysis was a top read story of the year. Please vote - anonymously - in 2014 KDnuggets Analytics & Data Science Salary/Income poll - we promise the results will be interesting!
on Mar 4, 2014 in Poll, Salary
Webinar: Building Predictive Apps with BigML API, March 11
BigML interface makes machine learning easy to use, the underlying API provides the same functionality enabling data scientists to quickly implement many machine learning and predictive applications. Learn more on March 11.
on Mar 4, 2014 in API, BigML, Machine Learning, Python
Stanford Data Mining, Finance, and Statistics Courses Online
With Stanford world-class online certificates, show advanced knowledge in Mining Massive Data Sets, Financial Risk Analysis, and Quantitative Methods in Finance. Spring quarter enrollment until Mar 26.
on Mar 4, 2014 in Certificate, Data Mining, Finance, Online Education, Stanford
Vendor-Neutral Hands-On Training in Data Mining [ Los Angeles, April | Wash-DC, May ]
Successful analytics in the big data era start not with data or software, but with immersive hands-on training and goal-driven strategy. Get this training from The Modeling Agency.
on Mar 3, 2014 in Analytic Maturity, Data Mining Training, Los Angeles-CA, TMA, Training, Washington-DC
Data Mining Cup 2014 – Student Competition – Starts
The DMC Competition is one of the largest Data Mining contests in the world, focusing on students. Registration starts Mar 3, task details will be published Apr 2.
on Mar 3, 2014 in Competition, Data Mining Cup, Germany, Registration, Student Competition
ACM SIGKDD Innovation and Service Awards – Call for Nominations
ACM SIGKDD Innovation and Service Awards recognize outstanding technical innovations and outstanding professional contributions to the field of Big data, Data Mining, Knowledge Discovery, and Predictive Analytics.
on Mar 3, 2014 in ACM, Awards, Data Mining, Innovation, Knowledge Discovery, Nominations, Service, SIGKDD
Top KDnuggets tweets, Feb 28 – Mar 2: Using R with Twitter – great tutorial; The Dos and Donts of Data Mining
Using R with Twitter - great tutorial in Rstudio; The Dos and Donts of Data Mining; Wolfram Breakthrough Knowledge-based Programming Language; Online Data Science Certificates in Analytics and Programming for Data Science.
on Mar 3, 2014 in Certificate, Data Mining, Online Education, R, Rstudio, Twitter, Wolfram
Book: Ask, Measure, Learn – Social Media Analytics for Customer Behavior
Ask-Measure-Learn is a rare book intended both for a manager and for a data scientist. It presents a framework that helps you ask the right questions, measure the right data, and then learn from the results.
on Mar 3, 2014 in Book, Customer Behavior, Social Media Analytics
PAW: Predictive Analytics World Toronto, May 12-15
The experts from leading companies are headed to Toronto to share with you how they are applying predictive analytics through case studies and in-depth sessions at PAW Toronto. KDnuggets Discount.
on Mar 2, 2014 in PAW, Predictive Analytics World, Toronto-Canada
Wolfram Breakthrough Knowledge-based Programming Language – what it means for Data Science?
The coming Wolfram Programming language, 30 years in making, will probably be the largest, most comprehensive, and most knowledge-based programming language ever, and can be a significant advance for data science.
on Mar 2, 2014 in Knowledge-Based, Programming, Raspberry Pi, Wolfram
Top stories for Feb 23 – Mar 1: Exclusive Interview with Yann LeCun; Graf.ly; Gartner MQ for Advanced Analytics
KDnuggets Exclusive: Interview with Yann LeCun, Deep Learning Expert, Director of Facebook AI Lab; Graf.ly: Making beautiful, interactive graphs; SAS, IBM, RapidMiner, Knime leaders in Gartner MQ for Advanced Analytics Platforms; The Dos and Donts of Data Mining.
on Mar 2, 2014 in Facebook, Gartner, Graf.ly, IBM, Knime, RapidMiner, SAS, Yann LeCun
Why Predictive Analytics Marketplaces are not taking off, and how to fix it
Three main hurdles holding back Predictive Analytics Marketplaces are a highly fragmented data mining tools market, limited support for customization, and lack of commitment. We examine how to overcome them.
on Mar 1, 2014 in How to fix, Hurdles, Marketplace, Predictive Analytics, Snap Analytx
The Do’s and Don’ts of Data Mining
Leading data mining and analytics experts give their favorite do's and don'ts, from "Do plan for data to be messy" to "Do not underestimate the power of a simpler-to-understand solution".
on Mar 1, 2014 in
Online Data Science Certificates: Analytics and Programming for Data Science
Statistics.com, a leading provider of online education in statistics and analytics announces two new online certificates for Data Science - "Analytics for Data Science" and "Programming for Data Science".
on Mar 1, 2014 in Certificate, Data Science, Hadoop, Python, Risk Modeling, SQL, Statistical Modeling, Statistics.com
|