Top KDnuggets tweets, Oct 29-30: If you can’t code, you can’t be a data scientist; 13 Machine Learning Books
The #strataconf debate: If You Can't Code, You Can't Be a Data Scientist; 13 Machine Learning Books recommended by Berkeley machine learning expert; Data Blending for Dummies - free ebook from Alteryx; Data Science 2.0, upcoming book from Vincent Granville.
on Oct 31, 2014 in Alteryx, Data Scientist, Deep Learning, Machine Learning, Michael Jordan, Vincent Granville
Wikibon Big Data Capital Markets Day – Big Data NYC 2014
One of the biggest events at Big Data NYC 2014 was the insightful presentation by Jeff Kelly from WikiBon. We provide here the key takeaways.
on Oct 30, 2014 in Big Data, Hadoop, Jeff Kelly, Market Research, New York-NY, NoSQL, Wikibon
Big Data Winter ahead – unless we change course, warns Michael Jordan
We have to have error bars around all our predictions, says machine learning expert Michael Jordan. Otherwise it's gambling, and too many failed predictions can lead to big disappointment with Big Data - a Big Data Winter.
on Oct 30, 2014 in Big Data Winter, Michael Jordan, Overfitting
Top KDnuggets tweets, Oct 27-28: Twitter Breakout detection in the wild; Marc Andreessen on #BigData and finance
Dilbert on inability of designers predict results of A/B tests; Marc Andreessen @pmarc, web pioneer, VC @a16z on #BigData, upending finance; Will Deep Learning take over Machine Learning, make other algorithms obsolete?;.@WillJHenry @data_nerd @KirkDBorne Data Scientists don't wear bowties!
on Oct 29, 2014 in Anomaly Detection, Cartoon, Data Scientist, Deep Learning, Dilbert, Marc Andreessen, Twitter
KDnuggets 14:n28, Top Data Science methodology; Big Data Halloween costume
KDnuggets latest stories, including: CRISP-DM, still the top methodology for analytics, data science; Cartoon: Halloween Costume for Big Data; Will Deep Learning take over Machine Learning? DM radio with KDnuggets, Oct 30.
on Oct 29, 2014 in Big Data Privacy, Cartoon, CRISP-DM, Deep Learning, DM Radio
Analytics Street, Boston Data Analytics Conference, Nov 5-7
Boston Analytics Street includes a workshop, a career fair, 3 Days, 45 Businesses, 16 Panels, 10 talks, 6 Keynotes and lots of learning and networking. Nov 5-7, 2014.
on Oct 28, 2014 in AnalyticsWeek, Boston-MA, Vishal Kumar
Cartoon: Halloween Costume for Big Data
New KDnuggets cartoon looks at the appropriate Halloween costume for Big Data and its companion, No Privacy.
on Oct 28, 2014 in Big Data, Cartoon, Halloween, Privacy
Big Data & Analytics Innovation Summit, Australia: Day 2 Highlights
Highlights from the presentations by Big Data leaders from Paypal, Huawei and Qantas on day 2 of Big Data & Analytics Innovation Summit 2014 in Sydney, Australia.
on Oct 28, 2014 in Analytics, Big Data, Conference, Consumer Insights, IE Group, Smartphone, Sydney-Australia, Text Mining
CRISP-DM, still the top methodology for analytics, data mining, or data science projects
CRISP-DM remains the most popular methodology for analytics, data mining, and data science projects, with 43% share in latest KDnuggets Poll, but a replacement for unmaintained CRISP-DM is long overdue.
on Oct 28, 2014 in CRISP-DM, Data Mining, James Taylor, Methodology, Poll
Upcoming Webcasts on Analytics, Big Data, Data Science – Oct 28 and beyond
DM Radio: Predictive Tools Are Pervasive - with Gregory Piatetsky (KDnuggets), Predixion, others; Deep Learning + Apache Spark; Data Mining - Failure to Launch; Demystify your data flows, Data hiding, and more.
on Oct 27, 2014 in Data Mining, Deep Learning, DM Radio, Gregory Piatetsky, Hadoop
Top KDnuggets tweets, Oct 24-26: Why Deep Learning is likely to make other Machine Learning algorithms obsolete
Why Deep Learning is likely to make other Machine Learning algorithms obsolete; Open Source Distributed Analytics Engine with SQL interface; Data Mining Reveals How News Coverage Varies Around the World; 3 Great (and Free) Data Science Books You Can Read Now.
on Oct 27, 2014 in Data Mining Books, Deep Learning, Free ebook, Hadoop, SQL
Will Deep Learning take over Machine Learning, make other algorithms obsolete?
Will deep learning will take over machine learning and make other algorithms obsolete, or is it too complex to use on simpler problems? We look at both sides of this discussion.
on Oct 27, 2014 in Deep Learning, Machine Learning, Quora
H2O World, Open Source Machine Learning Meeting, Nov 18-19, Mountain View
H2O World (Nov 18-19, Mountain View) is where the users of the very popular Open Source Machine Learning Engine H2O gather to share their knowledge and know-how to build Smart Applications.
on Oct 27, 2014 in Deep Learning, H2O, Machine Learning, Mountain View-CA, Open Source, Python, R, Scala
Simplilearn: Enroll in Analytics and Big Data Training, get free Amazon Fire (until Oct 31)
Be technically competent in data analytics processes with Simplilearn Business Analytics, Big Data, SAS, R, and Hadoop training. Enroll by Oct 31 and get a free Amazon Fire.
on Oct 27, 2014 in Data Science Education, Hadoop, Online Education, R, SAS, Simplilearn
WCAI Research: Desktop Software Subscription Analysis
A new dataset on when and how customers renew software licenses is now available for research into software purchase behavior. Register for Nov 21 webinar and submit proposals by Dec 8.
on Oct 27, 2014 in Churn, Research proposal, WCAI, Wharton
Making Sense of Public Data – Wrangling Jeopardy – Part 2
Wrangling Jeopardy (Part 2) describes the remaining steps of the data transformation process, detailing how we used Trifacta to structure, clean, enrich and distill Jeopardy data for analysis.
on Oct 27, 2014 in Data Preparation, Data Processing, Jeopardy, Trifacta
Big Data accelerates medical research? Or not?
Take a look at how big data in healthcare brings big opportunities, but along with those opportunities come great risk if statistics aren't carefully applied to those large datasets.
on Oct 26, 2014 in Big Data, Healthcare, Overfitting, Research
Top stories for Oct 19-25: Ebola Data Science Lessons; DM Radio, Oct 30 on Predictive Tools with KDnuggets, Predixion
Ebola Analytics and Data Science Lessons; DM Radio: Predictive Tools Are Pervasive, with KDnuggets, Predixion, RedPoint, and Appnomic, Oct 30; Big Data for Social Good IBM + Hadoop Challenge; TweetNLP: Twitter Natural Language Processing.
on Oct 26, 2014 in Challenge, DM Radio, Ebola, Hadoop, NLP, Social Good, Top stories, Twitter
Zipfian Academy: Become a Data Scientist in 12 Weeks
Zipfian Academy trains engineers, scientists, and analysts to become data scientists in an intense full-time program in San Francisco. 93% of graduates find data science roles in 6 months. Applications for Winter cohort due Nov 14.
on Oct 25, 2014 in Data Science Education, San Francisco-CA, Zipfian Academy
Top-Read Big Data Journal Open Access Articles
Top-read open access articles include: Why Big Data Won't Cure Us, Predictive Modeling With Big Data: Is Bigger Really Better?, and A Data Scientist's Guide to Start-Ups.
on Oct 25, 2014 in Big Data, Big Data Journal, Foster Provost, Journal
IEEE ICDM 2014 Outstanding Service Award: Prof. Rao Kotagiri
IEEE ICDM 2014 highest recognition for service achievements in Data Mining goes to Prof. Rao Kotagiri from U. of Melbourne, for his work on editorial boards of IEEE TKDE, VLDB, and SADM journals, and major contributions to ICDM, ICDE, SIGMOD, PAKDD, and other conferences.
on Oct 25, 2014 in Awards, ICDM, IEEE, Rao Kotagiri
IEEE ICDM 2014 Research Contributions Award: Prof. Jian Pei
Jian Pei wins a highest recognition for research in Data Mining for his work on the core frontiers of data mining, including pattern mining, classification, clustering, anomaly detection and outlier analysis.
on Oct 25, 2014 in Awards, ICDM, IEEE, Jian Pei
Top KDnuggets tweets, Oct 22-23: Baidu revenue jumps after Deep Learning use; Great viz: chess piece survival
Great viz: chances of survival of #chess pieces in average game; Baidu, 'Chinese Google', had big revenue jump after it started using Deep Learning; 4 ways to become a Data Scientist w/out a PhD; Machine-Learning expert Michael Jordan on the Delusions of #BigData.
on Oct 24, 2014 in Ajay Ohri, Baidu, Chess, Deep Learning, Mark Zuckerberg, NLP
TweetNLP: Twitter Natural Language Processing
A short overview of Natural Language Processing tools and utilities developed by Prof. Noah Smith, CMU and his team to analyze Twitter data.
on Oct 24, 2014 in Advanced Analytics, ARK, CMU, Datasets, NLP, Speech, Tools, Twitter
Text Mining and Election Analytics in Massachusetts
Election season is coming! Take a deeper look at some political dynamics with indico’s political analysis API.
on Oct 24, 2014 in API, Elections, indico, Massachusetts, Startups, Text Analytics, Web services
Supermarket customers segmentation using Self-Organizing Mapping
See how a leading European supermarket chain improved customer value and profitability and identified key customer groups by applying business intelligence and analytics techniques like self-organizing maps.
on Oct 23, 2014 in Business Intelligence, Clustering, Consumer Insights, Neural Networks
Predictive Analytics World events in 2015
Check 2015 Predictive Analytics World events focused on business, workforce, healthcare, government, and manufacturing - in San Francisco, Chicago, Boston, Washington DC, London, and Berlin.
on Oct 23, 2014 in Berlin-Germany, Boston-MA, Chicago-IL, London-UK, PAW, Predictive Analytics World, San Francisco-CA, Washington-DC
Top KDnuggets tweets, Oct 20-21: 4 ways to become a Data Scientist w/out a PhD; Ebola and Data Science Lessons
4 ways to become a Data Scientist w/out getting a PhD; Ebola Analytics and Data Science Lessons; Apple: Data Analyst; How to Be a Data Scientist: An Interview with Dr. Pete Meyers of Moz.
on Oct 22, 2014 in Apple, Data Scientist, Ebola, Internet of Things, Startups
Big Data & Analytics Innovation Summit, Australia: Day 1 Highlights
Highlights from the presentations by Big Data leaders from GE Capital, Datawatch and MapR Technologies on day 1 of Big Data & Analytics Innovation Summit 2014 in Sydney, Australia.
on Oct 22, 2014 in Conference, Data Visualization, Datawatch, GE, Hadoop, IE Group, MapR, Sydney-Australia
Piketty Revisited: Improving Economics through Data Science
How Data Curation and Data Science Enable More Faithful Economics (In Much Less Time) - a leading database researcher explains.
on Oct 22, 2014 in Data Curation, Economics, Michael Brodie, Tamr, Thomas Piketty
KDnuggets 14:n27, Ebola Data Science Lessons; Data Science methodology? Spotlight on Academic Research
Ebola Analytics and Data Science Lessons; New Poll: Methodology for Analytics, Data Mining Projects? Can Data Science Save Humanity from Mosquitoes? GraphDB - 3 versions; DM Radio with KDnuggets, Predixion on Oct 30.
on Oct 22, 2014 in Data Science, DM Radio, Ebola, GraphDB, Mosquitoes
TMA Predictive Analytics Data Mining Training
[Las Vegas, Dec | Orlando, Feb]
Successful analytics in the big data era does not start with data and software, but with hands-on, immersive training and goal-driven strategy - get it from The Modeling Agency. Next courses in Las Vegas (Dec) and Orlando (Feb).
on Oct 21, 2014 in Data Mining Training, Las Vegas-NV, Orlando-FL, TMA
DM Radio: Predictive Tools Are Pervasive, with KDnuggets, Predixion, RedPoint, and Appnomic, Oct 30
Today there are many companies offering predictive analytics tools and solutions. How, where, and when can these new tools be leveraged? Listen to DM Radio with KDnuggets, Predixion, RedPoint, and Appnomic, on Oct 30.
on Oct 21, 2014 in DM Radio, George Corugedo, Gregory Piatetsky, Ray Solnik, Will Ford
Salford Comprehensive Data Science Training, Dec 3-5, San Diego or Online
Learn the basics tree-structured data mining with CART, and progress to more advanced topics including Linear, Logistic, Nonlinear, Regularized, Lasso, MARS, TreeNet (Stochastic Gradient Boosting) and RandomForests(r), including Latest Refinements and Model Compression.
on Oct 21, 2014 in CART, Data Mining Training, MARS, Online Education, random forests algorithm, Salford Systems, San Diego-CA
TESC 18-month Online MBA in Data Analytics
Get an affordable online MBA in Data Analytics from Thomas Edison State College - study both foundational business courses and how to analyze and present the data.
on Oct 21, 2014 in MBA, Online Education, TESC
Ebola Analytics and Data Science Lessons
We analyze latest Ebola data, examine the recent slowdown in growth of cases in Liberia, and analyze its likely causes. Many problems with data lend themselves to good data science lessons.
on Oct 20, 2014 in Data Science, Ebola, Guinea, Healthcare, Liberia, Sierra Leone
Upcoming Webcasts on Analytics, Big Data, Data Science – Oct 21 and beyond
Big Data Changes everything, Deep Learning + Apache Spark, Data Mining - Failure to Launch, Linear Regression in Python, Demystify your data flows, and more.
on Oct 20, 2014 in Apache Spark, Datameer, Deep Learning, Lavastorm, Python
Top KDnuggets tweets, Oct 17-19: Air traffic analyzed to predict Ebola spread; Cool public data for data science
Air traffic data analyzed to predict Ebola spread; Some cool public data sources you can use for your next data science project; Data science can't be point and click ! Finding random correlation is too easy; Bayes Rule in an animated gif.
on Oct 20, 2014 in Bayes Rule, Data Science, Datasets, Ebola, Overfitting
Request: Crowdsourcing Health and Nutrition Tweets
Help investigate the relationships between geo-location, age, gender, and nutrition through the medium of Twitter by labeling tweets for this research project.
on Oct 20, 2014 in Crowdsourcing, Healthcare, Nutrition, Social Media Analytics, Twitter
Big Data for Social Good IBM + Hadoop Challenge
Use city data to develop great applications for social good and earn prizes in IBM's new Big Data for Social Good challenge, starting in November 10. More information on eligibility, terms, and prizes will be available at launch.
on Oct 20, 2014 in Big Data, Challenge, Hadoop, IBM, Social Good
Boston Docker Global Hack Day and Meetup, Oct 30
Docker is an open platform for developers and sysadmins to build, ship, and run distributed applications. Join other Boston-area developers for Docker Global Hack Day #2 at O'Reilly Media in Cambridge, MA.
on Oct 19, 2014 in Boston-MA, Cambridge-MA, Competition, Docker, Hackathon
Top stories for Oct 12-18: New Poll: Methodology for Analytics, Data Mining Projects? Big Data Is Not Big Context
New Poll: Methodology for Analytics, Data Mining, Data Science Projects? Big Data Is Not Big Context; Big Data on the Internet of Things; ADW, free software to measure semantic similarity.
on Oct 19, 2014 in Big Data, Context, Internet of Things, Poll, Top stories
Big Data and Hadoop, Big Data Boot Camp LA
Big Data Boot Camp LA provided attendees a comprehensive understanding of Big Data and Hadoop technologies. Sujee Maniyam provided a good technical overview of Hadoop and current trends. We provide key takeaways.
on Oct 17, 2014 in Big Data, Bootcamp, Elephant Scale, Global Big Data Conference, Hadoop, Los Angeles-CA, Sujee Maniyam, Training
Overcoming Text Analytics Barriers
Getting the value from companies text assets can be both time consuming and expensive. Learn how to overcome these barriers with “Overcoming Text Analytics Barriers" whitepaper, and at Text Analytics Summit West in San Francisco, Nov 4-5. KDnuggets discount.
on Oct 17, 2014 in Data-Driven Business, Janine Johnson, San Francisco-CA, Text Analytics
Interactive Network and Graph Data Repository
The network repository currently hosts over 500+ graphs/networks that span 19 collections of graphs from social science, machine learning, scientific computing, and many others.
on Oct 17, 2014 in Datasets, Graph Analytics, Graph Visualization, Network Graph
Top KDnuggets tweets, Oct 15-16: STOP and THINK cartoon; Math Model predicts Ebola to burn out in December
STOP and THINK, sometimes the simplest caption is the best; This model tracks Ebola outbreak well so far, predicts Ebola to burn out in December; BAH launches online course "Explore Data Science"; Watch: R wizard Hadley Wickham dplyr tutorial at useR! 2014 conf.
on Oct 17, 2014 in Bob Mankoff, Cartoon, Data Science Education, Ebola, GraphLab, Hadley Wickham, R, Strata
DataLadder outperforms IBM and SAS in Record Linkage
Data scientists from the Centre for Data Linkage at Curtin U. found that Connecticut-based firm Data Ladder has outperformed several major companies on record linkage.
on Oct 16, 2014 in Data Ladder, Deduplication, IBM, Record Linkage, SAS
Boston Data Festival Celebrates Big Data Community, Nov 3-8
Celebrate the big data community, see many world-class speakers, and participate in insightful events at this year's Boston Data Festival. The event takes place November 3-8.
on Oct 16, 2014 in Andy Palmer, Big Data, Boston-MA, Festival, Kaggle, Thomson Reuters
LinkedIn Economic Graph Challenge
Leverage the LinkedIn Economic Graph for your innovative and ambitious ideas for increasing economic value and gaining insights into economic opportunities using LinkedIn data and support. Proposals due Dec 15.
on Oct 16, 2014 in Challenge, Deepak Agarwal, Economic Graph, Graph Analytics, LinkedIn, Research proposal
TDWI Orlando, Dec 7-12, Premier Education Event for BI, Big Data and Analytics
Plan your week with the complete 6-day agenda, including course descriptions, keynotes, exhibit hall times, networking events, and BI certification opportunities.
on Oct 16, 2014 in Business Intelligence, Data Science Education, Data Warehouse, Orlando-FL, TDWI
Top KDnuggets tweets, Oct 13-14: Data mining classics: Classifying Shakespearean Drama
Also - The Open Source Data Science MS Curriculum: UW/Coursera + Harvard ; Statistical Modeling vs Machine Learning - mapping the terms and concepts; Very useful! Python 2.7 Quick Reference Sheet.
on Oct 15, 2014 in Cheat Sheet, Coursera, Data Science Education, MS in Data Science, Python, Shakespeare
Strata Hadoop World NYC – Watch Live, Oct 16
Strata + Hadoop is a leading conference on Big Data. The Strata + Hadoop NYC is sold out, but you can watch it live starting Oct 16 - see how.
on Oct 15, 2014 in Hadoop, New York-NY, Strata
Lavastorm Wizard and Witches Challenge
Make-believe costume company, WigWarts Costumes, is launching a new glow in the dark range of costumes in time for Halloween 2014. Help them combine and analyze data in Lavastorm Wizard and Witches Challenge - entries due Oct 30.
on Oct 15, 2014 in Analytics Engine, Challenge, Halloween, Lavastorm
MS in Analytics from the University of San Francisco
The MS in Analytics at U. San Francisco is an intensive one-year program that provides students with the skills necessary to develop techniques and processes for data-driven decision-making.
on Oct 14, 2014 in Master of Science, MS in Analytics, San Francisco-CA, University of San Francisco
Big Data on the Internet of Things
ParStream unveils the first analytics platform purpose-built for the speed and scale of the Internet of Things (IoT).
on Oct 14, 2014 in Gartner, Internet of Things, IoT, ParStream
Book: Data Mining for Managers
This book by a leading data mining consultant is meant for both practitioners and end users of data mining solutions, and it focuses more on the data and less on the math.
on Oct 14, 2014 in Big Data, Book, Data Mining, Manager, Richard Boire
Text Analytics West Summit – Use Data Scientists time productively
Data scientists time is expensive - it should be used productively to help answer important questions and help business grow. Use their time well at the Text Analytics West Summit, SF, Nov 4-5 - see KDnuggets Offer.
on Oct 14, 2014 in Data-Driven Business, San Francisco-CA, Summit, Text Analytics
GraphDB, a powerful graph database, 3 versions and KDnuggets offer
GraphDB blends text mining, powerful SPARQL queries, semantic annotation and semantic search into a powerful database that infers new meaning at scale. Free GraphDB Lite and special KDnuggets offer for GraphDB Standard and Enterprise.
on Oct 14, 2014 in Graph Databases, GraphDB, Ontotext, RDF, Semantic Analysis, SPARQL, Triplestore
New Poll: Methodology for Analytics, Data Mining, Data Science Projects?
KDnuggets revisits the question of methodology, and asks "What main methodology are you using for your analytics, data mining, or data science projects?" Please vote.
on Oct 13, 2014 in CRISP-DM, Methodology, Poll, SEMMA
Upcoming Webcasts on Analytics, Big Data, Data Science – Oct 14 and beyond
Hadoop means Business, Which Half of Your Graphs are Lying, Deep Learning + Apache Spark, Data Mining - Failure to Launch, Linear Regression in Python, and many more.
on Oct 13, 2014 in Apache Spark, Data Visualization, Deep Learning, Hadoop, Python
Top KDnuggets tweets, Oct 10-12: 7 Most Data Rich Companies in the World
7 Most Data Rich Companies in the World; R and #DataScience Webinar slides - status, why, code examples; Another list of 200+ #BigData thought leaders to follow on Twitter; Popular #BigData predictive apps and APIs.
on Oct 13, 2014 in API, Data Science, GE, Humor, IBM, Kaggle, R
ADW, free software to measure semantic similarity
ADW is a software for measuring semantic similarity of arbitrary pairs of lexical items, from word senses to texts, based on "Align, Disambiguate, and Walk", a WordNet-based state-of-the-art semantic similarity approach. Get it on github.
on Oct 13, 2014 in Natural Language Processing, Semantic Analysis, Similarity, WordNet
Big Data Is Not Big Context
Learn about common misconceptions when approaching big data problems, and how the ambiguity of human language requires more sophisticated techniques for more accurate understanding.
on Oct 12, 2014 in Big Data, Context, Natural Language Processing, Semantic Analysis
Top stories for Oct 5-11: Analyzing Ebola spread; Data science shows surveys may assess language more than attitudes
Analyzing Ebola - Is it spreading at exponential rate?; Data science shows surveys may assess language more than attitudes; Making Sense of Public Data - Wrangling Jeopardy.
on Oct 12, 2014 in Data Preparation, Ebola, Jeopardy, Surveys, Top stories
Sports Analytics Innovation Summit 2014 San Francisco: Day 2 Highlights
Highlights from the presentations by Analytics leaders from San Francisco Giants, New York University and LA Dodgers on day 2 of Sports Analytics Innovation Summit 2014 in San Francisco.
on Oct 11, 2014 in Analytics, Conference, Data, IE Group, Metrics, NBA, San Francisco-CA, Sports, Statistics
Salaries in IT – Scrape, refine, and plot case study
Very good case study, showing how to scrape with import.io, refine with OpenRefine, and plot with Plot.ly. Also learn about salaries vs age in Belgium.
on Oct 11, 2014 in Belgium, Data Preparation, Data Visualization, import.io, OpenRefine, Plotly, Salary
Top KDnuggets tweets, Oct 8-9: Clinical data determines only 10% of health; Kaggle hero 100-line Python code
IBM #Watson presentation: Clinical data determines only 10% of health; A @Kaggle hero 100-line Python code for online logistic regression; The Winner of Kaggle Criteo Data Science on his Odyssey; For Data Viz lovers: Keynote by Tableau CEO Christian Chabot on "Art of Analytics".
on Oct 10, 2014 in Healthcare, Kaggle, Python, R, Tableau, Watson
Book: Modern Optimization with R
Learn the most relevant concepts related to modern optimization methods and how to apply them using multi-platform, open source, R tools in this new book on metaheuristics.
on Oct 10, 2014 in Book, Open Source, Optimization, Paulo Cortez, R, Springer
Sports Analytics Innovation Summit 2014 San Francisco: Day 1 Highlights
Highlights from the presentations by Analytics leaders from San Francisco 49ers, United States Olympic Committee, and Chelsea FC on day 1 of Sports Analytics Innovation Summit 2014 in San Francisco.
on Oct 10, 2014 in Analytics, Conference, IE Group, NFL, Olympic, San Francisco-CA, Sports
Develve statistical software, free for non-commercial use
Check out Develve 2.0, a six-sigma tool, the new version featuring new utilities for measure system analysis and the design of sophisticated experiments.
on Oct 10, 2014 in Experimentation, Free Software, Statistical Modeling
September 2014 Analytics, Big Data, Data Mining Acquisitions and Startups Activity
September 2014 acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: Hootsuite, eBay - Paypal, MemSQL/In-Q-Tel, Qualtrics, SingTel, Radius, Numerify, DataStax, Nielsen/Indicus, Mail.ru/VKontakte, Teradata / Think Big Analytics.
on Oct 9, 2014 in Companies, eBay, Hootsuite, PayPal, Startups, Teradata
Deep Learning RNNaissance, an insightful, comprehensive, and entertaining overview
Watch this great overview of history and present state of Deep Learning, which is revolutionizing Machine learning, vision, robotics, and many other areas.
on Oct 9, 2014 in Deep Learning, Jurgen Schmidhuber, Machine Learning, Neural Networks, Recurrent Neural Networks
Big Data & Analytics for Retail Summit 2014 Chicago: Day 2 Highlights
Highlights from the presentations by Big Data leaders from The Hershey Company, Gongos, Clarks, and Mediacom on day 2 of Big Data & Analytics for Retail Summit 2014 in Chicago.
on Oct 9, 2014 in Big Data, Business Analytics, Chicago-IL, Conference, IE Group, Location Analytics, Retail, Strategy, Text Mining
SPOTLIGHT: Can Data Science Save Humanity from Mosquitoes and other Deadly Insects? #2
KDnuggets launches Spotlight initiative to bring attention to academic research. The journey begins with Prof. Eamonn Keogh, UCR and his talented student, Yanping Chen, who are applying data mining to save us all from insect-vectored diseases.
on Oct 9, 2014 in Acoustics, Andrew Ng, Data Mining, Interview, Machine Learning, Research, Time Series, UC Riverside, Yanping Chen
Predictive Analytics Innovation Summit, Chicago, Nov 12-13
Join other data scientists and decision-makers to learn about practical Predictive Analytics from top companies like Amazon, Intel, Twitter, Verizon, and many others. KDnuggets discount.
on Oct 9, 2014 in Chicago-IL, IE Group, Predictive Analytics, Summit
Perfume, computer programming, and Harvard
What is the connection between Perfume, computer programming, and Harvard education? Peter Bruce explains.
on Oct 8, 2014 in edX, Harvard, Programming, Statistics.com
Big Data & Analytics for Retail Summit 2014 Chicago: Day 1 Highlights
Highlights from the presentations by Big Data leaders from Sony Pictures Entertainment, Macy's and Nuevora on day 1 of Big Data & Analytics for Retail Summit 2014 in Chicago.
on Oct 8, 2014 in Big Data, Business Analytics, Chicago-IL, Conference, IE Group, Macy's, Retail, Sony
SPOTLIGHT: Can Data Science Save Humanity from Mosquitoes and other Deadly Insects?
KDnuggets launches Spotlight initiative to bring attention to academic research. The journey begins with Prof. Eamonn Keogh and his student, Yanping Chen, who are applying data mining to save us all from insect-vectored diseases.
on Oct 8, 2014 in Data Mining, Eamonn Keogh, Entomology, Interview, Mosquitoes, Skills, Time Series, UC Riverside
Top KDnuggets tweets, Oct 6-7: Great TED talk by @KnCukier “Big Data is better data”; Top 10 One-Person Startups
Great TED talk by @KnCukier "Big Data is better data"; Top 10 One-Person Startups; 7 critical elements of effective dashboards and visualizations; Making Sense of Public Data - Wrangling Jeopardy.
on Oct 8, 2014 in Dashboard, Data Preparation, Data Wrangling, Jeopardy, Kenneth Cukier, Startups, TED
KDnuggets 14:n26, Analyzing Ebola and World Events; Mirador, a free tool for visual exploration
Is Ebola spreading exponentially? Mirador, free data exploration tool; GDELT data on World Events; Surveys may assess language more than attitudes; KDnuggets Pass to Big Data TechCon; 17 jobs, and much more.
on Oct 8, 2014 in Data Visualization, Ebola, Mirador, Surveys
PAW: Predictive Analytics World London, Oct 29-30
PAW focuses on concrete examples of deployed predictive analytics - go to PAW London to learn exactly how top practitioners deploy predictive analytics, and the business impact it delivers.
on Oct 7, 2014 in London-UK, PAW, Predictive Analytics World
Webinar: Data Mining: Failure to Launch [Oct 16]
Learn how to get started with predictive modeling and overcome strategic and tactical limitations that cause data mining projects to fall short of their potential. Next webinar is Oct 16.
on Oct 7, 2014 in Data Mining, Failure to Launch, TMA
Making Sense of Public Data – Wrangling Jeopardy
Trifacta’s Alon Bartur & Will Davis detail their process for transforming or “wrangling” publicly available Jeopardy data found on the web for downstream analysis.
on Oct 7, 2014 in Data Preparation, Data Processing, Data Science Platform, import.io, Jeopardy, Trifacta
Top stories in September: Data Science is mainly a Human Science; Hiring Data Scientists: What to look for?
Data Science is mainly a Human Science; Hiring Data Scientists: What to look for?; Most Viewed Machine Learning Talks at Videolectures; Neural Networks and Deep Learning, free online book (draft).
on Oct 7, 2014 in Coursera, Data Science, edX, Hiring, Machine Learning, Neural Networks, Top stories, Videolectures
Request: Top Business Analytics Journals?
For a young business school professor in business analytics, what are the five to eight A-level journals in which he/she should try to publish?
on Oct 7, 2014 in Bruce Golden, Business Analytics, Business School, Journal
Top KDnuggets tweets, Oct 3-5: Best Programming Languages for Machine Learning; Analyzing Ebola
Best Programming Language for Machine Learning: R, Python, MATLAB - when yo use what; Analyzing Ebola - Is it spreading at exponential rate?; 31,000 people/hour are joining the new private social network Ello; Booking: Data Scientist.
on Oct 6, 2014 in Ebola, Ello, Machine Learning, MATLAB, Octave, Python, R
Upcoming Webcasts on Analytics, Big Data, Data Science – Oct 7 and beyond
Evolution of Classification, Billion Dollar Fraud Detection, Big Data Visualization, Deep Learning on Apache Spark, and more.
on Oct 6, 2014 in Apache Spark, Classification, Data Visualization, Deep Learning, Fraud Detection
Competition: Forecasting social network dynamic graph
Forecast the creation and disruption of edges in social networks representing social platforms, mobile clients, or the research community. Competition runs until Nov 28.
on Oct 6, 2014 in AlgoMost, Competition, Graph Analytics, Social Networks
Learn how Sparkling Water brings H2O Deep Learning to Apache Spark, Oct 29 Webinar
Sparkling Water is the latest innovation to combine two best-of-breed open source technologies Apache Spark and H2O. Learn how to setup your own Sparkling Water environment at Oct 29 Webinar.
on Oct 6, 2014 in Apache Spark, Deep Learning, H2O, Mountain View-CA
Webcast: Analytically Speaking Featuring Dan Ariely
Dan Ariely, author of several best-sellers, and TED speaker watched by millions, uses simple experiments to study how people actually act when making real-life decisions. Watch his "Analytically Speaking" webcast.
on Oct 5, 2014 in Analytically Speaking, Dan Ariely, JMP
Interview: Toni Jones, U-Haul on Deriving Business Insights from Social Media
We discuss social media strategy at U-Haul, the key drivers of a social media campaign, identifying what data to focus on, important metrics, career advice and more.
on Oct 5, 2014 in Advice, Analytics Strategy, Data, Interview, Marketing, Social Media Analytics, Toni Jones, U-Haul
Data science shows surveys may assess language more than attitudes
Breakthrough research shows that current data science approaches may not just supplant traditional surveys (as seen through the Facebook experiments), but also suggests that the last 70 years of foundations for survey science require re-examination.
on Oct 5, 2014 in Data Science, language, PLOS, Semantic Analysis, Surveys
Top stories for Sep 28 – Oct 4: Mirador, a free tool for visual exploration of complex datasets
Mirador, a free tool for visual exploration of complex datasets; Data Science is mainly a Human Science; Get Started in Text Analytics; Associations and Text Mining of World Events.
on Oct 5, 2014 in Association Rules, Data Visualization, GDELT, Mirador, Text Mining, Top stories
Analyzing Ebola – Is it spreading at exponential rate?
We examine how fast Ebola actually spreads in West Africa, and find a very different situation in Liberia, Guinea, and Sierra Leone. Unfortunately, recently it does spread exponentially.
on Oct 4, 2014 in CDC, Ebola, Guinea, Liberia, Sierra Leone
KDnuggets Free Pass to Big Data TechCon How-To Conference, Oct 27-29, San Francisco
Win a free KDnuggets Pass for Big Data TechCon in San Francisco - the conference to learn HOW-TO accommodate the terabytes and petabytes of data, learn the latest big data technologies, mingle and network.
on Oct 3, 2014 in Big Data, Free Pass, San Francisco-CA, Techcon
Big Data and Humanitarian Efforts
Discover ways that big data is impacting and improving humanitarianism, including ways it crowdsources accounts of events and disaster relief in the Philippines.
on Oct 3, 2014 in Big Data, Crowdsourcing, Haiti, Kenya, Natural Disasters, Philippines, Social Good
Top KDnuggets tweets, Oct 1-2: R wizard Hadley Wickham “Advanced R” book online
R wizard Hadley Wickham "Advanced R" book online; IPython Interactive Computing and Visualization Cookbook; Mirador, a tool for visual exploration of complex datasets; The End of the (Human) Data Scientist Bubble?
on Oct 3, 2014 in Bubble, Data Scientist, Data Visualization, Hadley Wickham, IPython, Mirador, R
Mirador Open Data Competition
This competition wants to get people interested in data-driven hypothesis making by offering a tool that is as intuitive and engaging as possible, while maintaining solid scientific and statistical foundations. Submissions due Oct 28.
on Oct 2, 2014 in Competition, Data Visualization, Healthcare, Mirador, Open Data
Upcoming Oct – Apr Meetings in Analytics, Big Data, Mining, Data Science
Coming soon: PAW Boston, PAW Health, Strata + Hadoop World NYC, SAS Analytics 2014, IEEE Big Data, Big Data TechCon, PAW London, Text Analytics Summit West, Boston Data Festival, Data Analytics Week, and many more.
on Oct 2, 2014 in Big Data Summit, Boston-MA, London-UK, PAW, Predictive Analytics World, San Francisco-CA
Text Analytics Summit West, San Francisco, Nov 4-5
Text Analytics West Summit (San Francisco, November 4-5) has an exceptional speaker roster and insightful agenda. Learn from LinkedIn, Toyota, Blue Shield, Mozilla, Twitter, and others. Reg by Oct 10 for KDnuggets Discount.
on Oct 2, 2014 in Data-Driven Business, San Francisco-CA, Text Analytics
One-handed Keystroke Biometric Identification Competition
Build a biometric keystroke classifier in this new competition to help identify the features that best predict one-handed typing samples. The prize for first place is a fingerprint scanner.
on Oct 2, 2014 in Biometrics, Classification, Competition, Identification, Python
Top KDnuggets tweets, Sep 29-30: Machine learning #cheatsheet; Can you find McDonald faster than a computer?
Machine learning #cheatsheet on github; Great list of resources on Cloud Computing, #BigData; Excellent (but scary) demo: Can you find the nearest McDonald faster than a computer? Top 10 presentations about data science / #BigData on SlideShare.
on Oct 1, 2014 in Cheat Sheet, Julia, Juno, Machine Learning, McDonalds, NoSQL, SlideShare
Mirador, a free tool for visual exploration of complex datasets
Mirador is an open-source tool for visual exploration of complex datasets, enabling users to discover correlation patterns and derive new hypotheses from the data. Download Windows and Mac OS X versions from Github.
on Oct 1, 2014 in Ben Fry, Data Visualization, GitHub, Mirador, Open Source
Additions to KDnuggets Directory in September
Boston DataFest and Analytics Street, Big Data Summits, PAKDD, and more meetings, DecisionIQ and Quantum Data Science, BS in Data Science from Ottawa, MS from RPI, and more.
on Oct 1, 2014 in Added to KDnuggets, Big Data, Boston-MA, Data Science Education