The most popular languages continue to be R (used by 61% of KDnuggets readers), Python (39%), and SQL (37%). SAS is stable at around 20%. The highest growth was for Pig/Hive/Hadoop-based languages, R, and SQL, while Perl, C/C++, and Unix tools declined. We also find a small affinity between R and Python users.
Stanford Data Mining and Statistics Online Courses; Data Scientists Guide to Making Money from Start-ups; 2013 Acquisitions in Analytics and Big Data
Top jobs: Research Scientist: Data Mining at Bethesda company, Bethesda, MD; Data Mining Programmer at Real Time Data Solution, Toronto, Canada;
Could a modestly funded group deliver nation-state type effects using only public data? This DARPA SBIR calls to investigate the US national security threat posed by public data and develop tools to characterize and assess the nature, persistence, and quality of the data. Opens: Aug 26, Closes Sep 25, 2013.
The focus of this competition is on application of knowledge discovery techniques for protecting personal computer information by means of detection, preventive measures, and responding to various attacks.
Nate Silver at JSM: 11 statistics principles for journalists; Mining a Data Mining Conference: Analytics on KDD-2013; Coursera Andrew Ng: Education for Everyone
Top jobs: Software Developer, Machine Learning at SGI; Data Mining Programmer at Real Time Data Solution
My report on KDD-2013 Keynote talk by Coursera co-founder Andrew Ng, on Coursera far-reaching experiment in education, which collected more educational data in one year and all the universities in the history of mankind. Andrew Ng believes that great education should not be only for the privileged but should be a fundamental human right.
How should data scientists think about starting or joining a start-up? We summarize the advice from a high-powered KDD-2013 panel of leading data scientists/enterpreneurs who share their start-up experience.
The Age of Big Data - BBC Documentary; 10 Enterprise Predictive Analytics Platforms Compared; RapidMiner and Big Data - In-Memory, In-Database, and In-Hadoop
Top jobs: Data Mining, Research SDE at Bing; Analyst - Web Commerce/Marketing at UFC.
Develop novel thinking for fusion of background radiation measurements, GPS, high-resolution video, and LIDAR and propose algorithms for detection, localization, and identification of radiation anomalies. Submissions due Aug 26.
IKANOW Infinit.e is a scalable framework for collecting, storing, processing, retrieving, analyzing, and visualizing unstructured documents and structured records, with community edition (free), enterprise edition, and developer API.
The Age of Big Data - BBC Documentary; McKinsey eBook (free): Big Data, Analytics, and the Future of Marketing and Sales; Data: Portals, Government, State, City, Local, and Public; Top jobs: Data Scientist, Strategic at Groupon, Palo Alto, CA; Data Scientist at Groupon, Seattle, WA;
Lavastorm Analytics Engine breaks down data silos, giving business users the ability to acquire, integrate, and analyze data 10 times faster than traditional tools. As a first step, read "Breaking Through the Analytics Limitations of Access and SQL" and try our Lavastorm Free for Life software yourself.
KDnuggets Big Data Science Summer Reading List; DataMind: FREE Online Interactive Learning Platform for R; 5 Roles You Need on Your Big Data Team
Top jobs: Statisticians at AIG; PhD Student, Mixing Meta-Modeling and Data-Mining