KDnuggets™ News 15:n06, Feb 25: My brief guide to Big Data; Data Scientist 3 wishes; Active Data Mining Blogs
My Brief Guide to Big Data; Cartoon: Data Scientist gets 3 wishes for Valentine Day; Active Data Mining, Data Science blogs; Gartner 2015 Magic Quadrant for Advanced Analytics - gainers and losers.
Features | Software | Opinions | Interviews | News | Webcasts | Courses | Meetings | Jobs | Academic | Publications | Tweets | CFP | Quote
Features blue line corresponds to post views, red line to post shares
My Brief Guide to Big Data and Predictive Analytics for non-experts - Feb 12, 2015.
My brief guide to Big Data and Predictive Analytics for non-experts suggests key books, films, and websites to learn more.- Cartoon: Data Scientist gets 3 wishes for Valentine's Day
- Feb 13, 2015.
New KDnuggets cartoon imagines what could happen if a Big Data genie would grant a romantic Data Scientist 3 wishes for a Valentine's Day.
Active Data Mining, Data Science blogs - Feb 16, 2015.
Here are 85 or so active (recently updated) data mining, data science, and machine learning blogs.- Gartner 2015 Magic Quadrant for Advanced Analytics Platforms: who gained and who lost - Feb 23, 2015.
SAS, IBM, KNIME, and RapidMiner lead in Gartner 2015 Magic Quadrant for Advanced Analytics Platforms. We analyze who gained and who lost versus last year.
History of Data Science Infographic in 5 strands - Feb 17, 2015.
History of Data Science infographic presents key events in Data Science across 5 strands: Computer Science, Data Technology, Visualization, Mathematics/OR, and Statistics.
Automatic Statistician and the Profoundly Desired Automation for Data Science - Feb 17, 2015.
The Automatic Statistician project by Univ. of Cambridge and MIT is pushing ahead the frontiers of automation for the selection and evaluation of machine learning models. In general, what does automation mean to Data Science?- PAW San Francisco: Learn Uplift Modeling - Feb 24, 2015.
The analytical method to optimize for influence is uplift modeling (aka persuasion modeling) and its adoption is rapidly growing. Learn it in two sessions and a full-day training workshop at PAW Business, San Francisco, Mar 29 - Apr 2, 2015. KDnuggets discount.- Cartoon: Data Scientist gets 3 wishes for Valentine's Day
Software (see also All Software )
- Ontotext: Integrated Text Mining and Triplestores, a form of graph database - Feb 12, 2015.
Learn about 2 hot trends: RDF triplestores, a form of graph database, and the use of text mining to extract meaning from Big Data, and how Ontotext enables both. Free eval, Feb 26 webinar, and more. - Prismatic Interest Graph [API]: Organize and Recommend Content - Feb 20, 2015.
Prismatic Interest Graph API provides a set of tools for automatically analyzing unstructured text and annotating it with a variety of tags that are useful for organizing and recommending content. - Google BigQuery Public Datasets - Feb 20, 2015.Google BigQuery is not only a fantastic tool to analyze data, but it also has a repository of public data, including GDELT world events database, NYC Taxi rides, GitHub archive, Reddit top posts, and more.
- Fun and Top! US States in 2 Words using twitteR - Feb 19, 2015.
Combining twitteR package with text mining techniques and visualization tools can produce interesting outputs. Find out which US state is fun and top, and which is good and crazy, according to Twitter. - Tamr Enterprise Platform for Scalable, End-to-End Data Unification - Feb 17, 2015.
The new Tamr Platform radically simplifies and speeds the availability of unified data for analytics and downstream application, with key new features: catalog, connect, and consume. Tamr also announced solutions for Pharma and Procurement. - Tinderbox: Automating Romance with Tinder and Eigenfaces - Feb 15, 2015.
Tinderbox is a software uses machine learning and image recognition to automate Tinder, a popular app for single meetings. The author describes his experience and feedback until it started to work too well.
Opinions (see also All Opinions for this month )
- Big Data, Privacy, and Security - which side are you on? - Feb 18, 2015.
After all the positive promise, the hype, and predictions about Big Data, 2015 started with a debate about privacy and specifically whether or not companies like Google and Facebook should be allowed to encrypt their users data. - Data Mining finds JASBUG, a Critical Security Vulnerability - Feb 17, 2015.
We explain how the critical Microsoft security vulnerability JASBUG that existed for 15 years was detected with similarity search and regular expression inference.
Interviews (see also All Interviews for this month )
- Interview: David Kasik, Boeing on Data Analysis vs Data Analytics - Feb 23, 2015.
We discuss the impact of increasing amount of data on visualization, difference between Data Analysis and Data Analytics, motivation, trends, desired skills and more. - Interview: David Kasik, Boeing on How Visual Analytics is Improving Aviation Safety - Feb 16, 2015.
We discuss data visualization at Boeing, the importance of Visual Analytics, Aviation Safety improvement through Analytics and augmented reality. - Interview: M.C. Srivas, CTO, MapR on Data Agility - The Next Frontier of Big Data - Feb 12, 2015.
We discuss the competitive differentiation of MapR, challenges in consumerizing Big Data, trends, strategy recommendations, desired skills and more. - Interview: M.C. Srivas, MapR on Demystifying the Art of Processing Massive Data - Feb 11, 2015.
We discuss the launch and evolution of MapR, achievements, key characteristics of MapR-DB, significance of Apache Drill, MapR use cases and more.
News (see also All News )
- January 2015 Analytics, Big Data, Data Mining Acquisitions and Startups Activity - Feb 17, 2015.January acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: Microsoft buys Revolution Analytics, Why dunnhumby is worth billions, GraphLab renames to Dato, raises $18M, MongoDB raised $80M, and more.
- Top stories for Feb 15-21: 10 things statistics taught us about big data analysis; History of Data Science in 5 strands - Feb 22, 2015.
My Brief Guide to Big Data and Predictive Analytics for non-experts; 10 things statistics taught us about big data analysis; History of Data Science Infographic in 5 strands; Automatic Statistician and the Profoundly Desired Automation for Data Science. - Top stories for Feb 8-14: 10 things statistics taught us about Big Data; Data Science Most Confused Jargon - Feb 15, 2015.
10 things statistics taught us about big data analysis; Data Science's Most Used, Confused, and Abused Jargon; Top 30 people in Big Data and Analytics; Cartoon: Data Scientist 3 wishes for Valentine Day. - Top /r/MachineLearning Posts, Feb 15-21: The Elephant in the Room of ML Research - Feb 24, 2015.
Problems with deep learning papers, Coursera linear algebra courses, Reddit comment visualizations, deep learning lectures, and genetic algorithm introductions make up the top posts this week on /r/MachineLearning. - Top /r/MachineLearning Posts, Feb 8-14: Automating Tinder, Statistics and Machine Learning - Feb 17, 2015.
Automating Tinder with Eigenfaces, statistics lessons in big data analysis, an upcoming AMA, the basics of PCA, and neural network programming in Python are all topics covered in the last week on Reddit. - Top /r/MachineLearning posts, January - Feb 13, 2015.
Talking Machines, SVM lectures, a new Stanford statistical learning online course, and a listing of open-source datasets top the most popular Reddit posts on /r/MachineLearning for the month of January. - TEDx RheinMain Datanauts Competition - Feb 17, 2015.
Send your project idea to TEDx RheinMain "Datanauts" Competition. It should fit to one of the following categories: mobility, environment, culture, common Good.
Webcasts and Webinars (see also All Webcasts and Webinars )
- Upcoming Webcasts on Analytics, Big Data, Data Science - Feb 24 and beyond - Feb 23, 2015.
Winning with Big Data Analytics, a Roadmap for Data-Driven Culture, Data Science for Workforce Optimization, Text Mining and Knowledge Graphs in the Cloud, Performance and Scale Options for R with Hadoop, and more.
Courses (see also All Courses )
- PAW San Francisco: Learn Uplift Modeling - Feb 24, 2015.
The analytical method to optimize for influence is uplift modeling (aka persuasion modeling) and its adoption is rapidly growing. Learn it in two sessions and a full-day training workshop at PAW Business, San Francisco, Mar 29 - Apr 2, 2015. KDnuggets discount. - Statistical Learning and Data Mining III: 10 Hot Ideas for Learning from Data, Mar 19-20, Palo Alto - Feb 23, 2015.
Taught by top Stanford professors and leading statisticians Trevor Hastie and Robert Tibshirani, this course presents 10 hot ideas for learning from data, and gives a detailed overview of statistical models for data mining, inference and prediction. - Simplilearn Big Data and Analytics Online Courses - Feb 19, 2015.
Be Big Data Ready - get 30% Off Simplilearn Big Data and Analytics Online Courses with code FEB30A, valid till 28 Feb 2015. - Statistics.com courses on RESTful APIs - Feb 17, 2015.
Applying analytics to big data requires a mechanism to rapidly get and share data and RESTful APIs is the standard way doing it. Learn how to write Python code to ingest data, communicate with, and create RESTful APIs with online courses from Statistics.com. - NYC DSA Data Science Bootcamp,
June 1 - August 21 - Feb 13, 2015.
NYC Data Science Academy offers the highest quality in data science training, designed specifically around the skills employers are seeking, including R, Python, Hadoop, github, D3.js, raspberry pi and much more. - Statistics.com Online Data Science Courses and Certificates - Feb 12, 2015.
Accelerate your career and upgrade your skills With Statistics.com training provided by top experts who will answer your questions on a daily basis. Work on practical exercises with real problems, real data and multiple software tools. - Lipari Summer School: Algorithms, Data, and Models for Social and Urban Systems - Feb 12, 2015.
Lipari Summer School will address the role of GIS, social media, big social data, agent-based models, network models, and their integration in the study, design, and implementation of social and urban systems.
Meetings (see also All Meetings )
- Big Data TechCon, the HOW-TO conference, Boston, April 26-28 - Feb 24, 2015.
Plan now to attend Big Data TechCon, April 26-28 in Boston, to learn HOW-TO master and analyze Big Data. Learn Hadoop, Spark, Yarn, HBase, R, and Hive from the smartest, hardest-working faculty. Special discount. - Watch Keynotes
LiveRecorded: Strata + Hadoop World San Jose, Feb 19-20 - Feb 18, 2015.WatchliveFeb 19, 20 Keynotes! The Strata + Hadoop World, Feb 17-20, San Jose conference is sold-out again, but if you are not there, here is how you can watch the keynotes live on Feb 19 and Feb 20, 2015. - Big Data Innovation Summit, San Jose, Apr 28-29, 2015 - Feb 17, 2015.
The Summit will bring 800+ data practitioners for 7 business and technical focused stages, 70+ sessions, keynotes, workshops, panels and countless networking opportunities. Early Bird until Feb 27.
Jobs (see also All Jobs )
- Booking: Data Scientist - Machine Learning - Feb 23, 2015.
Work side by side with Developers, Designers and Product Owners to translate terabytes of data into unforgettable holidays for millions of people around the globe. Generous worldwide relocation package. - Booking: Data Scientist - General - Feb 23, 2015.
You will be working with stakeholders throughout the company to generate understanding, strategy and suggest actions based on data. Open to worldwide candidates - a generous relocation package available. - Megaputer: Data Analysis Consultant - Feb 20, 2015.
Create data analysis and reporting solutions for Megaputer customers with the help of PolyAnalyst(tm) platform: experimental, proof-of-concept, implementation, and production projects. Develop successful long-term relationships with customers. - Collective: Data Scientist - Feb 18, 2015.
Work on our first-of-its-kind sales analytics platform, which combines a proprietary, always-learning network with data-backed, predictive applications. - HP: Master Data Scientist - Feb 18, 2015.
Apply advanced analytics, data mining and statistical techniques to design and develop enterprise analytic solutions which are focused on specific industry solutions for HP clients. - Localytics: Data Scientist - Feb 12, 2015.
Build the future of mobile with Localytics. Named among the top places to work by The Boston Globe, we're changing mobile marketing and analytics through predictive modeling and machine learning. - Apple: iOS/OS X Data Analysis Data Scientist - Feb 11, 2015.
The iOS/OS X Data Analysis team analyzes and produces insights from diagnostic and usage data from iPhone, iPad, Apple Watch, and Macintosh systems. - Apple: Data Mining Scientist - Feb 11, 2015.
Apple Data Mining Lab looks for an outstanding data scientist to design, develop, and field data mining and data science solutions with measurable impact.
Academic and Research positions (see also All Academic positions )
- Syracuse University: Interdisciplinary Faculty - Feb 20, 2015.
The school hosts 5 research centers, including a new created Center for Computational and Data Sciences, which advances scholarships on computational data analytics. - UWS (U. of Western Sydney): Manager, eResearch - Feb 19, 2015.
Embed eResearch across the research lifecycle as an essential component of research; enhance, develop, promote and support the eResearch capabilities within UWS. Apply by Mar 4. - Bournemouth University: Professor (Associate) in Data Science and Analytics, Data Science Institute - Feb 16, 2015.
The Faculty of Science and Technology seeks an exceptional candidate for a research intensive position at a Professor or Associate Professor level to further our growth and reputation in Data Science. - SUTD: Postdoctoral Fellowship at MIT and SUTD - Feb 12, 2015.
The SUTD-MIT Postdoctoral Program offers unique research opportunities to highly talented individuals to engage in new or ongoing research programs at MIT and SUTD.
Publications
- Big Data Innovators Under 35 - Feb 11, 2015.
Young innovators in deep learning, interface design, and data science automation are all included in MIT Technology Review's Innovators Under 35 list.
Top Tweets (see also All top tweets for this month )
- Top KDnuggets tweets, January - Feb 24, 2015.
Good list of #MachineLearning Resources, #DeepLearning, Graphical Models;
Sample #MachineLearning solutions with R on #Azure ML Marketplace #rstats;
New book: Data Driven: Creating a Data Culture, by @dpatil, @hmason;
Intro to #Python and #IPython for #DataMining. - Top KDnuggets tweets, Feb 16-22 - Feb 23, 2015.
History of #DataScience across 5 strands;
Most Popular Coding Languages of 2015: #Python 31% ...;
#BigData reveals how information travels: 8 clusters in Europe;
New Face Detection Algorithm to revolutionize search: finding faces no longer unique to humans. - Top KDnuggets tweets, Feb 18-19 - Feb 20, 2015.
Practical #DataScience in #Python #MachineLearning - nice intro;
New Face Detection Algorithm to revolutionize search;
Well written: How to Transition from Excel to R;
Microsoft launches #Azure #MachineLearning Platform for #BigData, adds Python. - Top KDnuggets tweets, Feb 16-17 - Feb 18, 2015.
Most Popular Coding Languages of 2015: #Python 31%, Java 20%, C++ 9.8%;
History of #DataScience across 5 strands: CS, #Data, #Visualization, Math, Stats;
IBM Verse new messaging software will use #Watson to declutter your inbox;
Doctors store 1,600 digital #hearts for #BigData study. - Top KDnuggets tweets, Feb 9-15 - Feb 16, 2015.
Why limit yourself to "50 Shades of Grey?" R has 102;
Why Electric Cars Don't Have Better Batteries - a sad story of Envia;
More evidence that #sports is a goldmine for #MachineLearning;
Wedding with 200+ guests is 92% less likely to lead to divorce. - Top KDnuggets tweets, Feb 11-12 - Feb 13, 2015.
Romantic #DataScientist @crockpotveggies automates #Tinder with Eigenfaces;
My Brief Guide to Big Data and Predictive Analytics for non-experts;
#DataMining finds corruption is correlated with low income, low development MIT;
Hitachi buys Pentaho to extend Its #BigData footprint. - Top KDnuggets tweets, Feb 9-10 - Feb 11, 2015.
#BigData on Divorce: Wedding with 200+ guests is 92% less likely to end in divorce;
Should you teach #Python or R #rstats for #DataScience?;
Top 30 people in Big Data and Analytics;
Paris history, captured in its streets, visualized with R.
CFP - Calls for Papers (see also All Calls for Papers )
- Due Feb 27, Emerging Software as a Service and Analytics 2015 (ESaaSA 2015) , Lisbon, Portugal. 20-22 May 2015
- Due Mar 2, PAKDD 2015 Workshop: Data Analytics for Evidence-based Healthcare , Ho Chi Minh City, Viet Nam. May 19-22, 2015
- Due Mar 2, PAKDD 2015 Workshop: Quality issues, measures of interestingness and evaluation of data mining models , Ho Chi Minh City, Viet Nam. May 19-22, 2015
- Due Mar 2, PAKDD 2015 Workshop: Pacific Asia Workshop on Intelligence and Security Informatics (PAISI 2015) , Ho Chi Minh City, Viet Nam. May 19-22, 2015
- Due Mar 2, PAKDD 2015 Workshop: 4th PAKDD workshop on Biologically inspired data mining techniques , Ho Chi Minh City, Viet Nam. May 19-22, 2015
- Due Mar 2, PAKDD 2015 Workshop: The Third Int. Workshop on Vietnamese Language and Speech Processing , Ho Chi Minh City, Viet Nam. May 19-22, 2015
- Due Mar 2, PAKDD 2015 Workshop: The 2nd workshop on Pattern Mining and Application of Big Data , Ho Chi Minh City, Viet Nam. May 19-22, 2015
- Due Mar 9, The 15th IEEE Int. Conf. on Data Mining (IEEE ICDM 2015), workshop proposals , Atlantic City, NJ, USA. Nov 14-17, 2015
- Due Mar 12, Data Analytics 2015, The Fourth Int. Conf. on Data Analytics , Nice, France. Jul 19-24, 2015
- Due Apr 3, NGDM'2015: BIG DATA FOR CONNECTED CARS AND IOT CONFERENCE , Novi, MI, USA
- Due May 18, WISE 2015: Web Information System Engineering , Miami, Florida, USA. Oct 18-20, 2015
- Due Jul 1, 2015 IEEE Int. Conf. on Big Data (IEEE Big Data 2015) , Santa Clara, CA, USA. Oct 29 - Nov 1, 2015
Quote
"Meet Scarledoopython - you did ask for Scarlett Johansson, Hadoop, and Python?" KDnuggets Valentine Day CartoonTop Stories Past 30 Days
|
|