Seeking industry practitioners to develop and teach courses in the areas of Data Mining, Data Science or Business Analytics. We are interested in faculty with advanced degrees and experience teaching courses on-site, on-line or in a blended format.
The Mathematical Shape of Big Science Data - new calculus of network analysis; Great read: HP Guide to NoSQL explains CAP theorem, MapReduce, new RDBMS systems; 10 rules for reproducible computation research (and data science); Strata #BigData Conference + Hadoop World 2013 in NYC - watch keynotes live
Chordalysis is a log-linear analysis method for big data, which exploits recent discoveries in graph theory by representing complex models as compositions of triangular structures (aka chordal graphs).
SiSense unveils Crowd Accelerated Analytics, which gets faster with more users, who benefit from each other queries. SiSense showcases In-Chip 2.0 technology at Strata + Hadoop World and opens NYC headquarters.
Waterfront International, Toronto-based quantitative finance research firm, specializing in statistical trading strategies, is looking for a talented analyst to perform sentiment analysis using various news data feeds, to find patterns, build models, and derive insight.
Predictive Analytics World is the leader in educating and supporting the greater analytics industry. The upcoming San Francisco event (Mar 16-20) will feature killer keynotes, conference sessions and networking, and 6 great workshops. Special KDnuggets discount
Free Book: Theory and Applications for Advanced Text Mining; SAS CEO Jim Goodnight says #BigData hype manufactured by analysts and media; Big Data is not enough for better decisions - you need to connect diverse data; 0xdata releases H2O, open-source fast machine learning engine for #BigData
The data revolution requires an equal revolution in statistical methods, software, education, and collaborations with natural sciences, social sciences, and industry. Listen to some of the brightest minds, including Hilary Mason, Hadley Wickham, and Sinan Aral.
Data Factory is a Chrome extension for quick access to import.io library of web APIs, for converting a web page into a table of data. import.io launched Data Factory with 1,000 public APIs and plans to release 10,000 by end of 2013.
Coordinating and managing all data design and analysis-related functions, planning and managing complex data-related projects, coordinating access to key healthcare and quality data sources. Apply by Nov 8.
Predict major life events based on a unique anonymized dataset from a major financial services company that allows for an unprecedented view into customer-company interaction. More in a webinar on Nov 22.
Automating the Black Art and "Oral traditions" of Deep Learning; Top 10 Ways You Know You're a Data Scientist - very funny; LIONbook Chapter 11: Democracy in machine learning - how to combine different models
The 7th Sentiment Analysis Symposium, March 5-6, 2014 in New York City, will feature presentations, panels, and workshops covering Digital Measurement, Intelligent Customer Experience, Sentiment Analysis, and Speech and Text Analytics. Call for speakers until Oct 28.
Develve statistical software (beta), written by Frank Pauw, aims for a direct experience of your data, with no deep hidden menus, making all functions directly accessible, and results directly visible.
7 Steps for Learning Data Mining and Data Science; Predictive Analytics in China; Exclusive: Cognitive Mining and Data Mining; Top jobs: Senior Data Scientist - Discovery and Personalization at Netflix; Applied Data Scientist at Intel
Hans-Peter Kriegel is recognized for his outstanding contributions to data mining and knowledge discovery research over a wide range of topics including clustering, outlier detection and high-dimensional data analysis.
Geoff Webb has been an active promoter of ICDM conference since its inception, and has many contributions to the entire data mining field, as Editor-in-Chief of Data Mining and Knowledge Discovery journal, as organizer and PC member of many top conferences, and as an active researcher in data mining.
Data Science Toolkit on AWS Marketplace; LinkedIn Top Scientist @dtunkelang on How to Interview a Data Scientist; Intel: Applied Data Scientist, Graph Analytics, Big Data Analytics; BBVA Innova Data Mining Challenge, 1st time bank releases anonymized card transaction
The LIONbook on machine learning and optimization, written by co-founders of LionSolver software, is provided free for personal and non-profit usage. Chapter 11 looks at Democracy in machine learning - how to combine different models in flexible, creative and effective ways.
Join a passionate team of entrepreneurs, be on the cutting edge of big data development and predictive modeling, interpret health data to bridge the gap with hospitals, health systems, payers, and patients.
The U. of Helsinki, a top ten European university, seeks faculty in data management including distributed DB, management of big data systems, data models and data description languages. Apply by Oct 31.
Tutorial: The Naive Bayes Text Classifier; How Quantum Computers and Machine Learning Will Revolutionize #BigData; See how easy it is to find patterns in random data; Applied Data Science - free, self-guided online course
BabelNet is a multilingual "encyclopedic dictionary" and a multilingual ontology created by mapping the Wikipedia to WordNet, the top English computational lexicon, and by integrating other lexical resources such as OmegaWiki and the Open Multilingual WordNet.
This part-time MSc program designed for working professionals will give students thorough knowledge of analytics techniques, and the ability to apply them to real-world and business scenarios. Apply by Nov 13 for Winter 2014 start.
The LIONbook on machine learning and optimization, written by co-founders of LionSolver software, is provided free for personal and non-profit usage. Chapter 10 looks at Statistical Learning Theory and Support Vector Machines (SVM).
This course introduces Data Science content in self-guided learning format - work your way through the course at your own pace. This course is free and a select number of participants will be invited to try it on October 28.
Python extensions for machine learning and Data Science; Google releases new R package HistogramTools for #BigData; Top news, Oct 6-12: 3 Free Big Data books on Amazon; 7 Steps for Learning Data Mining; Twitter Analytics: A Beginner's Guide
3 Free Big Data books on Amazon; 7 Steps for Learning Data Mining and Data Science; Circle of Trust and Google Plus; Top jobs: Big Data/Econometric Internship at Democrats for Education Reform; Data Mining Scientist at Apple
To keep large scientific data for long periods of time special-purpose technologies and expertise are required. That is the purpose of Corral big data repository, which is celebrating addition of 100th unique scientific research collection.
Gregory Piatetsky outlines 7 Steps for learning Data Mining and Data Science; 5 Data Science Deadly Sins: Cherry Picking, Confirmation Bias, Data Selection Bias ...; Great job for data scientist who loves to travel; New algorithm mines your Twitter stream, finds most significant events
Join the best corporate analytics practitioners from companies like Bank of America, LinkedIn, and Express Scripts at the Text Analytics Summit West, Dec 3-4 in San Francisco - see details and get KDnuggets discount.
Despite big investments, BI projects often fail to deliver, and traditional waterfall methods have proven ineffective. The iterative approach proposed here outlines how to break large projects into more manageable pieces, and uses the idea of a "parking lot" of value-adding features.
A self-motivated, high-energy and organized graduate student with experience in data analysis, statistical analysis, modeling and techniques to assist DFER in an initial build-out and implementation of a robust data-centric political platform and smaller research projects.
Build in-demand skills for the growing analytics field, prepare for leadership-level career opportunities, learn from distinguished faculty and industry experts. Winter quarter application deadline: Oct 15.
Does Big Data imply "You have collected all there is - all the data there is about a phenomenon". I strongly disagree with this quote from Viktor Mayer-Schonberger and Kenneth Cukier book on Big Data - here is my letter to the editor.
Sample source code for various data science tasks and projects; To Hadoop or Not to Hadoop? Questions to determine if you need Hadoop; Big Data experts get big salaries - $115K on average; Data Mining reveals the emotional differences in emails written by Men and Women
Data Mining and Analysis: Fundamental Concepts and Algorithms, free PDF download (draft); Statistical Modeling: The Two Cultures, by Leo Breiman; To Hadoop or Not to Hadoop?
Top jobs: Sr. Data Mining Analyst at Genworth Financial, Richmond, VA; Data Mining Scientist at Apple, Austin, TX;
Top 5 most used tools were R (used by 70% of data miners), IBM SPSS Statistics, Rapid Miner, SAS, and Weka, while STATISTICA, KNIME, SAS JMP, IBM SPSS Modeler, and RapidMiner had the the highest satisfaction. Big Data is actually used only in a small fraction of projects.
The September 2013 acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: SAP buys KXEN, Rocket Fuel IPO, Clarabridge Indisys, Narrative Science, Practice Fusion and more.
Data Science with R: Getting Started with Rattle - a survival guide; KDD 2013 videolectures: the top researchers in Data Mining, Data Science; Statistical Modeling: The Two Cultures, by Leo Breiman; Social Media Analytics, free e-book, an overview of theory, applications, and economics
Kiji is an open source framework for building big data apps with Apache HBase, launched by WibiData to fill the gap between a key-value store functionality and the needs of a predictive modeling application.
The School of Information at U. of Texas at Austin looks for full-time, tenure-track junior and senior faculty, especially in the areas of data analytics, human-computer interaction, and archival studies.
Many interesting upcoming meetings in Q4 2013, including Discovery Science, IEEE Big Data, ACM Mining Big Data Camp, Big Data Techcon, SAS Analytics 2013, PAW London, Strata + Hadoop World NYC, AusDM, Big Data Festival, Text Analytics Summit West, ICDM 2013, Toronto Data Marketing Conference, and many more.
Thasos, founded by top scientists from MIT Media Lab and Sense Networks, combines and analyzes non-financial Big Data sources in order to measure real-time company fundamentals and macro-economic developments.
Thasos, founded by top MIT scientists, combines and analyzes non-financial Big Data sources in order to measure real-time company fundamentals and macro-economic developments. Expertise with Hadoop, distributed file systems and large-scale datasets needed.
Many great courses, including Text Analytics and Sentiment Mining, Data Mining: Principles and Best Practices, Supercomputer Data Mining Boot Camp, Survival Analysis, Net lift (Uplift) models, Machine Learning, and Predictive Analytics and Data Mining Model Development and Strategic Implementation.
There are two cultures in the use of statistical modeling to reach conclusions from data. One assumes that the data are generated by a given stochastic data model. The other uses algorithmic models and treats the data mechanism as unknown - read the full paper.
New Book: Data Mining and Analysis: Fundamental Concepts and Algorithms, free PDF dow; Random Forests Algorithm - what is it, why does it work so well; Penn researchers use Facebook data to predict users age, gender, personality; Google Hummingbird is a completely new search algorithm and incredibly no one noticed
KDnuggets Cartoon: Next Trend after Big Data; New Poll: Has Big Data Reached the Hype Peak and is due for Decline and Disillusionment?; edX: Learning from Data, free online course
Top jobs: Data Mining Scientist at Apple, Austin, TX; Machine Learning Scientists at Amazon, Bangalore, India;