Seeking industry practitioners to develop and teach courses in the areas of Data Mining, Data Science or Business Analytics. We are interested in faculty with advanced degrees and experience teaching courses on-site, on-line or in a blended format.
The LIONbook on machine learning and optimization, written by co-founders of LionSolver software, is provided free for personal and non-profit usage. Chapter 11 looks at Top-down clustering: K-means.
Watch a very interesting discussion of top statisticians and data scientists on the future of statistics. How can we optimize software for both cognitive and computational challenges?
The Mathematical Shape of Big Science Data - new calculus of network analysis; Great read: HP Guide to NoSQL explains CAP theorem, MapReduce, new RDBMS systems; 10 rules for reproducible computation research (and data science); Strata #BigData Conference + Hadoop World 2013 in NYC - watch keynotes live
Chordalysis is a log-linear analysis method for big data, which exploits recent discoveries in graph theory by representing complex models as compositions of triangular structures (aka chordal graphs).
Learn how to get started with predictive modeling and overcome strategic and tactical limitations that cause data mining projects to fall short of their potential. Next webinar is Nov 19.
Upping Your Analytic IQ, CAP Certification, The New World of Data Visualization, The Economics of SQL on Hadoop, Analytically Speaking, SPSS Modeler Courses, Data Mining: Failure to Launch, and more.
SiSense unveils Crowd Accelerated Analytics, which gets faster with more users, who benefit from each other queries. SiSense showcases In-Chip 2.0 technology at Strata + Hadoop World and opens NYC headquarters.
Stevens Capital Management LP, a large hedge fund, seeks talented and motivated analyst to conduct text mining and sentiment analysis on large news data to identify useful patterns and insight.
Waterfront International, Toronto-based quantitative finance research firm, specializing in statistical trading strategies, is looking for a talented analyst to perform sentiment analysis using various news data feeds, to find patterns, build models, and derive insight.
Predictive Analytics World is the leader in educating and supporting the greater analytics industry. The upcoming San Francisco event (Mar 16-20) will feature killer keynotes, conference sessions and networking, and 6 great workshops. Special KDnuggets discount
Free Book: Theory and Applications for Advanced Text Mining; SAS CEO Jim Goodnight says #BigData hype manufactured by analysts and media; Big Data is not enough for better decisions - you need to connect diverse data; 0xdata releases H2O, open-source fast machine learning engine for #BigData
The data revolution requires an equal revolution in statistical methods, software, education, and collaborations with natural sciences, social sciences, and industry. Listen to some of the brightest minds, including Hilary Mason, Hadley Wickham, and Sinan Aral.
Data Factory is a Chrome extension for quick access to import.io library of web APIs, for converting a web page into a table of data. import.io launched Data Factory with 1,000 public APIs and plans to release 10,000 by end of 2013.
Coordinating and managing all data design and analysis-related functions, planning and managing complex data-related projects, coordinating access to key healthcare and quality data sources. Apply by Nov 8.
Math informs; design compels. Which matters more? A well-designed collection of flawed information-or an opaque, hard-to-parse, but unerringly accurate model?
This book has 9 chapters introducing text mining techniques, including Relation Extraction, ontology learning using Word Net, and automatic compilation of travel information from texts.
Company seeks evaluators to validate authentic use cases, test SQL coverage and benchmark performance; initial results show a 10x improvement in hardware price/performance over Oracle databases.
7 Steps for Learning Data Mining and Data Science; IEEE BigData 2013 report; Top jobs: Applied Data Scientist at Intel; Big Data Analyst at RightCareSolutions
Seeking professionals with 5-8 years of quantitative experience within the financial service industry, risk management, academic research, for work on dynamic client engagements.
Predict major life events based on a unique anonymized dataset from a major financial services company that allows for an unprecedented view into customer-company interaction. More in a webinar on Nov 22.
Automating the Black Art and "Oral traditions" of Deep Learning; Top 10 Ways You Know You're a Data Scientist - very funny; LIONbook Chapter 11: Democracy in machine learning - how to combine different models
The 7th Sentiment Analysis Symposium, March 5-6, 2014 in New York City, will feature presentations, panels, and workshops covering Digital Measurement, Intelligent Customer Experience, Sentiment Analysis, and Speech and Text Analytics. Call for speakers until Oct 28.
Develve statistical software (beta), written by Frank Pauw, aims for a direct experience of your data, with no deep hidden menus, making all functions directly accessible, and results directly visible.
A report from inaugural IEEE Big Data conference highlights, including Berkeley Data Analytics Stack, Crowdsourcing for Data Analytics, Security for Big Data and more.
New Society of Data Miners will be launched at Predictive Analytics World, London on 23 Oct 2013. It will have to compete with SIGKDD, INFORMS, and other groups in this field.
7 Steps for Learning Data Mining and Data Science; Predictive Analytics in China; Exclusive: Cognitive Mining and Data Mining; Top jobs: Senior Data Scientist - Discovery and Personalization at Netflix; Applied Data Scientist at Intel
Hans-Peter Kriegel is recognized for his outstanding contributions to data mining and knowledge discovery research over a wide range of topics including clustering, outlier detection and high-dimensional data analysis.
Geoff Webb has been an active promoter of ICDM conference since its inception, and has many contributions to the entire data mining field, as Editor-in-Chief of Data Mining and Knowledge Discovery journal, as organizer and PC member of many top conferences, and as an active researcher in data mining.
Outstanding candidates in all areas of computer science will be considered, with priority given to candidates with a research focus in either Data Mining or Cybersecurity, both defined broadly.
A new Big Data lab was launched by top VC firms in San Francisco, with plans to support 5-10 Big Data startups at a time, and providing them with space and mentorship to get started.
Data Science Toolkit on AWS Marketplace; LinkedIn Top Scientist @dtunkelang on How to Interview a Data Scientist; Intel: Applied Data Scientist, Graph Analytics, Big Data Analytics; BBVA Innova Data Mining Challenge, 1st time bank releases anonymized card transaction
The LIONbook on machine learning and optimization, written by co-founders of LionSolver software, is provided free for personal and non-profit usage. Chapter 11 looks at Democracy in machine learning - how to combine different models in flexible, creative and effective ways.
Join a passionate team of entrepreneurs, be on the cutting edge of big data development and predictive modeling, interpret health data to bridge the gap with hospitals, health systems, payers, and patients.
First time ever a bank (BBVA) allows developers and researchers to use anonymized card transaction data to design new services, applications and content, and to creating stunning visualizations.
The U. of Helsinki, a top ten European university, seeks faculty in data management including distributed DB, management of big data systems, data models and data description languages. Apply by Oct 31.
Tutorial: The Naive Bayes Text Classifier; How Quantum Computers and Machine Learning Will Revolutionize #BigData; See how easy it is to find patterns in random data; Applied Data Science - free, self-guided online course
BabelNet is a multilingual "encyclopedic dictionary" and a multilingual ontology created by mapping the Wikipedia to WordNet, the top English computational lexicon, and by integrating other lexical resources such as OmegaWiki and the Open Multilingual WordNet.
Solve our customer problems, manage the full lifecycle of machine learning and big data solutions, using Graph Analytics, Big Data Analytics and Large-Scale Machine Learning.
This part-time MSc program designed for working professionals will give students thorough knowledge of analytics techniques, and the ability to apply them to real-world and business scenarios. Apply by Nov 13 for Winter 2014 start.
The LIONbook on machine learning and optimization, written by co-founders of LionSolver software, is provided free for personal and non-profit usage. Chapter 10 looks at Statistical Learning Theory and Support Vector Machines (SVM).
InMaps provides a visual representation of your professional Linkedin universe, and allows you to better understand your professional ties and the relationship patterns.
Cognitive Mining and Data Mining, data scientists mission, Big Data, privacy, and advice for beginning data scientists - Part 2 of the KDnuggets exclusive interview with StatSoft VP Dr. Thomas Hill
What is the relationship between Cognitive Mining and Data Mining? I discuss this, what makes StatSoft different, achieveing user satisfaction, Big Data and Privacy with StatSoft VP Dr. Thomas Hill.
This course introduces Data Science content in self-guided learning format - work your way through the course at your own pace. This course is free and a select number of participants will be invited to try it on October 28.
Python extensions for machine learning and Data Science; Google releases new R package HistogramTools for #BigData; Top news, Oct 6-12: 3 Free Big Data books on Amazon; 7 Steps for Learning Data Mining; Twitter Analytics: A Beginner's Guide
The goal is to identify high-quality, anonymous, and unbiased longitudinal data sets that monitor education, work, life and career experiences of people of ages 5 to 65.
3 Free Big Data books on Amazon; 7 Steps for Learning Data Mining and Data Science; Circle of Trust and Google Plus; Top jobs: Big Data/Econometric Internship at Democrats for Education Reform; Data Mining Scientist at Apple
Join Data Science experts David Smith (Revolution Analytics) and Gregory Piatetsky (KDnuggets) for an open-panel discussion hosted by Kalido, focusing on Data Science principles for any data size.
To keep large scientific data for long periods of time special-purpose technologies and expertise are required. That is the purpose of Corral big data repository, which is celebrating addition of 100th unique scientific research collection.
Gregory Piatetsky outlines 7 Steps for learning Data Mining and Data Science; 5 Data Science Deadly Sins: Cherry Picking, Confirmation Bias, Data Selection Bias ...; Great job for data scientist who loves to travel; New algorithm mines your Twitter stream, finds most significant events
Predictive Analytics World features sessions with beginner tracks to advanced tracks. There is something there for everyone - register today and save with KDnuggets discount code.
Join the best corporate analytics practitioners from companies like Bank of America, LinkedIn, and Express Scripts at the Text Analytics Summit West, Dec 3-4 in San Francisco - see details and get KDnuggets discount.
Despite big investments, BI projects often fail to deliver, and traditional waterfall methods have proven ineffective. The iterative approach proposed here outlines how to break large projects into more manageable pieces, and uses the idea of a "parking lot" of value-adding features.
Free ebooks from O'Reilly Media, available on Amazon, look at Big Data disruptive possibilities, emerging architecture, tools, applications, and trends, with a special section on health care.
Data Scientists need to be Polyglots; NaSent, new algorithm from Stanford, uses recursive deep learning; Less is more: How to Simplify & Sexify your Graphs; RAW: A Data Visualization tool
A self-motivated, high-energy and organized graduate student with experience in data analysis, statistical analysis, modeling and techniques to assist DFER in an initial build-out and implementation of a robust data-centric political platform and smaller research projects.
Build in-demand skills for the growing analytics field, prepare for leadership-level career opportunities, learn from distinguished faculty and industry experts. Winter quarter application deadline: Oct 15.
Watch a Primer on Predictive Analytics for Business webinar (Oct 8) and learn how affordable CUNY Online MS in Data Analytics can help you excel in data science.
Does Big Data imply "You have collected all there is - all the data there is about a phenomenon". I strongly disagree with this quote from Viktor Mayer-Schonberger and Kenneth Cukier book on Big Data - here is my letter to the editor.
Sample source code for various data science tasks and projects; To Hadoop or Not to Hadoop? Questions to determine if you need Hadoop; Big Data experts get big salaries - $115K on average; Data Mining reveals the emotional differences in emails written by Men and Women
Can we educate people about privacy via gaming? DataDealer is an award-winning new game, where a consumer manages the privacy of other people and organizations.
This challenge has three tracks and is based on two very large, multi-class, multi-label and hierarchical datasets created from the ODP web directory (DMOZ) and Wikipedia.
Davenport University seeks subject matter experts and instructors for the Online Data Analytics Program. Instructors should be familiar with WEKA and willing to teach Online.
Data Mining and Analysis: Fundamental Concepts and Algorithms, free PDF download (draft); Statistical Modeling: The Two Cultures, by Leo Breiman; To Hadoop or Not to Hadoop?
Top jobs: Sr. Data Mining Analyst at Genworth Financial, Richmond, VA; Data Mining Scientist at Apple, Austin, TX;
Top 5 most used tools were R (used by 70% of data miners), IBM SPSS Statistics, Rapid Miner, SAS, and Weka, while STATISTICA, KNIME, SAS JMP, IBM SPSS Modeler, and RapidMiner had the the highest satisfaction. Big Data is actually used only in a small fraction of projects.
The September 2013 acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: SAP buys KXEN, Rocket Fuel IPO, Clarabridge Indisys, Narrative Science, Practice Fusion and more.
CIO Review selects 20 most promising Big Data companies, from Actian to Zementis, that have achieved significant momentum and will rise above the rest.
We are particularly interested in individuals who have extensive experiences in business and marketing analytics, data visualization, data mining and machine learning.
Data Science with R: Getting Started with Rattle - a survival guide; KDD 2013 videolectures: the top researchers in Data Mining, Data Science; Statistical Modeling: The Two Cultures, by Leo Breiman; Social Media Analytics, free e-book, an overview of theory, applications, and economics
Kiji is an open source framework for building big data apps with Apache HBase, launched by WibiData to fill the gap between a key-value store functionality and the needs of a predictive modeling application.
Complete data analysis of program information, create standardized and ad hoc reports and queries from multiple data sources, and develop and maintain the Division's website. Apply by Oct 18.
This book gives Social Media Analytics overview, techniques, theory and applications, examines the economic impact of social networks and appropriate analytics methods.
The School of Information at U. of Texas at Austin looks for full-time, tenure-track junior and senior faculty, especially in the areas of data analytics, human-computer interaction, and archival studies.
Many interesting upcoming meetings in Q4 2013, including Discovery Science, IEEE Big Data, ACM Mining Big Data Camp, Big Data Techcon, SAS Analytics 2013, PAW London, Strata + Hadoop World NYC, AusDM, Big Data Festival, Text Analytics Summit West, ICDM 2013, Toronto Data Marketing Conference, and many more.
Colombian UN Office for Human Rights is looking for proposals for a pilot project using text-mining software to effectively explore and analyze scanned text documents.
Thasos, founded by top scientists from MIT Media Lab and Sense Networks, combines and analyzes non-financial Big Data sources in order to measure real-time company fundamentals and macro-economic developments.
Thasos, founded by top MIT scientists, combines and analyzes non-financial Big Data sources in order to measure real-time company fundamentals and macro-economic developments. Expertise with Hadoop, distributed file systems and large-scale datasets needed.
Apple Data Mining Lab looks for an outstanding data scientist to to design, develop, and field data mining solutions with direct and measurable impact.
Many great courses, including Text Analytics and Sentiment Mining, Data Mining: Principles and Best Practices, Supercomputer Data Mining Boot Camp, Survival Analysis, Net lift (Uplift) models, Machine Learning, and Predictive Analytics and Data Mining Model Development and Strategic Implementation.
Check A Primer on Predictive Analytics for Business, Data Discovery Platforms, Data Mining: Failure to Launch, and Data Science: Not Just for Big Data - with Gregory Piatetsky and David Smith
There are two cultures in the use of statistical modeling to reach conclusions from data. One assumes that the data are generated by a given stochastic data model. The other uses algorithmic models and treats the data mechanism as unknown - read the full paper.
New Book: Data Mining and Analysis: Fundamental Concepts and Algorithms, free PDF dow; Random Forests Algorithm - what is it, why does it work so well; Penn researchers use Facebook data to predict users age, gender, personality; Google Hummingbird is a completely new search algorithm and incredibly no one noticed
A treasure of latest Data Mining and Data Science research is now available, with videolectures of KDD-2013, ACM SIGKDD Conference on Knowledge Discovery and Data Mining held recently in Chicago.
KDnuggets Cartoon: Next Trend after Big Data; New Poll: Has Big Data Reached the Hype Peak and is due for Decline and Disillusionment?; edX: Learning from Data, free online course
Top jobs: Data Mining Scientist at Apple, Austin, TX; Machine Learning Scientists at Amazon, Bangalore, India;
Luminoso, Quadbase, Skyttle, Vitria, Zoomdata, and more companies, datasets, education, Big Data and Analytics meetings, software, and solutions added to KDnuggets.