- Wikidata: Wikipedia Free Knowledge Base - Jul 15, 2013.
Wikidata is a centralized knowledge base for Wikipedia and it can be read and edited by humans and machines alike. It currently has over 10M items.
- Clustify 3.2 Adds Graphical Visualization of Document Clusters - Jul 15, 2013.
Clustify software does conceptual clustering, near-duplicate detection, content-based email threading, and automatic categorization / predictive coding.
- The Hymn of Acxiom – Sacred music the age of Big Data? - Jul 15, 2013.
A singer who spent a summer as an intern at Acxiom, working on databases of consumer profiles, wrote The Hymn of Acxiom - "let our formulas find your soul".
- 7 reasons not to miss Predictive Analytics World Boston, Sep 29 – Oct 3 - Jul 15, 2013.
2013/07/kdnuggets-cartoon-nsa-cat-videos-ufo-reports-pizza-connection.html
- Big Data Innovation Summit, Sep 12-13, Boston - Jul 15, 2013.
The Big Data Innovation Summit in Boston provides the ideal platform to discuss challenges with your peers – keeping you ahead of the curve. Get early bird passes by July 12.
- LionBook Chapter 3: Learning requires a method - Jul 15, 2013.
Real learning is associated with extracting the deep and basic relationships in a phenomenon, with summarizing with short models a wide range of events, with unifying different cases by discovering the underlying explanatory laws. This chapter explains the bias-variance dilemma.
- IDC Report: SAS Remains Top Advanced Analytics Provider - Jul 15, 2013.
The IDC research showed analytics leader SAS with a 36.2% share of the 2012 global market, up from 35.3% in 2011. The IDC report profiled Oracle, SAP, IBM, Microsoft, SAS, and Teradata.
- 2013 IEEE Big Data Workshop Papers and Posters - Jul 15, 2013.
The IEEE Int. Conference on Big Data 2013 is a major forum for the latest in Big Data Research, Development, and Applications. Submit poster and papers to 19 workshops (deadlines in late July/early August). Poster submissions due Aug 25.
- CrowdAnalytix: Predict the next phone call using RandomForests - Jul 15, 2013.
The goal of this contest is creating right set of features to model and predict a users next phone call, using only RandomForests. Submission deadline July 25.
- Northwestern Online MS in Predictive Analytics - Jul 15, 2013.
Learn from distinguished faculty and industry experts, build statistical and analytic expertise, and prepare for leadership-level career opportunities – build in-demand skills for the growing analytics field.
- LionBook Chapter 5: Mastering generalized linear least-squares - Jul 15, 2013.
After reading this chapter you are expected to improve from a casual modeler to a professional least-squares guru. Losing accuracy is not a weakness but a strength, an opportunity to create more powerful models by simplifying the analysis.
- IBM CXO Chat July 29: Segmentation and Big Data with Gregory Piatetsky-Shapiro - Jul 15, 2013.
Join July 29 twitterchat with Analytics and Data Science expert Gregory Piatetsky-Shapiro, Ph.D, as we discuss “Solving for Segmentation: Is Big Data X ?”
- Upcoming Meetings on Analytics, Big Data, Data Mining, and Knowledge Discovery, Jul-Sep - Jul 15, 2013.
30 upcoming Meetings and Conferences on Analytics, Big Data, Data Mining, and Knowledge Discovery, July – September 2013.
- Angoss Video Series: Customer Analytics Roadmap - Jul 15, 2013.
Watch this 4-part best practices video series, focusing on Customer Segmentation, Customer Acquisition, Upsell/Cross Sell, and Next Product Recommendation and transform your big data for marketing and customer intelligence.
- Lionsolver Machine Learning Approach to Smartphone Data Wins Parkinson Data Challenge - Jul 15, 2013.
Researchers from Lionsolver, Inc won first prize in The Michael J. Fox Foundation's $10K Parkinson Data Challenge which used smart phone to monitor disease progression. In spite of the very sparse data, the winning entry could predict incidence and monitor disease progression with 100% accuracy.
- GDELT: Global Data on Events, Location and Tone - Jul 15, 2013.
The GDELT database: Global Data on Events, Location and Tone, which is an amazing tool for data journalists. The Guardian described it as "a #BigData history of life, the universe and everything"
- June 1 Hackathon at the White House – apply by Apr 19 - Jul 15, 2013.
Participants will focus on producing full, production ready apps and visualization tools that will be featured on the We the People website and made available under an open source license - apply by April 19.
- Polychart.js JavaScript charting library - Jul 15, 2013.
Polychart.js is a JavaScript charting library with a powerful grammar, inspired by ggplot2, has built-in interactivity between charts.
- Bitcoin tools and datasets - Jul 15, 2013.
Bitcoin, a secure and anonymous internet currency, has recently experienced a bubble in value and attention. Here is a very useful (and free) set of data extraction scripts and datasets for analysts interested in Bitcoin.
- Quadrigram platform for customized data visualizations - Jul 15, 2013.
Quadrigram is a highly flexible platform for creating data visualizations and solutions. It uses a visual language, with modules to importing/exporting many data types, performing all kinds of operations, controlling the data flow of data, and a large catalogue of visualizations and visual metaphors.
- Rexer Analytics 2013 Data Miner Survey Closes April 22, Participate Now - Jul 15, 2013.
Data Analysts, Predictive Modelers, Data Scientists, Data Miners, and all other types of analytic professionals, students, and academics: Please participate in the Rexer Analytics 2013 Data Miner Survey. The survey closes on April 22, so please participate now!
- IBM Accelerates Big Data - Jul 15, 2013.
IBM announced several related technologies in a bid to lead the Big Data Market, including a dramatic 8-25x BLU Acceleration for DB2, an easy-to-use Big Data Platform, and a system for Hadoop.
- TMA Courses in Data Analytics[Aug: San Jose, Sep: Washington, DC] - Jul 15, 2013.
Get up to speed in data mining faster and more effectively than with any other training program available. Next courses in San Jose, CA and Washington, DC.
- KDnuggets 13:n16, New SIGKDD Chair; Stanford Learning Analytics Online; Data Science Jobs - Jul 15, 2013.
Latest analytics/data mining news, including Features (7) | Software (1) | Webcasts (1) | Courses, Events (2) | Meetings (2) | Jobs (13) | Academic (1) | Publications (4) | Tweets (3) | NewsBriefs (9) | CFP (6)
- ebook: Instant Weka How-to - Jul 15, 2013.
A practical guide with examples and applications of programming Weka in Java. Start with the basics and learn how to include Weka machinery in your Java application.
- Clustify 3.2 Adds Graphical Visualization of Document Clusters - Jul 15, 2013.
Clustify software does conceptual clustering, near-duplicate detection, content-based email threading, and automatic categorization / predictive coding.
- McKinsey eBook (free): Big Data, Analytics, and the Future of Marketing and Sales - Jul 15, 2013.
This ebook from McKinsey explores the business opportunities, company examples, and organizational implications of Big Data and advanced analytics.
- INSOFE: Master Big Data Analytics Online - Jul 15, 2013.
Taught by experts who are Carnegie Mellon, JHU, and Stanford alumni, INSOFE programs helped many to become data scientists and get industry certifications and at lower cost than similar programs.
- DMCS 2013: Data Mining Case Studies and Data Mining Practice Prize CFP - Jul 15, 2013.
DMCS will highlight data mining implementations that have been responsible for a significant and measurable improvement in business operations, or an equally important scientific discovery, or another benefit to humanity. Submissions due July 27.
- KXEN Location Analytics, Location-Aware Marketing Webinar (on-demand) - Jul 15, 2013.
Location-aware data is exploding, and you should use it to personalize customer relationships. Join KXEN to learn how you can use location-aware data to zoom in on each and every customer interaction.
- Machine Learning Online Roundtable: How to Make it Work, July 25 - Jul 15, 2013.
In this roundtable moderated by Ismail Parsa of Amazon, experts from Twitter, Skytree, Uber, and Adconion will discuss how to apply machine learning to practical problems in real organizations. Live on July 25 or on-demand afterwards.
- Webinar: Data Mining: Failure to Launch [July 18] - Jul 15, 2013.
Learn how to get started with predictive modeling and overcome strategic and tactical limitations that cause data mining projects to fall short of their potential. Next webinar is July 18.
- Applied Predictive Analytics Training with Statistics.com - Jul 15, 2013.
Learn inside tricks and methods in a new online training course developed by CrowdAnalytix (a Kaggle competitor), in partnership with Statistics.com, the leading provider of online education in statistics and analytics.
- Anderson Analytics OdinText Patent for Powerful New Text Analytics Process - Jul 15, 2013.
US Patent 8,473,498 leverages contextual data and provides a process for filtering out the noise which is so common in unstructured data. Both of these important benefits have been deficient in text analytics software until now.
- Kaggle Belkin Energy Disaggregation Competition - Jul 15, 2013.
Use machine learning on EMI signatures and other data to understand what appliances are used as a step for providing personalized and cost-effective energy saving recommendations.
- Upcoming July Webcasts on Analytics, Big Data, Data Science - Jul 15, 2013.
Coming soon: Discovery analytics, Analytically Speaking, Survival Analysis, Data Discovery, Enterprise Informatics, When Hadoop Is Not Enough, Data Mining: Failure to Launch, and SciDB-Py.
- US has one third of world data - Jul 15, 2013.
The US stores 898 exabytes (898 billion gigabytes) of data, nearly a third of the global total. Western Europe has 19% and China has 13%.
- Large Scale Hierarchical Text Classification Challenge - Jul 15, 2013.
This challenge comprises three tracks and is based on two large datasets created from the ODP web directory (DMOZ) and Wikipedia. There are 3 tracks: Very Large Scale Supervised Learning; Multi-task learning; and Refinement-learning.
- US has one third of world data - Jul 15, 2013.
The US stores 898 exabytes (898 billion gigabytes) of data, nearly a third of the global total. Western Europe has 19% and China has 13%.
- PAWGOV: Predictive Analytics World for Government - Jul 15, 2013.
Unlike any other government conference, PAW-GOV is designed to help Federal, State and Local government leaders, analysts, and program managers to understand how they can apply predictive analytics more effectively and efficiently accomplish their mission.
- Book: Data Clustering: Algorithms and Applications - Jul 15, 2013.
The chapters are carefully constructed to cover the area of clustering comprehensively with up-to-date surveys, making this book accessible to beginning data scientists and analysts.
- Data Marketing 2013, Toronto, Dec 9-10 - Jul 15, 2013.
Technology and data enable marketers to deliver communications that are much more relevant through effective micro-segmentation, sentiment analysis, behavior prediction and personalization. DATA MARKETING 2013 will address these challenges with a unique approach.
- Supercomputer Data Mining Boot Camps,San Diego, Sep 12-13, Oct 17-18 - Jul 15, 2013.
The Power to Predict: The Sexiest Job in the 21st Century. Register for UCSD Data Mining Boot Camps scheduled to be held at the San Diego Supercomputer Center on Sep 12-13 and Oct 17-18.
- KDnuggets 13:n18, Analytics Education Poll; Public data sites; Data Mining “Nobel” - Jul 15, 2013.
Latest analytics/data mining news, including Features (8) | Software (4) | Webcasts (2) | Courses, Events (5) | Meetings (1) | Jobs (14) | Academic (2) | Competitions (2) | Publications (6) | Tweets (6) | NewsBriefs (4) | CFP (11)
- 5 Roles You Need on Your Big Data Team - Jul 15, 2013.
Getting value from Big Data requires also paying enough attention to people, and is not just about hiring the best talent. Also very important is identifying the roles the companies really need.
- Notre Dame CARE: Collaborative Assessment Recommendation Engine personalized disease risk predictions - Jul 15, 2013.
U. of Notre Dame researchers have developed a computer-aided method that uses electronic medical records to offer the promise of rapid advances toward personalized health care, disease management and wellness.
- Elder Research Course: Tools for Discovering Patterns in Data, Sep 9-10, Charlottesville, VA - Jul 15, 2013.
Drawing on 20 years of experience, Dr. John Elder will explain powerful analytic methods for classification and estimation, compare the leading algorithms, and demonstrate their effectiveness on practical applications. Attendees will receive the award-winning Handbook of Statistical Analysis and Data Mining Applications, and fully functional (limited time) data mining software from SAS, IBM/SPSS, and StatSoft.
- KDD 2013 Industry Practice Expo - Jul 15, 2013.
Meet the industry leaders at KDD-2013 Industry Practice Expo, and learn about Kaggle, Hadoop from the trenches, Decide.com - to buy or not to buy, Analytics from Presidential Elections to Social Good, and using Big Data to solve small data problems.
- CIO 10 Top Big Data Startups - Jul 15, 2013.
The final ranking is based on reader votes, but also big-name end users, VC funding, the management team and market positioning.
- Kaggle Belkin Energy Disaggregation Competition - Jul 15, 2013.
Use machine learning on EMI signatures and other data to understand what appliances are used as a step for providing personalized and cost-effective energy saving recommendations.
- Webinar (Aug 6), Text Analytics Case Studies from LinkedIn, BoA, and Serendio - Jul 15, 2013.
LinkedIn, Bank of America and Serendio will share case studies on how they use insights from text analytics to save time and money and build competitive strategies - join us on Aug 6.