KDnuggets™ News 13:n11, Apr 24
Features (8) | Software (1) | Webcasts (2) | Courses, Events (3) | Meetings (3) | Jobs (4) | Competitions (1) | Tweets (3) | NewsBriefs (2) | CFP (11) | Quote
Features
- Poll Results: Largest Dataset Analyzed/Data Mined - Apr 23, 2013.
The largest dataset analyzed kept growing, with the median value in 40-60 GB range, about twice the 2012 value. US data miners lead other regions in Big Data: about 28% of them worked with TB size databases. We again observed the 11-100 Petabyte gap.
- Top 30 LinkedIn Groups for Analytics, Big Data, Data Mining, and Data Science - Apr 22, 2013.
We investigate the largest LinkedIn groups for Analytics, Big Data, Data Mining, and Data Science and look at their size, growth and activity levels. We identify 4 distinct types of group behavior, and find there are 50% more discussions/week than comments.
- PAW: Predictive Analytics World Chicago June - Apr 23, 2013.
Charge your expertise with the latest methodologies, connect with your peers, and experience unmatched networking and learning at PAW Chicago - June 10-13, 2013. Register before May 1, 2013 for savings up to $400.
- GE NFL NineSigma $10 Million Head Health Challenge - Apr 19, 2013.
GE and NFL Head Health Challenge will award up to $10 million for more accurate diagnoses of brain injury and prognosis for recovery. The Challenge is hosted by NineSigma.
- Data Scientist Hat - Best Comments - Apr 21, 2013.
Here are best comments on Data Scientist Hat - clearly data scientists need to wear multiple hats, including y-hat, to help protect from Hype and Overfitting.
- KDD Innovation and Service Awards, nominations due May 31 - Apr 23, 2013.
Nominate leaders in Big data, Data Mining, Knowledge Discovery, and Predictive Analytics for ACM SIGKDD Innovation and Service Awards. The awards recognize outstanding technical innovations and outstanding professional contributions to the field and are due May 31.
- Should Data Science become a Profession: Pro and Con - Apr 18, 2013.
A Data Science Code of Professional Conduct can protect both consumers of data science and data scientists themselves. But it is useful and possible without a single professional body? Read the pro and con arguments and join lively debate on this topic.
- Top news for Apr 14-20: KDD Cup 2013: Author-Paper ID; 5 Top Articles DMKD Articles; Microsoft ML Summit Streamed Live Apr 23 - Apr 21, 2013.
KDD Cup 2013: Author-Paper Identification Challenge; Free Read 5 Top Articles from DMKD journal; Microsoft Machine Learning Summit Streamed Live Apr 23
Top jobs: Machine Learning Scientist at Accretive Health; Data Scientist at American Express
Software
- GDELT: Global Data on Events, Location and Tone - Apr 17, 2013.
The GDELT database: Global Data on Events, Location and Tone, which is an amazing tool for data journalists. The Guardian described it as "a #BigData history of life, the universe and everything"
Webcasts
- Pivotal - EMC, VMware Big Data Spinoff: Apr 24 Webcast - Apr 19, 2013.
Pivotal, "A New Platform For A New Era", a spinoff from EMC and VMware, will be unveiled during a live live streaming event on April 24.
- IIA Webinar May 8: The Birds and the Bees: The Benefits of Cross-Pollination - Apr 19, 2013.
IIA Faculty Lead Dwight McNeill will review the strengths and analytic sweet spots of four industries: retail, banking, sports, and political campaigns and explore how these strengths could be adapted for healthcare.
Courses, Events
- SAS: New Text Analytics Course - Apr 23, 2013.
This course will help you organize, manage, and mine textual data to generate customer insights and to understand and predict customer sentiments.
- SAS: Exploratory Analysis for Large and Complex Problems - Apr 23, 2013.
This two-day course delivers highly pragmatic and practical methods with a unique approach to predictive analytics that will address these issues and more.
- Lipari Summer School Computational Social Science and Big Data - Apr 19, 2013.
The 2013 Lipari Summer School will bring together world-class experts in the fields of quantitative social science and big data analysis to provide a lecture and workshop series. Apply by April 30.
Meetings
- IEEE ICDM Conference on Data Mining Demos - Apr 23, 2013.
The demo session at ICDM-2013, the world's premier research conference in data mining, provides data mining researchers and practitioners an exciting and highly interactive way to explore new ideas and results. Submission deadline is August 9th, 2013.
- BPDM scholarships: Broadening Participation in Data Mining, KDD 2013 - Apr 22, 2013.
The BPDM Program cultivates a talented and diverse population of data mining researchers by providing scholarships for students of underrepresented groups to interact with and learn from senior researchers in industry. Apply by May 5.
- Meetup/Webcast Apr 23: Shark Data Analytics Stack on a Hadoop Cluster - Apr 18, 2013.
Data Science Meetup: "Shark Data Analytics Stack on a Hadoop Cluster", April 23, 2013, 6 pm MT in Denver, CO - Free and open to all. Live webcast for folks unable to attend in-person.
Jobs
- Analytic Application Developer at Nielsen, Schaumburg, IL - Apr 23, 2013.
Work with Nielsen Advanced Solutions Group analytic modelers to commercialize analytical methods that prove to be successful with our FMCG (fast moving consumer goods) customers.
- Business Intelligence Developer at Epic Systems, Madison, WI - Apr 23, 2013.
Apply your interest in analytics to one of the biggest challenges of our time: healthcare, and help our customers make sense of the wealth of information they capture within an electronic medical record system.
- Senior Editor, Statistics at Springer, New York, NY - Apr 23, 2013.
Responsible for increasing company profitability via acquisitions of books, journals, and electronic products in the field of Statistics and related subject areas.
- Senior Software Engineer -Data Mining Scientist at Rocketfuel, Redwood Shores, CA - Apr 20, 2013.
Rocket Fuel is a digital-advertising technology company in Silicon Valley that has grown rocket-fast since its founding in 2008, and is a leader in the emerging phenomenon of scientific advertising online.
Competitions
- KDD Cup 2013: Author-Paper Identification Challenge - Apr 18, 2013.
One of the main challenges of searching academic literature is resolving author-name ambiguity: many authors have similar names, and some authors publish under different variations of their name. This problem is the topic of KDD Cup 2013: determine which papers in Microsoft Academic Search author profile were truly written by that author. Submission deadline: June 12.
Top Tweets
- Top KDnuggets tweets, Apr 19-21: 2 Books on Data analysis with Python, other open source too; List of Machine Learning APIs - sentiment analysis, face rec - Apr 22, 2013.
2 Books on Data analysis with Python, other open source tools; List of Machine Learning APIs - sentiment analysis, face recognition, query classification; This video clip on Dzhokhar Tsarnaev VKontakte page likely shows his brother; 6 steps to get hired as a data scientist
- Top KDnuggets tweets, Apr 17-18: How eBay is using visualization and Tableau; Cannot predict age on Twitter after 30 - Apr 19, 2013.
How eBay is using visualization and Tableau to make data democratic; Hey, dude, can you tell, like, how old I am? Software cannot predict user age on Twitter after 30; Has the #BigData Bubble Burst? Not yet, but correction is due, especially for Hadoop; How to speed up Excel and process billions of rows in the cloud
- Top KDnuggets tweets, Apr 15-16: MSR Machine Learning Summit Streamed Live Apr 23; DMKD: Read 5 Top Articles from Data Mining and Knowledge Dis - Apr 17, 2013.
Microsoft Research Machine Learning Summit Streamed Live April; DMKD: Read 5 Top Articles from Data Mining and Knowledge Discovery journal; Where can you find a true Data Scientist? Julia vs R: Julia is fast, has native support for parallel computing
News Briefs
- INFORMS First CAPs: Certified Analytics Professionals - Apr 23, 2013.
CAP designation is earned through a combination of education, experience, confirmation of communication skills, and passing a rigorous examination. The passing rate for the first exam was 77%.
- Cogniview Data Conversion Software and Data Blog - Apr 19, 2013.
Cogniview is a leading provider of data conversion software, and also covers data mining and related topics in its blog.
CFP - Calls for Papers
- WI'13: IEEE/WIC/ACM Int. Conf. on Web Intelligence 2013, due May 1
- WI'13: IEEE/WIC/ACM Int. Conf. on Web Intelligence 2013, due May 1
- DMH: Data Mining for Healthcare, due May 5
- RecSys 2013: ACM Int. Conf. on Recommender Systems , due May 6
- ASONAM 2013 Industrial: ASONAM 2013 Industrial Track, due May 15
- IDEAL'2013: Intelligent Data Engineering and Automated Learning, due May 24
- ICDM '13 demos: IEEE ICDM Demos, due Aug 9
- MLSA13: Machine Learning and Data Mining for Sports Analytics , due Jun 28
- BDMT: Big Data Mining Techniques for Online Sales and Customer Service, due Jul 30
- ICDM'13-T: ICDM'13: Call for Tutorial Proposals, due Aug 3
- ICDM '13 demos: IEEE ICDM Demos, due Aug 9
Quote
One Fund Boston helps victims of Boston Marathon bombing - onefundboston.org/