KDnuggets™ News 14:n14, Jun 10
Features (8) | Software (3) | Opinions (14) | News (6) | Webcasts (3) | Courses (1) | Meetings and Reports (9) | Jobs (6) | Publications (1) | Tweets (6) | CFP (13) | Quote
Features
- KDnuggets 15th Annual Analytics, Data Mining, Data Science Software Poll: RapidMiner Continues To Lead - Jun 7, 2014.
With over 3,000 data miners taking part in KDnuggets 15th Annual Software Poll, RapidMiner continues to lead. Free software is used much more outside US, and Hadoop usage grows fastest in Asia.
- US Open Data Action Plan and Datasets - May 31, 2014.
We summarize the key findings in the recently released US Open Data Action Plan, highlighting the principles, commitments, datasets released and future outlook.
- Big Data Strategy: Datafication - Jun 5, 2014.
Datafication of everything enables new ways of creating value and becoming more competitive. Oracle Big Data Strategist Paul Sonderegger explains.
- Exclusive: Raul Valdes-Perez on OnlyBoth, Scientific Discovery, Advice for Winners - Jun 5, 2014.
Our exclusive interview covers OnlyBoth and Vivisimo startups, Scientific Discovery, legendary Herbert A. Simon, venture capital, Big Data, advice for winners, and more.
- Interview: Kirk Borne, Data Scientist, GMU on Decision Science as a Service and Data Science curriculum - May 31, 2014.
We discuss Kirk's role at Syntasa, the concept of "Decision Science as a Service", key components of a well-designed Data Science education curriculum, advice for young aspirants and more.
- PAW: Predictive Analytics World Boston, Oct 5-9 - Jun 8, 2014.
Join predictive analytics experts from leading organizations at Predictive Analytics World Boston (Oct 5-9, 2014) to increase your knowledge and get insights into the ever-evolving field of analytics. Get KDnuggets discount.
- INFORMS, Uniting Operations Research and Analytics - Jun 4, 2014.
INFORMS is a large professional association which started in operations research and management science. I discuss their evolution to analytics, CAP certification, Big Data and more.
- KDnuggets Social Network in NodeXL, May 2014 - May 29, 2014.
We examine KDnuggets Twitter Social Network, as generated by NodeXL, looking at clusters, top Twitter accounts, URLs, hashtags, words, and what does it all mean?
Software
- Lavastorm - Top Rated Analytics Platform - Free Download - Jun 5, 2014.
Download your free copy of the new version of top-rated Lavastorm Analytics Engine Public, and respond faster to information requests, eliminate Excel headaches and scripting hassles, and improve data visibility.
- OpenNN, An Open Source Library For Neural Networks - Jun 2, 2014.
OpenNN is an open source class library written in C++ which implements neural networks, and runs on Windows, Apple, or Linux.
- InnovAccer: Simplifying Research and Analysis - Jun 5, 2014.
Innovaccer cleans and prepares data for analysis by researchers to save time and improve confidence in the quality of the data.
Opinions and Interviews
- The First Law of Data Science: Do Umbrellas Cause Rain? - Jun 9, 2014.
Michael Brodie on the first law of data science, the role of data curation in Big Data analysis, and Thomas Piketty economic theories.
- Interview: Lloyd Tabb, Chairman & CTO, Looker on Front-line Analytics and Data Democratization - Jun 9, 2014.
We discuss the capabilities of Looker, data democratization across organization, change in the tools being used by analytics-savvy business managers, front-line analytics, competitive landscape and more.
- Don Zereski, VP, Local Search & Discovery, HERE (Nokia) on Location Analytics and Architecture Evolution - Jun 8, 2014.
We discuss trends in location analytics, evolution of HERE's analytics architecture, infrastructure challenges, data governance and more.
- Data Lakes vs Data Warehouses - Jun 7, 2014.
Data Warehouses, traditionally popular for business intelligence tasks, are being replaced by less-structured Data Lakes which allow more flexibility.
- Interview: Santhosh Adayikkoth, CEO, BigInfo Labs on Big Data perception and learning Big Data skills - Jun 7, 2014.
We discuss BigInfo Labs' future plans, Big Data perception at C-level in large firms, most effective ways to learn Big Data skills and more.
- Interview: Santhosh Adayikkoth, CEO, BigInfo Labs on Data Relevance and Intel Partnership - Jun 6, 2014.
We discuss BigInfo Labs, the concept of "Data Relevance" in Big Data, experience of partnership with Intel, and BigInfo Labs' strategy for competitive differentiation.
- Data Science Last Mile - Jun 6, 2014.
This post discusses the Data Science "Last Mile", the final work to take the discovered insights and deliver them a highly usable format or integrate into a specific application.
- Lynn Goldstein, Chief Data Officer, NYU on the Need for Data Governance - Jun 3, 2014.
We discuss the role of Data Governance, establishing Big Data accountability, impact of Data Governance on Data Quality, and assessing the education available for Data Governance.
- Interview: Tom Kern, Risk Modeling Manager, Paychex on Risk Analytics and Sales Anticipation Model - Jun 2, 2014.
We discuss the role of Risk Analytics at Paychex, strategic importance of Sales Anticipation Model, optimizing business processes by leveraging Big Data, and advice for companies thinking about Big Data as well as aspiring students.
- Preparing Industry for the Upcoming Data Deluge: PAW-Manufacturing 2014 - May 30, 2014.
Predictive analytics will become more powerful in industry as the data that computers collect and analyze in consumer and manufacturing contexts becomes more numerous.
- Interview: Kirk Borne, Data Scientist, GMU on Big Data in Astrophysics and Correlation vs. Causality - May 30, 2014.
We discuss how to build the best data models, significance of correlation and causality in Predictive Analytics, and impact of Big Data on Astrophysics.
- Data Mining Modern Languages - May 30, 2014.
We examine the trends and implications in modern language enrollment in the United States, and also show an excellent example of using rCharts and ggplot2 for interactive visualization.
- Interview: Walter Maguire, Chief Field Technologist on HP Big Data Strategy and HAVEn - May 28, 2014.
We discuss how HP views Big Data, capabilities of HP HAVEn, leveraging Big Data for improving customer experience, Analytics challenges, outsourcing criteria and current trends.
- MoDAT: Designing the Market of DATa - Workshop Report - May 28, 2014.
An overview of MoDAT workshop on "Designing the Market of DATa" - key research ideas such as recommending expertise, chance discovery, "data jackets", privacy risks, and more.
News
- May 2014 Analytics, Big Data, Data Mining Acquisitions and Startups Activity - Jun 9, 2014.
May 2014 acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: 40 events, including ExtraHop, Capptain, Datalogix, "Data Nation", 6Sense, Sumo Logic, DataPad, SeeWhy, Tamr, LiveRamp, and Adometry.
- ICON Challenge on Forecasting and Scheduling - Jun 3, 2014.
ICON is a combined competition with both a machine learning component (predicting energy prices) and an scheduling component (using the predicted prices to schedule tasks on machines).
- Top stories for Jun 1-7 - Jun 8, 2014.
New Poll: Analytics, Data Mining, Data Science Software Used? OpenNN, An Open Source Library For Neural Networks; Data Lakes vs Data Warehouses; Stanford University: Data Analyst.
- Top stories in May: New Poll - Analytics, Data Mining Software; Data Science Cheat Sheets - Jun 5, 2014.
New Poll: Analytics, Data Mining, Data Science Software Used? Guide to Data Science Cheat Sheets; Big Data Landscape, v 3.0, analyzed; Where to Learn Deep Learning.
- Top stories for May 25-31 - Jun 1, 2014.
New Poll: Analytics, Data Mining, Data Science Software Used? Where to Learn Deep Learning - Courses, Tutorials, Software; Interview: Martin Hack, CEO, Skytree on Industrializing Machine Learning for Big Data; Data Mining and Analysis: Fundamental Concepts and Algorithms.
- Additions to KDnuggets Directory in May - Jun 2, 2014.
ClearVu Analytics, OpenNN neural net library from Intelnics, QIWare, Vowpal Wabbit software for fast learning, Analytics Vidhya, 17 new Big Data meetings, companies, Latin American Data Science education, and more.
Webcasts and Webinars
- Webinar: Data Mining: Failure to Launch [June 11] - Jun 4, 2014.
Learn how to get started with predictive modeling and overcome strategic and tactical limitations that cause data mining projects to fall short of their potential. Next webinar is June 11.
- Upcoming Webcasts on Analytics, Big Data, Data Science - Jun 9 and beyond - Jun 9, 2014.
Data Mining FTL, Analytically Speaking with Dan Ariely, Solr, Hadoop, Cloud BI, Employee Churn, and more.
- Webcast - Analytically Speaking Featuring Michael Schrage - Jun 3, 2014.
MIT Research Fellow Michael Schrage helps you answer the key question for becoming a successful innovator. His insights will give you a new perspective on how to create value.
Courses
- Vendor-Neutral Hands - On Training in Data Mining [Denver-CO, July | Wash-DC, Sep] - Jun 9, 2014.
Successful analytics in the big data era does not start with data and software, but with immersive hands-on training and goal-driven strategy. Get this training from The Modeling Agency.
Meetings and Reports
- PAW: Predictive Analytics World Boston, Oct 5-9 - Jun 8, 2014.
Join predictive analytics experts from leading organizations at Predictive Analytics World Boston (Oct 5-9, 2014) to increase your knowledge and get insights into the ever-evolving field of analytics. Get KDnuggets discount.
- HR & Workforce Analytics Innovation Summit 2014 Chicago: Day 2 Highlights - Jun 5, 2014.
Highlights from the presentations by HR leaders from Caterpillar, Coca-Cola, Pfizer, and Marriott International on day 2 of HR & Workforce Analytics Innovation Summit 2014 in Chicago.
- Jun-Oct 2014 Meetings in Analytics, Big Data, Data Mining, and Data Science - Jun 4, 2014.
Coming soon: Big Data Innovation Summits, Useful Business Analytics, PAW and PAW-MFG in Chicago, GigaOM Structure, INFORMS Business of Big Data, MMDS, GraphLab Conference, TDWI World Conf in Boston, and KDD-2014 in NYC.
- HR & Workforce Analytics Innovation Summit 2014 Chicago: Day 1 Highlights - Jun 2, 2014.
Highlights from the presentations by HR leaders from Wells Fargo, Sears Holdings, Johnson Controls, Trulia on day 1 of HR & Workforce Analytics Innovation Summit 2014 in Chicago.
- Big Data Innovation Summit 2014 London: Highlights - May 31, 2014.
Highlights from the presentations by Big Data technology practitioners from Sears Holdings, Microsoft, Ticketmaster during Big Data Innovation Summit 2014 in London.
- Gaming Analytics Innovation Summit: Day 2 Highlights - May 30, 2014.
Highlights from the presentations by Gaming Analytics experts from Ubisoft, Electronic Arts, Sega on Day 2 of Gaming Analytics Summit 2014.
- Data Discovery to Real Business Value - INFORMS Conference, June 22-24, San Jose - May 30, 2014.
Learn about how to achieve return on investment and real business value from your Big Data investments through the real-life case studies, insightful presentations, and lot more at the INFORMS Conference in San Jose.
- Big Data for Executives 2014: Day 2 Highlights - May 29, 2014.
Highlights from the presentations by Big Data experts from McKinsey Solutions, SAP, Techfetch, Weather Analytics on Day 2 of Big Data for Executives 2014.
- Gaming Analytics Summit 2014: Day 1 Highlights - May 29, 2014.
Highlights from the presentations by Gaming Analytics experts from Activision, Valve, Microsoft and Broken Bulb Studios on Day 1 of Gaming Analytics Summit 2014.
Jobs
- Megaputer Intelligence: Data Analysis Consultant - Jun 9, 2014.
Create data analysis and reporting solutions for Megaputer customers with the help of PolyAnalyst(tm) platform: experimental, proof-of-concept, implementation, and production projects. Develop successful long-term relationships with customers.
- The Doctors: Manager, Marketing Data & Analytics - Jun 8, 2014.
Managing all market data analysis and reporting systems and activities for the enhancement of market insights, demand generation campaigns, and retention analytics.
- Stanford University: Data Analyst - Jun 6, 2014.
Work with wide-range of challenges by analyzing unique expenditure datasets, produce insights to help reduce spending and improve reimbursements, payments, and contractors payments.
- Apple: Sr. Software Engineer, Machine Learning - Jun 5, 2014.
Apply advanced techniques and algorithms to improve an ad network, develop and implement ad algorithms, yield optimization solutions and network data processes, deep understanding of the ad network behavior.
- Chubb: Analytics Director - May 30, 2014.
Leading analytic initiatives to address complex business challenges using SAS, including data extraction, integration, preparation, business rules, and quality control.
- Thomson Reuters: Data Scientist - May 29, 2014.
Proven record of building data driven solutions. Creative, understand complex business problems, develop prototypes. Have expertise in the data life cycle, from data collection to analysis and presentation.
Publications
- Big Data Assessment - Key Business Drivers, Expected Benefits and Common Challenges - Jun 5, 2014.
Recent survey on Big Data outlook reports increasing interest in Big Data for more accurate and timely decision-making; and concerns about project costs and ability to scale.
Top Tweets
- Top KDnuggets tweets, Jun 6-8 - Jun 9, 2014.
A tutorial on statistical learning with with scikit-learn Data science vs the hunch: When data contradicts manager gut instinct Stanford University: Data Analyst Data Lakes vs Data Warehouses.
- Top KDnuggets tweets, Jun 4-5 - Jun 6, 2014.
How does "Practical Data Science with R" book stand out ? Top 5 cities for #BigData jobs: San Francisco, McLean, Boston, St. Louis, and Toronto< Big jump in #BigData applications, code built with Apache Spark 76 Startup Failure Post-Mortems.
- Top KDnuggets tweets, Jun 2-3 - Jun 4, 2014.
SAS vs R vs SPSS - Statistical Language Wars - a giant infographic Very useful - R Refcard for Data Mining Mind-boggling - The Internet in Real-Time - how quickly data is generated #BigData, Open Data, and Open Govt - Venn Diagram.
- Top KDnuggets tweets, May 30 - Jun 1 - Jun 2, 2014.
Tutorial: Step-by-Step Guide to Setting Up an R - #Hadoop System 100+ Interesting Data Sets for Statistics (and Data Science) #BigData sets available for free - big list from Data Science Central Twitter to release all tweets to scientists - a research boon and an ethical dilemma.
- Top KDnuggets tweets, May 28-29 - May 30, 2014.
SAS University Edition offers free #SAS software for higher education, teaching Ultra-cool! Google "Quantum Computing Playground" - fiddle with quantum algorithms Thomson Reuters: Data Scientist Realtime Personalization and Recommendation with Stream Mining.
- Top KDnuggets tweets, May 26-27 - May 28, 2014.
Machine Learning Algorithms Tour: Regression, kNN, Regularization, Decision Tree Where to Learn Deep Learning - Courses, Tutorials, Software 9 Courses on Data Science, R, Machine Learning start on Coursera.
CFP - Calls for Papers
- DataWiz 2014: 1st Int. Workshop on Data Visualization, due Jun 6
- Ethics: Data Ethics Workshop, due Jun 13
- DMNLP : Interactions between Data Mining and Natural Language Processing, due Jun 20
- CSSW: Computational Social Science: Social Contagion, Collective Behaviour, and Networks, due Jun 22
- KDIR: Knowledge Discovery and Information Retrieval, due Jun 23
- BIOKDD'14: 13th Int. Workshop on Data Mining in Bioinformatics, due Jun 23
- ICCBR-DM: Synergies between CBR and Data Mining, due Jun 23
- STANDARDS: Standards in Predictive Analytics Workshop , due Jun 24
- IEEE ICDM 2014: IEEE Int. Conf. on Data Mining, due Jun 24
- ICTAI 2014: 26th IEEE Int. Conf. on Tools with Artificial Intelligence, due Jul 14
- ISM 2014: IEEE Int. Symposium on Multimedia, due Jul 18
- DMS2014: Data Mining for Service, due Aug 1
- COMAD 2014: 20th Int. Conf. on Management of Data, due Aug 11