KDnuggets™ News 14:n10, Apr 30
Features (9) | Opinions (5) | Software (3) | News (6) | Webcasts (1) | Courses (3) | Meetings (4) | Jobs (10) | Academic (2) | Publications (4) | Tweets (6) | CFP (9) | Quote
Features
- Cartoon: Data Scientist Salary Negotiation - Apr 29, 2014.
New KDnuggets Cartoon looks at Data Scientist Salary Negotiation situation.
- New Poll: What data types/sources you analyzed in the past 12 months? - Apr 29, 2014.
New KDnuggets Poll is asking: What data types/sources you analyzed in the past 12 months? Please vote on www.kdnuggets.com .
- Elusive Data Scientists Driving High Salaries - Apr 21, 2014.
Recent study tracks experience, salary, industry and location of Data scientists, finds they are earning base salaries over $200K. Download free report.
- Data Mining Medicare Data - What Can We Find? - Apr 24, 2014.
Medicare released detailed reimbursement data for 2012: $77 billion paid to more than 880,000 health care providers, by doctor and procedure.We take an initial look and find large variances and potential indicators of fraud.
- PAW: Predictive Analytics World Toronto - New Data Paradigm - Apr 29, 2014.
2014 means big changes for big data. Get ready for a fierce debate on the new data paradigm at Predictive Analytics World Toronto on May 15. Special KDnuggets discount.
- Big Data Innovation Summit 2014: Highlights of Keynote Speeches on Day 2 - Apr 23, 2014.
Highlights from keynote speeches by big data experts from Facebook, RedPoint Global, Quintiles, Samsung, GMU, PayPal, and others on Day 2 of Big Data Innovation Summit 2014 in Santa Clara.
- Big Data Innovation Summit 2014: Highlights of Keynote Speeches on Day 1 - Apr 23, 2014.
Highlights from keynote speeches by big data technology leaders from industry and academia on first day of Big Data Innovation Summit 2014 in Santa Clara.
- Big Data Innovation Summit 2014 Santa Clara: Highlights of Selected Talks on Day 2 - Apr 29, 2014.
Highlights from the presentations by big data technology practitioners from NYSE, Glassdoor, Slice and Paychex on day 2 of Big Data Innovation Summit 2014 in Santa Clara.
- Big Data Innovation Summit 2014 Santa Clara: Highlights of Selected Talks on Day 1 - Apr 25, 2014.
Highlights from the presentations by big data technology practitioners from eBay, YarcData, LinkedIn, Trulia, and other leading companies on day 1 of Big Data Innovation Summit 2014 in Santa Clara.
Opinions and Interviews
- Exclusive Interview: Michael Brodie, Leading Database Researcher, Industry Leader, Thinker - Apr 21, 2014.
We discuss the most important database research advances, industry developments, role of relational, NoSQL, Graph databases, Computing Reality, and more.
- KDnuggets Interview: Michael Brodie on Data Curation, Cloud Computing, Startup Quality, Verizon (part 2) - Apr 28, 2014.
The second part of our exclusive interview focuses on Data Curation, Cloud Computing, Data Tamer and Jisto startups, and his experience as a chief Scientist of Verizon - and how that relates to teenager never tidying a room for 60 years.
- Exclusive Interview: David Stringfellow, Chief Economist, State Utah Auditor - Apr 25, 2014.
We discuss Analytics for Public Policy decisions, responsibilities of Utah Chief Data Officer, crowdsourcing analytics for resolving Government problems and most important skills for data science practitioners.
- Are Big Data and Privacy at odds? FICO Interview - Apr 23, 2014.
We discuss privacy, FICO scores, balancing predictive power and non-discrimination, whether technology bringing big data and privacy closer, and most important privacy issues for FICO.
- Exclusive Interview: Peter Bruce, President Statistics.com - Apr 17, 2014.
We discuss the mission of Statistics.com, selection of analytics courses and certificates, the future of analytics education, MOOCs, are Statistics disconnected from Big Data, the role of a data scientist, and more.
Software
- MLTK: Machine Learning Toolkit in Java - free download - Apr 27, 2014.
MLTK is a collection of machine learning algorithms in Java, supporting Generalized Linear Models: Ridge, Lasso, Elastic Net, Regression Trees, Random Forests, and more. Free download under BSD license.
- Apache Spark, the hot new trend in Big Data - Apr 18, 2014.
Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. Leveraging Hadoop Yarn, Alpine has made it very simple to get started with Spark.
- Examining GoodData Open Analytics Platform - Apr 16, 2014.
KDnuggets examines the main features of GoodData Open Analytics Platform, its users, how it compares to competition, and future plans.
News
- SIGKDD Data Science/Data Mining PhD Dissertation Award - Nominations Due Apr 30 - Apr 23, 2014.
This annual award by ACM SIGKDD seeks to recognize outstanding research by doctoral candidates in the field of data mining, data science, and knowledge discovery. Nominations due Apr 30.
- Big Data Leads Top Paying Skills - Apr 29, 2014.
Big Data related skills led the list of top paying technical skills (six-figure salaries) in 2013. Several other useful insights are available in the Dice Tech Survey Report, available for free download.
- Microsoft Expands Big Data Platform - Apr 21, 2014.
Microsoft expands its data platform with 3 major features: SQL Server 2014 with in-memory technology, Azure Intelligent Systems Service, and Analytics Platform System - SQL Server + Hadoop. New CEO Satya gives low-key but impressive presentation.
- Big Data TechCon - Great How-To Conference - Apr 17, 2014.
The recent BigData TechCon conference in Boston featured practical, how-to classes and tutorials for IT and Big Data professionals. It is the how-to training conference for professionals implementing and analyzing Big Data.
- Top stories for Apr 20-26 - Apr 27, 2014.
Elusive Data Scientists Driving High Salaries; Data Workflows for Machine Learning; New Book: Social Media Mining - free PDF download; Microsoft Expands Big Data Platform.
- Top stories for Apr 13-19 - Apr 20, 2014.
Top LinkedIn Groups in 2014 for Analytics, Big Data, Data Science; Data Analytics Handbook, free download; Apache Spark, the hot new trend in Big Data; GoodData Open Analytics Platform.
Webcasts and Webinars
- Upcoming Webcasts on Analytics, Big Data, Data Science - April 28 and beyond - Apr 28, 2014.
Stuck in Traffic, Beyond Excel, Evolving Your BI Strategy, BLU Acceleration with Cognos, SAS and Cloudera, Analytically Speaking Featuring David Meintrup, Data Mining: Failure to Launch, and more.
Courses
- USC Marshall MS in Business Analytics - Apr 24, 2014.
USC Marshall new MS in Business Analytics will give you the tools to leverage big and unstructured data for effective decision-making - study full or part-time, and customize your degree to your career goals.
- TESC Online, Affordable MBA in Data Analytics - Recharge Your Career - Apr 22, 2014.
This online program enables graduates to use advanced data analytics to drive continuous improvement in business and organizations, and lets you earn credit for professional certifications/expertise.
- UC Berkeley Master of Information and Data Science, Online - Apr 17, 2014.
This online degree is for professionals who want to become leaders in the field of data science. Students benefit from UC Berkeley strong ties to Silicon Valley and multidisciplinary approach that teaches the entire data life cycle.
Meetings
- PAW: Predictive Analytics World Toronto - New Data Paradigm - Apr 29, 2014.
2014 means big changes for big data. Get ready for a fierce debate on the new data paradigm at Predictive Analytics World Toronto on May 15. Special KDnuggets discount.
- U. Cincinnati Analytics Summit 2014, May 23 - Apr 28, 2014.
Keynotes by Eric Siegel (PAW Founder) and Jack Levis (UPS Director of Process Management), Tracks on Predictive Analytics, Descriptive Analytics, Prescriptive Analytics, Social and Mobile Media Analytics.
- KDD 2014 Workshops - the leading edge of Data Science Research - Apr 27, 2014.
KDD 2014 workshops provide the forum for the leading-edge research on topics like Data Science for Social Good, Crowd Sensing, Mobile Health, Stream Mining, Data Ethics, Sports Analytics, Social Networks, and much more. Papers due in June.
- MMDS 2014: Workshop on Algorithms for Modern Massive Data Sets, Berkeley - Apr 22, 2014.
The MMDS 2014 workshop (Berkeley, June 17-20) will bring top researchers to address algorithmic, mathematical, and statistical challenges in modern statistical data analysis. Early registration deadline May 1.
Jobs
- Microsoft: Applied Researcher - Apr 26, 2014.
Be at the forefront of Big Data, command many thousands of machines, process petabytes of data, and not just answer the question given but define what the right question to answer is.
- Paychex: Manager, Risk Modeling & Review - Apr 25, 2014.
Lead efforts to plan, design strategy, build, deploy and monitor predictive models to leverage revenue opportunities, and mitigate risks.
- Great West Casualty Company: Predictive Modeler - Data Analyst - Apr 22, 2014.
Researching, analyzing and developing predictive models in support of company mission - be the premier provider of insurance products and services for truckers.
- Apple: Data Scientist - iTunes - Apr 20, 2014.
Apple has a tremendous amount of data, and we have just scratched the surface in pattern detection, anomaly detection, predictive modeling, and optimization. We encourage scientists to stay abreast of research by attending conferences and working with academy.
- Apple: iAd - Senior Software Engineer - Apr 20, 2014.
Apple advertising is redefining the advertising experience on mobile devices. Be part of a dynamic team building high performance and scalable applications.
- Videology: Data Mining Scientist - Apr 17, 2014.
Videology is a leading technology company in the digital advertising industry. Support data mining and machine learning efforts, both for research and for ad optimization products.
- Amazon: Business Intelligence Engineer, Mobile Business Development - Apr 17, 2014.
Talented BI Engineer who is passionate about using data to drive crucial business decisions regarding our activity in the mobile ecosystem.
- Bosch: Data Mining Engineer - Big Data Infrastructure - Apr 17, 2014.
Bring together disparate technologies and use data mining and analytics to solve business problems in Predictive Maintenance, Health Informatics, Vehicle Diagnostics, Manufacturing, and other domains.
- Best Practice Partners: Principal Consultant, Clinical Innovations and Physician Engagement - Apr 16, 2014.
Lead the development and delivery of the client company clinical quality improvement and provider engagement services to assist clients in validating and ensuring improved outcomes.
- Apple: BI Applications Developer - Apr 16, 2014.
Have a startup mentality rather than a IT shop mentality, develop solutions to create different business analytical reports from vast amount of data.
Academic/Research positions
- NDSU: Informatics Postdocs - Apr 26, 2014.
Join a dynamic team performing groundbreaking research in the area of combinatorial cheminformatics, and propose independent strategic research themes in energy-related computational chemistry.
- Idiap/EPFL: PhD and Internship Positions in Social Computing - Apr 24, 2014.
The Social Computing Research Group at Idiap/EPFL has 3 PhD positions and several internship positions to help research in social media, ubiquitous computing, and computational social science.
Publications
- 9 Free Books for Learning Data Mining and Data Analysis - Apr 29, 2014.
Whether you are learning data science for the first time or refreshing your memory or catching up on latest trends, these free books will help you excel through self-study.
- Where are your users? Geo-localization with KNIME - Apr 28, 2014.
Learn how KNIME can help you improve user understanding through Geo-localization of IP addresses and dynamic visualization. Access free white paper for more details.
- New Book: Social Media Mining - free PDF download - Apr 22, 2014.
Social Media Mining integrates social media, social network analysis, and data mining to enable students, practitioners, researchers, and managers to understand the basics and potentials of this field.
- Data Workflows for Machine Learning - Apr 20, 2014.
Paco Nathan compares several open source frameworks for Machine Learning workflows, including KNIME, IPython Notebook and related libraries, Cascading, Cascalog, and Spark/MLbase, and proposes 9 criteria to evaluate the best alternatives.
Top Tweets
- Top KDnuggets tweets, Apr 25-27 - Apr 28, 2014.
Recommended Tutorials for Data Scientists from PyCon 2014; How One Woman Hid Her Pregnancy from #BigData; MLTK: Machine Learning Toolkit in Java - free download; Deep Learning for Natural Language Processing.
- Top KDnuggets tweets, Apr 23-24 - Apr 25, 2014.
#BigData Cartoon: "It does look similar - but this one is powered by Hadoop"; Great list: 9 Python Machine Learning Books; Why people are bad at technology predictions; Too busy recommending things to experience them.
- Top KDnuggets tweets, Apr 21-22 - Apr 23, 2014.
Sweet! Chocolate Consumption strongly correlated to Nobel Prizes; Cheat Sheets for Data Scientists; New Book: Social Media Mining - free PDF download; Elusive Data Scientists Driving High Salaries.
- Top KDnuggets tweets, Apr 18-20 - Apr 22, 2014.
Cross-validation pitfalls for regression/classification and how to avoid them; Data Workflows for Machine Learning ; Apache Spark, the hot new trend in Big Data ; Visual Analysis Best Practices - download a free guidebook from Tableau.
- Top KDnuggets tweets, Apr 16-17 - Apr 19, 2014.
Scikit-Learn: a great python library for machine learning; A map of where nobody lives in the US; Apache Spark, the hot new trend in Big Data ; NYU @aghose on Est. Demand for Mobile Apps - Learn more: NYU Stern MS in Biz Analytics.
- Top KDnuggets tweets, Apr 14-15 - Apr 16, 2014.
9 Free Books for Learning Data Mining and Data Science; Coursera #DataScience Specialization: 10 courses from JHU; Top LinkedIn Groups in 2014 for Analytics, Big Data, Data Mining, and Data Science; EMC Data Science and Big Data Analytics Offer.
CFP - Calls for Papers
- DS 2014: 17th Int. Conf. on Discovery Science, due May 9
- C3-2014 : Curbing Collusive Cyber-gossips in Social Networks, due May 25
- BMAW 2014: 11th Annual UAI Application Workshop, due Jun 2
- ODD2: Outlier Detection & Description under Data Diversity , due Jun 4
- ACM SIGSPATIAL GIS 2014: SIGSPATIAL Advances in Geographic Information Systems, due Jun 17
- NFMCP: Workshop on New Frontiers in Mining Complex Processes, due Jun 20
- ICDM 2014: IEEE Int. Conf. on Data Mining (ICDM), due Jun 24
- NewsKDD: Data Science for News Publishing , due Jun 24
- BD-DSG: Big Data Journal: Special Issue on Data for Social Good, due Oct 6