Exclusive: Dave Marvit, Innovation Strategy Consultant, Fujitsu on Modern Sentiment Analysis using Ubiquitous Continuous Sensing
We discuss traditional sentiment analysis vs. modern sentiment analysis, role of data science in Human Centric Intelligent Society, mainstream adoption of bio sensors and opportunities created by Big Data from ubiquitous continuous sensing.
on Jun 30, 2014 in Crossing the Chasm, Data Science, Dave Marvit, Fujitsu, Healthcare, Interview, Sentiment Analysis
Upcoming Webcasts on Analytics, Big Data, Data Science – Jun 30 and beyond
CAP Theorem (key Big Data idea), Analytics and Machine Learning, SAS and Hadoop, MongoDB, Data Lakes, Data Visualization, and more.
on Jun 30, 2014 in Analytics, Bloor Group, Hadoop, MongoDB, SAS
Top KDnuggets tweets, Jun 27-29: Google says Hadoop era is over
Google says #Hadoop era is over, Google Cloud Dataflow can do much more; Machine learning, data mining, predictive analysis, and advanced analytics ~ same; Do you need a Masters Degree to become a Data Scientist? ; Larry Page: "if we data mined health care data, we could save 100K lives next year".
on Jun 30, 2014 in Data Scientist, Dataflow, Google, Hadoop, Healthcare, Master of Science
100 Big Data Companies Analyzed
We analyze the CRN Big Data 100 for insights into trends in the future of Big Data companies, including changes in database solutions, active regions, and what industries are undergoing the most change right now.
on Jun 29, 2014 in Big Data, Big Data Vendors, Business Analytics, Companies, CRN, Data Management, Hadoop, NoSQL
The Impact Cycle – how to think of actionable insights
The IMPACT Cycle provides a guiding framework for thinking about the steps for being effective analytical consultant, and can be a tool to help you drive effectiveness through your analytical teams.
on Jun 29, 2014 in Analytics Consultant, Business Analytics, Business Strategy, Data Analytics, Jean-Paul Isson
Top stories for Jun 22-28
Does Deep Learning Have Deep Flaws? What is Text Analytics? Data Science Skills and Business Problems; CRN 50 Emerging Big Data Vendors.
on Jun 29, 2014 in Big Data Vendors, Data Science Skills, Deep Learning, Text Analytics, Top stories
New Beginnings in Facial Recognition
Developments in neural networks and deep learning are bringing great improvements in facial recognition, which could have exciting (and scary) applications on platforms like Google Glass.
on Jun 28, 2014 in AI, Deep Learning, Face Recognition, Google Glass, Image Recognition, Neural Networks, Sean McClure
Do you need a Masters Degree to become a Data Scientist?
Leading analytics experts answer the question: "Do you need a Masters Degree to become a Data Scientist?" Read practical tips and interesting commentary.
on Jun 27, 2014 in Data Science Education, Data Scientist, LinkedIn Groups, Master of Science
Interview: Samaneh Moghaddam, Applied Researcher, eBay on Opinion Mining – Typical Projects and Major Challenges
We discuss typical sentiment analysis problems at eBay, underrated challenges, career motivation, important soft skills and more.
on Jun 27, 2014 in Advice, Challenges, eBay, Interview, Samaneh Moghaddam, Skills
Interview: Ingo Mierswa, RapidMiner CEO on “Predaction” and Key Turning Points
RapidMiner CEO Ingo Mierswa talks about "predaction", reasons for RapidMiner popularity, business source model, analytics to investigate fraud, key turning points, and more.
on Jun 27, 2014 in Ajay Ohri, Fraud analytics, Ingo Mierswa, Open Source, RapidMiner
Top KDnuggets tweets, Jun 25-26: 12 JavaScript Libraries for Data Viz; TF-IDF – key measure for Text Analytics
Very useful: 12 JavaScript Libraries for Data Visualization; Introduction to TF-IDF - key measure for Text Analytics; UC Berkeley new online MS in #DataScience, 18 months, $60K; XLMiner solves Big Data Problems in Excel.
on Jun 27, 2014 in Data Visualization, Javascript, Text Analytics, TF-IDF, UC Berkeley, XLMiner
Data Science Skills and Business Problems
Discover what skills a data scientist benefits from learning and how the concept of a data scientist, and what businesses expect of them, has developed over time.
on Jun 27, 2014 in Alex Jones, Business Analytics, Data Science Skills, DJ Patil, McKinsey, Unicorn
Book: Win With Advanced Business Analytics
Written for the non-technical professional, this definitive guide shows you how to gain the most opportunity and value from every type of advanced business analytics.
on Jun 26, 2014 in Book, Business Analytics, Jean-Paul Isson, Wiley
Menthal – Access Rich Smartphone Dataset
Menthal projects studies smartphone habits and depression, and has an App which collects smartphone interactions and gives user a feedback. Researchers are invited to analyze a large set of volunteered data by visiting Bonn, Germany.
on Jun 26, 2014 in App, Depression, Germany, Menthal, Postdoc, Smartphone
Interview: Samaneh Moghaddam, Applied Researcher, eBay on Aspect-based Opinion Mining
We discuss aspect-based opinion mining, major challenges, cold start items, the need for accurate opinion mining models for cold start items and how factorized LDA can be leveraged.
on Jun 26, 2014 in Challenges, eBay, LDA, Samaneh Moghaddam, Sentiment Analysis, Text Mining
CRN 50 Emerging Big Data Vendors
We examine CRN top 50 Emerging Big Data Vendors, with 65% located in Silicon Valley. The prototypical company is located in San Francisco and develops software for Hadoop analytics platform. Competition will be tough!
on Jun 26, 2014 in Big Data, Big Data Vendors, Companies, CRN, Hadoop, Startups
Domino – A Platform For Modern Data Analysis
Tools that facilitate data science best practices have not yet matured to match their counterparts in the world of software engineering. Domino is a platform built from the ground up to fill in these gaps and accelerate modern analytical workflows.
on Jun 26, 2014 in Business Analytics, Data Analysis, Data Science Platform, Domino, Tools
CRN 25 Big Data Management Companies
We examine top 25 Big Data Management companies, part of CRN Big Data 100, including Actian, Couchbase, and MemSQL. A large fraction of these companies develop NoSQL solutions.
on Jun 26, 2014 in Big Data, Companies, CRN, Data Management, NoSQL
XLMiner solves Big Data Problems in Excel
XLMiner, a part of Analytic Solver Platform integrated software for predictive and prescriptive analytics - forecasting, data mining, optimization and simulation, lets you solve small or Big Data problems in Excel.
on Jun 26, 2014 in Data Mining, Excel, Forecasting, Optimization, XLMiner
Top KDnuggets tweets, Jun 23-24: Machine learning in the cloud: Microsoft Azure; Understanding Data Distribution
Machine learning in the cloud: the brains behind Microsoft Azure; Understanding Data Distribution - key first step in analyzing a new data set; Mapmaking for R Programmers - an introduction. What is Text Analytics?
on Jun 25, 2014 in Machine Learning, Maps, Microsoft Azure, R, Text Analytics
CRN 50 Big Data Business Analytics Companies
We examine CRN top 50 Big Data Business Analytics companies. They are younger (average age is 10), and 44% are founded since 2010.
on Jun 25, 2014 in Big Data, Business Analytics, Companies, CRN
KDnuggets 14:n16, Does Deep Learning have deep flaws? New poll: Largest dataset analyzed?
KDnuggets analytics, data mining, and data science stories, including Features, Software, Opinions, News, Webcasts, Courses, Meetings and Reports, Jobs, Academic, Tweets, CFP, and Quote.
on Jun 25, 2014 in Deep Learning, LIONbook, Netflix, Poll, Twitter
Analytics 2014 Conference – Oct 20-21, Las Vegas
Analytics 2014 features four keynote speakers and more than 40 breakout sessions on hot topics like visual analytics, social media analytics, predictive modeling, and much more! Register by Aug 1 to get the early rates.
on Jun 24, 2014 in Analytics, Conference, Las Vegas-NV, SAS
PAW: Predictive Analytics World Boston, Oct 5-9
Tired of using old predictive modeling tools? Hear the Chief Scientist at Mail Chimp discuss his experience building successful models and what it means to "lead from the back" in predictive modeling.
on Jun 24, 2014 in Boston-MA, PAW, Predictive Analytics World
CRN 25 Big Data Infrastructure Companies
We examine the top 25 Big Data Infrastructure companies, part of CRN Big Data 100, which includes Amazon, IBM, and Microsoft.
on Jun 24, 2014 in Big Data, Companies, CRN, Infrastructure
Chief Data Officer Summit 2014 – Day 1 Highlights
Highlights from the presentations by Data Governance experts from State of Colorado, IBM, Informatica and Sony Pictures Entertainment on day 1 of Chief Data Officer Summit 2014 in San Francisco, CA.
on Jun 24, 2014 in 3Vs of Big Data, Analytics, Chief Data Officer, Conference, Data Governance, IE Group, San Francisco-CA
What is Text Analytics?
Anderson Analytics explains Text Analytics and the difference between First Generation approaches and Next Generation software OdinText.
on Jun 24, 2014 in OdinText, Text Analytics, Tom HC Anderson
KDnuggets Twitter Follower 20,000 – Interview
KDnuggets crosses a milestone of 20,000 Twitter followers. Number 20,000 is a PhD student in Data Mining in China. I ask her about research, data mining, Big Data market leaders in China, and more.
on Jun 23, 2014 in China, KDnuggets Honors, Liyang Tang, Twitter
Upcoming Webcasts on Analytics, Big Data, Data Science – Jun 23 and beyond
Reducing employee churn, Wolfram language, Rise of Machine Learning, Analytics with Hadoop, Social Media Analytics for Healthcare, and more.
on Jun 23, 2014 in Big Data Analytics, Employee Churn, Hadoop, Machine Learning, Wolfram
Top KDnuggets tweets, Jun 20-22: Great visualization of English letters; Good list of R functions to manipulate data
Great visualization: English letters in words; Good list of R functions to manipulate data; Watch: Practical Deep-Learning Lecture: Machine Perception and Applications; Wikipedia Usage Statistics - analyze this 4TB data set in AWS cloud.
on Jun 23, 2014 in Deep Learning, R, Visualization, Wikipedia
LION Resources for Teaching Machine Learning and Optimization
A great collection of resources for "LION: Learning and Intelligent Optimization" textbook includes slides, tutorial movies, exercises, use cases, and LIONoso - an academic version of LIONsolver software.
on Jun 23, 2014 in Data Science Education, LIONbook, LIONsolver, Machine Learning, Optimization
Top stories for Jun 15-21
Does Deep Learning Have Deep Flaws?; Cartoon: Big Data and World Cup Football; Optimizing the Netflix Experience with Data Science; The Cardinal Sin of Data Mining: Overfitting.
on Jun 22, 2014 in Cartoon, Data Science, Deep Learning, Netflix, Overfitting, Soccer, Top stories
KDD Partners with Bloomberg to Unleash Data
Researchers will be able to engage with practitioners, policy makers and activists engaged in NGOs to explore opportunities to utilize data science in socially relevant application domains. Abstracts or other submissions due July 15.
on Jun 21, 2014 in Bloomberg, Claudia Perlich, KDD-2014, New York-NY, SIGKDD
Data Visualization of Census Data with R
This article shows step-by-step how to use R to access US Census Data, visualize it, and plot it on the map.
on Jun 20, 2014 in API, Data Visualization, R, US Census
Top KDnuggets tweets, Jun 18-19: Does Deep Learning Have Deep Flaws? Mode opens GitHub for data
Does Deep Learning Have Deep Flaws? Mode Analytics opens 'GitHub for data'; Data Science Stack Exchange Q&A site. Big Data Analytics with Google Big Query and R.
on Jun 20, 2014 in Big Query, Deep Learning, Google, Mode Analytics, Netflix, R
New Poll: Largest dataset analyzed / data mined?
New KDnuggets Poll is asking: What was the largest dataset you analyzed / data mined? The median answer in 2013 was 40-50 GB. Please vote and we will analyze and publish the trends.
on Jun 19, 2014 in Big Data, Largest, Poll
Optimizing the Netflix Streaming Experience with Data Science
How Netflix uses data science and Big Data analytics to improve the Quality of streaming experience for its members.
on Jun 19, 2014 in Netflix, Stream Mining
Does Deep Learning Have Deep Flaws?
A recent study of neural networks found that for every correctly classified image, one can generate an "adversarial", visually indistinguishable image that will be misclassified. This suggests potential deep flaws in all neural networks, including possibly a human brain.
on Jun 19, 2014 in Artificial Intelligence, Deep Learning, Google, Image Recognition, Neural Networks
The R User Conference, June 30 – July 3, Los Angeles
The open source R language is a leading tool for data scientists. Attend useR! conference, the main annual event of the R community, June 30 - July 3, in Los Angeles.
on Jun 19, 2014 in Los Angeles-CA, Open Source, R
GraphLab Conference, Graph Analytics and Machine Learning, San Francisco July 21
GraphLab Conference (San Francisco, July 21) brings together experts in graph analytics, large scale machine learning, and data science from leading companies, academic institutions and organizations. Special KDnuggets discount.
on Jun 19, 2014 in Graph Analytics, Graph Databases, Graph Visualization, GraphLab, Python, San Francisco-CA
Summer School: Constraint Programming Data Mining, Sicily
"Constraint Programming Meets Data Mining" is an upcoming summer school in Sicily providing intensive training in the state of the art in constraint solving, machine learning, and data mining.
on Jun 18, 2014 in Constraint Programming, Data Mining, Italy, Sicily, Summer School
Top KDnuggets tweets, Jun 16-17: You cannot afford to ignore next #AI wave; 5 Companies doing #BigData Right
You cannot afford to ignore next #AI wave - see early leaders; 5 Companies doing #BigData Right: Amazon, British Airways, eBay, Otto group, Netflix; Data Mining 200 years of Patents shows that invention is combinatorial; Cartoon: Big Data and World Cup Football.
on Jun 18, 2014 in AI, Amazon, Cartoon, eBay, Google, Netflix, Patents, Watson, World Cup
KDnuggets 14:n15, Analytics Software Poll – Analyzed; Cartoon: Big Data and World Cup
Also Data Mining Cardinal Sin, KDnuggets Profile, CAP, and more analytics/data mining features, software, opinions, news, webcasts, courses, jobs, academic positions, publications, tweets, and CFP.
on Jun 18, 2014 in Cartoon, Data Mining Software, INFORMS, Overfitting, Poll, World Cup
IBM, SAS, SAP, Angoss Lead in Advanced Analytics – Hurwitz Report
The Hurwitz Victory Index for Advanced Analytics looks at major trends and ranks top vendors - including IBM, SAS, SAP, Angoss, StatSoft, Revolution Analytics, RapidMiner, and Megaputer - across 4 key dimensions.
on Jun 17, 2014 in Advanced Analytics, Angoss, Hurwitz, IBM, Report, SAP, SAS
KDnuggets Analytics, Data Mining, Data Science Software Poll – Analyzed
We analyze the results of KDnuggets Software Poll, including correlations between tools, and relationships between commercial, free, and Hadoop/Big Data tools. We identify a potential capability gap. Download anonymized data and analyze it yourself.
on Jun 17, 2014 in Data Mining Software, Hadoop, Poll, R, RapidMiner
Cartoon: Big Data and World Cup Football
New KDnuggets Cartoon takes a fresh look on Big Data insights and World Cup 2014 in Soccer. What should a player do when Big Data predicts his behavior?
on Jun 17, 2014 in Big Data, Cartoon, World Cup
INFORMS: CAP® Analytics Certification
Maximize your analytics career with CAP(r) Analytics certification - show that you are well-qualified. Apply free of charge and take CAP exam at over 700 testing centers worldwide.
on Jun 17, 2014 in Analytics, CAP, Certification, INFORMS
Upcoming Webcasts on Analytics, Big Data, Data Science – Jun 16 and beyond
The Marriage of BI and Big Data, Unstructured Data on Hadoop, Future of Decision Making, Stopping employee churn, and more.
on Jun 16, 2014 in Bloor Group, Business Intelligence, Hadoop
Interview: Conal Sathi, Data Scientist, Slice on Creating Value from Mining Shoppers’ e-Receipts
We discuss the relevance of "Purchase Graph", Slice platform, analytical insights from mining all activity around a customer's purchase, experimentation strategy, experience of working as a data scientist and more.
on Jun 16, 2014 in Conal Sathi, Consumer Insights, Experimentation, Graph Analytics, Interview, Machine Learning
NYU Data Science Program – Things to Know Part 2
NYU Data Science program reviewed from inside, including courses on Machine Learning, Big Data, Deep Learning, top professors, great NYC location, and future plans.
on Jun 16, 2014 in Data Science, Deep Learning, New York-NY, NYU, Ran Bi, Yann LeCun
Top KDnuggets tweets, Jun 13-15: Book: Data Classification: Algorithms and Applications
Book: Data Classification: Algorithms and Applications; Top 10 Data Analysis Tools for Business; #BigData companies to watch selected by top analytics experts; The Cardinal Sin of Data Mining and Data Science: Overfitting.
on Jun 16, 2014 in Companies, Data Analysis, Data Classification, Overfitting, Top 10
Top stories for Jun 8-14
KDnuggets 15th Annual Analytics, Data Mining, Data Science Software Poll: RapidMiner Continues To Lead; Data Lakes vs Data Warehouses; The First Law of Data Science: Do Umbrellas Cause Rain? Huge Big Data Poster and Reference.
on Jun 15, 2014 in Causation, Data Lakes, Poll, Poster, Top stories
Book: Data Classification: Algorithms and Applications
This new book explores the underlying algorithms of classification and applications in text, multimedia, social network, biological data, and other domains. 25% off with KDnuggets discount.
on Jun 14, 2014 in Algorithms, Book, Charu Aggarwal, Classification, CRC Press
The Cardinal Sin of Data Mining and Data Science: Overfitting
Overfitting leads to public losing trust in research findings, many of which turn out to be false. We examine some famous examples, "the decline effect", Miss America age, and suggest approaches for avoiding overfitting.
on Jun 14, 2014 in Dean Abbott, John Ioannidis, Kirk D. Borne, Overfitting, S&P 500
NYU Data Science Program – Things to Know
Inside summary of NYU Data Science program launched last year, what it is, and what makes it special.
on Jun 13, 2014 in Data Science, Deep Learning, New York-NY, NYU, Ran Bi, Yann LeCun
Top KDnuggets tweets, Jun 11-12: Huge Big Data poster; “Data science” misses half the equation
Huge Big Data Poster and Reference; "Data science" misses half the equation: you also need "decision science"; Proposed ethical guidelines for Twitter data mining: clear objectives, protect anonymity; Great talk at Google! John Ioannidis on why most published research is wrong.
on Jun 13, 2014 in Big Data, Decision Science, Ethics, Poster, Twitter
The Algorithm that Runs the World Can Now Run More of It
The most important algorithm, used for optimizing almost everything, is linear programming. New advances allow linear programming problems to be solved faster using the new commercial parallel simplex solver.
on Jun 13, 2014 in Algorithms, FICO, Linear Programming, Optimization, Qi Huangfu, Simplex
Top 10 Data Analysis Tools for Business
Ten free, easy-to-use, and powerful tools to help you analyze and visualize data, analyze social networks, do optimization, search more efficiently, and solve your data analysis problems.
on Jun 13, 2014 in Data Analysis, Knime, RapidMiner, Tableau, Top 10, Wolfram
Huge Big Data Poster and Reference
A really Big poster "Do You Know Big Data" includes: What it is, Leading tools, What is a Data Scientist, What questions should we ask of databases, Visual techniques, Statistical algorithms, Privacy, and more.
on Jun 12, 2014 in Altamira, Big Data, Bob Gourley, CTOvision, Poster
YARN is All the Rage at Hadoop Summit 2014
Apache YARN, which enables much broader types of computations than MapReduce, is quickly becoming an integral part of Hadoop projects. We review best practices considerations for a YARN cluster.
on Jun 12, 2014 in Apache, Apache Spark, Daniel D. Gutierrez, Hadoop, Summit, YARN
Profile: KDnuggets Serves Analytics and Big Data Fields
A profile of KDnuggets, including an overview, history, and present highlights, is featured on the homepage of INFORMS, a major society for Analytics and Optimization (until June 23, 2014).
on Jun 11, 2014 in Big Data, Gregory Piatetsky, INFORMS, KDD, KDnuggets Honors, Twitter
Request: Apache UIMA Research Partnership in EU
Looking for any EU university department currently working with Apache UIMA developing text analysis software, and interested in research partnership.
on Jun 11, 2014 in Apache, Europe, UIMA
Top KDnuggets tweets, Jun 9-10: Numeric Matrix Manipulation: Cheat Sheet; The First Law of Data Science
Also - The First Law of Data Science: Do Umbrellas Cause Rain? ; Tell Your Kids to be Data Scientists - Not Doctors; DLib Library for Machine Learning
on Jun 11, 2014 in Causation, Cheat Sheet, Correlation, Data Scientist, Julia, numpy
Zipfian Academy 6-week Data Fellowship (Free)
Zipfian Academy is offering a free 6-week data science fellowship to help build a robust skill set and connect with hiring partners. The program begins June 30.
on Jun 11, 2014 in Data Science, Fellowship, San Francisco-CA, Summer School, Zipfian Academy
AlgoMost contest: Predicting future company acquisitions
Develop an algorithm to predict which companies are most likely to be acquired in the current fiscal year.
on Jun 11, 2014 in Acquisitions, AlgoMost, Competition, Data Science, Startups
DLib: Library for Machine Learning
DLib is an open source C++ library implementing a variety of machine learning algorithms, including classification, regression, clustering, data transformation, and structured prediction.
on Jun 10, 2014 in C++, DLib, Machine Learning, Open Source, Tools
Moving from Data-Rich to Decision-Smart: INFORMS Conference The Business of Big Data 2014
We interview the co-chairs of INFORMS Conference The Business of Big Data 2014 (June 22-24, 2014) on Big Data maturity, opportunities assessment, analytics for operations research, conference agenda and more.
on Jun 10, 2014 in Big Data, Diego Klabjan, INFORMS, Margery H. Connor, Predictive Analytics, Trends, Workshops
KDnuggets 14:n14, KDnuggets Analytics/Data Mining Poll Results; First Law of Data Science
Also Datafication, US Open Data Plan, Kirk D. Borne, Raul Valdez-Perez and more interesting Interviews and Opinions, Analytics/Data mining Features, Software, Webcasts, Courses, Meetings and Reports, Jobs, Publications, Tweets, and CFP.
on Jun 10, 2014 in Datafication, Kirk D. Borne, Open Data, Poll, Raul Valdes-Perez
The First Law of Data Science: Do Umbrellas Cause Rain?
Michael Brodie on the first law of data science, the role of data curation in Big Data analysis, and Thomas Piketty economic theories.
on Jun 9, 2014 in Causation, Confirmation Bias, Correlation, Data Curation, Michael Brodie, Piketty
Upcoming Webcasts on Analytics, Big Data, Data Science – Jun 9 and beyond
Data Mining FTL, Analytically Speaking with Dan Ariely, Solr, Hadoop, Cloud BI, Employee Churn, and more.
on Jun 9, 2014 in Analytically Speaking, Dan Ariely, Data Mining, Failure to Launch, Hadoop, Solr
Interview: Lloyd Tabb, Chairman & CTO, Looker on Front-line Analytics and Data Democratization
We discuss the capabilities of Looker, data democratization across organization, change in the tools being used by analytics-savvy business managers, front-line analytics, competitive landscape and more.
on Jun 9, 2014 in Advice, Analytics, Data Democratization, Interview, Lloyd Tabb, Looker, Metrics
May 2014 Analytics, Big Data, Data Mining Acquisitions and Startups Activity
May 2014 acquisitions, startups, and company activity in Analytics, Big Data, Data Mining, and Data Science: 40 events, including ExtraHop, Capptain, Datalogix, "Data Nation", 6Sense, Sumo Logic, DataPad, SeeWhy, Tamr, LiveRamp, and Adometry.
on Jun 9, 2014 in Acquisitions, Datalogix, Google, Startups, Sumo Logic, Tamr
Top KDnuggets tweets, Jun 6-8: Statistical-learning tutorial w. scikit-learn; Data science vs the hunch
A tutorial on statistical learning with with scikit-learn ; Data science vs the hunch: When data contradicts manager gut instinct; Stanford University: Data Analyst ; Data Lakes vs Data Warehouses.
on Jun 9, 2014 in Data Lakes, Data Science, Hunch, scikit-learn, Stanford, Tutorial
Vendor-Neutral Hands-On Training in Data Mining [Denver-CO, July | Wash-DC, Sep]
Successful analytics in the big data era does not start with data and software, but with immersive hands-on training and goal-driven strategy. Get this training from The Modeling Agency.
on Jun 9, 2014 in Data Mining Training, Denver-CO, TMA, Vendor-neutral, Washington-DC
PAW: Predictive Analytics World Boston, Oct 5-9
Join predictive analytics experts from leading organizations at Predictive Analytics World Boston (Oct 5-9, 2014) to increase your knowledge and get insights into the ever-evolving field of analytics. Get KDnuggets discount.
on Jun 8, 2014 in Boston-MA, PAW, Predictive Analytics World
Don Zereski, VP, Local Search & Discovery, HERE (Nokia) on Location Analytics and Architecture Evolution
We discuss trends in location analytics, evolution of HERE's analytics architecture, infrastructure challenges, data governance and more.
on Jun 8, 2014 in Data Governance, Don Zereski, Infrastructure, Interview, Location Analytics, Nokia, Real-time
Top stories for Jun 1-7
New Poll: Analytics, Data Mining, Data Science Software Used? OpenNN, An Open Source Library For Neural Networks; Data Lakes vs Data Warehouses; Stanford University: Data Analyst.
on Jun 8, 2014 in Data Lakes, Data Warehouse, Neural Networks, OpenNN, Poll, Stanford, Top stories
KDnuggets 15th Annual Analytics, Data Mining, Data Science Software Poll: RapidMiner Continues To Lead
With over 3,000 data miners taking part in KDnuggets 15th Annual Software Poll, RapidMiner continues to lead. Free software is used much more outside US, and Hadoop usage grows fastest in Asia.
on Jun 7, 2014 in Data Mining Software, Excel, Hadoop, Knime, Poll, Python, R, RapidMiner, SAS, SQL, SQL Server, Weka
Data Lakes vs Data Warehouses
Data Warehouses, traditionally popular for business intelligence tasks, are being replaced by less-structured Data Lakes which allow more flexibility.
on Jun 7, 2014 in Business Intelligence, Data Lakes, Data Science Platform, Data Visualization, Data Warehouse, DataRPM
Interview: Santhosh Adayikkoth, CEO, BigInfo Labs on Big Data perception and learning Big Data skills
We discuss BigInfo Labs' future plans, Big Data perception at C-level in large firms, most effective ways to learn Big Data skills and more.
on Jun 7, 2014 in Interview, Learning from Data, Santhosh Adayikkoth, Skills, Startup
Interview: Santhosh Adayikkoth, CEO, BigInfo Labs on Data Relevance and Intel Partnership
We discuss BigInfo Labs, the concept of "Data Relevance" in Big Data, experience of partnership with Intel, and BigInfo Labs' strategy for competitive differentiation.
on Jun 6, 2014 in Competition, Innovation, Intel, RichRelevance, Santhosh Adayikkoth, Startup
Data Science Last Mile
This post discusses the Data Science "Last Mile", the final work to take the discovered insights and deliver them a highly usable format or integrate into a specific application.
on Jun 6, 2014 in Alpine, Data Science, Joel Horwitz, Predictive Analytics
Top KDnuggets tweets, Jun 4-5: “Practical Data Science with R” stands out; Top 5 cities for #BigData jobs
How does "Practical Data Science with R" book stand out ? Top 5 cities for #BigData jobs: San Francisco, McLean, Boston, St. Louis, and Toronto; Big jump in #BigData applications, code built with Apache Spark ; 76 Startup Failure Post-Mortems.
on Jun 6, 2014 in Apache Spark, Boston-MA, Data Science, Failure, McLean-VA, R, San Francisco-CA, Startups
Exclusive: Raul Valdes-Perez on OnlyBoth, Scientific Discovery, Advice for Winners
Our exclusive interview covers OnlyBoth and Vivisimo startups, Scientific Discovery, legendary Herbert A. Simon, venture capital, Big Data, advice for winners, and more.
on Jun 5, 2014 in Advice, Herbert A. Simon, OnlyBoth, Raul Valdes-Perez, Startups, Vivisimo
HR & Workforce Analytics Innovation Summit 2014 Chicago: Day 2 Highlights
Highlights from the presentations by HR leaders from Caterpillar, Coca-Cola, Pfizer, and Marriott International on day 2 of HR & Workforce Analytics Innovation Summit 2014 in Chicago.
on Jun 5, 2014 in Analytics, Chicago-IL, HR, IE Group, Workforce Analytics
InnovAccer: Simplifying Research and Analysis
Innovaccer cleans and prepares data for analysis by researchers to save time and improve confidence in the quality of the data.
on Jun 5, 2014 in Data Curation, Data Integration, Data Preparation, InnovAccer, Statistical Analysis
Big Data Assessment – Key Business Drivers, Expected Benefits and Common Challenges
Recent survey on Big Data outlook reports increasing interest in Big Data for more accurate and timely decision-making; and concerns about project costs and ability to scale.
on Jun 5, 2014 in Analytics, Big Data, Business Strategy, Challenges, Industry, Report
Top stories in May: New Poll – Analytics, Data Mining Software; Data Science Cheat Sheets
New Poll: Analytics, Data Mining, Data Science Software Used? Guide to Data Science Cheat Sheets; Big Data Landscape, v 3.0, analyzed; Where to Learn Deep Learning.
on Jun 5, 2014 in Big Data, Cheat Sheet, Data Science, Deep Learning, Landscape, Poll, Top stories
Big Data Strategy: Datafication
Datafication of everything enables new ways of creating value and becoming more competitive. Oracle Big Data Strategist Paul Sonderegger explains.
on Jun 5, 2014 in Datafication, Las Vegas-NV, Oracle, Paul Sonderegger, Strategy
Jun-Oct 2014 Meetings in Analytics, Big Data, Data Mining, and Data Science
Coming soon: Big Data Innovation Summits, Useful Business Analytics, PAW and PAW-MFG in Chicago, GigaOM Structure, INFORMS Business of Big Data, MMDS, GraphLab Conference, TDWI World Conf in Boston, and KDD-2014 in NYC.
on Jun 4, 2014 in Boston-MA, Chicago-IL, IE Group, INFORMS, Meetings, PAW
INFORMS, Uniting Operations Research and Analytics
INFORMS is a large professional association which started in operations research and management science. I discuss their evolution to analytics, CAP certification, Big Data and more.
on Jun 4, 2014 in CAP, Certification, Data Science Certificate, Gary Bennett, INFORMS
Webinar: Data Mining: Failure to Launch [June 11]
Learn how to get started with predictive modeling and overcome strategic and tactical limitations that cause data mining projects to fall short of their potential. Next webinar is June 11.
on Jun 4, 2014 in Data Mining, How to start, TMA, Webinar
Top KDnuggets tweets, Jun 2-3: SAS vs R vs SPSS – Statistical Wars; RDataMining R Refcard for Data Mining
SAS vs R vs SPSS - Statistical Language Wars - a giant infographic ; Very useful - R Refcard for Data Mining; Mind-boggling - The Internet in Real-Time - how quickly data is generated ; #BigData, Open Data, and Open Govt - Venn Diagram.
on Jun 4, 2014 in Open Data, R, Real-time, Refcard, SAS, SPSS
Webcast – Analytically Speaking Featuring Michael Schrage
MIT Research Fellow Michael Schrage helps you answer the key question for becoming a successful innovator. His insights will give you a new perspective on how to create value.
on Jun 3, 2014 in Analytically Speaking, JMP, Michael Schrage
Lynn Goldstein, Chief Data Officer, NYU on the Need for Data Governance
We discuss the role of Data Governance, establishing Big Data accountability, impact of Data Governance on Data Quality, and assessing the education available for Data Governance.
on Jun 3, 2014 in Data Governance, Data Quality, Data Science, Lynn Goldstein, NYU
ICON Challenge on Forecasting and Scheduling
ICON is a combined competition with both a machine learning component (predicting energy prices) and an scheduling component (using the predicted prices to schedule tasks on machines).
on Jun 3, 2014 in Competition, Forecasting, Scheduling
HR & Workforce Analytics Innovation Summit 2014 Chicago: Day 1 Highlights
Highlights from the presentations by HR leaders from Wells Fargo, Sears Holdings, Johnson Controls, Trulia on day 1 of HR & Workforce Analytics Innovation Summit 2014 in Chicago.
on Jun 2, 2014 in Advanced Analytics, Chicago-IL, HR, IE Group, Predictive Modeling, Workforce Analytics
OpenNN, An Open Source Library For Neural Networks
OpenNN is an open source class library written in C++ which implements neural networks, and runs on Windows, Apple, or Linux.
on Jun 2, 2014 in Neural Networks, Open Source, OpenNN
Interview: Tom Kern, Risk Modeling Manager, Paychex on Risk Analytics and Sales Anticipation Model
We discuss the role of Risk Analytics at Paychex, strategic importance of Sales Anticipation Model, optimizing business processes by leveraging Big Data, and advice for companies thinking about Big Data as well as aspiring students.
on Jun 2, 2014 in Advice, Big Data, Challenges, Interview, Predictions, Risk Assessment, Tom Kern, Tools
Upcoming Webcasts on Analytics, Big Data, Data Science – Jun 2 and beyond
SQL-on-HaDOOP, BigML, ClearStory, Analytic Maturity with Dean Abbott and TIBCO, Just Enough Math, Analytically Speaking with Dan Ariely, Data Mining FTL, and more.
on Jun 2, 2014 in Analytically Speaking, ClearStory, Hadoop, SQL
Top KDnuggets tweets, May 30 – Jun 1: Guide to Setting Up an R-Hadoop ; 100+ Interesting Data Sets
Tutorial: Step-by-Step Guide to Setting Up an R - #Hadoop System; 100+ Interesting Data Sets for Statistics (and Data Science); #BigData sets available for free - big list from Data Science Central ; Twitter to release all tweets to scientists - a research boon and an ethical dilemma.
on Jun 2, 2014 in Datasets, Hadoop, R, Twitter
Additions to KDnuggets Directory in May
ClearVu Analytics, OpenNN neural net library from Intelnics, QIWare, Vowpal Wabbit software for fast learning, Analytics Vidhya, 17 new Big Data meetings, companies, Latin American Data Science education, and more.
on Jun 2, 2014 in Added to KDnuggets, Blogs, Neural Networks, Venezuela
Top stories for May 25-31
New Poll: Analytics, Data Mining, Data Science Software Used? Where to Learn Deep Learning - Courses, Tutorials, Software; Interview: Martin Hack, CEO, Skytree on Industrializing Machine Learning for Big Data; Data Mining and Analysis: Fundamental Concepts and Algorithms.
on Jun 1, 2014 in Algorithms, Deep Learning, Martin Hack, Poll, Skytree, Top stories
|