The Present and the Future of the KDD Cup Competition
KDD cup is the first and among most prestigious competitions in data science, Among key takeaways from KDD Cup 2015: XGBoost – Gradient Boosted Decision Trees package works wonders in data classification, feature engineering is the king, and team work is crucial.
on Aug 31, 2015 in ACM, Competition, Gradient Boosting, KDD, KDD Cup, KDD-2015
Big Data Influence on Data Driven Advertising
More and more companies relying on big data for their data driven initiatives. In a survey conducted by BlueKai, we are trying to capture what its impact on advertising strategies.
on Aug 31, 2015 in Advertising, Big Data, Kaushik Pal
Gartner 2015 Hype Cycle: Big Data is Out, Machine Learning is in
Which are the most hyped technologies today? Check out Gartner's latest 2015 Hype Cycle Report. Autonomous cars & IoT stay at the peak while big data is losing its prominence. Smart Dust is a new cool technology for the next decade!
on Aug 28, 2015 in Big Data, Citizen Data Scientist, Gartner, Machine Learning
What is the success rate in Hadoop adoption?
Hadoop is no more an unknown term for the big data analytics, it’s to find its value return. Here, we tried to explore on the popular opinions of the Hadoop adopters, we also talk about current challenges for adoption.
on Aug 28, 2015 in Big Data Hype, Hadoop, Kaushik Pal, Success
Data Hierarchy of Needs
Data Hierarchy of Needs helps understand the steps in Big Data processing. Before going to advanced data modeling (top of the pyramid), organizations need to fill huge holes they frequently have in the base of the pyramid, lacking reliable complete data flow.
on Aug 28, 2015 in Data Management, Data-Driven Business, Yanir Seroussi
5 questions to decide if you need a data scientist
Here are 5 questions to answer if you are thinking about hiring a data scientist. It depends not only on a person, but on the company culture, business problem and understanding its potential.
on Aug 26, 2015 in Data Scientist, Hiring, Yanir Seroussi
Data Marts as an indispensable analytical tool
An analytical Data Mart is in effective and user-friendly tool for reporting, analyses and modeling. Explore, how data marts could provide time saving, less error prone and streamline solution for your business problems.
on Aug 26, 2015 in Algolytics, Data Marts
OpenText Data Driven Digest Aug 21: College Majors, Hacking Glory, Innovation Performance
The simple beauty of X-Y coordinates belies the power they hold; indeed, many of the best data visualizations created today rely on, and build upon, on the Cartesian plane concept to show complex data sets. Here are three examples.
on Aug 25, 2015 in Data Visualization, OpenText, P-value
Paradoxes of Data Science
There are many paradoxes, ironies and disconnects in today’s world of data science: pain points, things ignored, shoved under the rug, denied or paid lip.
on Aug 21, 2015 in Data Science, Data Science Skills, Myths, Thomas Ball
OpenText Data-Driven Digest, Aug 14
In three data visualizations, we dive into what you would see looking west or east across the ocean; the contours and makeup of the seabed; and the width of rivers throughout North America.
on Aug 15, 2015 in Data Visualization, Dataflow, Ocean, OpenText
Recycling Deep Learning Models with Transfer Learning
Deep learning exploits gigantic datasets to produce powerful models. But what can we do when our datasets are comparatively small? Transfer learning by fine-tuning deep nets offers a way to leverage existing datasets to perform well on new tasks.
on Aug 14, 2015 in Deep Learning, Image Recognition, ImageNet, Machine Learning, Neural Networks, Transfer Learning, Zachary Lipton
11 things to know about Sentiment Analysis
Seth Grimes, a text analytics guru, shares 11 key observations on what works, what is past, what is coming, and what to keep in mind while doing sentiment analysis.
on Aug 13, 2015 in Affective Computing, Emoji, Sentiment Analysis, Text Analytics
Overcoming Overfitting with the reusable holdout: Preserving validity in adaptive data analysis
Misapplication of statistical data analysis is a common cause of spurious discoveries in scientific research. We demonstrate a new approach for addressing the challenges of adaptivity based on insights from privacy-preserving data analysis.
on Aug 12, 2015 in Holdout, Model Performance, Moritz Hardt, Overfitting, P-value, Vitaly Feldman
Predictive Analytics as an Engine Of R&D and New Product Launches
Predictive analytics is not only the way to discover the underlying patterns, but it can also help you with innovation. Here, we discuss the ways to innovate by combining it with business logic, marketing and bridging demand supply factors.
on Aug 12, 2015 in Innovation, Lana Klein, Predictive Analytics
3D Data Sculptures: a New Way to Visualize Data
3D printing can go beyond printing products like iPod cases, or butterfly earrings, and can offer a sustainable way to understand strategic DATA by printing decision support landscapes.
on Aug 11, 2015 in 3D, China, Data Visualization, Sculpture
R Programming: Who, Where and What
The “sexiest job” has the sexiest demand, and R is one of their leading weapons. Here, we are trying to capture how these unicorns are distributed, and also where you can move if you want to have great opportunities.
on Aug 11, 2015 in India, Programming, R, Salary, USA
Three Essential Components of a Successful Data Science Team
A Data Science team, carefully constructed with the right set of dedicated professionals, can prove to be an asset to any organization,
on Aug 10, 2015 in Business Analyst, Data Engineer, Data Science Team, Machine Learning, Team
World Economic Forum Tech Pioneers & Analytics Winners
World Economic Forum selected its 2015 Tech Pioneers, which included quite a few companies on the cutting edge of Analytics, Big Data, and Machine Learning.
on Aug 8, 2015 in Advanced Analytics, Ayasdi, Dataminr, Domo, World Economic Forum
How Long Should You Stay at Your Analytics Job?
Considering the huge demand for the data scientists many are pondering to switch for a better profile and salary. But, there some things to be pondered about like what should be the interval between two switches, acquiring new skills and your loyalty.
on Aug 7, 2015 in Analytics, Burtch Works, Data Scientist, Hiring
Big Data Analytics Pain Points
Big data analytics is still in infancy, and we haven't yet embraced a data-driven decision making. Here, we discussed the current pain points in it and how you can deal them in better ways.
on Aug 6, 2015 in Big Data Analytics, Challenges, Kaushik Pal, Marketing Analytics
Interview: Stefan Groschupf, Datameer on Why Domain Expertise is More Important than Algorithms
We discuss large-scale data architectures in 2020, career path, open source involvement, advice, and more.
on Aug 6, 2015 in Advice, Algorithms, Architecture, Career, Datameer, Domain Knowledge, Interview, Open Source, Stefan Groschupf
Patterns for Streaming Realtime Analytics
Design patterns are well-known for solving the recurrent problems in software engineering, on similar lines we can have Streaming Realtime Analytics patterns and avoid reinventing the wheel. Here, you can see the major patterns we found out for it.
on Aug 5, 2015 in Frequent Pattern Mining, Realtime Analytics, Streaming Analytics
The Big ‘Big Data’ Question: Hadoop or Spark?
With a considerable number of similarities, Hadoop and Spark are often wrongly considered as the same. Bernard carefully explains the differences between the two and how to choose the right one (or both) for your business needs.
on Aug 5, 2015 in Apache Spark, Bernard Marr, Data Science Tools, Distributed Systems, Hadoop, Machine Learning, Performance, RDD
Interview: Stefan Groschupf, Datameer on Why SQL on Hadoop is a Bad Idea
We discuss the startups landscape in Big Data, valuation of Big Data companies, recognition earned by Datameer, and why SQL on Hadoop is a bad idea.
on Aug 5, 2015 in Datameer, Interview, SQL. Hadoop, Startups, Stefan Groschupf
Interview: Stefan Groschupf, Datameer on Balancing Accuracy and Simplicity in Analytics
We discuss common pain points in Big Data projects, evolution of Datameer technology, department specific solution – Datameer Professional, Datameer 5.0 Smart Execution, tacking over-simplicity and more.
on Aug 4, 2015 in Apache Spark, Data Warehousing, Datameer, Flink, Hadoop, Insights, Interview, MapReduce, Stefan Groschupf
New Standard Methodology for Analytical Models
Traditional methods for the analytical modelling like CRISP-DM have several shortcomings. Here we describe these friction points in CRISP-DM and introduce a new approach of Standard Methodology for Analytics Models which overcomes them.
on Aug 3, 2015 in CRISP-DM, Data Mining, Modeling, Olav Laudy, ROI
Data is Ugly – Tales of Data Cleaning
Whether you want to do business analytics or build the deep learning models, getting correct data and cleansing it appropriately remains the major task. Find out experts opinions on how you can make efficient data cleansing and collection efforts.
on Aug 1, 2015 in Big Data, Data Cleaning, Data Preparation, Data-Driven Business