
Everything a Data Scientist Should Know About Data Management - Oct 22, 2019.
For full-stack data science mastery, you must understand data management along with all the bells and whistles of machine learning. This high-level overview is a road map for the history and current state of the expansive options for data storage and infrastructure solutions.
Data Management, Data Scientist, Hadoop

The Death of Big Data and the Emergence of the Multi-Cloud Era - Jul 11, 2019.
The Era of Big Data is coming to an end as the focus shifts from how we collect data to processing that data in real-time. Big Data is now a business asset supporting the next eras of multi-cloud support, machine learning, and real-time analytics.
Big Data, Cloudera, Hadoop, Multi-cloud, Realtime Analytics
Apache Spark Introduction for Beginners - Oct 18, 2018.
An extensive introduction to Apache Spark, including a look at the evolution of the product, use cases, architecture, ecosystem components, core concepts and more.
Apache Spark, Beginners, Hadoop, R
- Big Data Day Camp: Big Data Tools & Techniques (October 25-26) - Oct 4, 2018.
Learn how to use data to make wise, actionable data driven decisions! Our first 2-day camp, Big Data Tools & Techniques, is October 25-26 at Qualcomm Institute, UCSD.
Apache Spark, Big Data, Deep Learning, Hadoop, Kafka
- KDnuggets™ News 18:n35, Sep 19: How Many Data Scientists Out There? Hadoop for Beginners; Data Science of Adele - Sep 19, 2018.
Also Top /r/MachineLearning posts, August 2018: Everybody Dance Now; 10 Big Data Trends You Should Know; You Aren't So Smart: Cognitive Biases are Making Sure of It.
Hadoop, Machine Learning, Reddit
Hadoop for Beginners - Sep 12, 2018.
An introduction to Hadoop, a framework that enables you to store and process large data sets in parallel and distributed fashion.
Beginners, Big Data, Hadoop
Python eats away at R: Top Software for Analytics, Data Science, Machine Learning in 2018: Trends and Analysis - May 22, 2018.
Python continues to eat away at R, RapidMiner gains, SQL is steady, Tensorflow advances pulling along Keras, Hadoop drops, Data Science platforms consolidate, and more.
Pages: 1 2
Anaconda, Data Mining Software, Data Science Platform, Hadoop, Keras, Poll, Python, R, RapidMiner, SQL, TensorFlow, Trends
- Ranking Popular Distributed Computing Packages for Data Science - Mar 20, 2018.
We examined 140 frameworks and distributed programing packages and came up with a list of top 20 distributed computing packages useful for Data Science, based on a combination of Github, Stack Overflow, and Google results.
Apache Spark, Data Science, Distributed Systems, GitHub, Hadoop
- TDWI Chicago, May 6-11: Get Your Hands Dirty With Data – KDnuggets Offer - Mar 2, 2018.
Attend the Hands-on Lab series and bring practical skills back from Chicago. Save 30% through March 16 with priority code KD30.
Chicago, Hadoop, IL, Machine Learning, Python, R, TDWI, Training
- Best Data Science, Machine Learning Courses from Udemy, only $10 until Dec 21 - Dec 14, 2017.
Holiday Dev & IT sale on best courses from Udemy, including Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $10 until Dec 21, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
Did Spark Really Kill Hadoop? - Nov 22, 2017.
A comprehensive survey conducted by iDatalabs shows us the trends of the future of these two Data Science technologies.
Apache Spark, Big Data, Hadoop, iDatalabs
- How (& Why) Data Scientists and Data Engineers Should Share a Platform - Nov 17, 2017.
Sharing one platform has some obvious benefits for Data Science and Data Engineering teams, but technical, language and process challenges often make this a challenge. Learn how one company implemented single cloud platform for R, Python and other workloads – and some of the unexpected benefits they discovered along the way.
Apache Spark, Cazena, Data Science Platform, Hadoop, Python, R
- Best Data Science, Machine Learning Courses from Udemy, only $10 until Nov 28- Black Friday/Cybermonday sale - Nov 17, 2017.
Black Friday/Cybermonday sale on best courses from Udemy, including Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $10 until Nov 28, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
- Best Data Science, Machine Learning Courses from Udemy (only $12 until Oct 31) - Oct 27, 2017.
Fall sale on best courses from Udemy, including Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $12 until Oct 31, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
- Updates & Upserts in Hadoop Ecosystem with Apache Kudu - Oct 27, 2017.
A new open source Apache Hadoop ecosystem project, Apache Kudu completes Hadoop's storage layer to enable fast analytics on fast data.
Apache, Big Data, Data Management, Hadoop, Java, NoSQL
- Best Data Science, Machine Learning Courses from Udemy (only $12 until Sep 20) - Sep 14, 2017.
Back-to-school sale on best courses from Udemy, including Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $12 until Sep 20, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
Are Data Lakes Fake News? - Sep 6, 2017.
The quick answer is yes, and the biggest problem is that the term “Data Lakes” has been overloaded by vendors and analysts with different meanings, resulting in an ill-defined and blurry concept.
Data Lakes, Data Warehouse, ETL, Fake News, Hadoop
- Best Data Science, Machine Learning Courses from Udemy (only $10 or $12 till Aug 10) - Aug 6, 2017.
Back-to-school sale on best courses from Udemy, including Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $10 or $12 until Aug 10, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
- Best Data Science Courses from Udemy (only $10 till June 21) - Jun 19, 2017.
Here are some of the best courses in data science from Udemy, covering Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $10 until June 21, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
- Hadoop as a Data Warehouse: Cracking the Code with Kudu - Jun 15, 2017.
Here we discuss problems behind replacing an existing Data Warehouse with Hadoop and available solutions to make this happen. Lets see how.
Cloudera, Data Warehouse, Hadoop, Kudu
- Simplifying Data Pipelines in Hadoop: Overcoming the Growing Pains - May 18, 2017.
Moving to Hadoop is not without its challenges—there are so many options, from tools to approaches, that can have a significant impact on the future success of a business’ strategy. Data management and data pipelining can be particularly difficult.
Data Management, Data Platform, Hadoop, SVDS
- Best Data Science Courses from Udemy (only $10 till May 27) - May 17, 2017.
Here a list of the best courses in data science from Udemy, covering Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $10 until May 27, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
- Top Recent Big Data videos on YouTube - May 17, 2017.
Top viewed videos on Big Data since 2015 include Big Data use cases in psychographics, sports, politics and data monetisation.
Big Data, Hadoop, TED, Tutorials, Youtube
HDFS vs. HBase : All you need to know - May 15, 2017.
Hadoop Distributed File System (HDFS), and Hbase (Hadoop database) are key components of Big Data ecosystem. This blog explains the difference between HDFS and HBase with real-life use cases where they are best fit.
Big Data, Hadoop, HBase, HDFS
- Hadoop is Not Failing, it is the Future of Data - Apr 27, 2017.
The author disagrees with a previous KDnuggets post on “Why Hadoop is Failing” and argues that the Darwinian Open Source Ecosystem ensures Hadoop is a robust and mature technology platform .
Big Data, Hadoop, Hortonworks, Trends
- Best Data Science Courses from Udemy (only $10 till Apr 29) - Apr 24, 2017.
Here a list of the best courses in data science from Udemy, covering Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $10 until April 29, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
- Strata London: Learn about all things data. KDnuggets Offer ends Apr 7 - Apr 3, 2017.
Strata Data Conference returns to London 22-25 May. Early Price ends Friday, 7 April, and you can save up to £479 on your pass with code PCKDNG.
Data Science, Hadoop, London, Strata, UK
- Key Takeaways from Strata + Hadoop World 2017 San Jose, Day 2 - Mar 29, 2017.
The focus is increasingly shifting from storing and processing Big Data in an efficient way, to applying traditional and new machine learning techniques to drive higher value from the data at hand.
AI, CA, DNA, Fake News, Google, Hadoop, Robots, San Jose, Strata
Key Takeaways from Strata + Hadoop World 2017 San Jose, Day 1 - Mar 24, 2017.
The focus is increasingly shifting from storing and processing Big Data in an efficient way, to applying traditional and new machine learning techniques to drive higher value from the data at hand.
CA, Cloudera, Coursera, Hadoop, MapR, Pinterest, San Jose, Strata
- What Top Firms Ask: 100+ Data Science Interview Questions - Mar 22, 2017.
Check this out: A topic wise collection of 100+ data science interview questions from top companies.
Algorithms, Data Science, Google, Hadoop, Interview Questions, Machine Learning, Microsoft, Statistics, Uber
- Best Data Science Courses from Udemy (only $19 till Mar 31) - Mar 10, 2017.
Here a list of the best courses in data science from Udemy, covering Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $19 until March 31, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
- KDnuggets™ News 17:n09, Mar 8: 7 More Steps to Mastering Machine Learning w. Python; Every Intro to Data Science Course, Ranked - Mar 8, 2017.
Also The Data Science Project Playbook; Hadoop Is Falling - Why? Bokeh Cheat Sheet: Data Visualization in Python
Bokeh, Data Science Education, Data Visualization, Hadoop, Machine Learning, Python
- KDnuggets Free Pass to Strata + Hadoop World London, May 22-25, 2017 - Mar 7, 2017.
Strata + Hadoop World is the leading event on how big data and ubiquitous, real-time computing is shaping the course of business and society. Win KDnuggets free pass to Strata + Hadoop World London.
Hadoop, London, Strata, UK
- Predictions for Data Science in 2017 - Mar 2, 2017.
Our predictions include: 2017 will be the year of Deep Learning (DL) technology, Artificial General Intelligence is still far away, Software and Hardware Progress will accelerate, and AI will have unexpected socio-political implications.
2017 Predictions, AI, Data Science, Hadoop, Healthcare, Hiring
Hadoop Is Falling – Why? - Mar 1, 2017.
Three years ago, looking beyond Hadoop was insanity, and there was little else that could come close. Recently, adoption of Hadoop has slowed down considerably. We examine why.
Big Data, Big Data Hype, Hadoop
- The 6 Best Data Science Courses from Udemy (only $10 till Feb 28) - Feb 25, 2017.
Here a list of the best courses in data science from Udemy, covering Data Science, Machine Learning, Python, Spark, Tableau, and Hadoop - only $10 until Feb 28, 2017.
Apache Spark, Hadoop, Machine Learning, Online Education, Python, Tableau, Udemy
- Strata + Hadoop World, May 22-25, London, UK – KDnuggets Offer - Feb 6, 2017.
Strata + Hadoop World is a rich learning experience at the intersection of data science and business. Get best price by Feb 24 and save extra with code PCKDNG.
Hadoop, London, Strata, UK
- Get Early Price for the priceless: Strata + Hadoop World - Jan 19, 2017.
Whatever you want to learn about data, you’ll find it at Strata + Hadoop World. Take a look at the program and see for yourself, and register by midnight January 20 with code PCKDNG and save up to $670 on your pass.
Big Data, CA, Data Science, Hadoop, O'Reilly, San Jose, Strata
50+ Data Science, Machine Learning Cheat Sheets, updated - Dec 14, 2016.
Gear up to speed and have concepts and commands handy in Data Science, Data Mining, and Machine learning algorithms with these cheat sheets covering R, Python, Django, MySQL, SQL, Hadoop, Apache Spark, Matlab, and Java.
Cheat Sheet, Data Science, Django, Hadoop, Java, Machine Learning, MATLAB, Python, R
- Navigating the World of Big Data Analytics - Dec 8, 2016.
Fulcrum Agile Analytics Lab- helps our partners test new technologies, new methodologies and new data sets quickly in an environment that can scale up and down and that meets all of their security and compliance requirements. Read to learn more and schedule a consultation.
Agile, Analytics, Apache Spark, Big Data, Consulting, Hadoop
- Analytica: Informatica PowerCenter Systems Administrator - Dec 5, 2016.
Seeking an Informatica PowerCenter Systems Administrator with proven expertise and experience in managing enterprise scale Informatica architectures and environments.
Analytica, DC, Hadoop, Informatica, Washington
- How Hadoop, Spark, and Data Science are evolving – Nov 10 Webinar - Nov 8, 2016.
Find out how Hadoop and Spark are evolving for Data Science in this Nov 10 webinar and live Q&A with guest speaker, Forrester VP and Principal Analyst Mike Gualtieri.
Apache Spark, Cazena, Data Lakes, Hadoop, Mike Gualtieri
- How to Choose a Data Format - Nov 3, 2016.
In any data analytics project, after business understanding phase, data understanding and selection of right data format as well as ETL tools is very important task. In this article, a very useful and practical set of guidelines is explained covering data format selection and ETL phases of project lifecycle.
Pages: 1 2
Data Cleaning, Data Engineering, Data Preparation, ETL, Hadoop, HDFS
- Strata Hadoop 2016: Fast Data and Robots - Oct 14, 2016.
Did you miss Strata Hadoop World conference this year?? No worries! Want to know “how exciting it was”? Lets hear it from an expert in her own words.
Carla Gentry, Hadoop, New York City, Robots, Strata, Workforce Analytics
- Apache: Big Data Europe (Nov. 14-16) – Leading Event for Big Data Technologists - Oct 13, 2016.
Apache: Big Data Europe (Nov 14-16, Seville, Spain) will gather together the Apache projects, people and technologies working in Big Data, ubiquitous computing and data engineering and science to educate, collaborate and connect. Register by Nov 3 to save over $250!
Apache, Apache Spark, Big Data, Europe, Hadoop, Spain
- Top KDnuggets tweets, Oct 05-11: Most Active #DataScientists on #Github; Why Not So Hadoop? - Oct 12, 2016.
Most Active #DataScientists, Free Books, Notebooks & Tutorials on #Github; Why Not So Hadoop?; Free #MachineLearning text PDF, from theory to algorithms; Top @reddit #MachineLearning Posts September.
GitHub, Hadoop, Machine Learning, Reddit, Top tweets
- Why Not So Hadoop? - Sep 27, 2016.
Are Big Data and Hadoop synonymous? Not really, but they are often conflated. Has Hadoop lived up to its hype? In this article, we will look at a brief history of Hadoop and see where it stands today.
Big Data, Big Data Hype, Cloudera, Hadoop, Hortonworks
- Big Data Masters Course to Transform Your Career - Sep 27, 2016.
The Simplilearn Online Masters Program ensures that you transform into a Hadoop Architect by acquiring core skill sets, including Hadoop Development, Real time processing using Spark, and NoSQL database technology. Learn more.
Big Data Architect, Hadoop, Online Education, Simplilearn
- O’Reilly Live Training–Real-time. Real experts. Real learning. - Sep 26, 2016.
Get intensive, hands-on training from O'Reilly's expert network on critical data topics - from SQL fundamentals to distributed computing; enterprise strategy to data science at scale.
Apache Spark, Courses, Distributed Systems, Hadoop, O'Reilly, scikit-learn, SQL
- Spark for Scale: Machine Learning for Big Data - Sep 23, 2016.
This post discusses the fundamental concepts for working with big data using distributed computing, and introduces the tools you need to build machine learning models.
Pages: 1 2 3
Apache Spark, Big Data, Hadoop, HDFS, Machine Learning, MapReduce
- 5 EBooks to Read Before Getting into A Data Science or Big Data Career - Aug 11, 2016.
A short, carefully-curated list of 5 free ebooks to help you better understand what Data Science is all about and how you can best prepare for a career in data science, big data, and data analysis.
Big Data, Free ebook, Hadoop, Programming Languages, Simplilearn, Tableau
- Big Data Key Terms, Explained - Aug 11, 2016.
Just getting started with Big Data, or looking to iron out the wrinkles in your current understanding? Check out these 20 Big Data-related terms and their concise definitions.
Pages: 1 2
3Vs of Big Data, Apache Spark, Big Data, Business Intelligence, Cloud Computing, Data Warehouse, Explained, Hadoop, Key Terms, Predictive Analytics
- Making Data Science Accessible – HDFS - Aug 4, 2016.
This post explains some basic Big Data concepts and offers some insight into when HDFS can be useful, employing basic analogies to do so.
Data Science, Hadoop, HDFS, MapReduce
- KDnuggets Free Bronze Pass to Strata + Hadoop World New York City, Sep 28-29, 2016 - Jul 30, 2016.
Strata + Hadoop World is the leading event on how big data and ubiquitous, real-time computing is shaping the course of business and society. Win KDnuggets free pass to Strata + Hadoop World New York City.
Big Data, Business, Hadoop, New York City, NY, Strata
- 5 Big Data Projects You Can No Longer Overlook - Jul 21, 2016.
Check out 5 Big Data projects that you are not likely to have seen before, but which may be useful to you, and perhaps even scratch an itch you didn't know you had.
Big Data, Cloud Computing, Google, Hadoop, Javascript, Overlook, Presto, Spotify, SQL
- Take a Risk Free Hadoop Ride. Save up to 80% cost and offload time. - Jul 14, 2016.
The Impetus Data Warehouse Workload Migration product is a proven, cost-effective, and low-risk solution to offload traditional data warehouse to Big Data warehouse. Contact us for a proof-of-concept.
Big Data, Data Warehouse, Hadoop, Impetus
- Strata + Hadoop World, New York City, Sep 26-29 – KDnuggets discount - Jun 22, 2016.
Strata + Hadoop World is where cutting-edge science and new business fundamentals intersect-and merge. It's a deep dive into emerging techniques and technologies. Get 20% off with code PCKDNG.
Big Data, Hadoop, New York City, NY, Strata
- Hadoop Key Terms, Explained - May 30, 2016.
An straightforward overview of 16 core Hadoop ecosystem concepts. No Big Picture discussion, just the facts.
Pages: 1 2
Apache Spark, Explained, Hadoop, HBase, HDFS, Key Terms, MapReduce, YARN
- UnitedHealth Group: Hadoop Big Data Developer - May 25, 2016.
Seeking a Hadoop Big Data Developer for a large scale Big Data Program which involves ingesting hundreds of files into a Hadoop ecosystem, enriching Hadoop data, and distributing significant amounts of that data to SQL and to dozens of other applications.
Big Data, Boston, Hadoop, MA, UnitedHealth Group
- Webinar: High Performance Hadoop With Python, May 5th - Apr 28, 2016.
On May 5th, Dr. Kristopher Overholt and Dr. Matthew Rocklin of Continuum Analytics will present a webinar on High Performance Hadoop with Python. Reserve your spot today!
Continuum Analytics, Hadoop, Python, Webinar
- Top Data Science Courses on Udemy - Apr 27, 2016.
An overview of the very best that Udemy has to offer in data science education. Includes courses covering machine learning, Python, Hadoop, visualization, and more.
Pages: 1 2 3
Apache Spark, Brendan Martin, Data Science, Hadoop, Machine Learning, Python, Udemy
- Workshop opportunities – Hadoop, R, Predictive Modeling - Apr 11, 2016.
Spots are limited for the upcoming training workshops at Predictive Analytics World for Business, June 20-23, 2016 in Chicago. Check the new Hadoop workshop and instruction about advanced methods, modeling methods, R, and more – reserve your spot today.
Chicago, Hadoop, IL, PAW, Predictive Analytics World, Predictive Modeling, R
- 100 Active Blogs on Analytics, Big Data, Data Mining, Data Science, Machine Learning - Mar 29, 2016.
Stay on top of your data science skills game! Here’s a list of about 100 most active and interesting blogs on Big Data, Data Science, Data Mining, Machine Learning, and Artificial intelligence.
Pages: 1 2
Big Data, Blogs, Data Science, Deep Learning, Hadoop, Machine Learning
- Strata + Hadoop World San Jose, Keynote Live Streaming, Mar 30-31 - Mar 28, 2016.
Watch Strata + Hadoop San Jose 2016 Conference Keynotes live on March 30 and March 31. Topics include Hadoop at 10, Predictive Analytics for on-demand economy, Real-Time, Summoning the demon of AI, Cybersecurity, The theorem that wouldn't die, and Nonsense science by comedian Paula Poundstone.
Doug Cutting, Hadoop, Keynote Speech, San Jose, Strata
- Simplilearn disrupts Big Data Industry with Masters and Flexi Pass Programs - Mar 8, 2016.
Simplilearn, the largest online certification training company, offers 3 separate Big Data Masters Programs, courses on Hadoop and Spark, its unique CloudLab, and certification.
Big Data, Certification, Hadoop, Master of Science, Online Education, Simplilearn
- Webinar: Driving Data Democracy: Hadoop and Redshift, Mar 16 - Mar 4, 2016.
The Hadoop ecosystem has improved markedly over the past few years. MPP databases allow analytics teams to easily query massive structured data sets. Learn how these pipelines work on March 16.
Amazon Redshift, Hadoop, Looker, MPP Database, SQL
- Apache Big Data, Vancouver, May 9-12, KDnuggets Discount, Early bird ends Mar 6 - Mar 4, 2016.
Apache Big Data brings together the full suite of Big Data open source projects - check the amazing lineup of keynotes and breakout sessions and save with code APBD16KDN20.
Apache, Apache Spark, Big Data, Canada, Doug Cutting, Hadoop, Matei Zaharia, Vancouver
- Top Big Data Processing Frameworks - Mar 3, 2016.
A discussion of 5 Big Data processing frameworks: Hadoop, Spark, Flink, Storm, and Samza. An overview of each is given and comparative insights are provided, along with links to external resources on particular related topics.
Apache Samza, Apache Spark, Apache Storm, Flink, Hadoop
- Why Spark Reached the Tipping Point in 2015 - Feb 26, 2016.
A quantitative look at Spark's breakthrough year in 2015, from 3 different points of view. Will 2016 be an even bigger year for the open source project?
Adoption, Apache Spark, Big Data, Hadoop, Matthew Mayo
- Data Lake Plumbers: Operationalizing the Data Lake - Feb 18, 2016.
Gain insight into data lakes, their benefits, when they are appropriate, and how to operationalize them. How do they compare to the data warehouse?
Data Lake, Data Warehouse, ETL, Hadoop
- KDnuggets Free Pass to Strata + Hadoop World San Jose 2016 - Feb 16, 2016.
Strata + Hadoop World is the leading event on how big data and ubiquitous, real-time computing is shaping the course of business and society. Win KDnuggets free pass to Strata + Hadoop World San Jose.
Big Data, CA, Hadoop, San Jose, Strata
- Strata Hadoop World London 2016 - Feb 3, 2016.
Strata + Hadoop World is the leading event on how big data and ubiquitous, real-time computing is shaping the course of business and society. Make plans to join Strata + Hadoop World in London 31 May-3 June 2016. Save 20% with code PCKDNG.
Doug Cutting, Hadoop, London, Strata, UK
- SanDisk: Senior Big Data Engineer/Hadoop Developer - Feb 3, 2016.
Planning and designing next-generation Big Data System architectures, managing the development and deployment of Hadoop applications.
Big Data Engineer, CA, Hadoop, Milpitas, SanDisk
- Simplilearn Special: 30% off on Big Data and Analytics courses - Feb 2, 2016.
Get access to Simplilearn R, Big Data, Hadoop and other Data Science-related courses at unbeatable prices with code GetAhead. This offer good till 7 Feb, 2016.
Certification, Hadoop, R, Simplilearn
- Spark and the Remorseless Recrystallization of the Open Source Analytics Ecosystem - Jan 23, 2016.
Apache Spark had robust machine learning, graph, streaming, and in-memory capability to the Hadoop-centric ecosystem. In 2016, we expect adoption in diverse big data, advanced analytics, data science, Internet of Things, and other application domains.
Apache Spark, Hadoop, James Kobielus
- Hadoop and Big Data: The Top 6 Questions Answered - Jan 22, 2016.
6 questions surrounding Hadoop and Big Data are posed and answered, including those related to implementation, management, and practical uses. Find out where Hadoop currently sits in the world of Big Data.
Apache Spark, Big Data, Data Warehouse, Hadoop, Implementation
- SanDisk: Senior Staff Hadoop Developer - Jan 20, 2016.
Planning and designing next-generation Big Data System architectures, managing the development and deployment of Hadoop applications.
CA, Developer, Hadoop, Milpitas, SanDisk
- Kickstart Your Data Initiatives in 2016 – KDnuggets discount - Jan 18, 2016.
The Apache Hadoop, Predictive Analytics and Data Science Innovation Summits will be in San Diego, Feb 18-19. Get 20% off all two-day passes with code KD20.
CA, Data Science, Hadoop, IE Group, Innovation, Predictive Analytics, San Diego, Summit
- Strata + Hadoop World San Jose, Mar 28-31, Best Price ends Jan 15 - Jan 5, 2016.
Strata + Hadoop World is the leading event on how big data and ubiquitous, real-time computing is shaping the course of business and society. Get KDnuggets discount to Strata + Hadoop World San Jose.
CA, Hadoop, San Jose, Strata
- Course: Big Data Processing with Hadoop & Spark, starts Jan 19, NYC - Jan 5, 2016.
Learn Hadoop and Spark, two key Big Data technologies, with an evening course in New York City, starting Jan 19. Special KDnuggets discount with code 09P8W01CUP7B.
Apache Spark, Hadoop, Metis, New York City, NY
- 2016: The Year of Hadooplooza - Dec 31, 2015.
Bruno Aziza examines the Hadoopalooza effect, how to avoid poor decisions to come back from the party a "Hadoop-loser", and what is needed to get value from data lakes.
2016 Predictions, Bruno Aziza, Data Lakes, Hadoop
- 8 Myths about Virtualizing Hadoop on vSphere Explained - Dec 22, 2015.
This article takes some common misperceptions about virtualizing Hadoop and explains why they are errors in people’s understanding.
Pages: 1 2
Containers, Hadoop, Myths, Virtualization, VMware, vSphere
- Strata + Hadoop World 2015 Singapore – Day 2 Highlights - Dec 15, 2015.
Here are the quick takeaways and valuable insights from selected talks at one of the most reputed conferences in Big Data – Strata + Hadoop World 2015, Singapore, day 2.
Apache Spark, Deep Learning, Deepak Agarwal, Devendra Desale, Doug Cutting, Hadoop, Security, Strata
- Strata + Hadoop World 2015 Singapore – Day 1 Highlights - Dec 11, 2015.
Here are the quick takeaways and valuable insights from selected talks at one of the most reputed conferences in Big Data – Strata + Hadoop World 2015, Singapore.
Apache Spark, Devendra Desale, Hadoop, LinkedIn, Singapore, Strata, Trends
- Predictive Analytics + Hadoop = Business Impact, Dec 15 Webinar - Dec 4, 2015.
Learn how to use Predictive Analytics and Hadoop to Turn the Promise of Big Data into Business Impact in this webinar with RapidMiner Founder and CTO Ingo Mierswa and leading Gartner Analyst Merv Adrian.
Business, Gartner, Hadoop, Ingo Mierswa, Merv Adrian, Predictive Analytics, RapidMiner
- Taming the Elephant: Advice to Director, Big Data Architect - Nov 30, 2015.
Every other day, there is a new big data software is released in the market. Which one is the right to build your product? Understand how to resolve this conundrum and role of decision makers.
Big Data, Director, Hadoop
- Career path explained: Big Data Hadoop DEVELOPER to ARCHITECT - Nov 24, 2015.
The path to becoming a Big Data and Hadoop Architect is fraught with major challenges and responsibilities, but here is a handy infographic to help you chart out your path.
Big Data, Big Data Architect, Developer, Hadoop, Simplilearn
- Arabesque Distributed Graph Mining Platform - Nov 23, 2015.
Arabesque provides an elegant solution to the difficult problem of graph mining that lets a user easily express graph algorithms and efficiently distribute the computation.
Big Data, Graph Analytics, Graph Mining, Hadoop
- KDnuggets™ News 15:n38, Nov 18: TensorFlow Disappoints; Spark with Python; Deep Learning; Top 20 Books - Nov 18, 2015.
TensorFlow Disappoints - Google Deep Learning falls shallow; Introduction to Spark with Python; A Statistical View of Deep Learning; Amazon Top 20 Books in Databases & Big Data.
Apache Spark, Books, Convolutional Neural Networks, Coursera, Deep Learning, Hadoop, Jobs, TensorFlow
- How to discover stolen data using Hadoop and Big data? - Nov 11, 2015.
We discuss recent data breaches and present an approach that uses Hadoop and data fingerprint matching techniques to discover stolen data.
Pages: 1 2
Big Data, Fraud Detection, Hadoop
- Morpace: Hadoop Java Design Specialist - Oct 23, 2015.
Morpace is a leading market research and consulting organization, that provides quality research and leading-edge technology to its clients. Develop and implement a new full scale system using Hadoop ecosystem.
Big Data Engineer, Farmington Hills, Hadoop, Java, MI, Morpace
- Upcoming Webcasts on Analytics, Big Data, Data Science – Oct 20 and beyond - Oct 19, 2015.
Easier Data Prep and Analysis for Data Scientists, Measure and Enhance Analytics Maturity, Amazon QuickSight, Textual Healing, and more.
Amazon QuickSight, Analytic Maturity, Data Preparation, Hadoop, RapidInsight
- Strata + Hadoop World, Singapore, Dec 1-3: 2 for 1 Passes - Oct 13, 2015.
Register with code 2FOR1P and get a free pass of equal value to Strata + Hadoop World in Singapore. Expires Oct 31.
Hadoop, Singapore, Strata
- 90+ Active Blogs on Analytics, Big Data, Data Mining, Data Science, Machine Learning - Oct 8, 2015.
Stay on top of your data science skills game! Here's a list of 90+ active blogs on Big Data, Data Science, Data Mining, Machine Learning, and Artificial intelligence.
Pages: 1 2
Big Data, Blogs, Data Science, Deep Learning, Hadoop, Machine Learning
- Strata NYC Live Streaming info and Expo Hall Plus passes - Sep 28, 2015.
Watch Strata NYC keynotes live streaming on Sep 30 and Oct 1 here; also last moment info on Expo passes.
Hadoop, New York City, NY, Strata
- Hadoop Maturity Survey: The Tipping Point - Sep 24, 2015.
AtScale first global Hadoop maturity survey finds Hadoop value greatly increases with nodes deployed; its use for ETL is frequently a transition stage to higher-value Data Science applications.
AtScale, Cloudera, Hadoop, Survey, Tableau
- Lavastorm Webinar: Using Data from Hadoop to Improve Your Business, Sep 24 - Sep 16, 2015.
Learn how to easily extract data from Hadoop, create a more comprehensive view of your business, and quickly turn business data into business insights.
Business Value, Hadoop, Lavastorm
- Upcoming Webcasts on Analytics, Big Data, Data Science – Sep 15 and beyond - Sep 14, 2015.
Hadoop Guarantee, Best Practices of Data Science, Forecasting With Predictive Analytics, Data Mining - Failure to Launch, Using Data from Hadoop to Improve Your Business, and more.
Hadoop, Lavastorm, Salford Systems, TMA, Workforce Analytics
- Upcoming Webcasts on Analytics, Big Data, Data Science – Sep 8 and beyond - Sep 7, 2015.
The Future of Data Science, Ensuring Business Value from Analytics, Apache Ignite, Text Analytics, Best Practices of Data Science, Forecasting With Predictive Analytics, and more.
Business Value, Forecasting, Hadoop, IIA, In-Memory Computing, SQL, Text Analytics
- Upcoming Webcasts on Analytics, Big Data, Data Science – Sep 1 and beyond - Aug 31, 2015.
Big Data Certification - which one, Data at the Speed of Business, The Future of Data Science, Ensuring Business Value from Analytics, Data Mining: Failure to Launch, and more.
Forrester, Hadoop, IIA, Looker, Skytree, Trifacta
- O’Reilly Learning Paths – Data Science Training – reduced rate until Sep 2 - Aug 30, 2015.
O'Reilly Learning Paths will help you get learn Hadoop, Data Visualization, Data Science with R, Python for Data - until Sep 2 buy them for only $99.
Data Science Education, Data Visualization, Hadoop, Learning Path, O'Reilly, Online Education, Python, R
- What is the success rate in Hadoop adoption? - Aug 28, 2015.
Hadoop is no more an unknown term for the big data analytics, it’s to find its value return. Here, we tried to explore on the popular opinions of the Hadoop adopters, we also talk about current challenges for adoption.
Big Data Hype, Hadoop, Kaushik Pal, Success
- KDnuggets Free Pass to Strata + Hadoop World, New York City, Sep 29 – Oct 1, 2015 - Aug 21, 2015.
Enter to win KDnuggets free pass to Strata + Hadoop World NYC - let us know what buzzword will replace "Big Data". Submit your entry by Aug 31, 2015.
Free Pass, Hadoop, New York City, NY, Strata
- Rapidminer Webinar: Taming Hadoop – Extracting Value, Aug 20 - Aug 13, 2015.
Join Dr. Ingo Mierswa, RapidMiner CTO and leading EMA Analysts for a discussion on how to close the loop between predictive insights and action using big data analytics.
Hadoop, Ingo Mierswa, RapidMiner
- KDnuggets™ News 15:n26, Aug 12: Big Data, Data Science top influencers; Hadoop or Spark? Or Flink? - Aug 12, 2015.
Top influencers in Big Data, Data Science; The Big "Big Data" Question: Hadoop or Spark? 3 Key Components of a Successful Data Science Team; Stefan Groschupf, CEO Datameer, on why SQL on Hadoop is a bad idea.
Apache Spark, Big Data Influencers, Hadoop
- Upcoming Webcasts on Analytics, Big Data, Data Science – Aug 11 and beyond - Aug 10, 2015.
How Scale-Out and In-Memory Solve ETL, Data Mining: Failure to Launch, Harnessing the Hadoop Ecosystem, Leveraging Data for Effective Data Visualization, and more.
Gartner, Hadoop, In-Memory Computing, Lavastorm, TMA
- Apache Flink and the case for stream processing - Aug 7, 2015.
Realtime analytics have been proven challenging in the past, but with new tools it will be possible to setup your pipelines in relative short time. Apache Flink is one of such framework, find out how you can exploit it for your demands.
API, Flink, Hadoop, Realtime Analytics, Streaming Analytics
- The Big ‘Big Data’ Question: Hadoop or Spark? - Aug 5, 2015.
With a considerable number of similarities, Hadoop and Spark are often wrongly considered as the same. Bernard carefully explains the differences between the two and how to choose the right one (or both) for your business needs.
Pages: 1 2
Apache Spark, Bernard Marr, Data Science Tools, Distributed Systems, Hadoop, Machine Learning, Performance, RDD
- Interview: Stefan Groschupf, Datameer on Balancing Accuracy and Simplicity in Analytics - Aug 4, 2015.
We discuss common pain points in Big Data projects, evolution of Datameer technology, department specific solution – Datameer Professional, Datameer 5.0 Smart Execution, tacking over-simplicity and more.
Apache Spark, Data Warehousing, Datameer, Flink, Hadoop, Insights, Interview, MapReduce, Stefan Groschupf
- NYC Data Science Academy courses & bootcamps in Data Engineering, Data Science, R, Python, and Machine Learning - Jul 31, 2015.
Upcoming training from NYC Data Science Academy: 6-Week Intensive Data Engineering Bootcamp, 12-Week Data Science Bootcamp, courses in R, Python, Data Science and Machine Learning, and more.
Apache Spark, Bootcamp, Data Science Education, Hadoop, Machine Learning, New York City, NY, NYC Data Science Academy, Python, R, scikit-learn
- Big Data TechCon, Nov 2-4, Chicago, the HOW-TO Big Data Event - Jul 29, 2015.
Plan now to attend Big Data TechCon in Chicago, to learn HOW-TO manage and analyze Big Data from your Web logs, social media interactions, transactions, sensors, etc. Use BIGDATA for special discount.
Apache Spark, Big Data, Chicago, Data Science Education, Hadoop, IL, Techcon
- Impact of IoT on Big Data Landscape - Jul 29, 2015.
The Internet of Things (IoT) is the next technological revolution, expected to generate over $300 B by year 2020, according to Gartner. The IoT will also generate unprecedented amounts of data and its impact will be felt across the entire big data universe.
Big Data, Hadoop, IoT, Kaushik Pal
- Interview: Thanigai Vellore, Art.com on Why Big Data vs RDBMS is the Wrong Question - Jul 24, 2015.
We discuss success factors with polyglot architectures, Big Data challenges, recommendations for using Big Data technologies, trends, advice, and more.
Architecture, Art.com, Big Data, Career, Challenges, Hadoop, Interview, RDBMS, Recommendations
- Interview: Thanigai Vellore, Art.com on Delivering Contextually Relevant Search Experience - Jul 23, 2015.
We discuss the role of Analytics at Art.com, the polyglot data architecture at Art.com, the use cases for Hadoop, vendor selection, supporting semantic search and experience with Avro.
Architecture, Art.com, Avro, Hadoop, HBase, Interview, Semantic Analysis, Solr, Thanigai Vellore
- To Code or Not to Code with KNIME - Jul 22, 2015.
Find out how KNIME allows us to integrating analytical languages, such as R and Python and visual design of SQL code. Also, learn to integrate your Hadoop, visualization and ETL systems with the KNIME.
Pages: 1 2
Hadoop, Javascript, Knime, Michael Berthold, Python, R, SQL
- Big Data – yes, that’s what a latest Sensational Rap Music Video is all about - Jul 16, 2015.
Music video featuring Big Data and Hadoop (and Map-Reduce and NoSQL) might be all you need to light up your day!
Big Data, Data Scientist, Hadoop, MapReduce, Music, NoSQL, Viacom Velocity
- Top KDnuggets tweets, Jul 7-13: Deep Learning and the Triumph of Empiricism - Jul 14, 2015.
Deep Learning and the Triumph of Empiricism; What can Hadoop do that my data warehouse cant?; Emacs for Data Science; Dataiku DataScience Studio - intuitive solution.
Data Science, Data Warehouse, Dataiku, Deep Learning, Emacs, Hadoop
- 50+ Data Science and Machine Learning Cheat Sheets - Jul 14, 2015.
Gear up to speed and have Data Science & Data Mining concepts and commands handy with these cheatsheets covering R, Python, Django, MySQL, SQL, Hadoop, Apache Spark and Machine learning algorithms.
Cheat Sheet, Data Science, Django, Hadoop, Machine Learning, Python, R
- Strata + Hadoop World, New York City, Sep 29 – Oct 1 - Jul 7, 2015.
Strata + Hadoop World in New York sold out last year with more than 5,500 attendees. See what is new for 2015 and get 20% off with code KDNG by July 10.
Hadoop, New York City, NY, Strata
- KDnuggets Interview: Amr Awadallah, CTO & Co-founder, Cloudera on the Secret Sauce of Open Source - Jul 2, 2015.
We discuss the critical success factor for open source projects, entrepreneurial lessons, advice, desired qualities in data scientists and more.
Amr Awadallah, Apache, Cloudera, Data Science Skills, Entrepreneur, Hadoop, Hiring, Interview, Open Source
- KDnuggets Interview: Amr Awadallah, CTO & Co-founder, Cloudera on the Future of Information Architecture Design - Jun 29, 2015.
We discuss Cloudera’s achievements, story behind the name ‘Cloudera’, CTO role, and key attributes of information architecture designed for future.
Amr Awadallah, Cloudera, Hadoop, Information Management, Interview, Performance, Success
- Interview: Anil Gadre, MapR on 3 Keys for Big Data Success: Reliability, Security, & Scalability - Jun 24, 2015.
We discuss the origin of Apache Myriad, state of security in Big Data, MapR Quick Start Solutions, Hadoop vendor selection criteria, and more.
Anil Gadre, Future, Hadoop, Interview, MapR, Security, Success, Trends, Vendors
- Interview: Anil Gadre, MapR on What it takes to Automate Data-to-Action? - Jun 23, 2015.
We discuss how analytics can impact the business “as-it-happens”, merging business analytics with production operations, transition challenges, and recently announced partnership with Teradata.
Anil Gadre, Business Analytics, Decision Making, Hadoop, Interview, MapR, Realtime Analytics, Teradata
- Upcoming Webcasts on Analytics, Big Data, Data Science – Jun 23 and beyond - Jun 22, 2015.
Which Data Should You Move to Hadoop, Using Data from Hadoop to Improve Your Business, Tips and Tricks for Logistic Regression, Data Mining: Failure to Launch, and more.
Hadoop, Lavastorm, Logistic Regression, Salford Systems
- Strata + Hadoop World, New York City, Sep 29 – Oct 1, KDnuggets discount - Jun 18, 2015.
Strata + Hadoop World in New York is where big data most influential business decision makers, strategists, architects, developers, and analysts gather to shape the future. Get 20% off with code KDNG.
Hadoop, New York City, NY, Strata
- Simplilearn: Flat 30% off on 7-course Big Data package until June 22 - Jun 18, 2015.
Simplilearn updated All-in-one Big Data course packages now include latest databases like Apache Cassandra, Impala, and Google Big Table. Use Code ONLINE30 to get 30% Discount until 22nd June 2015.
Certification, Data Science Education, Hadoop, Online Education, SAS, Simplilearn
- Interview: Beth Smith, General Manager of the IBM Analytics Platform business, on Analytics, Hadoop, Spark - Jun 12, 2015.
We discuss coming Analytics surprises, what has changed, Open Source, Hadoop, Apache Spark, Open Data Platform, new analytics roles, IBM resources for analytics educations, and more.
Pages: 1 2
Apache Spark, Beth Smith, Hadoop, IBM
- Which Big Data, Data Mining, and Data Science Tools go together? - Jun 11, 2015.
We analyze the associations between the top Big Data, Data Mining, and Data Science tools based on the results of 2015 KDnuggets Software Poll. Download anonymized data and analyze it yourself.
Apache Spark, Data Mining Software, Excel, Hadoop, Knime, Poll, Python, R, RapidMiner, SQL
- Interview: Ranjan Sinha, eBay on Advanced Hadoop Cluster Management through Predictive Modeling - Jun 9, 2015.
We discuss categorization of e-commerce analytics, opportunities/ challenges of Big Data, Astro predictive model for Hadoop cluster management, and Apache Kylin.
Apache Hive, Apache Kylin, Astro, Customer Experience, eBay, Ecommerce, Hadoop, Interview, Predictive Modeling, Ranjan Sinha
- CRN 2015 Big Data Infrastructure Companies - May 28, 2015.
The CRN identifies top 25 big data infrastructure, tools and service companies offering everything from hardware servers, to software platforms and applications, to cloud-based services. The list includes major players in the big data space like Microsoft, Amazon, and IBM!
Big Data, CRN, Data Infrastructure, Data Platform, Hadoop, Startups