Search results for flume
-
Working with Big Data: Tools and Techniques
Where do you start in a field as vast as big data? Which tools and techniques to use? We explore this and talk about the most common tools in big data.https://www.kdnuggets.com/working-with-big-data-tools-and-techniques
-
Skills to Build for Data Engineering">Skills to Build for Data Engineering
This article jumps into the latest skill set observations in the Data Engineering Job Market which could definitely add a boost to your existing career or assist you in starting off your Data Engineering journey.https://www.kdnuggets.com/2020/06/skills-build-data-engineering.html
-
Practical Apache Spark in 10 Minutes
Check out this series of articles on Apache Spark. Each part is a 10 minute tutorial on a particular Apache Spark topic. Read on to get up to speed using Spark.https://www.kdnuggets.com/2019/01/practical-apache-spark-10-minutes.html
-
Why the Data Scientist and Data Engineer Need to Understand Virtualization in the Cloud
This article covers the value of understanding the virtualization constructs for the data scientist and data engineer as they deploy their analysis onto all kinds of cloud platforms. Virtualization is a key enabling layer of software for these data workers to be aware of and to achieve optimal results from.https://www.kdnuggets.com/2017/01/data-scientist-engineer-understand-virtualization-cloud.html
-
Top 10 Amazon Books in Data Mining, 2016 Edition">Top 10 Amazon Books in Data Mining, 2016 Edition
Given the ongoing explosion in interest for all things Data Mining, Data Science, Analytics, Big Data, etc., we have updated our Amazon top books lists from last year. Here are the 10 most popular titles in the Data Mining category.https://www.kdnuggets.com/2016/11/top-10-amazon-books-data-mining.html
-
Top 12 Interesting Careers to Explore in Big Data
From data driven strategies to decision making, the true worth of Big Data has been realized, and has led to opening up of amazing career choices. Check out these 12 interesting careers to explore in Big Data.https://www.kdnuggets.com/2016/10/top-12-interesting-careers-explore-big-data.html
-
The top 5 Big Data courses to help you break into the industry
Here is an updated and in-depth review of top 5 providers of Big Data and Data Science courses: Simplilearn, Cloudera, Big Data University, Hortonworks, and Courserahttps://www.kdnuggets.com/2016/08/simplilearn-5-big-data-courses.html
-
Apache Spark Key Terms, Explained
An overview of 13 core Apache Spark concepts, presented with focus and clarity in mind. A great beginner's overview of essential Spark terminology.https://www.kdnuggets.com/2016/06/spark-key-terms-explained.html
-
Top Big Data Processing Frameworks
A discussion of 5 Big Data processing frameworks: Hadoop, Spark, Flink, Storm, and Samza. An overview of each is given and comparative insights are provided, along with links to external resources on particular related topics.https://www.kdnuggets.com/2016/03/top-big-data-processing-frameworks.html
-
Data Lake Plumbers: Operationalizing the Data Lake
Gain insight into data lakes, their benefits, when they are appropriate, and how to operationalize them. How do they compare to the data warehouse?https://www.kdnuggets.com/2016/02/data-lakes-plumbers-operationalizing.html
-
Spark SQL for Real-Time Analytics
Apache Spark is the hottest topic in Big Data. This tutorial discusses why Spark SQL is becoming the preferred method for Real Time Analytics and for next frontier, IoT (Internet of Things).https://www.kdnuggets.com/2015/09/spark-sql-real-time-analytics.html
-
R and Hadoop make Machine Learning Possible for Everyone
R and Hadoop make machine learning approachable enough for inexperienced users to begin analyzing and visualizing interesting data to start down the path in this lucrative field.https://www.kdnuggets.com/2014/11/r-hadoop-make-machine-learning-possible-everyone.html
-
18 essential Hadoop tools
Hadoop tools develop at a rapid rate, and keeping up with the latest can be difficult. Here we detail 18 of the most essential tools that work well with Hadoop.https://www.kdnuggets.com/2014/08/18-essential-hadoop-tools.html