Search results for graphx
-
The Benefits & Examples of Using Apache Spark with PySpark
Apache Spark runs fast, offers robust, distributed, fault-tolerant data objects, and integrates beautifully with the world of machine learning and graph analytics. Learn more here.https://www.kdnuggets.com/2020/04/benefits-apache-spark-pyspark.html
-
Everything a Data Scientist Should Know About Data Management">Everything a Data Scientist Should Know About Data Management
For full-stack data science mastery, you must understand data management along with all the bells and whistles of machine learning. This high-level overview is a road map for the history and current state of the expansive options for data storage and infrastructure solutions.https://www.kdnuggets.com/2019/10/data-scientist-data-management.html
-
Learn how to use PySpark in under 5 minutes (Installation + Tutorial)
Apache Spark is one of the hottest and largest open source project in data processing framework with rich high-level APIs for the programming languages like Scala, Python, Java and R. It realizes the potential of bringing together both Big Data and machine learning.https://www.kdnuggets.com/2019/08/learn-pyspark-installation-tutorial.html
-
Practical Apache Spark in 10 Minutes
Check out this series of articles on Apache Spark. Each part is a 10 minute tutorial on a particular Apache Spark topic. Read on to get up to speed using Spark.https://www.kdnuggets.com/2019/01/practical-apache-spark-10-minutes.html
-
Apache Spark Introduction for Beginners">Apache Spark Introduction for Beginners
An extensive introduction to Apache Spark, including a look at the evolution of the product, use cases, architecture, ecosystem components, core concepts and more.https://www.kdnuggets.com/2018/10/apache-spark-introduction-beginners.html
-
Introduction to Apache Spark
This is the first blog in this series to analyze Big Data using Spark. It provides an introduction to Spark and its ecosystem.https://www.kdnuggets.com/2018/07/introduction-apache-spark.html
-
Apache Spark : Python vs. Scala">Apache Spark : Python vs. Scala
When it comes to using the Apache Spark framework, the data science community is divided in two camps; one which prefers Scala whereas the other preferring Python. This article compares the two, listing their pros and cons.https://www.kdnuggets.com/2018/05/apache-spark-python-scala.html
-
Graph Analytics Using Big Data
An overview and a small tutorial showing how to analyze a dataset using Apache Spark, graphframes, and Java.https://www.kdnuggets.com/2017/12/graph-analytics-using-big-data.html
-
Big Data Key Terms, Explained
Just getting started with Big Data, or looking to iron out the wrinkles in your current understanding? Check out these 20 Big Data-related terms and their concise definitions.https://www.kdnuggets.com/2016/08/big-data-key-terms-explained.html
-
Apache Spark Key Terms, Explained
An overview of 13 core Apache Spark concepts, presented with focus and clarity in mind. A great beginner's overview of essential Spark terminology.https://www.kdnuggets.com/2016/06/spark-key-terms-explained.html
-
Introducing GraphFrames, a Graph Processing Library for Apache Spark
An overview of Spark's new GraphFrames, a graph processing library based on DataFrames, built in a collaboration between Databricks, UC Berkeley's AMPLab, and MIT.https://www.kdnuggets.com/2016/03/introducing-graphframes-apache-spark.html
-
Top Spark Ecosystem Projects
Apache Spark has developed a rich ecosystem, including both official and third party tools. We have a look at 5 third party projects which complement Spark in 5 different ways.https://www.kdnuggets.com/2016/03/top-spark-ecosystem-projects.html
-
Research Leaders on Data Mining, Data Science and Big Data key advances, top trends
Research Leaders in Data Science and Big Data reflect on the most important research advances in 2015 and the key trends expected to dominate throughout 2016.https://www.kdnuggets.com/2016/01/research-leaders-data-science-big-data-top-trends.html
-
Exclusive Interview: Matei Zaharia, creator of Apache Spark, on Spark, Hadoop, Flink, and Big Data in 2020
Apache Spark is one the hottest Big Data technologies in 2015. KDnuggets talks to Matei Zaharia, creator of Apache Spark, about key things to know about it, why it is not a replacement for Hadoop, how it is better than Flink, and vision for Big Data in 2020.https://www.kdnuggets.com/2015/05/interview-matei-zaharia-creator-apache-spark.html