Spark with Scala – ACM Professional Development Seminar, Santa Clara, Aug 5

This class will introduce Apache Spark 2, focusing on using it for data analysis Taught by Sujee Maniyam on behalf of the local ACM chapter, SFbayACM.

Apache Spark The course will introduce Apache Spark to participants. This is an introductory course and no previous knowledge of Spark is needed. Detailed course outline is listed below.

When: Sat, August 5, 2017, 8 am - 4 pm

Where: Intel, 2200 Mission College Blvd, Santa Clara, CA 95054

About the Course:

This course will introduce Apache Spark. The students will learn how to use Spark for data analysis. We will cover the latest Spark version 2.

The course and labs cover:
  • Scala Primer (if needed, optional)
  • Spark ecosystem
  • Installing Spark
  • Spark shell for interactive data analysis
  • Spark Data models : RDDs / Dataframes / Dataset
  • Spark streaming
Labs will cover:
  • text data
  • clickstream data
  • 2016 election contributions
  • Spark commit logs
  • plus bonus labs for people who get done early
Audience: Developers / Analysts / Architects / Engineers

Format: Full day in-person Saturday workshop + 2-hour on-line review with Q & A

Workshop (in person) : Lectures + hands-on labs. We do a lot of hands-on exercises to reinforce the concepts.

  • Developer background
  • Familiarity with either Java / Scala / Python language (labs will be in Scala - a quick Scala primer will be taught to bring students up to speed)
  • Basic understanding of Linux development environment (command line navigation/ editing files using VI/Emacs/other text editor)
More details and register here.