StarweaverStarweaver
  • Explore Courses
    • Collections

      • IT Management Frameworks
      • Live Online
      • Software Development
      • Finance Foundations – Capital Markets
      • Finance Foundations – Risk Management
      • Agile, Scrum, SAFe, Kanban…
      • Artificial Intelligence / Machine Learning
      • Business Domains
      • Big Data
      • Cloud
      • Certifications
      • Streaming Courses
  • Free Live Online
  • Get Certified
  • FOR BUSINESS
  • WEBINARS
    • Upcoming (Live)
    • Recorded (past)
  • Login?
    Explore Courses
    X
    • Streaming Courses
    • Live Online
    • Cloud
    • Artificial Intelligence / Machine Learning
    • Big Data
    • Agile, Scrum, SAFe, Kanban…
    • IT Management Frameworks
    • Software Development
    • Finance Foundations – Capital Markets
    • Finance Foundations – Corporate Finance
    • Finance Foundations – Risk Management
    • Certifications
    ⟵
    Python for Data Science & Machine Learning – Certification Boot Camp

    This Python for Data Science & Machine Learning Certification program covers how to use NumPy, Pandas, Seaborn, Matplotlib, Plotly, Scikit-L, and more in Machine Learning. Become a Python guru now!

    Learn More
    Immersion Certification in Continuous Integration and Development Tools

    This six (6) week training program combines live online, live in person and recorded content, instruction, labs, quizzes and tests to ensure delegates have a strong understanding.

    Learn More
    ⟵
    Streaming Courses View All
    Technology & Business
    • Business & Technical Writing Immersion
    • AWS Essentials
    • Cyber Security: Building a CyberWarrior Certification
    • Sales and Relationship Management
    Finance Related
    • Fundamental Financial Math
    • Capital Market Immersion
    • The Securities Trade Lifecycle
    • Commercial Credit Analysis
    ⟵
    Live Online View All
    Top Courses Now
    • Python for Data Science & Machine Learning – Certification Boot Camp
    • Azure Cloud Architect Immersion
    • Data Science & Machine Learning
    • Blockchain Business Consultant & Program Manager
    • AWS Certified Solutions Architect – Associate
    ⟵
    Cloud View All
    • Microservices Business Consultant & Program Manager Certification Program
    • Microservices Developer Certification Program
    • Understanding Kubernetes
    • Advanced Architecting on AWS
    • AWS Business Essentials
    • Big Data on AWS
    • Developing on AWS
    • DevOps Engineering on AWS
    ⟵
    Artificial Intelligence / Machine Learning View All
    Introduction/Intermediate
    • Understanding Machine Learning For Lawyers
    • Machine Learning Essentials
    • Intro to Deep Learning With TensorFlow
    • An Introduction to AI (Artificial Intelligence) and its applications
    Intermediate/Advanced
    • Data Science & Machine Learning – Developer Certification
    • DevOps Engineering on AWS
    • Machine Learning With Apache Spark
    • Machine Learning With Apache Spark
    • Machine Learning with Sagemaker (AWS)
    ⟵
    Big Data View All
    Dig into Data
    • Data Science and Big Data Analytics
    • Introduction to Python Programming
    • Data Analytics  With Python / Python for Data Scientists
    • Mastering Python
    ⟵
    Agile, Scrum, SAFe, Kanban… View All
    Certification Tracks
    • Certified ScrumMaster® (CSM)
    • Certified Scrum Product Owner® (CSPO)
    • SAFe 4.0 Scrum Master Orientation Training
    • Certified ScrumDeveloper® (CSD)
    Agile in Action
    • Introduction to Agile
    • Implementing Agile Test-Driven Development for Non-Programmers
    • Advanced Disciplined Agile Delivery
    • Collaborating and Communicating Agile Requirements
    ⟵
    IT Management Frameworks View All
    • TOGAF 9.1 Certified-Combined Program
    • Foundation Certificate Program – DevOps
    • ITIL ITSM 2011 Foundation Certification
    • Kanban Management Professional
    ⟵
    Software Development View All
    • Microservices Developer Certification Program
    • Data Science & Machine Learning – Developer Certification
    • Advanced Hadoop for Developers
    • Spark V2 For Developers
    ⟵
    Finance Foundations – Capital Markets View All
    Courses
    • Fundamental Financial Math
    • Yield Curve Building Blocks
    • Futures and Options Markets
    • Bonds with Options
    Full Curriculums
    • Capital Market Immersion
    • Capital Market Road Map
    • The Securities Trade Lifecycle
    • Market Risk Management and Capital Markets
    ⟵
    Finance Foundations – Corporate Finance View All
    Courses
    • Capital Asset Pricing Model
    • Principles of Credit Analysis
    • Corporate Financial Strategy & Capitalisation Alternatives
    • VBA Programming for Finance
    • Corporate Finance Modeling, Forecasting, Valuation and Capital Structure
    ⟵
    Finance Foundations – Risk Management View All
    Introduction/Intermediate
    • Financial Institutions and Risks
    • Credit Risk and Risk Management
    • Capital Markets: Products, Risks and Strategies
    Intermediate/Advanced
    • Financial Risk Management Essentials
    • Introduction to Credit Spreads and of the Management of Risk
    • Principles of Credit Analysis
    • Counterparty Credit Risk for Financial Institutions
    ⟵
    Professional View All
    Immersion Certification in Continuous Integration and Development Tools
    Azure Cloud Architect Immersion
    Data Science & Machine Learning Developer
    Blockchain Business Consultant and Program Manager
    • Explore Courses
      • Collections

        • IT Management Frameworks
        • Live Online
        • Software Development
        • Finance Foundations – Capital Markets
        • Finance Foundations – Risk Management
        • Agile, Scrum, SAFe, Kanban…
        • Artificial Intelligence / Machine Learning
        • Business Domains
        • Big Data
        • Cloud
        • Certifications
        • Streaming Courses
    • Free Live Online
    • Get Certified
    • FOR BUSINESS
    • WEBINARS
      • Upcoming (Live)
      • Recorded (past)
    • Login?

    Artificial Intelligence & Data Science

    • Home
    • All courses
    • Artificial Intelligence & Data Science
    • Spark V2 For Developers
    Home / LP Courses / Cutting Edge IT / Big Data / Spark V2 For Developers
    learn for good program

    Spark V2 For Developers

    COURSE 1
    Enquire

    Course Features

    • Duration: 24 hours
    • Skill Level: All levels
    • Language: English
    • Overview

    Overview

    This course will introduce Apache Spark. The students will learn how  to use Spark for data analysis and write Spark applications.

    Completely updated for latest Spark version 2.x!
    Spark version 2 has lots of changes compared to v1.  This course covers the latest Spark v2 features.

    Objective

    Learn Spark eco-system

    What You Will Learn

    • Spark Shell
    • Spark internals
    • Spark Data structures : RDDs, Dataframes, Datasets
    • Spark APIs
    • Spark SQL
    • Spark and Hadoop
    • Spark MLLib
    • Spark Graphx
    • Spark streaming

    Audience

    Developers / Data Analysts

    Prerequisites

    • Familiarity with either Java / Scala / Python language (our labs in Scala and Python – we provide a quick Scala introduction)
    • Basic understanding of Linux development environment (command line navigation / running commands)

    Lab Environment

    We provide the complete lab environment in the cloud.  No need to install Spark on your laptop.

    Detailed Outline

    1. Scala primer
      • A quick introduction to Scala
      • Labs : Getting know Scala
    2. Spark Basics
      • Big Data, Hadoop, Spark
      • What’s new in Spark v2
      • Spark concepts and architecture
      • Spark eco system (core, spark sql, mlib, streaming)
      • Labs : Installing and running Spark
    3. Spark Shell
      • Spark shell
      • Spark web UIs
      • Analyzing dataset – part 1
      • Labs: Spark shell exploration
    4. RDDs (Condensed coverage)
      • RDDs concepts
      • RDD Operations / transformations
      • Labs : Unstructured data analytics using RDDs
    5. Data model concepts
      • Partitions
      • Distributed processing
      • Failure handling
      • Caching and persistence
    6. Spark Dataframes & Datasets
      • Intro to Dataframe / Dataset
      • Programming in Dataframe / Dataset API
      • Loading structured data using Dataframes
      • Labs : Dataframes, Datasets, Caching
    7. Spark SQL
      • Spark SQL concepts and overview
      • Defining tables and importing datasets
      • Querying data using SQL
      • Handling various storage formats : JSON / Parquet / ORC
      • Labs : querying structured data using SQL; evaluating data formats
    8. Spark API programming (Scala / Python)
      • Introduction to Spark  API
      • Submitting the first program to Spark
      • Debugging / logging
      • Configuration properties
      • Labs : Programming in Spark API, Submitting jobs
    9. Spark and Hadoop
      • Hadoop Primer : HDFS / YARN
      • Hadoop + Spark architecture
      • Running Spark on YARN
      • Processing HDFS files using Spark
      • Spark & Hive
    10. Machine Learning (ML / MLib)
      • Machine Learning primer
      • Machine Learning in Spark : MLib / ML
      • Spark ML overview (newer Spark2 version)
      • Algorithms : Clustering, Classifications, Recommendations
      • Labs : Writing ML applications in Spark
    11. GraphX
      • GraphX library overview
      • GraphX APIs
      • Labs : Processing graph data using Spark
    12. Spark Streaming
      • Streaming concepts
      • Evaluating Streaming platforms
      • Spark streaming library overview
      • Streaming operations
      • Sliding window operations
      • Structured Streaming
      • Continuous streaming
      • Spark & Kafka streaming
      • Labs : Writing spark streaming applications
    13. Spark in the real world
      • Highlight some Spark use cases in real world
    Curriculum is empty
    Free

    You May Like

    Python for Data Science & Machine Learning – Certification Boot Camp Read More
    techsupport

    Python for Data Science & Machine Learning - Certification Boot Camp

    Enquire
    Enquire

    Python for Data Science & Machine Learning – Certification Boot Camp

    Artificial Intelligence & Data Science, Live Now!

    50 hours ♦ All levels

    Overview This course will introduce Apache Spark. The students will learn how  to use Spark for data analysis and write Spark applications.Completely updated for latest Spark version 2.x! Spark version 2 has lots of changes compared to v1.  This course covers the latest Spark v2 features. Objective Learn Spark eco-system What You Will LearnSpark Shell Spark internals Spark Data structures : RDDs, Dataframes, Datasets

    More DetailsEnquire Now
    Immersion Certification in Continuous Integration and Development Tools Read More
    techsupport

    Immersion Certification in Continuous Integration and Development Tools

    Enquire
    Enquire

    Immersion Certification in Continuous Integration and Development Tools

    Certifications

    50 hours ♦ All levels

    Overview This course will introduce Apache Spark. The students will learn how  to use Spark for data analysis and write Spark applications.Completely updated for latest Spark version 2.x! Spark version 2 has lots of changes compared to v1.  This course covers the latest Spark v2 features. Objective Learn Spark eco-system What You Will LearnSpark Shell Spark internals Spark Data structures : RDDs, Dataframes, Datasets

    More DetailsEnquire Now
    Azure Cloud Architect Immersion Certification Program Read More
    Paul Siegel

    Azure Cloud Architect Immersion Certification Program

    Enquire
    Enquire

    Azure Cloud Architect Immersion Certification Program

    Certifications, Cloud, Learn Now, Live Now!

    60 hours ♦ All levels

    Overview This course will introduce Apache Spark. The students will learn how  to use Spark for data analysis and write Spark applications.Completely updated for latest Spark version 2.x! Spark version 2 has lots of changes compared to v1.  This course covers the latest Spark v2 features. Objective Learn Spark eco-system What You Will LearnSpark Shell Spark internals Spark Data structures : RDDs, Dataframes, Datasets

    More DetailsEnquire Now

    Latest Courses

    Blockchain and Machine Learning

    Blockchain and Machine Learning

    Blockchain and Machine Learning

    Blockchain, Cutting Edge IT, Machine Learning

    3 Days ♦ All levels

    Blockchain is a forge-proof distributed database, and Machine Learning is a popular technology that allows computers to understand and learn from data. Combined, then give rise to a new class of applications. In this class, the students learn about the technologies and implementation of combined Blockchain + Machine Learning use cases.Goals ● Get a solid foundation in Blockchain, Bitcoin, Ethereum, Hyperledger ●

    More DetailsEnquire Now
    Introduction to Python Programming

    Introduction to Python Programming

    Introduction to Python Programming

    App Development, Big Data, Cutting Edge IT, Data Analysis, Databases, Programming Languages

    24 hours ♦ All Levels

    DESCRIPTION Python has been around for decades, but it's still one of the most versatile and popular programming languages out there. Whether you're relatively new to programming or have been developing software for years, Python is an excellent language to add to your skill set. In this course, you'll learn the fundamentals of programming in Python, and you'll develop applications to

    More DetailsEnquire Now
    Blockchain Business Applications (4 hours)

    Blockchain Business Applications (4 hours)

    $97.50$124.95

    Blockchain Business Applications (4 hours)

    Blockchain, Cloud, Cutting Edge IT, Data Analysis, Machine Learning

    4 Hours ♦ All levels

    Description    This course is for IT consultants and business staff familiar with blockchain basics, who want to know how to apply blockchain in business functions. The course provides a solid foundation and understanding of blockchain technology (including its principles and fundamental operations) as well as a solid understanding the many growing applications of blockchain in business.  The course is

    More Details

    $124.95$97.50

    starweaver-logo-transparent

    795 Folsom Street, San Francisco, California 94107 || +1-415-483-2260 // +44 20 3289 3277

    COMPANY

    About Us
    Jobs & Careers
    Help/Support
    Policies and Terms
    Contact

    PARTNER WITH US

    Instructors & Teachers
    Channel Partners/Affiliates
    Writing and Publishing

     

    COMMUNITY

    Slack Channel
    Alumni

    FOR BUSINESS

    What Customers Say
    Private Classes
    Learning Paths
    Competency Frameworks

    Follow us

    Education you can bank on® ||  People are your most important assets!® || People are the only real asset!®

    © Starweaver Group, Inc. All Rights Reserved.

    Welcome to Starweaver!

    Stay informed and sharpen your skills!

    This website uses cookies, including third party ones, to allow for analysis of how people use our website in order to improve your experience and our services. By continuing to use our website, you agree to the use of such cookies. Click here for more information on our Privacy Policy.

    More information
    Privacy SettingsCookies

    Privacy Settings

    This website uses cookies, including third party ones, to allow for analysis of how people use our website in order to improve your experience and our services. By continuing to use our website, you agree to the use of such cookies. Click here for more information on our Privacy Policy. You may change your settings at any time. Your choices will not impact your visit.

    NOTE: These settings will only apply to the browser and device you are currently using.

    Cookies

    This website uses cookies, including third party ones, to allow for analysis of how people use our website in order to improve your experience and our services. By continuing to use our website, you agree to the use of such cookies.

    Accept