Month End:Flat 15% Offer Ends in:Loading... Grab Now New Course Enquiry : +91-9739305140

Avail Offer 🎉

Month End Offer:

Flat 15% OFF

Please Note : you agree to NearLearn's Terms & Conditions and Privacy Policy.
Logo

CATEGORIES

  • Classroom Trainings
  • Online Course
No courses found matching your search
×

Get up to 15% Off

+ Extra Flat 5% Off on New Course

Code: OFF15
popup image
Machine Learning Using PySpark Classroom Training Bangalore, India

UPCOMING TRAININGS SCHEDULE
Date Days Timings
NOV 23rd Mon - Fri | 7 Weeks 10:00 AM - 12:00 PM IST

Course Description

Learn the latest Big Data Technology - Spark! And learn to use it with one of the most popular programming languages, Python!

Apache Spark is one of the most widely used and supported open-source tools for machine learning and big data.

One of the most valuable technology skills is the ability to analyze huge data sets, and this course is specifically designed to bring you up to speed on one of the best technologies for this task, Apache Spark! The top technology companies like Google, Facebook, Netflix, Airbnb, Amazon, NASA, and more are all using Spark to solve their big data problems!

Spark can perform up to 100x faster than Hadoop MapReduce, which has caused an explosion in demand for this skill! Because the Spark 2.0 DataFrame framework is so new, you now have the ability to quickly become one of the most knowledgeable people in the job market!

This course will teach the basics with a crash course in Python, continuing on to learning how to use Spark DataFrames with the latest Spark 2.0 syntax! Once we've done that we'll go through how to use the MLlib Machine Library with the DataFrame syntax and Spark. All along the way you'll have exercises and Mock Consulting Projects that put you right into a real world situation where you need to use your new skills to solve a real problem!

We also cover the latest Spark Technologies, like Spark SQL, Spark Streaming, After you complete this course you will feel comfortable putting Spark and PySpark on your resume!

If you're ready to jump into the world of Python, Spark, and Big Data, this is the course for you!


Requirements

  • Basic understanding of Hadoop ecosystem/SQL/Database Knowledge
  • Some prior programming or scripting experience

Who this course is for:

  • Someone who knows Python and would like to learn how to use it for Big Data
  • Someone who is very familiar with another programming language and needs to learn Spark
  • People with some software development background who want to learn the hottest technology in big data analysis will want to check this out.
  • If your Data scientist job involves, or will involve, processing large amounts of data, you need to know about Spark.
  • If you're training for a new career in data science, machine learning or big data, Spark is an important part of it.

Take Away of Machine Learning Using PySpark

  • Use Python and Spark together to analyze Big Data
  • Learn how to use the new Spark 2.0 DataFrame Syntax
  • Work on Consulting Projects that mimic real world situations!
  • Use Spark's MLlib to create Powerful Machine Learning Models
  • Learn about the DataBricks Platform!
  • Get set up on Amazon Web Services EC2 for Big Data Analysis
  • Learn how to leverage the power of Linux with a Spark Environment like cloudera!
  • Use Spark Streaming to Analyze Tweets in Real Time!
  • And much more ……

Our Pricing


Instructor Bangalore


Brief Info

6+ years of experienced & result oriented data analytics professional possessing a proven track record of successfully applying statistical modeling, algorithm development & machine learning techniques to solve key business problems. Highly skilled in applying advanced machine learning/statistical algorithms such as Neural Network, Random Forests, Ensemble Models, Clustering etc. to the real world problems using Python. Looking forward to applying the acquired a gamut of skills to a challenging role in the data analytics space.


Key Skills
  • Data & Quantitative Analysis
  • Data Science • Big Data Analytics
  • Predictive/Statistical Modelling
  • Data Mining
  • Data Visualisation
  • Machine Learning Algorithms
  • Business Learning Algorithms
  • Business Intelligence
  • Project Monitoring & Project Monitoring & Management

  • Technical Skills

    Data Science Tools: Python, R, Advanced Excel, PySpark, Basic Django Framework, Basic SQL Packages: Numpy,Scipy, Pandas, Seaborn, Matplotlib, Statsmodels, Scikit-Learn, NLTK, NLP, Django, BeautifulSoup, TFLearn, Keras Statistics/Machine Learning:

    Agorithm:- Linear Regression, Logistic Regression , Association Rule Mining, K-Means Clustering, Artificial Neural Network , Support Vector Machine , Random Forest, Naive Bayes , Decision Tree. • CRISP-DM Methodology:CRISP-DM Methodology:- Dimensionality Reduction, Principal Component Analysis , Outlier treatment , Missing Value Treatment, Variance Inflation Factor. • Descriptive and Inferential Descriptive and Inferential

    Statistics:- Central Tendency, Variance ,Standard Deviation, Confidence Level, p value, Hypothesis Testing , ANOVA ,Confusion Matrix, multicollinearity , autocorrelation ,KS table calculation , ROC,AUC. • Text Analytics :Text Analytics :- Auto Summarization, Sentiment Analysis , Lexical Resources , Regular Expression , Topic Analysis , web scraping. • Field Of Analytics: Field Of Analytics:- Customer Segmentation , Market Basket Analysis , RFM , Customer Feedback Analysis ,Repeat Purchase, Customer Churn. • Domain :Domain :- Retail, Telecom, BFSI, etc. • Deep Learning :Deep Learning :- Activation Function , DNN Classifier , tensorflow , DNN Regressor , Learning Rate , Convolution Neural Network(CNN).

    In depth working experience in different statistical tools like Python with Jupyter Notebook, PySpark, Adv. Excel, etc. •Working on different machine learning techniques like Regression, Classification, Clustering , Association Rule Mining, Non-linear Classifier like (ANN, SVM, Random Forest, Naïve Bayesian), Data Mining Techniques like Missing Value and outlier analysis, PCA, Variable binning, etc using matplotlib ,numpy ,pandas, scikit-learn, statsmodel etc. • Experienced in different Big Data techniques like PySpark , Spark-ML , HDFS , Sqoop etc using Cloudera distribution. • Working on text analytics technique by using python package nltk , Beautiful Soup for web scraping, stop words removal, tokenization, lemmatization, frequency on textual data etc. and exposure on neural network , CNN using Tensorflow package.


    Course Outline

    Lecture1.1 Python Object and Data Structure

    Lecture1.2 Python Comparison Operators

    Lecture1.3 Python Statements

    Lecture1.4 Methods and Functions

    Lecture1.5 Milestone Project – 1

    Lecture1.6 Object-Oriented Programming

    Lecture1.7 Modules and Packages

    Lecture1.8 Errors and Exception Handling

    Lecture1.9 Milestone Project – 2

    Lecture1.10 Built-in Functions

    Lecture1.11 Python Decorators

    Lecture1.12 Python Generators

    Lecture1.13 Final Capstone Python Project

    Lecture1.14 Advanced Python Modules

    Lecture1.15 Advanced Python Objects and Data Structures

    Lecture1.16 Advanced OOP

    Lecture1.15 Parallel Processing

    Lecture2.1 What is Scala ?

    Lecture2.2 Why Scala For Spark ?

    Lecture2.3 Uses of Scala in other Framework

    Lecture2.4 Scala REPL

    Lecture2.5 Variables in Scala

    Lecture2.6 Basic Scala Operations

    Lecture2.7 Control Structure and loop in Scala

    Lecture2.8 Functions and Procedures in Scala

    Lecture2.9 Collections and Case Class in Scala

    Lecture3.1 Data Types and Central Tendency

    Lecture3.2 Probability Theory and Hypothesis

    Lecture3.3 ANOVA & CO-Relation Algorithm

    Lecture4.1 Cloudera Single Node VM[Virtual Machine] Cluster set-up and Understanding of Big Data and HDFS commands

    Lecture4.2 Map Reduce Concept and Implementation

    Lecture5.1 Setting up Python with Spark

    Lecture5.2 Local VirtualBox Set-up

    Lecture5.3 AWS EC2 PySpark Set-up [Cloud Computing Framework]

    Lecture5.4 Databricks Setup [Cloud Computing Framework]

    Lecture5.5 AWS EMR Cluster Setup [Cloud Computing Framework]

    Lecture5.6 Spark Data Frame and common operations

    Lecture5.7 Spark SQL

    Lecture5.8 Spark Streaming

    Lecture5.9 Implementations of Machine Learning / Artificial Intelligence using Spark MLLIB/MLLIB in PySpark environment

    Lecture5.9.a Linear Regression Algorithm

    Lecture5.9.b Logistic Regression Algorithm

    Lecture5.9.c D-Tree and Random Forest Algorithm

    Lecture5.9.d K-Means Clustering Algorithm

    Lecture5.9.e Collaborative Filter and Recommendation System Algorithm

    Lecture5.9.f NLP Algorithm

    Lecture6.1 Telecom Project using PySpark and spark-ml for customer churn and client segmentation.

    Lecture6.2 Real Time Sales Transaction using PySpark

    Classroom FAQ’s

    Machine Learning is a highly interdisciplinary subject. Hence, basic understanding of the following Concepts is highly useful:

    • Some background in programming (basic familiarity with variables, functions, loops, etc in any language) is helpful.
    • Linear algebra
    • Probability theory
    • Statistics & hypothesis testing
    • Optimization methods

    NearLearn has a dedicated job Assistance Team, who work with candidates on individual basis in assisting for right Machine Learning job.

    Payment can be made via Cheque / DD / Online Funds transfer / Cash Payment.

    Cheque should be drawn in favour of “Near And Learn Pvt. Ltd.” payable at Bangalore NEFT Payment:

    Account Name: Near And Learn Pvt. Ltd,

    Account Number: 201002051650

    IFSC Code: INDB0000008

    Branch: IndusInd Bank Limited Bangalore

    A/c Type: Current

    09.30am – 06.00pm Weekend(Saturday-Sunday)

    Quick Enquiry





    Card image cap

    Machine Learning With Python

    02nd - 03rd Nov, 2019
    Bangalore

    Read More
    Card image cap

    Machine Learning with Python

    02nd - 03rd Nov, 2019
    Pune

    Read More
    Card image cap

    Machine Learning With Python

    23rd - 24th Nov, 2019
    Mumbai

    Read More
    Card image cap

    Blockchain Workshop

    16th - 17th Nov, 2019
    Bangalore

    Read More
    Card image cap

    Blockchain Workshop

    23rd - 24th Nov, 2019
    Mumbai

    Read More
    Card image cap

    Blockchain Workshop

    23rd - 24th Nov, 2019
    Pune

    Read More

    Classroom Location

    (*tentative)