Customized ONLINE Classes available.​

Course Name Faculty Name Starting Date Time
Data Science Mr.Krishna 27-MAR-2023 11:00 AM
Data Science Mr.Srinivas 31-MAR-2023 11:00 AM

Data administration and management being the biggest challenges of the information explosion happening these days, this data science course gets the deeper and yet knowledgeable course for the data analytics professionals. The course allows one to bring up their basic data base knowledge and make it apply to the more advanced level of data science which is a very much typically needed mindset for the current data analysis of IT field.
 To know about the business intelligence and business analysis
 To understand the descriptive statistics of Data analysis
 To excel working with Tableau
 To get introduced to R and data exploration to R
 Understand to create Decision trees.
 To understand Data collection and data mining
 Know the importance of big data technologies
 To get the prior idea about the Loop functions and debugging tools
 To learn running non-parametric tests



CERTIFYING BODY:

  • Microsoft

CAREER PROSPECTS:

    • To manage whopping amount of the data, data scientists are needed who are the most enthusiastic people. It is the undoubtedly emerging field in data analysis which has great link with the upcoming data software that are being prepared for the improvisation of data management.

 Data Analyst
 Data Scientist
 M/L engineer
 Data Science Lead

SYLLABUS:

Introduction to Data science
· What is Data Science?
· How Data Science different from BI and Data Analyst
· Who are DataScientist?
· What skill sets are required to become a DataScientist ?
· What is Machine Learning?
· Statistics and Mathematics for Data Science
Introduction to Python
· Introduction to Anaconda
· Installation of Anaconda Python Distribution – For Windows, Mac OS, and Linux
· Jupyter Notebook Installation
· Jupyter Notebook Introduction
· Variable Assignment
· Basic Data Types: Integer, Float, String, None, and Boolean; Typecasting
· Creating, Accessing, and Slicing Tuples
· Creating, Accessing, and Slicing Lists
· Creating, Viewing, Accessing, and Modifying Dicts
· Creating and using operations on sets
· Basic Operators: ‘in’, ‘+’, ‘*’
· Functions
· Classes and Objects
· Packages
Python Essentials for Data Science
· Introduction to NumPy
· Data Manipulation with Pandas
· Visualization with Matplotlib
· NLTK
· Keras / TensorFlow / Theano
· Scikit-Learn
Numpy
· Introduction to Numpy
· Numpy Arrays
· Quick Note on Array Indexing
· Numpy Array Indexing
· Numpy Operations
· Numpy Exercises Overview
· Numpy Exercises Solutions
· Understanding Data Types in Python
· The Basics of Numpy Arrays
· Computation on Numpy Arrays: Universal Functions
· Aggregations: Min, Max, and Everything In Between
· Computation on Arrays: Broadcasting
· Comparisons, Masks, and Boolean Logic
· Fancy Indexing
· Sorting Arrays
· Structured Data: Numpy’s Structured Arrays
Pandas
· Pandas-Python for Data Analysis
· Introduction to Pandas
· Series
· Data-Frames
· Missing Data
· Group-by
· Merging Joining and Concatenating
· Operations
· Data Input and Output
· Introducing Pandas Objects
· Data Indexing and Selection
· Operating on Data in Pandas
· Handling Missing Data
· Hierarchical Indexing
· Combining Datasets: Concat and Append
· Combining Datasets: Merge and Join
· Aggregation and Grouping
· Pivot Tables
· Vectorized String Operations
· Working with Time Series
Seaborn-Python for Data Visualization
· Introduction to Seaborn
· Link to Seaborn Documentation
· Distribution Plots
· Categorical Plots
· Matrix Plots
· Grids
· Regression Plots
· Style and Color
· Seaborn Exercise Overview
· Seaborn Exercise Solutions
Matplotlib
· Simple Line Plots
· Simple Scatter Plots
· Visualizing Errors
· Density and Contour Plots
· Histograms, Binnings, and Density
· Customizing Plot Legends
· Customizing Colorbars
· Multiple Subplots
· Text and Annotation
· Customizing Ticks
· Customizing Matplotlib: Configurations and Stylesheets
· Three-Dimensional Plotting in Matplotlib
Scientific computing with Python (Scipy)
· SciPy and its Characteristics
· SciPy Sub-Packages
· SciPy Sub-Packages –Integration
· SciPy Sub-Packages – Optimize
· Linear Algebra
· SciPy Sub-Packages – Statistics
· SciPy Sub-Packages – Weave
· SciPy Sub-Packages – I O
· Scikit-Learn Approach
· Scikit – Learn Approach Built – in Modules
· Scikit – Learn Approach Feature Extraction
· Scikit – Learn Approach Model Training
· Scikit – Learn Grid Search and Multiple Parameters
Mathematical and Business Statistics Concepts for Data Science
· Mean, Mode Median
· Standard deviation, Variance, Correlation Analysis, Skew-ness, Quartile
· Linear Algebra, Probability, Optimization Theory
· Time Series Data Representation
· Some Common Terms Used in Statistics
· Data Distribution: Central Tendency, Percentiles, Dispersion
· Histogram
· Bell Curve
· Hypothesis Testing
· Chi-Square Test
· Correlation Matrix
· Inferential Statistics
· Data Type
· Icategorical Data (Nominal, Ordinal)
· Numerical Data (Discrete, Continuous, Interval, Ratio)
· Why Data Types are important?
· Statistical Methods
Descriptive Statistics
· Measures of Frequency: * Count, Percent, Frequency .
· Measures of Central Tendency. * Mean, Median, and Mode
· Measures of Dispersion or Variation. * Range, Variance, Standard Deviation
· Measures of Position. * Percentile Ranks, Quartile Ranks.
Sampling
· Different Types of Sampling
· Simple Random Sampling:
· Systematic Sampling
· Stratified Sampling
· Data Distribution
· Inferential Statistics
Test of Hypothesis
· Null Hypothesis Formulation
· Alternative Hypothesis
· Type I and Type II Errors
· Power Value
· One Tail and Two Tail
· T-TEST’s
· ANOVA
· MANOVA
· Chi Square Test
· Kendall Chi Square
· Kruskal-Wallis Rank Test Chi Square
· Mann-Whitney, Chi Square
· Wilcoxon, Chi Square
Data Cleaning Process Quality Check and Data Profiling
· Unsupervised Data
· PCA Regression Scores for Supervised Data
· Noise Data Detecting
· Data Cleaning with Regression Residual
· Data Transformation
· Data Wrangling
· Data Mining
· Data Profiling
· Model Validation and Testing
· Data Science & Business Analytics
· Basic Probability for Business Issues
Machine Learning
· What Is Machine Learning
· Key Terminology
· Key Tasks of Machine Learning
· Steps in Developing a Machine Learning Application
· Categories of Machine Learning
· Qualitative Examples of Machine Learning Applications
Predictive Analytics
· Different Type of Predictive Analytics – Prediction, Forecasting, Optimization, Segmentation etc..
· Supervised Learning
· Unsupervised Learning
· Time Series Analysis –Forecasting
Supervised Learning
· Regression
· Classification
Regression
· Linear Regression & Logistic: A Model-Based Approach
· Linear Regression Theory
· Model Selection Updates for SciKit Learn
· Linear Regression with Python /R
· Linear Regression Project Solution
· Regression Fundamentals: Data and Models
· Feature Selection in Model building
· Evaluating Over Fitting via Training/Test split
· Training
· Finding best-fit lines with Linear Regression
· Weighted Linear Regression
· Shrinking Coefficients
· Ridge Regression
· The Bias/Variance Trade Off
· Example: using Linear Regression
· Tree-based Regression
· Building Trees with Continuous and Discrete Features
· Using CART for Regression
· Tree Pruning
Classification
· Analyzing the Sentiment of Reviews: A Case Study in Classification
· Classification Fundamentals : Data and Models
· Understanding Decision Trees & Naive Bayes
· Feature Selection in Model Building
· Linear Classifiers
· Decision Boundaries
· Training and Evaluating a Classifier
· False Positives, False Negatives, and Confusion Matrices
· Classifying with k-Nearest Neighbours
· Distance Measurements
· Classifying with Decision Trees
· Tree Construction
· Testing and Storing the Classifier
· Example: using Decision Trees
· Classifying with Bayesian Decision Theory
· Classifying with Conditional Probabilities
· Classification with the AdaBoost Algorithm
· Classification Imbalance
· Recommendation
Clustering
· Clustering System Overview
· Data and Models
· Feature Selection in Model Building
· Clustering and Similarity ML Block Diagram
· Unsupervised Learning – Recommendation
· Recommender Systems ML
Deep Learning
· Deep Learning: Searching for Images
· Searching for Images: A Case Study in Deep Learning
· Learning very Non-linear Features with Neural Networks
· Application of Deep Learning to Computer Vision
· Deep Learning Performance
· Demo of Deep Learning Model on Image Net data
· Deep Learning ML Block Diagram
· K Nearest Neighbors
· KNN Theory
· KNN Project Overview
· KNN Project Solutions
Decision Trees and Random Forests
· Introduction to Tree Methods
· Decision Trees and Random Forest with Python
· Decision Trees and Random Forest Project Overview
· Decision Trees and Random Forest Solutions
Support Vector Machines
· SVM Theory
· Support Vector Machines with Python
· SVM Project Overview
· SVM Project Solutions
· K Means Clustering
· K Means Algorithm Theory
· K Means with Python
· K Means Project Overview
· K Means Project Solutions
· Bisecting k-means
· EM Algorithm
· Example: Clustering
· The Apriori Algorithm
· Frequent Item Set Generation
· Association Rule Generation
· Finding Association Rules in Voting
· Principal Component Analysis (PCA)
· Understand the Basics of RL and its Applications in AI
· Q-learning Algorithms
Principal Component Analysis
· Principal Component Analysis
· PCA with Python
Natural Language Processing
· Natural Language Processing Theory
· NLP with Python
· NLP Project Overview
· NLP Project Solutions
TensorFlow
· What is TensorFlow?
· Changes with TensorFlow
· TensorFlow Installation
· TensorFlow Basics
· MNIST with Multi-Layer Perceptron
· TensorFlow with ContribLearn
· Tensorflow Project Exercise Overview
· Tensorflow Project Exercise – Solutions
Artificial Intellegence
· Turing Machines & Turing Test
· AI Intelligence Agents & Environments
· AI Learning Types
· AI Problem Solving
· Single-State Problem
· Multi-State Problem
· Water-Jug Problem
· Maze Problem
· Queens Problem
· AI Search Algorithms
· Brute Force Search
· BFS,DFS, Uniform Cost Search
· Heuristic Search
· Hill Climbing Search
· Travelling Salesman Problem
Model Selection & Boosting
· Model Selection
· XG Boost
Time Series
· Date and Time Data Types and Tools
· Time Series Basics
· Date Ranges, Frequencies, and Shifting
· Time Zone Handling
· Periods and Period Arithmetic
· Resampling and Frequency Conversion
· Time Series Plotting
· Moving Window Functions
· Performance and Memory Usage Notes
· Auto Regression, Moving Average,
· Multiplicative, ARMA, Additive Model
R Programming
· Installing R & R-studio
Data Types
· Vector
· Array
· Matrix
· Data Frame & List
· Factors
R Connection – Interfaces
· Reading Tabular Data
· Textual Data Format
· Compress File – gzip , bzip2
· Connection to Web
Control Structure
· If-Else
· For Loop
· While Loop
· Repeat , Next , Break
Functions
· Packages & Libraries
· Writing Custom Functions
Date & Time
R Objects
Loop Functions
· Apply
· L-apply
· M-apply
· T-apply
· Split
Logistic Regression in R
· Reason for Logistic Regression
· The Logistic Transform
· Logistic Regression Modelling
· Model Optimisation
· Understanding ROC Curve
· Default Modelling using Logistic Regression in R Lang
Decision Trees
· Theory of Entropy & Information Gain
· Stopping Rules
· Cross Validations for Overfitting Problem
· Pruning as a Solution for Overfitting
· Ensemble Learning
· Bootstrap Aggregation
· Random Forests
· Intrusion Detection in IT Network
Linear Regression in R
· Covariance and Correlation
· Multivariate Analysis
· Hypothesis Testing
· Limitations of Regression
· Business Case: Managing Credit Risk
· Loss Given Default using Linear Regression
Support Vector Machine
· Classification as a Hyper Plane Location Problem
· Motivation for Linear Support Vectors
· Quadratic Optimization
· Non Linear SVM
· Kernel Functions
· Default Modelling using SVM in R
Introduction to Big data
· Big data and Analytics?
· Leverage Big data Platforms for Data Science
· Introduction to Evolving Tools e.g Spark
· Machine Learning with Spark
Analytical Visualisation with Tableau and SAS
· Why is it Important for Data-Analyst
· Tableau Workbook Walkthrough
· Instruction of Creation of Your Own Workbooks
· Demo of Few more Workbooks
Introduction to Cloud and Big-Data Computing Over Cloud
· Amazon Guide to Creating an AWS Account
· Quick Note on AWS Security
· EC2 Instance Set-Up
· AWS WITH ml
PySpark
· Introduction to Spark and Python
· RDD Transformations and Actions
We are Offering
· Mock Interviews Questions and Case Studies
· Guidance to prepare Resumes
· Information on Companies and Industry Trends on Data Science
Work Shop
SQL
SAS
BIGDATA Hadoop and Pyspark
Tableau
Case Studies, Capstone Project.