# Data Science Statistics Training

### Module 1

• Introduction to Statistics
• Five Number Summary
• The Centre of the Data and the Effects of Extreme Values
• The Spread of the Data
• The Shape of the Data
• Categorical Variables
• Some Features of Data
• Relationships Between Quantitative and Categorical Variables
• Examining Relationships Between Two Categorical Variables
• Relationships Between Two Quantitative Variables
• Data Collection
• Sampling-Data Collection
• Observational Studies-Data Collection
• Experiments
• The Need for Probability
• Some Probability Basics
• Probability Distributions
• Long-Run Averages
• Sampling Distributions
• Introduction to Confidence Intervals
• Confidence Intervals for Proportions
• Sample Size for Estimating a Proportion
• Confidence Intervals for Means-Robustness of Confidence Intervals
• Introduction to Statistical Tests
• The Structure of Statistical Tests
• Hypothesis Testing for Proportions
• Hypothesis Testing for Means
• Power and Type I and Type II Errors
• Connection Between Confidence Intervals and Hypothesis Testing
• Matched Pairs
• Comparing Two Proportions
• Comparing Two Means
• The Linear Regression Formula
• Regression Coefficients Residuals and Variances
• Regression Inference and Limitations
• Residual Analysis and Transformations

### Module 2

• Machine Learning in Data Science
• Hypothesis Space and Inductive Bias
• Evaluation and cross-validation
• Linear Regression
• Decision Trees
• Bayesian Learning
• Naive Bayes
• k-Nearest Neighbour
• PCA
• Logistic Regression
• Support Vector Machine for both linear and rbf kernels
• Lasso,Ridge,Elastic Net
• Ensemble Regressor/Ensemble classifier
• Module 3:Big Data in Data Science
• HDFS
• Pig
• Sqoop
• Hive
• Flume
• Spark
• Nosql Hbase
• Kafka
• Cloudera Distribution
• Hortonworks Distribution
• MapR Distribution
• Hue
• Oozie
• Talend,ETL Integration
• Tableau Integration
• Nifi
• Introduction to Tableau
• What is Tableau?
• Tableau User Interface
• Basic Tableau Design Flow
• Basic Visualization Design
• Show Me! choosing Mark Types color
• Size, and Shape Options
• shaped Axis Charts-combination Charts
• Measure Names
• Measure Values
• Data Connection
• Connecting to Various Data Sources
• Customizing Your View of the Data
• Sets
• Groups
• Hierarchies
• Extracting Data
• Data Blending
• Top 10 Chart
• Bar Chart,Line Chart
• Area Chart
• Text Table/Cross Tab
• Scatter Plot/Bubble Chart
• Bullet Chart,Box Plot
• Tree Map
• Pie Chart
• World Cloud
• Interacting With the Viewer
• Quick Filters
• Parameters
• Worksheet Actions
• Tableau maps
• Geocoded Fields
• Custom Geocoding
• Background Map Options
• Custom Background Images
• Calculated Fields, Table Calculations, and Statistics
• Creating Custom Calculations
• Simple and Advanced Table Calculations
• Using Table Calculations Functions in Custom Calculations
• Reference Lines, Bands, and Distribution,Trend Lines
• Creating Dashboards
• Organizing Worksheets
• Containers, Images, Text, and Web Pages
• Dashboard Actions
• Distributing and Sharing Your Dashboards
• Exporting Worksheets and Dashboards
• Publishing to Tableau Server
• Creating Tableau Server User Filters
• Smartphones and Tablets with iOS and Android
• Calculation Function and Operator Reference
