Descriptive Statistics
Measure of Central Tendency
Measure of Dispersion
Skewness and Kurtosis
Covariance and Correlation
Events
Sets in Probability
Random Variables
Conditional Probability
Expectation and Variance
Discrete Distribution
Uniform
Bernoulli
Binomial
Poisson
Continuous Distribution
Uniform
Normal
Hypothesis formulation
Null Hypothesis
Procedure for testing a hypothesis
Small Sample test
Large Sample test
t-test
chi-square test
z-score
p-value
One-way ANOVA
Data Import and Export
Data Cleaning and Transformation
Combining Datasets
Regression Analysis
Clustering and Classification
Case Study
Installation of R-Studio
Introduction to Various panes of R-Studio
Data Types, Operators
Creation of Vectors
Functions
Data handling and cleaning
Dataframes, Merging Dataframes
Plotting
Case Study
Python Built-in types
Python operators
Python Control flow
Python functions
User defined and Lamda functions
Python classes
Python modules and packages
Numpy
Pandas
Dataframe manipulation
DDL Statements
DML Statements
Aggregate functions
Date functions
Union, Union All
Joins
View, Indexes
Rollup and Cubes, Group by
Introduction to Weka:
Overview of Weka
Installation and setup
Exploring the Weka Interface:
Overview of the graphical user interface
Loading datasets
Basic navigation and functionality
Data Preprocessing:
Handling missing values
Data cleaning and filtering
Transforming and normalizing data
Exploratory Data Analysis (EDA):
Descriptive statistics
Visualization using Weka tools
Classification Algorithms:
Introduction to classification
Decision trees
k-Nearest Neighbors (k-NN)
Support Vector Machines (SVM)
Naive Bayes
Clustering Algorithms:
Introduction to clustering
k-Means clustering
Hierarchical clustering
EM (Expectation-Maximization) clustering
Association Rule Mining:
Market basket analysis
Apriori algorithm
Feature Selection and Dimensionality Reduction:
Importance of feature selection
Principal Component Analysis (PCA)
Model Evaluation and Validation:
Cross-validation
Confusion matrix and performance metrics
ROC curves
Ensemble Learning:
Introduction to ensemble methods
Random Forest
Bagging and boosting
Introduction to Scripting in Weka:
Using Weka through the command line
Writing scripts for automation
Resources and Further Learning:
Weka documentation and resources
Additional readings and tutorials
Assessment:
Assignments and quizzes
Final project or exam
