Learning Plan: Data Science Mathematics
Structured Learning Plan for Data Science Mathematics
This 12-week learning plan breaks down each topic into digestible weekly goals with recommended exercises and applications in data science.
Week 1: Fundamentals of Linear Algebra
π Topics
- Introduction to Vectors and Matrices
- Matrix Operations: Addition, Multiplication, Transpose
- Identity, Inverse, and Special Matrices
π Resources
- Gilbert Strangβs Linear Algebra and Its Applications
- Khan Academy: Linear Algebra Fundamentals
π Exercises
- Solve basic matrix operations and vector manipulations
- Compute determinants and inverses manually and using NumPy
π Applications in Data Science
- Representing datasets as matrices
- Feature scaling and transformations
Week 2: Advanced Linear Algebra for Data Science
π Topics
- Eigenvalues and Eigenvectors
- Singular Value Decomposition (SVD)
- Principal Component Analysis (PCA)
π Resources
- Deep Learning by Ian Goodfellow (Chapter on Linear Algebra)
- 3Blue1Brownβs Essence of Linear Algebra series
π Exercises
- Compute eigenvalues and eigenvectors using Python
- Perform PCA on a dataset using sklearn.decomposition.PCA
π Applications in Data Science
- Dimensionality reduction in high-dimensional datasets
- Image compression using SVD
Week 3: Differential Calculus
π Topics
- Limits and Continuity
- Derivatives and Partial Derivatives
- Chain Rule and Gradient Calculation
π Resources
- Calculus by James Stewart
- MIT OpenCourseWare: Single Variable Calculus
- 3Blue1Brownβs The essence of calculus series
π Exercises
- Manually compute derivatives of polynomial and exponential functions
- Implement gradient computation in Python
π Applications in Data Science
- Compute loss function gradients in machine learning
Week 4: Integral & Vector Calculus
π Topics
- Definite & Indefinite Integrals
- Gradient, Hessian, and Jacobian Matrices
- Applications in Probability
π Resources
- Mathematics for Machine Learning by Deisenroth et al.
π Exercises
- Compute multiple integrals manually and with Python (sympy.integrate()
)
π Applications in Data Science
- Computing expectations in probability distributions
Week 5: Probability Theory
π Topics
- Basics of Probability
- Bayesβ Theorem & Conditional Probability
- Discrete & Continuous Distributions
π Resources
- Probability and Statistics for Engineering and the Sciences by Jay Devore
- Khan Academy: Probability & Statistics
π Exercises
- Solve probability problems manually
- Simulate probability distributions using NumPy
π Applications in Data Science
- NaΓ―ve Bayes classifier for text classification
Week 6: Statistics & Inference
π Topics
- Descriptive Statistics
- Hypothesis Testing (T-tests, Chi-Square)
- Maximum Likelihood Estimation
π Resources
- The Elements of Statistical Learning by Hastie, Tibshirani, Friedman
- Think Stats by Allen B. Downey
π Exercises
- Perform hypothesis testing using scipy.stats
- Compute confidence intervals on sample datasets
π Applications in Data Science
- Feature selection and A/B testing
Week 7: Optimization Techniques
π Topics
- Gradient Descent & Variants (SGD, Adam, RMSprop)
- Lagrange Multipliers
- Convex vs.Β Non-Convex Optimization
π Resources
- Convex Optimization by Boyd & Vandenberghe
π Exercises
- Implement gradient descent from scratch in Python
- Optimize logistic regression parameters using gradient descent
π Applications in Data Science
- Training machine learning models efficiently
Week 8: Numerical Methods
π Topics
- Newtonβs Method
- Matrix Factorization (LU, QR)
- Iterative Methods (Jacobi, Gauss-Seidel)
π Resources
- Numerical Methods for Scientists and Engineers by R.W. Hamming
π Exercises
- Solve non-linear equations using Newtonβs method in Python
π Applications in Data Science
- Efficient computations in large-scale datasets
Week 9: Information Theory & Entropy
π Topics
- Entropy and Mutual Information
- KL Divergence & Cross-Entropy
- Shannonβs Theorem
π Resources
- Information Theory, Inference, and Learning Algorithms by David MacKay
π Exercises
- Compute entropy of a dataset using Python
π Applications in Data Science
- Feature selection in decision trees
Week 10: Graph Theory
π Topics
- Graph Representations & Adjacency Matrices
- Graph Traversal (DFS, BFS)
- PageRank Algorithm
π Resources
- Networks, Crowds, and Markets by Easley & Kleinberg
π Exercises
- Implement DFS and BFS in Python
- Compute PageRank on a sample graph
π Applications in Data Science
- Knowledge graph creation and analysis
Week 11: Time Series Analysis
π Topics
- Stationarity & Differencing
- Autoregressive Models (AR, MA, ARIMA)
- Fourier Transforms
π Resources
- Time Series Analysis and Its Applications by Shumway & Stoffer
π Exercises
- Implement ARIMA models using statsmodels
π Applications in Data Science
- Financial and economic forecasting
Week 12: Final Project & Consolidation
π― Objective: Apply all learned concepts in a capstone project
π Project Ideas
- Build a predictive model using PCA + Regression
- Implement an ML model and optimize it using gradient descent
- Perform statistical hypothesis testing on a real-world dataset