study-plan/MachineLearning.md

## Machine Learning

### Machine Learning: The Basics

Topics to review so you don't get weeded out.
* Supervised learning
* Unsupervised learning
* Semi-supervised learning
* Modeling business decisions usually uses supervised and unsupervised learning.
* Classification and regression are the most commonly seen machine learning models.

### Machine Learning: The Full Topics List

List of topics focusing on theoretical components: 

* Supervised Learning
	* VC Dimension
	* PAC Learning
	* Noise 
	* Regression
	* Model Selection
	* Dimensions

* Bayesian Decision Theory
	* Classification
	* Losses and Risks
	* Discriminant Functions
	* Utility Theory
	* Association Rules

* Parametric Models
	* Maximum Likelihood Estimation
	* Building Estimators
	* Bayes Estimator
	* Parametric Classification
	* Regression
	* Tuning Model Complexity
	* Model Selection

* Multivariate Methods
	* Multivariate Data
	* Parameter Estimation
	* Missing Values
	* Multivariate Normal
	* Multivariate Classificaiton
	* Tuning Complexity
	* Discrete Features
	* Multivariate Regression

* Dimensionality Reduction
	* Subset Selection
	* PCA
	* Factor Analysis
	* Multidimensional Scaling
	* LDA
	* Isomapl
	* Local Linear Embedding

* Clustering
	* Mixture Densities
	* k-Means
	* Expectation Maximization Algorithm
	* Mixtures of Latent Variables
	* Supervised Learning after Clustering
	* Hierarchical Clustering
	* Numbers of Clusters

* Nonparametric Methods
* Decision Trees
* Linear discrimination
* Miultilyaer Perceptrons
* Local Models
* Kernel Machines
* Bayesian Estimation
* Hidden Markov Models
* Graphical Models
* Combining Multiple Learners
* Reinforcement Learning
* Design and Analysis of Machine Learning Experiments


A long and full list of types of models under each sub-heading: 

* Regression: **Modeling relationship between variables, iteratively refined using an error measure.**
	* Linear Regression
	* Logistic Regression
	* OLS (Ordinary Least Squares) Regression
	* Stepwise Regression
	* MARS (Multivariate Adaptive Regression Splines)
	* LOESS (Locally Estimated Scatterplot Smoothing)

* Instance Based: **Build up database of data, compare new data to database; winner-take-all or memory-based learning.**
	* k-Nearest Neighbor
	* Learning Vector quantization
	* Self-Organizing Map
	* Localy Weighted Learning

* Regularization: **Extension made to other methods, penalizes model complexity, favors simpler and more generalizable models.**
	* Ridge Regression
	* LASSO (Least Absolute Shrinkage and Selection Operator)
	* Elastic Net
	* LARS (Least Angle Regression)

* Decision Tree: **Construct a model of decisions made on actual values of attributes in the data.**
	* Classification and Regression Tree
	* CHAID (Chi-Squared Automatic Interaction Detection)
	* Conditional Decision Trees

* Bayesian: **Methods explicitly applying Bayes' Theorem for classification and regression problems.**
	* Naive Bayes
	* Gaussian Naive Bayes
	* Multinomial Naive Bayes
	* Bayesian Network
	* BBN (Bayesian Belief Network)

* Clustering: **Centroid-based and hierarchical modeling approaches; groups of maximum commonality.**
	* k-Means
	* k-Medians
	* Expectation Maximization
	* Hierarchical Clustering

* Association Rule Algorithms: **Extract rules that best explain relationships between variables in data.**
	* Apriori algorithm
	* Eclat algorithm

* Neural Networks: **Inspired by structure and function of biological neural networks, used ofr regression and classification problems.**
	* Radial Basis Function Network (RBFN)
	* Perceptron
	* Back-Propagation
	* Hopfield Network

* Deep Learning: **Neural networks that exploit cheap and abundant computational power; semi-supervised, lots of data.**
	* Convolutional Neural Network (CNN)
	* Recurrent Neural Network (RNN)
	* Long-Short-Term Memory Network (LSTM)
	* Deep Boltzmann Machine (DBM)
	* Deep Belief Network (DBN)
	* Stacked Auto-Encoders

* Dimensionality Reduction: **Find inherent structure in data, in an unsupervised manner, to describe data using less information.**
	* PCA
	* t-SNE
	* PLS (Partial Least Squares Regression)
	* Sammon Mapping
	* Multidimensional Scaling
	* Projection Pursuit
	* Principal Component Regression
	* Partial Least Squares Discriminant Analysis
	* Mixture Discriminant Analysis
	* Quadratic Discriminant Analysis
	* Regularized Discriminant Analysis
	* Linear Discriminant Analysis

* Ensemble: **Models composed of multiple weaker models, independently trained, that provide a combined prediction.**
	* Random Forest
	* Gradient Boosting Machines (GBM)
	* Boosting
	* Bootstrapped Aggregation (Bagging)
	* AdaBoost
	* Stacked Generalization (Blending)
	* Gradient Boosted Regression Trees
Spliting document into parts. 7 years ago			`## Machine Learning`

			`### Machine Learning: The Basics`

			`Topics to review so you don't get weeded out.`
			`* Supervised learning`
			`* Unsupervised learning`
			`* Semi-supervised learning`
			`* Modeling business decisions usually uses supervised and unsupervised learning.`
			`* Classification and regression are the most commonly seen machine learning models.`

			`### Machine Learning: The Full Topics List`

add machine learning doc updates 7 years ago			`List of topics focusing on theoretical components:`

			`* Supervised Learning`
			`* VC Dimension`
			`* PAC Learning`
			`* Noise`
			`* Regression`
			`* Model Selection`
			`* Dimensions`

			`* Bayesian Decision Theory`
			`* Classification`
			`* Losses and Risks`
			`* Discriminant Functions`
			`* Utility Theory`
			`* Association Rules`

			`* Parametric Models`
			`* Maximum Likelihood Estimation`
			`* Building Estimators`
			`* Bayes Estimator`
			`* Parametric Classification`
			`* Regression`
			`* Tuning Model Complexity`
			`* Model Selection`

			`* Multivariate Methods`
			`* Multivariate Data`
			`* Parameter Estimation`
			`* Missing Values`
			`* Multivariate Normal`
			`* Multivariate Classificaiton`
			`* Tuning Complexity`
			`* Discrete Features`
			`* Multivariate Regression`

			`* Dimensionality Reduction`
			`* Subset Selection`
			`* PCA`
			`* Factor Analysis`
			`* Multidimensional Scaling`
			`* LDA`
			`* Isomapl`
			`* Local Linear Embedding`

			`* Clustering`
			`* Mixture Densities`
			`* k-Means`
			`* Expectation Maximization Algorithm`
			`* Mixtures of Latent Variables`
			`* Supervised Learning after Clustering`
			`* Hierarchical Clustering`
			`* Numbers of Clusters`

			`* Nonparametric Methods`
			`* Decision Trees`
update with what we have been doing. 7 years ago			`* Linear discrimination`
add machine learning doc updates 7 years ago			`* Miultilyaer Perceptrons`
			`* Local Models`
			`* Kernel Machines`
			`* Bayesian Estimation`
			`* Hidden Markov Models`
			`* Graphical Models`
			`* Combining Multiple Learners`
			`* Reinforcement Learning`
			`* Design and Analysis of Machine Learning Experiments`




			`A long and full list of types of models under each sub-heading:`
Spliting document into parts. 7 years ago
update with what we have been doing. 7 years ago			`* Regression: Modeling relationship between variables, iteratively refined using an error measure.`
Spliting document into parts. 7 years ago			`* Linear Regression`
			`* Logistic Regression`
			`* OLS (Ordinary Least Squares) Regression`
			`* Stepwise Regression`
			`* MARS (Multivariate Adaptive Regression Splines)`
			`* LOESS (Locally Estimated Scatterplot Smoothing)`

update with what we have been doing. 7 years ago			`* Instance Based: Build up database of data, compare new data to database; winner-take-all or memory-based learning.`
Spliting document into parts. 7 years ago			`* k-Nearest Neighbor`
			`* Learning Vector quantization`
			`* Self-Organizing Map`
			`* Localy Weighted Learning`

update with what we have been doing. 7 years ago			`* Regularization: Extension made to other methods, penalizes model complexity, favors simpler and more generalizable models.`
Spliting document into parts. 7 years ago			`* Ridge Regression`
			`* LASSO (Least Absolute Shrinkage and Selection Operator)`
			`* Elastic Net`
			`* LARS (Least Angle Regression)`

update with what we have been doing. 7 years ago			`* Decision Tree: Construct a model of decisions made on actual values of attributes in the data.`
Spliting document into parts. 7 years ago			`* Classification and Regression Tree`
			`* CHAID (Chi-Squared Automatic Interaction Detection)`
			`* Conditional Decision Trees`

update with what we have been doing. 7 years ago			`* Bayesian: Methods explicitly applying Bayes' Theorem for classification and regression problems.`
Spliting document into parts. 7 years ago			`* Naive Bayes`
			`* Gaussian Naive Bayes`
			`* Multinomial Naive Bayes`
			`* Bayesian Network`
			`* BBN (Bayesian Belief Network)`

update with what we have been doing. 7 years ago			`* Clustering: Centroid-based and hierarchical modeling approaches; groups of maximum commonality.`
Spliting document into parts. 7 years ago			`* k-Means`
			`* k-Medians`
			`* Expectation Maximization`
			`* Hierarchical Clustering`

update with what we have been doing. 7 years ago			`* Association Rule Algorithms: Extract rules that best explain relationships between variables in data.`
Spliting document into parts. 7 years ago			`* Apriori algorithm`
			`* Eclat algorithm`

update with what we have been doing. 7 years ago			`* Neural Networks: Inspired by structure and function of biological neural networks, used ofr regression and classification problems.`
Spliting document into parts. 7 years ago			`* Radial Basis Function Network (RBFN)`
			`* Perceptron`
			`* Back-Propagation`
			`* Hopfield Network`

update with what we have been doing. 7 years ago			`* Deep Learning: Neural networks that exploit cheap and abundant computational power; semi-supervised, lots of data.`
Spliting document into parts. 7 years ago			`* Convolutional Neural Network (CNN)`
			`* Recurrent Neural Network (RNN)`
			`* Long-Short-Term Memory Network (LSTM)`
			`* Deep Boltzmann Machine (DBM)`
			`* Deep Belief Network (DBN)`
			`* Stacked Auto-Encoders`

update with what we have been doing. 7 years ago			`* Dimensionality Reduction: Find inherent structure in data, in an unsupervised manner, to describe data using less information.`
Spliting document into parts. 7 years ago			`* PCA`
			`* t-SNE`
			`* PLS (Partial Least Squares Regression)`
			`* Sammon Mapping`
			`* Multidimensional Scaling`
			`* Projection Pursuit`
			`* Principal Component Regression`
			`* Partial Least Squares Discriminant Analysis`
			`* Mixture Discriminant Analysis`
			`* Quadratic Discriminant Analysis`
			`* Regularized Discriminant Analysis`
			`* Linear Discriminant Analysis`

update with what we have been doing. 7 years ago			`* Ensemble: Models composed of multiple weaker models, independently trained, that provide a combined prediction.`
Spliting document into parts. 7 years ago			`* Random Forest`
			`* Gradient Boosting Machines (GBM)`
			`* Boosting`
			`* Bootstrapped Aggregation (Bagging)`
			`* AdaBoost`
			`* Stacked Generalization (Blending)`
			`* Gradient Boosted Regression Trees`