当前位置: 首页 > news >正文

Introduction to Machine Learning

1 Tradition

1.1 Perceptron

Formula.

Error Bound Theorem.

Multiclass.

1.2 Linear Regression

MLE & MAP.

Formula.

Regularization.

1.3 kNN

Graph.

Overfit lines.

1.4 Logistic Regression

MLE & MAP.

Softmax & sigmoid.

Multiclass.

1.5 SVM

1.5.1 Primal

Formula.

1.5.2 Dual

Formula.

1.5.3 Feature Map

Quadratic example.

1.5.4 Kernel Machine

Formula.

Gram Matrix.

Complexity analysis.

1.6 Bagging and Boosting

1.6.1 Bagging and OOB

Definition.

1.6.2 Boosting Algorithms

Definition.

1.7 GMM

1.7.1 k-Means and ++

1.7.2 E-M

1.8 Gradient Descent Methods

1.8.1 Projection

1.8.2 ADAG

Also called Adaptive Gradient.

1.8.3 RMSProp

1.8.4 ADAM

Also called Adaptive Moment.

1.9 Decision Trees

1.9.1 Random Forest

1.9.2 Hyperparameters

2 Modern

2.1 Back Propagation

2.2 Common NN

2.2.1 CNN

Optimize.

2.2.2 RNN

Also the first sequence model.

2.2.3 GRU

2.3 Generative Models

2.3.1 GAN

An adversarial setting.

2.3.2 VAE

A latent variable method, with \(\text{dim}(z)\ll \text{dim}(x)\).

We optimize ELBO.

Reparameterize.

2.3.3 Normalizing Flow

Also a latent variable method, but \(\text{dim}(z)= \text{dim}(x)\).

Assume the flow \(x=z_0\rightarrow z_1\rightarrow\cdots\rightarrow z_K=z\), \(f: x\rightarrow z\),

\[\log_{p_X}(x)=\log_{p_Z}(z_K)+\sum_{k=1}^K \log|\det (J_{f_k}(z_{k-1}))|. \]

Sometimes, can also notate \(T^{-1}:=\bigcup f\) with triangular assumption,

\[\log_{p_X}(x)=\log_{p_Z}(T^{-1}(x))-\sum_{j=1}^d\log\partial_j T_j(T^{-1}(x)). \]

2.3.4 An Expectation View

MLE (Maximum Likelihood Estimation) and MAP (Maximum A Posteriori):

  • MLE is frequentist, MAP is bayesian.

  • MLE is a special MAP.

  • Regularization = MAP.

  • CE and OLS are MLE.

Derivation of CE:

\[H(p,q)=\mathbb E_{y|x\sim p}[-\log q(y|x)]=-\sum_{k=1}^K p_k\log q_k. \]

Note \(H(p,q)\neq H(q,p)\).

Derivation of KL-divergence:

\[D_{\text{KL}}(p\| q)=H(p,q)-H(p)=\sum_{k=1}^K p_k\log\frac{p_k}{q_k}. \]

Note \(D_{\text{KL}}(p\| q)\neq D_{\text{KL}}(q\| p)\).

Derivation of F-divergence:

\[D_f(p\| q)=\mathbb E_{x\sim q}\left [f\left(\frac{p(x)}{q(x)}\right)\right]. \]

Note \(D_{f}(p\| q)\neq D_{f}(q\| p)\), \(f\) is convex such as \(f(t)=t\log t\).

2.4 Transformer Models

2.4.1 Examples

2.4.2 Scaling Law

2.5 Differential Privacy and Robustness

3 Tips

  • What's the difference between online vs. offline?

Online gains dataset one by one not in once, but offline does.

  • What is Statistical Learning?

\[\underset{w}{\text{argmin}}\;\;\mathbb E_{(x,y)\sim P}[\ell_w(X,Y)]. \]

  • What is an optimizer?

Generally, for first-order methods, answer GD is the optimizer. Or, say like AGD, APGD, ADAM..(they are all optimizers).

  • Common curves.

ReLU, sigmoid, softmax, hinge.

  • Remember MultiGaussian.

\[\frac1{(2\pi)^{\frac d2}\sqrt{|\Sigma|}}\exp \left(-\frac12 (\bm x-\bm \mu)^\top \Sigma^{-1} (\bm x-\bm \mu)\right). \]

  • What is a weighted-sum?

Such as Attention, Divergence and its variants.

  • What is the difference between discriminative and generative?

\[\underset{\theta}{\text{argmax}}\;\;\mathbb E_{(x,y)\sim p}[\log(p_\theta (y\mid x))]\quad \text{and}\quad \underset{\theta}{\text{argmax}}\;\;\mathbb E_{x\sim p}[\log p_\theta (x)]. \]

  • What is the difference between \(p\) and \(q\)?

\(p\) is real distribution, \(q\) is fake distribution.

http://www.gsyq.cn/news/82010.html

相关文章:

  • 抛丸机生产厂家排名有哪些?2025广东抛丸机生产厂家推荐 - 栗子测评
  • 2025精选车间净化设计公司权威排行 - 栗子测评
  • Calibre-Web 用户创建功能中的存储型XSS漏洞详解
  • 2025年下半年内蒙古坑道钻机企业推荐榜单:前十名企业深度解析 - 2025年11月品牌推荐榜
  • 2025年下半年内蒙古勘探钻机企业综合推荐榜单:十大优质厂商深度解析 - 2025年11月品牌推荐榜
  • 2025年下半年内蒙古探水钻机企业综合评估与选购指南 - 2025年11月品牌推荐榜
  • Java 在现代企业级开发中的核心价值再认识:从语言特性到架构实践的全景解析
  • C++ 信号处理
  • 寻求“完整性理性”:AI元人文构想与价值-技术的统一场论
  • 团队作业5——测试与发布(Alpha版本)
  • IO多路复用
  • 251210今天居然被妹妹鼓励到了
  • AI元人文构想:从统一描述语言到文明的操作系统
  • STM32-bootloader引导程序跳转机制笔记 - 教程
  • Debian 13安装rime中文输入法
  • 中介者模式
  • [LabVIEW随笔-10] -基础知识3
  • linux进线程
  • Claude使用体验
  • 适合 Linux 强者的方式 freertos学习
  • 真相:大量硬件场景 根本跑不了 Linux
  • 2025最新玻璃胶推荐!国内优质玻璃胶品牌权威榜单发布,资质与性能双优助力高效密封 - 全局中转站
  • AshPostgres 政策绕过漏洞:空原子更新操作可能触发副作用
  • Supabase 实战指南:从零开始搭建数据库、配置 Auth 并接入 Resend 邮件服务
  • 2025 最新水泥基渗透结晶型防水涂料厂家 TOP5 评测!技术创新 + 工程实证专业榜单发布,构筑混凝土长效防护屏障 - 全局中转站
  • python: 用os库判断进程是否在运行中?
  • 2025.12.10总结
  • Enhance European/American Repairs with OTOFIX D1 Plus 1-Year Update Subscription
  • [Git] [GitHub] 如何在将本地代码推送到github
  • 2025最新AWHFVC防腐厂家TOP5评测!混凝土防腐品牌年度榜单,技术创新+长效防护,守护工业基建安全防线 - 全局中转站