雅克比矩阵学习笔记
创始人
2025-06-01 01:59:34

前置

假设f:Rn→Rmf:R_n\to R_mf:Rn​→Rm​是从nnn维欧氏空间线性映射到到mmm维欧氏空间的函数,这个函数由mmm个实函数组成,记作:
{y1=f1(x1,x2,...,xn)y2=f2(x1,x2,...,xn)...ym=fn(x1,x2,...,xn)\left\{ \begin{array}{lcl} y_1=f_1(x1,x2,...,x_n)\\ y_2=f_2(x1,x2,...,x_n)\\ ...\\ y_m=f_n(x1,x2,...,x_n)\\ \end{array} \right. ⎩⎧​y1​=f1​(x1,x2,...,xn​)y2​=f2​(x1,x2,...,xn​)...ym​=fn​(x1,x2,...,xn​)​

我们对f(Z),Z=(x1,x2,...,xn)f(Z),Z=(x_1,x_2,...,x_n)f(Z),Z=(x1​,x2​,...,xn​)进行一阶泰勒展开,
f(Z)=f(Z0)+Jf(Z0)(Z−Z0)f(Z)=f(Z_0)+J_f(Z_0)(Z-Z_0)f(Z)=f(Z0​)+Jf​(Z0​)(Z−Z0​)

其中f(Z0)f(Z_0)f(Z0​)就是f(Z)f(Z)f(Z)在Z0Z_0Z0​的导数,在这里就是f(Z)f(Z)f(Z)的雅克比矩阵。
值得注意的是,由于只展开到一阶,因此存在误差,故这里的等号并不是严格意义上的相等。

定义

m∗nm*nm∗n的雅克比矩阵:
[∂f1∂x1∂f1∂x2...∂f1∂xn∂f2∂x1∂f2∂x2...∂f2∂xn............∂fm∂x1∂fm∂x2...∂fm∂xn]\left[ \begin{matrix} \frac{\partial f_1}{\partial x_1}&\frac{\partial f_1}{\partial x_2}&...&\frac{\partial f_1}{\partial x_n}\\ \frac{\partial f_2}{\partial x_1}&\frac{\partial f_2}{\partial x_2}&...&\frac{\partial f_2}{\partial x_n}\\ ...&...&...&...\\ \frac{\partial f_m}{\partial x_1}&\frac{\partial f_m}{\partial x_2}&...&\frac{\partial f_m}{\partial x_n}\\ \end{matrix} \right]​∂x1​∂f1​​∂x1​∂f2​​...∂x1​∂fm​​​∂x2​∂f1​​∂x2​∂f2​​...∂x2​∂fm​​​............​∂xn​∂f1​​∂xn​∂f2​​...∂xn​∂fm​​​
当然也可以写成行矩阵的形式:
[∂f∂x1∂f∂x2...∂f∂xn]\left[ \begin{matrix} \frac{\partial f}{\partial x_1}&\frac{\partial f}{\partial x_2}&...&\frac{\partial f}{\partial x_n}\\ \end{matrix} \right][∂x1​∂f​​∂x2​∂f​​...​∂xn​∂f​​]
这也正是梯度矩阵的转置矩阵,即Jf(Z)=∇f(Z)TJ_f(Z)=\nabla f(Z)^TJf​(Z)=∇f(Z)T。

雅克比行列式

当n=mn=mn=m时,雅克比矩阵就变为了:
[∂f1∂x1∂f1∂x2...∂f1∂xn∂f2∂x1∂f2∂x2...∂f2∂xn............∂fn∂x1∂fn∂x2...∂fn∂xn]\left[ \begin{matrix} \frac{\partial f_1}{\partial x_1}&\frac{\partial f_1}{\partial x_2}&...&\frac{\partial f_1}{\partial x_n}\\ \frac{\partial f_2}{\partial x_1}&\frac{\partial f_2}{\partial x_2}&...&\frac{\partial f_2}{\partial x_n}\\ ...&...&...&...\\ \frac{\partial f_n}{\partial x_1}&\frac{\partial f_n}{\partial x_2}&...&\frac{\partial f_n}{\partial x_n}\\ \end{matrix} \right]​∂x1​∂f1​​∂x1​∂f2​​...∂x1​∂fn​​​∂x2​∂f1​​∂x2​∂f2​​...∂x2​∂fn​​​............​∂xn​∂f1​​∂xn​∂f2​​...∂xn​∂fn​​​

对上面的展开式进行移项,得:
f(Z)−f(Z0)=Jf(Z0)(Z−Z0)f(Z)-f(Z_0)=J_f(Z_0)(Z-Z_0)f(Z)−f(Z0​)=Jf​(Z0​)(Z−Z0​)

记Z−Z0=Δx,f(Z)−f(Z0)=ΔyZ-Z_0=\Delta x,f(Z)-f(Z_0)=\Delta yZ−Z0​=Δx,f(Z)−f(Z0​)=Δy,于是有:
Δy=Jf(Z0)Δx\Delta y=J_f(Z_0)\Delta xΔy=Jf​(Z0​)Δx

展开,有:
[dy1dy2...dyn]=[∂f1∂x1∂f1∂x2...∂f1∂xn∂f2∂x1∂f2∂x2...∂f2∂xn............∂fn∂x1∂fn∂x2...∂fn∂xn]∗[dx1dx2...dxn]\left[ \begin{matrix} \mathrm{d}y_1\\ \mathrm{d}y_2\\ ...\\ \mathrm{d}y_n\\ \end{matrix} \right]= \left[ \begin{matrix} \frac{\partial f_1}{\partial x_1}&\frac{\partial f_1}{\partial x_2}&...&\frac{\partial f_1}{\partial x_n}\\ \frac{\partial f_2}{\partial x_1}&\frac{\partial f_2}{\partial x_2}&...&\frac{\partial f_2}{\partial x_n}\\ ...&...&...&...\\ \frac{\partial f_n}{\partial x_1}&\frac{\partial f_n}{\partial x_2}&...&\frac{\partial f_n}{\partial x_n}\\ \end{matrix} \right]* \left[ \begin{matrix} \mathrm{d}x_1\\ \mathrm{d}x_2\\ ...\\ \mathrm{d}x_n\\ \end{matrix} \right]​dy1​dy2​...dyn​​​=​∂x1​∂f1​​∂x1​∂f2​​...∂x1​∂fn​​​∂x2​∂f1​​∂x2​∂f2​​...∂x2​∂fn​​​............​∂xn​∂f1​​∂xn​∂f2​​...∂xn​∂fn​​​​∗​dx1​dx2​...dxn​​
继续展开,有:
[dy1dy2...dyn]=[∂f1∂x1dx1+∂f1∂x2dx2+...+∂f1∂xndxn∂f2∂x1dx1+∂f2∂x2dx2+...+∂f2∂xndxn...∂fn∂x1dx1+∂fn∂x2dx2+...+∂fn∂xndxn]\left[ \begin{matrix} \mathrm{d}y_1\\ \mathrm{d}y_2\\ ...\\ \mathrm{d}y_n\\ \end{matrix} \right]= \left[ \begin{matrix} \frac{\partial f_1}{\partial x_1}\mathrm{d}x_1+\frac{\partial f_1}{\partial x_2}\mathrm{d}x_2+...+\frac{\partial f_1}{\partial x_n}\mathrm{d}x_n\\ \frac{\partial f_2}{\partial x_1}\mathrm{d}x_1+\frac{\partial f_2}{\partial x_2}\mathrm{d}x_2+...+\frac{\partial f_2}{\partial x_n}\mathrm{d}x_n\\ ...\\ \frac{\partial f_n}{\partial x_1}\mathrm{d}x_1+\frac{\partial f_n}{\partial x_2}\mathrm{d}x_2+...+\frac{\partial f_n}{\partial x_n}\mathrm{d}x_n \end{matrix} \right]​dy1​dy2​...dyn​​​=​∂x1​∂f1​​dx1​+∂x2​∂f1​​dx2​+...+∂xn​∂f1​​dxn​∂x1​∂f2​​dx1​+∂x2​∂f2​​dx2​+...+∂xn​∂f2​​dxn​...∂x1​∂fn​​dx1​+∂x2​∂fn​​dx2​+...+∂xn​∂fn​​dxn​​
正交化一下,有:
[dy10...00dy2...0............00...dyn]=[∂f1∂x1dx1∂f1∂x2dx2...∂f1∂xndxn∂f2∂x1dx1∂f2∂x2dx2...∂f2∂xndxn............∂fn∂x1dx1∂fn∂x2dx2...∂fn∂xndxn]\left[ \begin{matrix} \mathrm{d}y_1&0&...&0\\ 0&\mathrm{d}y_2&...&0\\ ...&...&...&...\\ 0&0&...&\mathrm{d}y_n\\ \end{matrix} \right]= \left[ \begin{matrix} \frac{\partial f_1}{\partial x_1}\mathrm{d}x_1&\frac{\partial f_1}{\partial x_2}\mathrm{d}x_2&...&\frac{\partial f_1}{\partial x_n}\mathrm{d}x_n\\ \frac{\partial f_2}{\partial x_1}\mathrm{d}x_1&\frac{\partial f_2}{\partial x_2}\mathrm{d}x_2&...&\frac{\partial f_2}{\partial x_n}\mathrm{d}x_n\\ ...&...&...&...\\ \frac{\partial f_n}{\partial x_1}\mathrm{d}x_1&\frac{\partial f_n}{\partial x_2}\mathrm{d}x_2&...&\frac{\partial f_n}{\partial x_n}\mathrm{d}x_n \end{matrix} \right]​dy1​0...0​0dy2​...0​............​00...dyn​​​=​∂x1​∂f1​​dx1​∂x1​∂f2​​dx1​...∂x1​∂fn​​dx1​​∂x2​∂f1​​dx2​∂x2​∂f2​​dx2​...∂x2​∂fn​​dx2​​............​∂xn​∂f1​​dxn​∂xn​∂f2​​dxn​...∂xn​∂fn​​dxn​​

取两边的行列式(特别注意的是,由于{dx},{dy}\{\mathrm{d}x\},\{\mathrm{d}y\}{dx},{dy}均为正数,因此行列式需取绝对值),有:
dy1⋅dy2⋅...⋅dyn=∣∣∂f1∂x1∂f1∂x2...∂f1∂xn∂f2∂x1∂f2∂x2...∂f2∂xn............∂fn∂x1∂fn∂x2...∂fn∂xn∣∣⋅dx1⋅dx2⋅...⋅dxn\mathrm{d}y_1\cdot \mathrm{d}y_2\cdot...\cdot\mathrm{d}y_n=\left| \begin{vmatrix} \frac{\partial f_1}{\partial x_1}&\frac{\partial f_1}{\partial x_2}&...&\frac{\partial f_1}{\partial x_n}\\ \frac{\partial f_2}{\partial x_1}&\frac{\partial f_2}{\partial x_2}&...&\frac{\partial f_2}{\partial x_n}\\ ...&...&...&...\\ \frac{\partial f_n}{\partial x_1}&\frac{\partial f_n}{\partial x_2}&...&\frac{\partial f_n}{\partial x_n}\\ \end{vmatrix} \right|\cdot\mathrm{d}x_1\cdot\mathrm{d}x_2\cdot...\cdot\mathrm{d}x_ndy1​⋅dy2​⋅...⋅dyn​=​∂x1​∂f1​​∂x1​∂f2​​...∂x1​∂fn​​​∂x2​∂f1​​∂x2​∂f2​​...∂x2​∂fn​​​............​∂xn​∂f1​​∂xn​∂f2​​...∂xn​∂fn​​​​⋅dx1​⋅dx2​⋅...⋅dxn​

即:
dy1⋅dy2⋅...⋅dyn=∣∣Jf(Z)∣∣⋅dx1⋅dx2⋅...⋅dxn\mathrm{d}y_1\cdot \mathrm{d}y_2\cdot...\cdot\mathrm{d}y_n=||J_f(Z)||\cdot\mathrm{d}x_1\cdot\mathrm{d}x_2\cdot...\cdot\mathrm{d}x_ndy1​⋅dy2​⋅...⋅dyn​=∣∣Jf​(Z)∣∣⋅dx1​⋅dx2​⋅...⋅dxn​

上式在n=1n=1n=1时,也即在一元函数中,可以理解为原线段的长度经过∣∣Jf(Z)∣∣||J_f(Z)||∣∣Jf​(Z)∣∣的缩放得到新线段的长度;
上式在n=2n=2n=2时,也即在二元函数中,可以理解为原平面图形的面积经过∣∣Jf(Z)∣∣||J_f(Z)||∣∣Jf​(Z)∣∣的缩放得到新平面图形的面积;
上式在n=3n=3n=3时,也即在三元函数中,可以理解为原平面图形的体积经过∣∣Jf(Z)∣∣||J_f(Z)||∣∣Jf​(Z)∣∣的缩放得到新平面图形的体积;
在n>3n>3n>3时直观上不好描述其几何意义,姑且不做讨论。
换言之,雅可比矩阵的行列式可以理解为原几何图形所确定的某种几何关系经过线性变化得到新几何图形的一种缩放比例。这也正是仿射变换

应用

通过仿射变换解决圆锥曲线中一些问题(高中数学常用);
和黑塞矩阵一起作为各种牛顿法的基础,也是梯度下降等算法的基础(稍后会写);
和机器人以及运动学有关(大雾~~)。

相关内容

热门资讯

白俄罗斯向乌克兰移交114人 当地时间13日,白俄罗斯向乌克兰移交114人。乌克兰国防部情报总局局长布达诺夫到场,迎接被移交人员。...
高峰对话:Agent能走多快,... 要让Agent成为工作助手,需要破解哪些环节?文|《中国企业家》记者 闫俊文编辑|何伊凡见习编辑|李...
马竞2:1险胜瓦伦西亚!主场1... 在西甲联赛第16轮的焦点对决中,马德里竞技坐镇主场迎战瓦伦西亚。最终,凭借科克和格列兹曼的进球,马竞...
如何重构新消费?5位消费公司掌... 好产品才能赢得消费者的心。文|《中国企业家》记者 胡楠楠编辑|米娜图片来源|中企图库12月6~7日,...
12亿存款“失踪”案,最新进展 昨天(12月12日),据证监会发布,经查,浙江优策投资管理有限公司(以下简称优策投资)及实际控制人存...