1. Derivative
Derivative can be defined on functions of a single variable ( f : R → R f: \mathbb{R} \rightarrow \mathbb{R} f:R→R).
2. Gradient
Gradient is a multi-variable generalization of the derivative ( f : R n → R f: \mathbb{R}^{n} \rightarrow \mathbb{R} f:Rn→R).
While a derivative can be defined on functions of a single variable, for scalar functions of several variables, the gradient takes its place.
∇ f ( p ) = [ ∂ f ∂ x 1 ( p ) ⋮ ∂ f ∂ x n ( p ) ] \nabla f(p)=\left[\begin{array}{c} \frac{\partial f}{\partial x_{1}}(p) \\ \vdots \\ \frac{\partial f}{\partial x_{n}}(p) \end{array}\right] ∇f(p)=⎣⎢⎡∂x1∂f(p)⋮∂xn∂f(p)⎦⎥⎤
3. Jacobian
雅克比矩阵实际上是对于梯度的一种泛化,对于值为标量的多变量的函数 f ( x ) f(x) f(x),我们使用梯度,对于值为向量的多变量的函数,使用雅可比矩阵.
The Jacobian is the generalization of the gradient for vector-valued functions of several variables( f : R n → R m f: \mathbb{R}^{n} \rightarrow \mathbb{R}^m f:Rn→Rm).
J = [ ∂ f ∂ x 1 ⋯ ∂ f ∂ x n ] = [ ∇ T f 1 ⋮ ∇ T f m ] = [ ∂ f 1 ∂