数值计算详细笔记（二）：非线性方程解法

最新推荐文章于 2023-01-01 17:50:33 发布

Gene_INNOCENT

最新推荐文章于 2023-01-01 17:50:33 发布

阅读量660

点赞数 1

分类专栏： # 数值计算大学课程笔记文章标签：数值计算非线性方程牛顿法二分法不动点

本文链接：https://blog.csdn.net/qq_41552508/article/details/108017409

版权

大学课程笔记同时被 2 个专栏收录

86 篇文章

订阅专栏

数值计算

3 篇文章

订阅专栏

本文深入探讨了非线性方程求根的多种方法，包括二分法、固定点迭代法、牛顿法、割线法、错误位置法等，详细解析了每种方法的原理、算法步骤、收敛性分析及误差界限，同时介绍了加速收敛的技巧和多项式求根的特殊方法。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

前言

如果你对这篇文章可感兴趣，可以点击「【访客必读 - 指引页】一文囊括主页内所有高质量博客」，查看完整博客分类与对应链接。

文章目录

2. Nonlinear Equation f(x) = 0

2.1 Bisection Method

2.1.1 Root-Finding Problem

Root-Finding or Zero-Finding Problem

Given a function $f (x)$ in one variable $x$ , finding a root $x$ of an equation of the form $f (x) = 0$ .

Solution $x$ is called a root of equation $f(x)=0 $, or zero of function $f (x)$ .

2.1.2 Bisection Algorithm

Prejudgment

By the Intermediate Value Theorem, if

$f\in C[a,b], and \ f(a)*f(b)<0,$

then there exists at least a point $x^*\in (a,b)$ , such that $f(x^*)=0$ .

Algorithm Description

INPUT: endpoints $a, b$ ; tolerance $T O L$ ; maximum number of iterations $N$ .
OUTPUT: approximate solution $c$ or message of failure.
Step $1$ : Set $k = 1, F A = f (a)$ ;
Step $2$ : While $k\leq N$ , do Steps $3 ～ 6$
- Step $3$ : Set $\displaystyle\frac{a+b}{2}$ ; and compute $F C = f (c)$ .
- Step $4$ : If $F C = 0$ or $\displaystyle\frac{|b-a|}{2}<TOL$ , then output $c$ , (Procedure complete successfully.) Stop!
- Step $5$ : If $F A * F C < 0$ , then set $b = c$ ; else set $a = c$
- Step $6$ : Set $k = k + 1$ .
Step $7$ : OUTPUT “Method failed after $N$ iterations.” STOP!

Other Stopping Criteria

Other Stopping Criteria for Iteration procedures with a given tolerance $\varepsilon >0:$

$|p_n-p_{n-1}|<\varepsilon\\ \displaystyle\frac{|p_n-p_{n-1}|}{|p_n|} < \varepsilon \\ |f(p_n)|< \varepsilon$

2.1.3 Convergence Analysis

Theorem

Suppose that $f\in C[a,b]$ , and $f (a) * f (b) < 0$ . The Bisection method generates a sequence $\{{p_n}\}_1^\infty$ approximating a zero point $p$ of $f$ with

$|p_n-p|\leq \displaystyle\frac{b-a}{2^n},n\geq1.$

Proof

By the procedure, we know that

$|b_1-a_1|=|b-a|,\\ |b_2-a_2|=\displaystyle\frac{|b_1-a_1|}{2}=\displaystyle\frac{|b-a|}{2},\\ ...\\ |b_n-a_n|=\displaystyle\frac{|b_{n-1}-a_{n-1}|}{2}=\displaystyle\frac{|b-a|}{2^{n-1}},$

Since $p_n=\displaystyle\frac{a_n+b_n}{2}$ and $p\in (a_n,p_n]$ or $p\in [p_n,b_n)$ for all $n\geq 1$ , it follows that

$|p_n-p|\leq \displaystyle\frac{|b_n-a_n| }{2}=\displaystyle\frac{b-a}{2^n}.$

Convergence Rate

Since

$|p_n-p|\leq \displaystyle\frac{|b_n-a_n|}{2}=\displaystyle\frac{|b-a|}{2^n}$

The Sequence $\{p_n\}_{n=1}^\infty$ converges to $p$ with rate of convergence $O(\displaystyle\frac{1}{2^n})$ , that is

$p_n=p+O(\displaystyle\frac{1}{2^n})$

Other Property

Bisection is certain to converge, but does so slowly.

Given starting interval $[a, b]$ , length of interval after $k$ iterations is $\displaystyle\frac{b-a}{2^k}$ , so achieving error tolerance of $\varepsilon(\displaystyle\frac{(b-a)}{2^k}<\varepsilon)$ requires $k\approx[log_2^{\frac{b-a}{\varepsilon}}]$ iterations, regardless of function $f$ involved.

2.2 Fixed Point Method

2.2.1 Introduction

Fixed point of given function $g:\mathbb{R}\rightarrow \mathbb{R}$ is value $x^*$ such that $x^*=g(x^*)$
Many iterative methods for solving nonlinear equations use fixed-point iteration scheme of form
$x_{k+1}=g(x_k)$
where fixed points for $g$ are solutions for $f (x) = 0$ .

Example

If $f(x)=x^2-x-2$ , it has two roots $x^*=2$ and $x^*=-1$ . Then fixed points of each of functions
$g(x)=x^2-2 \\ g(x) = \sqrt{x+2}\\ g(x) = 1+\displaystyle\frac{2}{x} \\ g(x) = \displaystyle\frac{x^2+2}{2*x-1}$
are solutions to equation $f (x) = 0$ .

2.2.2 Algorithm

Definition

To approximate the fixed point of a function $g (x)$ , we choose an initial approximation $p_0$ , and generate the sequence $\{p_n\}^\infty_{n=0}$ by letting
$\left\{ \begin{aligned} Given & \ \ \ \ p_0 \\ p_n = & g(p_{n-1}),n=0,1,..., \end{aligned} \right.$
for each $n\geq 1$ .
If the sequence $\{p_n\}^\infty_{n=0}$ converges to $p$ and $g (x)$ is continuous, then we have
$\lim_{n\rightarrow \infty}p_n=\lim_{n\rightarrow \infty}g(p_n)=g(\lim_{n\rightarrow \infty}p_n)=g(p).$
and a solution to $x = g (x)$ is obtained.
This technique described above is called fixed point iteration (or functional iteration).

Example

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-KJNHijUK-1597450649978)(media/15764284040137.jpg)]

Pseudo-Code

INPUT: Initial approximation $p_0$ , tolerance $T O L$ , Maximum number of iteration $N$ .
OUTPUT: approximate solution $p$ or message of failure.
Step $1$ : Set $n = 1$ .
Step $2$ : While $n\leq N$ , do Steps $3 ～ 5$ .
- Step $3$ : Set $p= g(p_0)$ .
- Step $4$ : If $p-p_0|<TOL$ , then output $p$ ; (Procedure complete successfully.) Stop!
- Step $5$ : Set $n=n+1, p_0=p$ .
Step $6$ : OUTPUT “Method failed after $N$ iterations.” STOP!

2.2.3 Existence and Uniqueness

Theorem: Sufficient Conditions for the Existence and Uniqueness of a Fixed Point

【Existence】If $g(x)\in C[a,b]$ and $g(x)\in [a,b]$ for all $x\in [a,b]$ , then $g (x)$ has a fixed point in $[a, b]$ . ( $g(x)\in [a,b]$ means $max\{g(x)\}\leq b,min\{g(x)\}\geq a$ )
【Uniqueness】If, in addition, $g^{'} (x)$ exists on $(a, b)$ , and a positive constant $k < 1$ exists with $|g'(x)|\leq k$ , for all $x\in (a,b)$ .

Meet the two conditions above means the fixed point in $[a, b]$ is unique.

Proof for Existence

If $g (a) = a$ or $g (b) = b$ , then $g (x)$ has a fixed point at an endpoint.
Suppose not, then it must be true that $g (a) > a$ and $g (b) < b$ .
Thus the function $h (x) = g (x) - x$ is continuous on $[a, b]$ , and we have
$h (a) = g (a) - a > 0, h (b) = g (b) - b < 0 .$
The Intermediate Value Theorem implies that there exists $p\in (a,b)$ for $h (x) = g (x) - x$ which $h (p) = 0$ .
Thus $g (p) - p = 0$ , and $p$ is a fixed point of $g (x)$ .

The proving process above changes $g (x) = x$ to $f (x) = g (x) - x$ , changing fixed point problem into zero point existing problem.

Proof for Uniqueness

Using contradiction method to prove this characteristic.

Suppose, in addition, that $|g'(x)\leq k\leq 1|$ and that $p$ and $q$ are both fixed points in $[a, b]$ with $p\not=q.$
Then by the Mean Value Theorem, a number $\xi$ exists between $p$ and $q$ . Hence, in $[a, b]$ ,
$\displaystyle\frac{g(p)-g(q)}{p-q}=g'(\xi)$
exists.
Then
$|p-q|=|g(p)-g(q)|=|g'(\xi)||p-q|\leq k*|p-q|<|p-q|,$
which is a contradiction.
So $p = q$ , and the fixed point in $[a, b]$ is unique.

2.2.4 Convergence Analysis

Theorem

Meet the two conditions above, we can find that, for any number $p_0\in[a,b]$ , the sequence $\{p_n\}^\infty_0$ defined by
$p_n=g(p_{n-1}), n\geq 1$
converges to the unique fixed point $p$ in $[a, b]$ .

Proof

Using the Intermediate Value Theorem, we can prove the existence.

Using the fact that $|g'(x)|\leq k$ and the Mean Value Theorem, we have
$|p_n-p|=|g(p_{n-1}-g(p))|=|g'(\xi)||p_{n-1}-p|\leq k*|p_{n-1}-p|,$
where $\xi \in (a,b)$ .

Applying this inequality inductively shows
$|p_n-p|\leq k*|p_{n-1}-p|\leq k^2*|p_{n-2}-p|\leq ...\leq k^n*|p_{0}-p|.$

Since $k < 1$ ,
$\lim_{n\rightarrow\infty}|p_n-p|\leq \lim_{n\rightarrow\infty}k^n|p_0-p|=0,$
and $\{p_n\}_0^\infty$ converges to $p$ .

Convergence Rate

Since
$|p_n-p|\leq k^n*|p_0-p|\leq k^n*|b-a|$ ,
the Sequence $\{p_n\}_{n=0}^\infty$ converges to $p$ with rate of convergence $O(k^n)$ with $k < 1$ , that is
$p_n=p+O(k^n),k<1$ .

2.2.5 Bounds for the Error

Corollary

If the solution for $g (x) = x$ exists and is unique, then the bounds for the error involved in using $p_n$ to approximate $p$ are given by
$|p_n-p|\leq k^n*max\{p_0-a,b-p_0\}$
and
$|p_n-p|\leq \displaystyle\frac{k^n}{1-k}*|p_1-p_0|,$
for all $n\geq 1$ .

Proof

$|p_n-p_{n-1}|\leq |g(p_{n-1})-g(p_{n-2})|\leq k*|p_{n-1}-p_{n-2}|\leq ...\leq k^{n-1}|p_1-p_0|.$

Let $m > n$ , using the theorem $|a+b|\leq |a|+|b|$ , then we have
$|p_m-p_n|\leq |p_m-p_{m-1}|+|p_{m-1}-p_{m-2}|+...+|p_{n+1}-p_n|\leq (k^{m-1}+k^{m-2}+...+k^n)|p_1-p_0|\\ |p_m-p_m|\leq k^n*(1+k+...+k^{m-n-1})|p_1-p_0|\leq k^n*\displaystyle\frac{1-k^{m-n-1}}{1-k}*|p_1-p_0|$ .

Let $m\rightarrow\infty$ , we have
$\lim_{m\rightarrow\infty}|p_m-p_n|=|p-p_n|\leq \displaystyle\frac{k^n}{1-k}*|p_1-p_0|.$

2.3 Newton’s Method

2.3.1 Introduction

Status

The Newton-Raphson (or simply Newton’s) method is one of the most powerful and well-known numerical methods for solving a root-finding problem
$f (x) = 0 .$

Rough Description

(1) Suppose that $f\in C^2[a,b]$ , and $x^*$ is a solution of $f (x) = 0$ .

(2) Let $\hat{x}\in [a,b]$ be an approximation to $x^*$ such that $f'(\hat{x})\not=0$ and $|\hat{x}-x^*|$ is “small”.

Consider the first Taylor polynomial for $f (x)$ expanded about $\hat{x}$ ,
$f(x)=f(\hat{x})+(x-\hat{x})f'(\hat{x})+\displaystyle\frac{(x-\hat{x})^2}{2}f''(\xi(x)).$
where $\xi(x)$ lies between $x$ and $\hat{x}$ .

Thus, consider $f(x^*)=0$ , and gives
$0=f(x^*)\approx f(\hat{x})+(x^*-\hat{x})f'(\hat{x}).$

Solution for finding $x^*$ is
$x^*\approx \hat{x}-\displaystyle\frac{f(\hat{x})}{f'(\hat{x})}.$

2.3.2 Algorithm

Definition

Starts with an initial approximation $x_0$
Defined iteration scheme by
$x_n=x_{n-1}-\displaystyle\frac{f(x_{n-1})}{f'(x_{n-1})},\forall n\geq 1$
This scheme generates the sequence $\{x_n\}_0^\infty$

Example

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-MZE0BeDK-1597450649980)(media/15764337024363.jpg)]

Pseudo-Code

The function $f$ is differentiable and $p_0$ is an initial approximation.

INPUT: Initial approximation $p_0$ , tolerance $T O L$ , Maximum number of iteration $N$ .
OUTPUT: approximate solution $p$ or message of failure.
Step $1$ : Set $n = 1$ .
Step $2$ : While $n\leq N$ , do Steps $3 ～ 5$ .
- Step $3$ : Set $p_0-\displaystyle\frac{f(p_0)}{f'(p_0)}$ .
- Step $4$ : If $p-p_0|<TOL$ , then output $p$ ; (Procedure complete successfully.) Stop!
- Step $5$ : Set $n=n+1, p_0=p$ .
Step $6$ : OUTPUT “Method failed after $N$ iterations.” STOP!

2.3.3 Convergence

Theorem

(1) $f\in C^2[a,b].$

(2) $p\in [a,b]$ is such that $f (p) = 0$ and $f'(p)\not=0$ .

Then there exists a $\delta>0$ such that Newton’s method generates a sequence $\{p_n\}_1^\infty$ converging to $p$ for any initial approximation $p_0\in[p-\delta,p+\delta]$ .

Proof

$p_n=g(p_{n-1})\\ g(x)=x-\displaystyle\frac{f(x)}{f'(x)}$

Things need to prove: According to the proofing process of the fixed point method, we need to find an interval $[p-\delta,p+\delta]$ that $g$ maps into itself, and $|g'(x)|\leq k<1$ for all $x\in[p-\delta,p+\delta].$ (Existence and Uniqueness)

Proving Process:

$\exists \delta_1>0,g(x) \in C[p-\delta_1,p+\delta_1]$

Since $f'(p)\not=0$ and $f^{'}$ is continuous, there exists $\delta_1>0$ such that $f'(x)\not= 0$ and $f’(x)\in C[a,b] $ with $\forall x\in [p-\delta_1,p+\delta_1]$ .
$\in C[p-\delta_1,p+\delta_1]$

$g'(x)=\displaystyle\frac{f(x)f''(x)}{(f'(x))^2}$ for all $x\in [p-\delta_1,p+\delta_1]$ . Since $f\in C^2[a,b]$ , $g'(x)\in [p-\delta_1,p+\delta_2]$ .
$|g'(x)|\leq k< 1$

$f (p) = 0, g^{'} (p) = 0$

Since $g'\in C[p-\delta_1,p+\delta_1]$ , there exists a $\delta$ with $0<\delta<\delta_1$ and

$|g'(x)|\leq k<1, \forall x\in [p-\delta,p+\delta].$
$g\in[p-\delta,p+\delta]\mapsto [p-\delta,p+\delta]$

According to the Mean Value Theorem, if $x\in[p-\delta,p+\delta]$ , there exists
$\xi \in [x,p],|g(x)-g(p)|=|g'(\xi)||x-p|\\ |g(x)-p|=|g(x)-g(p)|=|g'(\xi)||x-p|\leq k|x-p|<|x-p|<\delta$
Thus, $g\in[p-\delta,p+\delta]\mapsto [p-\delta,p+\delta]$ .

According to the proving process above, all the hypotheses of the Fixed-Point Theorem are satisfied for $g(x)=x-\displaystyle\frac{f(x)}{f'(x)}$ . Therefore, the sequence ${p_n}_{n=1}^\infty $ defined by
$p_n=g(p_n-1),\forall n\geq 1$
converges to $p$ for any $p_0\in[p-\delta,p+\delta].$

2.3.4 Secant Method

Background

For Newton’s method, each iteration requires evaluation of both function $f(x_k)$ and its derivative $f'(x_k)$ , which may be inconvenient or expensive.

Improvement

Derivative is approximated by finite difference using two successive iterates, so iteration becomes
$x_{k+1}=x_k-f(x_k)*\displaystyle\frac{x_k-x_{k-1}}{f(x_k)-f(x_{k-1})}.$
This method is known as secant method.

Example

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-2e52NMfX-1597450649981)(media/15764593636078.jpg)]

Pseudo-Code

INPUT: Initial approximation $p_0,p_1$ , tolerance $T O L$ , Maximum number of iteration $N$ .
OUTPUT: approximate solution $p$ or message of failure.
Step $1$ : Set $n = 2,q_0=f(p_0),q_1=f(p_1)$ .
Step $2$ : While $n\leq N$ , do Steps $3 ～ 5$ .
- Step $3$ : Set $p_1-q_1*\displaystyle\frac{p_1-p_0}{q_1-q_0}$ .(Compute $p_i$ )
- Step $4$ : If $p-p_1|<TOL$ , then output $p$ ; (Procedure complete successfully.) Stop!
- Step $5$ : Set $n=n+1, p_0=p_1,p_1=p, q_0=q_1,q_1=f(p)$ .
Step $6$ : OUTPUT “Method failed after $N$ iterations.” STOP!

2.3.5 False Position Method

Definition

To find a solution of $f (x) = 0$ for a given continuous function $f$ on the interval $p_0,p_1]$ , where $f(p_0)$ and $f(p_1)$ have opposite signs
$f(p_0)*f(p_1)<0.$

The approximation $p_2$ is chosen in same manner as in Secant Method, as the $x$ -intercept ( $x$ 轴截距) of the line joining $p_0,f(p_0))$ and $p_1,f(p_1))$ .

To decide the Secant Line to compute $p_3$ , we need to check $f(p_2)*f(p_1)$ or $f(p_2)*f(p_0)$ .

If $f(p_2)*f(p_1)$ is negative, we choose $p_3$ as the $x$ -intercept for the line joining $p_1,f(p_1)$ and $p_2,f(p_2)$ .

In a similar manner, we can get a sequence $\{p_n\}_2^\infty$ which approximates to the root.

Example

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-BfEywPS0-1597450649983)(media/15764608598025.jpg)]

Pseudo-Code

INPUT: Initial approximation $p_0,p_1$ , tolerance $T O L$ , Maximum number of iteration $N$ .
OUTPUT: approximate solution $p$ or message of failure.
Step $1$ : Set $n = 2,q_0=f(p_0),q_1=f(p_1)$ .
Step $2$ : While $n\leq N$ , do Steps $3 ～ 6$ .
- Step $3$ : Set $p_1-q_1*\displaystyle\frac{p_1-p_0}{q_1-q_0}$ .(Compute $p_i$ )
- Step $4$ : If $p-p_1|<TOL$ , then output $p$ ; (Procedure complete successfully.) Stop!
- Step $5$ : Set $n = n + 1, q = f (p)$ .
- Step $6$ : If $q*q_1<0$ , set $p_0=p, q_0=q$ ; else set $p_1=p, q_1=q.$
Step $6$ : OUTPUT “Method failed after $N$ iterations.” STOP!

2.4 Error Analysis for Iteration Methods

2.4.1 The Rate of Sequence Convergence

Definition

Suppose $\{p_n\}_{n=0}^\infty$ is a sequence that converges to $p$ , with $p_n\not=p$ for all n.

If positive constants $\lambda$ and $\alpha$ exist with
$\lim_{n\rightarrow\infty} \displaystyle\frac{|p_{n+1}-p|}{|p_n-p|^\alpha}=\lambda,$

then $\{p_n\}_{n=0}^\infty$ converges to $p$ of order $\alpha$ , with asymptotic error constant $\lambda$ .

Properties

A sequence with a high order of convergence converges more rapidly than a sequence with a lower order.
The asymptotic constant affects the speed of convergence but is not as important as the order.

Example

If $\alpha=1$ , the sequence is linearly convergent.
If $\alpha =2$ , the sequence is quadratically convergent.

Summary

Using the Mean Value Theorem to prove Linear Convergence and the Taylor’s Theorem to prove Quadratic Convergence with $g^{'} (p) = 0 .$

2.4.2 Convergent Order of Fixed-Point Iteration (Improved)

Convergent Oder of Fixed-Point Iteration

(1) $g\in C[a,b]$ for all $x\in[a,b]$
(2) $g^{'} (x)$ is continuous on $(a, b)$ and a positive constant $0 < k < 1$ exists with $|g'(x)|\leq k$ , for all $x\in(a,b)$ .

If $g'(p)\not=0$ , then for any number $p_0$ in $[a, b]$ , the sequence $p_n=g(p_{n-1})$ , for $n\geq 1$ , converges only linearly to the unique fixed point $p$ in $[a, b]$ .

Proof

$p_{n+1}-p=g(p_n)-g(p)=g'(\xi_n)(p_n-p),$
where $\xi_n$ is between $p_n$ and $p$ .

Since $\{p_n\}_{n=0}^\infty$ converges to $p$ , and $\xi_n$ is between $p_n$ and $p$ , thus $\{\xi_n\}_{n=0}^\infty$ also converges to $p$ .

Thus,
$\lim_{n\rightarrow\infty}\displaystyle\frac{|p_{n+1}-p|}{|p_n-p|}=\lim_{n\rightarrow\infty}|g'(\xi_n)|=|g'(p)|,$
fixed-point iteration exhibits linear convergence with asymptotic error constant $∣ g^{'} (p) ∣$ whenever $g'(p)\not=0$ , which also implies that higher-order convergence for fixed-point methods can occur only when $g^{'} (p) = 0$ .

Quadratical Convergence

Let $p$ be a solution of the equation $x = g (x)$ .

(1) $g^{'} (p) = 0$

(2) $g^{''}$ is continuous and strictly bounded by $M$ on an open interval $I$ containing $p$ .

Then there exists a $\delta >0$ such that, for $p_0\in [p-\delta, p+\delta]$ , the sequence defined by $p_n=g(p_{n-1})$ , when $n\geq 1$ , converges at least quadratically to $p$ .

Moreover, for sufficiently large values of $n$ ,
$|p_{n+1}-p|<\displaystyle\frac{M}{2}|p_n-p|^2.$

Proof

Due to the two conditions described above,
$g(x)=g(p)+g'(p)(x-p)+\displaystyle\frac{g''(\xi)}{2}(x-p)^2=p+\displaystyle\frac{g''(\xi)}{2}*(x-p)^2$
is derived, that $\xi$ lies between $x$ and $p$ .

Thus,
$p_{n+1}=g(p_n)=p+\displaystyle\frac{g''(\xi)}{2}*(p_n-p)^2\\ p_{n+1}-p=\displaystyle\frac{g''(\xi)}{2}*(p_n-p)^2\\ \lim_{n\rightarrow\infty}g''(\xi)=g''(p)\\ \lim_{n\rightarrow\infty}\displaystyle\frac{|p_{n+1}-p_n|}{|p_n-p|^2}=\lim_{n\rightarrow\infty}\displaystyle\frac{g''(\xi)}{2}=\displaystyle\frac{g''(p)}{2}$

Since $g^{''}$ is strictly bounded by $M$ on the interval $|p-\delta,p+\delta|$ , for sufficiently large values of $n$ ,
$|p_{n+1}-p|<\displaystyle\frac{M}{2}|p_n-p|^2$
is also derived.

Construct a quadratically convergent fixed-point problem

Let
$g(x)=x-\phi(x)f(x)\\ g'(x)=1-\phi'(x)f(x)-\phi(x)f'(x)$

And the condition is $g^{'} (p) = 0$ , thus $\phi(p)=\displaystyle\frac{1}{f'(p)}$ .

A reasonable approach is to let $\phi(x)=\displaystyle\frac{1}{f'(x)}$ , which is the Newton’s method.

Remarks

the convergence rate of Fixed-Point iteration is usually linear, with constant $C = ∣ g^{'} (p) ∣$ .
But if $g^{'} (p) = 0$ , then the convergence rate is at least $q u a d r a t i c$ .

2.4.3 Zero of Multiplicity

Definition

A solution $p$ of $f (x) = 0$ is a zero of multiplicity $m$ of $f (x)$ if for $x\not=p$ , we can write
$f(x)=(x-p)^mq(x),$
where
$\lim_{x\rightarrow p}q(x)\not=0.$

Theorem

$f\in C^1[a,b]$ has a simple zero at $p$ in $(a, b)$ if and only if $f (p) = 0$ , but $f'(p)\not=0$ .
The function $f\in C^m[a,b]$ has a zero of multiplicity $m$ at $p$ if and only if
$0=f(p)=f'(p)=f''(p)=...=f^{(m-1)}(p).$
but $f^m(p)\not=0$ .

Proof

If $f$ has a simple zero at $p$ , then
$f(p)=0\\ f(x)=(x-p)*q(x)\\ \lim_{x\rightarrow p}q(x)\not=0.$

Since $f\in C^1[a,b]$ ,
$f'(p)=\lim_{x\rightarrow p}f'(x)=\lim_{x\rightarrow p}[q(x)+(x-p)*q'(x)]=\lim_{x\rightarrow p}q(x)\not=0.$

2.4.4 Convergence of Newton’s Method

Property

Newton’s method transforms nonlinear equation $f (x) = 0$ into fixed-point problem $x = g (x)$ with $g(x)=x-\displaystyle\frac{f(x)}{f'(x)}$ .

If $p$ is a simple root, $f(p)=0,f'(p)\not=0,g'(p)=0$ , thus the convergence rate is quadratic. (Iterations must start close enough to root.)
If $p$ is a root of multiplicity,
$f(x)=(x-p)^mq(x)\\ g'(p)=1-\displaystyle\frac{1}{m}\not=0,$
thus the convergence rate is linear.

the Method of avoiding multiple root

$f(x)=(x-p)^mq(x)\\ u(x)=\displaystyle\frac{f(x)}{f'(x)}\\ u(x)=(x-p)*\displaystyle\frac{q(x)}{mq(x)+(x-p)q'(x)}\\ u'(x)=\displaystyle\frac{1}{m}\not=0.$

Thus $p$ is a simple root of $u (x)$ . Then we change the Newton’s method into
$g(x)=x-\displaystyle\frac{u(x)}{u'(x)}=x-\displaystyle\frac{f(x)f'(x)}{f'(x)^2-f(x)f''(x)}$
whose convergence rate is also quadratic.

2.4.5 Convergence rate of Secant Method

Convergence rate of secant method is normally superlinear, with $r\approx1.618$ , which is lower than Newton’s method.
Secant method need to evaluate two previous functions per iteration, there is no requirement to evaluate the derivative.
Its disadvantage is that it needs two starting guesses which close enough to the solution in order to converge.

2.5 Accelerating Convergence

2.5.1 Aitken’s method

Background

Accelerating the convergence of a sequence that is linearly convergent, regardless of its origin or application.

$\lim_{n\rightarrow \infty} \displaystyle\frac{p_{n+1}-p}{p_n-p}=\lambda,\lambda\not=0.$
Thus, when $n$ is sufficiently large,
$\displaystyle\frac{p_{n+1}-p}{p_n-p}\approx \displaystyle\frac{p_{n+2}-p}{p_n+1-p}\\ p\approx\displaystyle\frac{p_n*p_{n+2}-p_{n+1}^2}{p_{n+2}-2*p_{n+1}+p_n}\\ p\approx p_n-\displaystyle\frac{(p_{n+1}-p_{n})^2}{p_{n+2}-2*p_{n+1}+p_n}$

Aitken’s $\Delta$ method is to define a new sequence ${\hat{p}}_{n=0}^\infty:$
$\hat{p}=p_n-\displaystyle\frac{(p_{n+1}-p_{n})^2}{p_{n+2}-2*p_{n+1}+p_n},$
whose convergence rate is faster than the original sequence $\{p_n\}_{n=0}^\infty$ .

Definition

Given the sequence $\{p_n\}_{n=0}^\infty$ , the forward difference $\Delta p_n$ is defined by
$\Delta p_n=p_{n+1}-p_n,n\geq 0.$
Higher powers $\Delta^kp_n$ are defined recursively by
$\Delta^k p_n=\Delta(\Delta^{k-1}p_{n}),k\geq 2.$
For example,
$\Delta^2p_n=\Delta(\Delta p_{n})=\Delta(p_{n+1}-p_n )=\Delta(p_{n+1})-\Delta(p_n)=p_{n+2}-2p_{n+1}+p_n.$
Thus, $\hat{p_n}=p_n-\displaystyle\frac{(\Delta p_n)^2}{\Delta^2p_n}$ .

Theorem

Condition:
$\lim_{n\rightarrow \infty} \displaystyle\frac{p_{n+1}-p}{p_n-p}=\lambda,\lambda\not=0.\\ (p_n-p)(p_{n+1}-p)>0$

Result: the sequence $\{\hat{p_n}\}_{n=0}^\infty$ converges to $p$ faster than $\{p_n\}_{n=0}^\infty$ in the sense that
$\lim_{n\rightarrow\infty}\displaystyle\frac{\hat{p}_n-p}{p_n-p}=0.$

Proof:
[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-C6I8ajkM-1597450649984)(media/15764811726346.jpg)]

2.5.2 Steffensen’s method

Definition

The function is $p = g (p)$ , and the initial approximation is $p_0$ , $\hat{p_0}=p_0-\displaystyle\frac{(\Delta p_0)^2}{\Delta^2p_0}$ .

Assume that $\hat{p_0}$ is a better approximation than $p_2$ , so applying fixed-point iteration to $\hat{p_0}$ instead of $p_2$ , and the computing process shows below.

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-crbwguRB-1597450649985)(media/15764815980329.jpg)]

Theorem

Suppose that $x = g (x)$ has the solution $p$ with $g'(p)\not=1$ .

If there exists a $\delta>0$ such that $g\in C^3[p-\delta,p+\delta]$ ,

then Steffensen’s method gives quadratic convergence for any $p_0\in [p-\delta,p+\delta].$

Pseudo-Code

INPUT: Initial approximation $p_0$ , tolerance $T O L$ , Maximum number of iteration $N$ .
OUTPUT: approximate solution $p$ or message of failure.
Step $1$ : Set $n = 1$ .
Step $2$ : While $n\leq N$ , do Steps $3 ～ 5$ .
- Step $3$ : Set $p_1=g(p_0),p_2=g(p_1),p= p_0-\displaystyle\frac{(p_1-p_0)^2}{p_2-2p_1+p_0}$ .
- Step $4$ : If $p-p_0|<TOL$ , then output $p$ ; (Procedure complete successfully.) Stop!
- Step $5$ : Set $n=n+1, p_0=p$ .
Step $6$ : OUTPUT “Method failed after $N$ iterations.” STOP!

2.6 Zeros of Polynomials and Muller’s Method

2.6.1 Polynomial Theorem

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-IlOJhuve-1597450649986)(media/15764834637698.jpg)]

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-rN9aVv6z-1597450649987)(media/15764834787142.jpg)]

2.6.2 Horner’s Method

Background

A more efficient method to calculate the $P(x_0)$ and $P'(x_0)$ for a polynomial $P (x)$ .

Theorem

Let
$P(x)=\sum\limits_{i=0}^{i=n}a_ix^i.$

Construction process for P(x_0) (Substitute formulas one by one to verify)

if $b_n=a_n$ and
$b_k=a_k+b_{k+1}x_0,k\in [0,n-1],$
then $b_0=P(x_0)$ .

Moreover, if
$Q(x)=\sum\limits_{i=1}^{n}b_ix^{i-1}$
then
$P(x)=(x-x_0)Q(x)+b_0.$

Construction process for P’(x_0) (Substitute formulas one by one to verify)

$P(x)=(x-x_0)Q(x)+b_0\\ P'(x)=Q(x)+(x-x_0)Q'(x)\\ P'(x_0)=Q(x_0)$

Let $Q(x)=\sum\limits_{i=1}^{n}b_ix^{i-1}=(x-x_0)R(x)+c_1$ , where $R(x)=\sum\limits_{i=2}^{n}c_ix^{i-2}$ . Thus
$c_n=b_n,\\ c_k=b_k+c_{k+1}x_0,k\in[1,n-1],\\ Q(x_0)=c_1=P'(x_0).$

Pseudo-Code

To compute the value $P(x_0)$ and $P'(x_0)$ for a function $P(x)=\sum\limits_{i=0}^{n}a_ix^i.$

INPUT: Degree $n$ , coefficients $a_0,a_1,...,a_n$ of polynomial $P (x)$ , point $x_0$ .
OUTPUT: Values of $P(x_0)$ and $P'(x_0)$ .
Step $1$ : Set $y = a_n$ ( $b_n$ for $Q$ ), $z = 0$ ( $c_{n+1}$ for $R$ ).
Step $2$ : For $j = n - 1, n - 2, . . ., 0$ , set
- $z=y+z*x_0$ ( $c_{j+1}$ for $R$ ),
- $y =a_j+y*x_0$ ( $b_j$ for $Q$ ).
Step $3$ : OUTPUT $y:P(x_0)$ and $z:P'(x_0)$ .

2.6.3 Deflation Method

Newton’s method combined with Honor’s method

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-z4njXlMp-1597450649987)(media/15764860081651.jpg)]

Deflation Method (压缩技术)

Suppose that $x_N$ in the Nth iteration of the Newton-Raphson procedure, is an approximation zero of $P (x)$ , then
$P(x)=(x-x_N)Q(x)+b_0=(x-x_N)Q(x)+P(x_N)\approx (x-x_N)Q(x).$

Let $\hat{x_1}=x_N$ be the approximate zero of $P$ , and $Q_1(x)=Q(x)$ be the approximate factor, then we have
$P(x)\approx (x-\hat{x_1})Q_1(x).$

To find the second approximate zero of $P (x)$ , we can use the same procedure to $Q_1(x)$ , give $Q_1(x)\approx(x-\hat{x_2})Q_2(x)$ , where $Q_2(x)$ is a polynomial of degree $n - 2$ . Thus $P(x)\approx (x-\hat{x_1})Q_1(x)\approx (x-\hat{x_1})(x-\hat{x_2})Q_2(x)$ .

Repeat this procedure, till $Q_{n-2}(x)$ which is an quadratic polynomial and can be solved by quadratic formula. We can get all approximate zeros of $P (x)$ . This method is called deflation method.