Shell编程三剑客(二)

一、sed 工具

sed(Stream EDitor)是一个强大而简单的文本解析转换工具,可以读取文本,并根据指定的条件对文本内容进行编辑(删除、替换、添加、移动等),最后输出所有行或者仅输出处理的某些行。sed 也可以在无交互的情况下实现相当复杂的文本处理操作,被广泛应用于 Shell 脚本中,用以完成各种自动化处理任务。

二、sed 的工作流程主要包括读取、执行和显示三个过程

1.读取:sed 从输入流(文件、管道、标准输入)中读取一行内容并存储到临时的缓冲区中(又称模式空间,pattern space)。

2.执行:默认情况下,所有的 sed 命令都在模式空间中顺序地执行,除非指定了行的地址,否则 sed 命令将会在所有的行上依次执行。

3.显示:发送修改后的内容到输出流。再发送数据后,模式空间将会被清空。在所有的文件内容都被处理完成之前,上述过程将重复执行,直至所有内容被处理完。

注意:默认情况下,所有的 sed 命令都是在模式空间内执行的,因此输入的文件并不会发生任何变化,除非是用重定向存储输出。

1.sed 命令常见用法

通常情况下调用 sed 命令有两种格式:

(1)“参数”是指操作的目标文件,当存在多个操作对象时用,文件之间用逗号“,”分隔;

(2)而 scriptfile 表示脚本文件,需要用“-f”选项指定,当脚本文件出现在目标文件之前时,表示通过指定的脚本文件来处理输入的目标文件。

sed[选项] '操作' 参数

sed [选项] -f scriptfile 参数

2.常见的 sed 命令选项

 

选项含义
-e或--expression=

表示用指定命令或者脚本来处理输入的文本文件。

-f或--file=

表示用指定的脚本文件来处理输入的文本文件。

-h或--help

显示帮助。

-n、--quiet或silent

表示仅显示处理后的结果。

-i

直接编辑文本文件。

3.“操作”用于指定对文件操作的动作行为,也就是 sed 的命令

通常情况下是采用的“[n1[,n2]]”操作参数的格式。n1、n2 是可选的,不一定会存在,代表选择进行操作的行数,如操作需要在 5~20 行之间进行,则表示为“5,20 动作行为”。

常见的操作包括以下几种:

a

增加,在当前行下面增加一行指定内容。

c

替换,将选定行替换为指定内容。

d

删除,删除选定的行。

i

插入,在选定行上面插入一行指定内容。

p

打印,如果同时指定行,表示打印指定行;如果不指定行,则表示打印所有内容;如果有非打印字符,则以 ASCII 码输出。其通常与“-n”选项一起使用。

s

替换,替换指定字符。

y

字符转换。 

4.sed 命令的基本用法

(1)输出所有内容,等同于 cat test.txt

[root@localhost opt]# sed -n 'p' test.txt 
# This is the main Apache HTTP server configuration file.  It contains the
# configuration directives that give the server its instructions.
# See <URL:http://httpd.apache.org/docs/2.4/> for detailed information.
# In particular, see 
# <URL:http://httpd.apache.org/docs/2.4/mod/directives.html>
# for a discussion of each configuration directive.
#
# Do NOT simply read the instructions in here without understanding
# what they do.  They're here only as hints or reminders.  If you are unsure
# consult the online docs. You have been warned.  
#
# Configuration and logfile names: If the filenames you specify for many
# of the server's control files begin with "/" (or "drive:/" for Win32), the
# server will use that explicit path.  If the filenames do *not* begin
# with "/", the value of ServerRoot is prepended -- so 'log/access_log'
# with ServerRoot set to '/www' will be interpreted by the
# server as '/www/log/access_log', where as '/log/access_log' will be
# interpreted as '/log/access_log'.

#
# ServerRoot: The top of the directory tree under which the server's
# configuration, error, and log files are kept.
#
# Do not add a slash at the end of the directory path.  If you point
# ServerRoot at a non-local disk, be sure to specify a local disk on the

(2)输出第 3 行

[root@localhost opt]# sed -n '3p' test.txt 
# See <URL:http://httpd.apache.org/docs/2.4/> for detailed information.

(3)输出 3~5 行

[root@localhost opt]# sed -n '3,5p' test.txt 
# See <URL:http://httpd.apache.org/docs/2.4/> for detailed information.
# In particular, see 
# <URL:http://httpd.apache.org/docs/2.4/mod/directives.html>

(4)输出所有奇数行,n 表示读入下一行资料

[root@localhost opt]# sed -n 'p;n' test.txt 
# This is the main Apache HTTP server configuration file.  It contains the
# See <URL:http://httpd.apache.org/docs/2.4/> for detailed information.
# <URL:http://httpd.apache.org/docs/2.4/mod/directives.html>
#
# what they do.  They're here only as hints or reminders.  If you are unsure
#
# of the server's control files begin with "/" (or "drive:/" for Win32), the
# with "/", the value of ServerRoot is prepended -- so 'log/access_log'
# server as '/www/log/access_log', where as '/log/access_log' will be

# ServerRoot: The top of the directory tree under which the server's
#
# ServerRoot at a non-local disk, be sure to specify a local disk on the

(5)输出所有偶数行,n 表示读入下一行资料

[root@localhost opt]# sed -n 'n;p' test.txt 
# configuration directives that give the server its instructions.
# In particular, see 
# for a discussion of each configuration directive.
# Do NOT simply read the instructions in here without understanding
# consult the online docs. You have been warned.  
# Configuration and logfile names: If the filenames you specify for many
# server will use that explicit path.  If the filenames do *not* begin
# with ServerRoot set to '/www' will be interpreted by the
# interpreted as '/log/access_log'.
#
# configuration, error, and log files are kept.
# Do not add a slash at the end of the directory path.  If you point

(6)输出第 1~5 行之间的奇数行(第 1、3、5 行)

[root@localhost opt]# sed -n '1,5{p;n}' test.txt 
# This is the main Apache HTTP server configuration file.  It contains the
# See <URL:http://httpd.apache.org/docs/2.4/> for detailed information.
# <URL:http://httpd.apache.org/docs/2.4/mod/directives.html>

(7)输出第 10 行至文件尾之间的偶数行

[root@localhost opt]# sed -n '10,${n;p}' test.txt 
#
# of the server's control files begin with "/" (or "drive:/" for Win32), the
# with "/", the value of ServerRoot is prepended -- so 'log/access_log'
# server as '/www/log/access_log', where as '/log/access_log' will be

# ServerRoot: The top of the directory tree under which the server's
#
# ServerRoot at a non-local disk, be sure to specify a local disk on the

在执行“sed –n‘10,${n;p}’test.txt”命令时,读取的第 1 行是文件的第 10 行,读取的第 2 行是文件的第 11 行,依此类推,所以输出的偶数行是文件的第 11 行、13 行直至文件结尾,其中包括空行。

5.sed 命令结合正则表达式

(1)输出包含the的行

[root@localhost opt]# sed -n '/the/p' test.txt 
# This is the main Apache HTTP server configuration file.  It contains the
# configuration directives that give the server its instructions.
# Do NOT simply read the instructions in here without understanding
# what they do.  They're here only as hints or reminders.  If you are unsure
# consult the online docs. You have been warned.  
# Configuration and logfile names: If the filenames you specify for many
# of the server's control files begin with "/" (or "drive:/" for Win32), the
# server will use that explicit path.  If the filenames do *not* begin
# with "/", the value of ServerRoot is prepended -- so 'log/access_log'
# with ServerRoot set to '/www' will be interpreted by the
# ServerRoot: The top of the directory tree under which the server's
# Do not add a slash at the end of the directory path.  If you point
# ServerRoot at a non-local disk, be sure to specify a local disk on the

(2)输出从第 4 行至第一个包含 the 的行

[root@localhost opt]# sed -n '4,/the/p' test.txt 
# In particular, see 
# <URL:http://httpd.apache.org/docs/2.4/mod/directives.html>
# for a discussion of each configuration directive.
#
# Do NOT simply read the instructions in here without understanding

(3)输出包含the 的行所在的行号,等号(=)用来输出行号

[root@localhost opt]# sed -n '/the/=' test.txt 
1
2
8
9
10
12
13
14
15
16
21
24
25

(4)输出以PI 开头的行

[root@localhost opt]# sed -n '/^PI/P' test.txt 
PI=3.14159265358979323846264338324990142

(5)输出以数字结尾的行

[root@localhost opt]# sed -n '/[0-9]$/p' test.txt 
PI=3.14159265358979323846264338324990142

(6)输出包含单词wood 的行,\<、\>代表单词边界

[root@localhost opt]# sed -n '/\<wood\>/p' test.txt 
a wood cross!

6.删除符合条件的文本(d)

以下示例分别演示了 sed 命令的几种常用删除用法。

下面命令中 nl 命令用于计算文件的行数,结合该命令可以更加直观地查看到命令执行的结果。

(1)删除第 3 行

[root@localhost opt]# nl test.txt | sed '3d'
     1	# This is the main Apache HTTP server configuration file.  It contains the
     2	# configuration directives that give the server its instructions.
     4	# In particular, see 
     5	# <URL:http://httpd.apache.org/docs/2.4/mod/directives.html>
     6	# for a discussion of each configuration directive.
     7	#
     8	# Do NOT simply read the instructions in here without understanding

(2)删除第 3~5 行

[root@localhost opt]# nl test.txt | sed '3,5d'
     1	# This is the main Apache HTTP server configuration file.  It contains the
     2	# configuration directives that give the server its instructions.
     6	# for a discussion of each configuration directive.
     7	#
     8	# Do NOT simply read the instructions in here without understanding
     9	# what they do.  They're here only as hints or reminders.  If you are u

(3)删除包含cross 的行,原本的第 8 行被删除

[root@localhost opt]# nl test.txt | sed '/cross/d'
     1	# This is the main Apache HTTP server configuration file.  It contains the
     2	# configuration directives that give the server its instructions.
     3	# See <URL:http://httpd.apache.org/docs/2.4/> for detailed information.
     4	# In particular, see 
     5	# <URL:http://httpd.apache.org/docs/2.4/mod/directives.html>
     6	# for a discussion of each configuration directive.
     7	# shan skmks flmdlsks lsmls
     9	# shan yebd osjs ojsd ocok 
    10	# what they do.  They're here only as hints or reminders.  If you are unsure
    11	# consult the online docs. You have been warned.  

(4)删除不包含cross 的行,用!符号表示取反操作,如'/cross/!d'

[root@localhost opt]# nl test.txt | sed '/cross/ ! d'
     8	# Do NOT simply read the instructions cross in here without understanding
    27	a wood cross!

(5)删除以小写字母开头的行

[root@localhost opt]# sed '/^[a-z]/d' test.txt 
This is the main Apache HTTP server configuration file.  It contains the
See <URL:http://httpd.apache.org/docs/2.4/> for detailed information.
In particular, see 
URL:http://httpd.apache.org/docs/2.4/mod/directives.html>
Do NOT simply read the instructions cross in here without understanding

Configuration and logfile names: If the filenames you specify for many


ServerRoot: The top of the directory tree under which the server's

Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the
PI=3.14159265358979323846264338324990142

(6)删除以"."结尾的行

[root@localhost opt]# sed '/\.$/d' test.txt 
This is the main Apache HTTP server configuration file.  It contains the
In particular, see 
URL:http://httpd.apache.org/docs/2.4/mod/directives.html>
shan skmks flmdlsks lsmls
Do NOT simply read the instructions cross in here without understanding
shan yebd osjs ojsd ocok 
what they do.  They're here only as hints or reminders.  If you are unsure
consult the online docs. You have been warned  

Configuration and logfile names: If the filenames you specify for many
of the server's control files begin with "/" (or "drive:/" for Win32), the
server will use that explicit path.  If the filenames do *not* begin
with "/", the value of ServerRoot is prepended -- so 'log/access_log'
with ServerRoot set to '/www' will be interpreted by the
server as '/www/log/access_log', where as '/log/access_log' will be


ServerRoot: The top of the directory tree under which the server's

Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the
PI=3.14159265358979323846264338324990142
a wood cross!

(7)删除所有空行
 

[root@localhost opt]# sed '/^$/d' test.txt 
This is the main Apache HTTP server configuration file.  It contains the
configuration directives that give the server its instructions.
See <URL:http://httpd.apache.org/docs/2.4/> for detailed information.
In particular, see 
URL:http://httpd.apache.org/docs/2.4/mod/directives.html>
for a discussion of each configuration directive.
shan skmks flmdlsks lsmls
Do NOT simply read the instructions cross in here without understanding
shan yebd osjs ojsd ocok 
what they do.  They're here only as hints or reminders.  If you are unsure
consult the online docs. You have been warned  
Configuration and logfile names: If the filenames you specify for many
of the server's control files begin with "/" (or "drive:/" for Win32), the
server will use that explicit path.  If the filenames do *not* begin
with "/", the value of ServerRoot is prepended -- so 'log/access_log'
with ServerRoot set to '/www' will be interpreted by the
server as '/www/log/access_log', where as '/log/access_log' will be
interpreted as '/log/access_log'.
ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.
Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the
PI=3.14159265358979323846264338324990142
a wood cross!

(8)若是删除重复的空行,即连续的空行只保留一个, 执行“ sed –e ‘/^$/{n;/^$/d}’test.txt”命令即可实现。其效果与“cat -s test.txt”相同,n 表示读下一行数据。

[root@localhost opt]# sed -e '/^$/{n;/^$/d}' test.txt 
This is the main Apache HTTP server configuration file.  It contains the
configuration directives that give the server its instructions.
See <URL:http://httpd.apache.org/docs/2.4/> for detailed information.
In particular, see 
URL:http://httpd.apache.org/docs/2.4/mod/directives.html>
for a discussion of each configuration directive.
shan skmks flmdlsks lsmls
Do NOT simply read the instructions cross in here without understanding
shan yebd osjs ojsd ocok 
what they do.  They're here only as hints or reminders.  If you are unsure
consult the online docs. You have been warned  

Configuration and logfile names: If the filenames you specify for many
of the server's control files begin with "/" (or "drive:/" for Win32), the
server will use that explicit path.  If the filenames do *not* begin
with "/", the value of ServerRoot is prepended -- so 'log/access_log'
with ServerRoot set to '/www' will be interpreted by the
server as '/www/log/access_log', where as '/log/access_log' will be
interpreted as '/log/access_log'.

ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.

Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the
PI=3.14159265358979323846264338324990142
a wood cross!

7.替换符合条件的文本

在使用 sed 命令进行替换操作时需要用到 s(字符串替换)、c(整行/整块替换)、y(字符转换)命令选项,常见的用法如下:

(1)将每行中的第一个the 替换为 THE

[root@localhost opt]# sed 's/the/THE/' test.txt 
THE this bad
ServerRoot: The top of THE directory tree under which the server's
configuration, error, and log files are kept.
Do not add a slash at THE end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on THE
PI=3.14159265358979323846264338324990142
a wood cross!

(2)将每行中的第 3 个l 替换为L

[root@localhost opt]# sed 's/l/L/3' test.txt 
the this bad
ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.
Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a Local disk on the
PI=3.14159265358979323846264338324990142
a wood cross!

(3)将文件中的所有the 替换为THE

[root@localhost opt]# sed 's/the/THE/g' test.txt 
THE this bad
ServerRoot: The top of THE directory tree under which THE server's
configuration, error, and log files are kept.
Do not add a slash at THE end of THE directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on THE
PI=3.14159265358979323846264338324990142
a wood cross!

(4)将文件中的所有o 删除(替换为空串)

[root@localhost opt]# sed 's/o//g' test.txt 
the this bad
ServerRt: The tp f the directry tree under which the server's
cnfiguratin, errr, and lg files are kept.
D nt add a slash at the end f the directry path.  If yu pint
ServerRt at a nn-lcal disk, be sure t specify a lcal disk n the
PI=3.14159265358979323846264338324990142
a wd crss!

(5)在每行行首插入#号

[root@localhost opt]# sed 's/^/#/g' test.txt 
#the this bad
#ServerRoot: The top of the directory tree under which the server's
#configuration, error, and log files are kept.
#Do not add a slash at the end of the directory path.  If you point
#ServerRoot at a non-local disk, be sure to specify a local disk on the
#PI=3.14159265358979323846264338324990142
#a wood cross!

(6)在包含the 的每行行首插入#号

[root@localhost opt]# sed '/the/s/^/#/' test.txt 
#the this bad
#ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.
#Do not add a slash at the end of the directory path.  If you point
#ServerRoot at a non-local disk, be sure to specify a local disk on the
PI=3.14159265358979323846264338324990142
a wood cross!

(7)在每行行尾插入字符串EOF

[root@localhost opt]# sed 's/$/EOF/' test.txt 
the this badEOF
ServerRoot: The top of the directory tree under which the server'sEOF
configuration, error, and log files are kept.EOF
Do not add a slash at the end of the directory path.  If you pointEOF
ServerRoot at a non-local disk, be sure to specify a local disk on theEOF
PI=3.14159265358979323846264338324990142EOF
a wood cross!EOF

(8)将第 3~5 行中的所有the 替换为 THE

root@localhost opt]# sed '3,5s/the/THE/' test.txt 
the this bad
ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.
Do not add a slash at THE end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on THE
PI=3.14159265358979323846264338324990142
a wood cross!

(9)将包含the 的所有行中的o 都替换为 O

[root@localhost opt]# sed '/the/s/o/O/g' test.txt 
the this bad
ServerROOt: The tOp Of the directOry tree under which the server's
configuration, error, and log files are kept.
DO nOt add a slash at the end Of the directOry path.  If yOu pOint
ServerROOt at a nOn-lOcal disk, be sure tO specify a lOcal disk On the
PI=3.14159265358979323846264338324990142
a wood cross!

9.迁移符合条件的文本

H,复制到剪贴板;

g、G,将剪贴板中的数据覆盖/追加至指定行;

w,保存为文件;

r,读取指定文件;

a,追加指定内容

(1)将包含the 的行迁移至文件末尾,{;}用于多个操作

[root@localhost opt]# sed '/the/{H;d};$G' test.txt 
configuration, error, and log files are kept.
PI=3.14159265358979323846264338324990142
a wood cross!

the this bad
ServerRoot: The top of the directory tree under which the server's
Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the

(2)将第 1~5 行内容转移至第 17 行后

[root@localhost opt]# sed '1,5{H;d};17G' test.txt 
PI=3.14159265358979323846264338324990142
a wood cross!
1
2
3
4
5
6
7
8
9
2

the this bad
ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.
Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the
32
43
224
43
142
11
ew
23

(3)将包含the 的行另存为文件out.file

[root@localhost opt]# sed '/the/w out.file' test.txt 
the this bad
ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.
Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the
PI=3.14159265358979323846264338324990142
a wood cross!
1
2
3
[root@localhost opt]# cat out.file 
the this bad
ServerRoot: The top of the directory tree under which the server's
Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the

(4)将文件/etc/hostname 的内容添加到包含the 的每行以后

[root@localhost opt]# sed '/the/r/etc/hostname' test.txt 
the this bad
localhost.localdomain
ServerRoot: The top of the directory tree under which the server's
localhost.localdomain
configuration, error, and log files are kept.
Do not add a slash at the end of the directory path.  If you point
localhost.localdomain
ServerRoot at a non-local disk, be sure to specify a local disk on the
localhost.localdomain
PI=3.14159265358979323846264338324990142
a wood cross!
1
2
3

(5)在第 3 行后插入一个新行,内容为 New

[root@localhost opt]# sed '3aNEW' test.txt 
the this bad
ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.
NEW
Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the
PI=3.14159265358979323846264338324990142
a wood cross!
1
2

(6)在包含the 的每行后插入一个新行,内容为 New

[root@localhost opt]# sed '/the/aNEW' test.txt 
the this bad
NEW
ServerRoot: The top of the directory tree under which the server's
NEW
configuration, error, and log files are kept.
Do not add a slash at the end of the directory path.  If you point
NEW
ServerRoot at a non-local disk, be sure to specify a local disk on the
NEW
PI=3.14159265358979323846264338324990142
a wood cross!
1
2

(7)在第 3 行后插入多行内容,中间的\n 表示换行

[root@localhost opt]# sed '3aNew1\nNew2' test.txt 
the this bad
ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.
New1
New2
Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the
PI=3.14159265358979323846264338324990142
a wood cross!
1
2
3

10.使用脚本编辑文件

使用 sed 脚本,将多个编辑指令存放到文件中(每行一条编辑指令),通过“-f”选项来调用。

(1)将第 1~5 行内容转移至第 17 行后

sed '1,5{H;d};17G' test.txt

以上操作可以改用脚本文件方式:

[root@localhost opt]# vim opt.list
1,5H
1,5d
17G
[root@localhost opt]# sed -f opt.list test.txt 
PI=3.14159265358979323846264338324990142
a wood cross!
1
2
3
4
5
6
7
8
9
2

the this bad
ServerRoot: The top of the directory tree under which the server's
configuration, error, and log files are kept.
Do not add a slash at the end of the directory path.  If you point
ServerRoot at a non-local disk, be sure to specify a local disk on the
32
43
224
43
142
11
ew
23

 

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值