fork（）函数的理解

最新推荐文章于 2021-05-12 08:12:11 发布

hyshine2012

最新推荐文章于 2021-05-12 08:12:11 发布

阅读量480

点赞数

分类专栏： linux

linux 专栏收录该内容

2 篇文章 0 订阅

订阅专栏

作者：王姗姗，华清远见嵌入式学院讲师。

　　对于刚刚接触Unix/Linux操作系统，在Linux下编写多进程的人来说，fork是最难理解的概念之一：它执行一次却返回两个值。

　　首先我们来看下fork函数的原型：

　　＃i nclude

　　pid_t fork(void);

　　返回值：

　　负数：如果出错，则fork()返回-1,此时没有创建新的进程。最初的进程仍然运行。

　　零：在子进程中，fork()返回0

　　正数：在负进程中，fork()返回正的子进程的PID

　　其次我们来看下如何利用fork创建子进程。

　　创建子进程的样板代码如下所示：

　　pid_t child;

　　if((child = fork())<0)

　　else if(child == 0)

　　else

　　fock函数调用一次却返回两次；向父进程返回子进程的ID，向子进程中返回0，

　　这是因为父进程可能存在很多过子进程，所以必须通过这个返回的子进程ID来跟踪子进程，

　　而子进程只有一个父进程，他的ID可以通过getppid取得。

　　下面我们来对比一下两个例子：

　　第一个：

　　#include

　　int main()

　　{

　　pid_t pid;

　　int count=0;

　　pid = fork();

　　printf( "This is first time, pid = %d\n", pid );

　　printf( "This is second time, pid = %d\n", pid );

　　count++;

　　printf( "count = %d\n", count );

　　if ( pid>0 )

　　{

　　printf( "This is the parent process,the child has the pid:%d\n", pid );

　　}

　　else if ( !pid )

　　{

　　printf( "This is the child process.\n")

　　}

　　else

　　{

　　printf( "fork failed.\n" );

　　}

　　printf( "This is third time, pid = %d\n", pid );

　　printf( "This is fouth time, pid = %d\n", pid );

　　return 0;

　　}

　　运行结果如下：

fork（）函数的理解

　　问题：

　　这个结果很奇怪了，为什么printf的语句执行两次，而那句“count++;”的语句却只执行了一次

　　接着看：

　　#include

　　int main(void)

　　{

　　pid_t pid;

　　int count=0;

　　pid = fork();

　　printf( "Now, the pid returned by calling fork() is %d\n", pid );

　　if ( pid>0 )

　　{

　　printf( "This is the parent process,the child has the pid:%d\n", pid );

　　printf( "In the parent process,count = %d\n", count );

　　}

　　else if ( !pid )

　　{

　　printf( "This is the child process.\n");

　　printf( "Do your own things here.\n" );

　　count ++;

　　printf( "In the child process, count = %d\n", count );

　　}

　　else

　　{

　　printf( "fork failed.\n" );

　　}

　　return 0;

　　}

　　运行结果如下：

　　现在来解释上面提出的问题。

　　看这个程序的时候，头脑中必须首先了解一个概念：在语句pid=fork()之前，只有一个进程在执行这段代码，但在这条语句之后，就变成两个进程在执行了，这两个进程的代码部分完全相同，将要执行的下一条语句都是if ( pid>0 )……。

　　两个进程中，原先就存在的那个被称作“父进程”，新出现的那个被称作“子进程”。父子进程的区别除了进程标志符（process ID）不同外，变量pid的值也不相同，pid存放的是fork的返回值。fork调用的一个奇妙之处就是它仅仅被调用一次，却能够返回两次，它可能有三种不同的返回值：

　　1. 在父进程中，fork返回新创建子进程的进程ID；

　　2.在子进程中，fork返回0；

　　3.如果出现错误，fork返回一个负值；

　　fork出错可能有两种原因：（1）当前的进程数已经达到了系统规定的上限，这时errno的值被设置为EAGAIN。（2）系统内存不足，这时errno的值被设置为ENOMEM。

　　接下来我们来看看APUE2中对fork的说明：

　　The new process created by fork is called the child process. This function is called once but returns twice. The only difference in the returns is that the return value in the child is 0, whereas the return value in the parent is the process ID of the new child. The reason the child's process ID is returned to the parent is that a process can have more than one child, and there is no function that allows a process to o^ain the process IDs of its children. The reason fork returns 0 to the child is that a process can have only asingle parent, and the child can always call getppid to o^ain the process ID of its parent. (Process ID 0 is reserved for use by the kernel, so it's not possible for 0 to be the process ID of a child.)

　　被fork创建的新进程叫做自进程。fork函数被调用一次，却两次返回。返回值唯一的区别是在子进程中返回0，而在父进程中返回子进程的pid。在父进程中要返回子进程的pid的原因是父进程可能有不止一个子进程，而一个进程又没有任何函数可以得到他的子进程的pid。

　　Both the child and the parent continue executing with the instruction that follows the call to fork. The child is a copy of the parent. For example, the child gets a copy of the parent's data space, heap, and stack. Note that this is a copy for the child; the parent and the child do not share these portions of memory. The parent and the child share the text segment (Section 7.6).

　　子进程和父进程都执行在fork函数调用之后的代码，子进程是父进程的一个拷贝。例如，父进程的数据空间、堆栈空间都会给子进程一个拷贝，而不是共享这些内存。

　　Current implementations don't perform. a complete copy of the parent's data, stack, and heap, since a fork is often followed by an exec. Instead, a technique called copy-on-write (COW) is used. These regions are shared by the parent and the child and have their protection changed by the kernel to read-only. If either process tries to modify these regions, the kernel then makes a copy of that piece of memory only, typically a "page" in a virtual memory system. Section 9.2 of Bach [1986] and Sections 5.6 and 5.7 of McKusick et al. [1996] provide more detail on this feature.

　　我们来给出详细的注释

　　#include

　　int main(void)

　　{

　　pid_t pid;

　　int count=0;

　　pid = fork();

　　printf( "Now, the pid returned by calling fork() is %d\n", pid );

　　if ( pid>0 )

　　{

　　printf( "This is the parent process,the child has the pid:%d\n", pid );

　　printf( "In the parent process,count = %d\n", count );

　　}

　　else if ( !pid )

　　{

　　printf( "This is the child process.\n");

　　printf( "Do your own things here.\n" );

　　count++;

　　printf( "In the child process, count = %d\n", count );

　　}

　　else

　　{

　　printf( "fork failed.\n" );

　　}

　　return 0;

　　}

　　也就是说，在Linux下一个进程在内存里有三部分的数据，就是"代码段"、"堆栈段"和"数据段"。"代码段"，顾名思义，就是存放了程序代码的数据，假如机器中有数个进程运行相同的一个程序，那么它们就可以使用相同的代码段。"堆栈段"存放的就是子程序的返回地址、子程序的参数以及程序的局部变量。而数据段则存放程序的全局变量，常数以及动态数据分配的数据空间（比如用malloc之类的函数取得的空间）。系统如果同时运行数个相同的程序，它们之间就不能使用同一个堆栈段和数据段。

　　仔细分析后，我们就可以知道：

　　一个程序一旦调用fork函数，系统就为一个新的进程准备了前述三个段，首先，系统让新的进程与旧的进程使用同一个代码段，因为它们的程序还是相同的，对于数据段和堆栈段，系统则复制一份给新的进程，这样，父进程的所有数据都可以留给子进程，但是，子进程一旦开始运行，虽然它继承了父进程的一切数据，但实际上数据却已经分开，相互之间不再有影响了，也就是说，它们之间不再共享任何数据了。

　　fork()不仅创建出与父进程代码相同的子进程，而且父进程在fork执行点的所有上下文场景也被自动复制到子进程中，包括：

　　——全局和局部变量

　　——打开的文件句柄

　　——共享内存、消息等同步对象

　　而如果两个进程要共享什么数据的话，就要使用另一套函数（shmget，shmat，shmdt等）来操作。现在，已经是两个进程了，对于父进程，fork函数返回了子程序的进程号，而对于子程序，fork函数则返回零，这样，对于程序，只要判断fork函数的返回值，就知道自己是处于父进程还是子进程中。

hyshine2012

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
fork（）函数的理解

作者：王姗姗，华清远见嵌入式学院讲师。　　对于刚刚接触Unix/Linux操作系统，在Linux下编写多进程的人来说，fork是最难理解的概念之一：它执行一次却返回两个值。　　首先我们来看下fork函数的原型：　　＃i nclude　　＃i nclude　　pid_t fork(void);　　返回值：　　负数：如果出错，则fork()返回-1,此时没有创建新的进程。最
复制链接

扫一扫

专栏目录