关于localtime函数引入的一系列glibc死锁问题及解决方案

最新推荐文章于 2023-06-20 10:02:14 发布

虚渊玄

最新推荐文章于 2023-06-20 10:02:14 发布

阅读量593

点赞数 1

分类专栏：问题文章标签： c语言 linux

本文链接：https://blog.csdn.net/qq_17019203/article/details/130225864

版权

问题专栏收录该内容

3 篇文章 0 订阅

订阅专栏

问题场景：

线程1：localtime --> fork -- 子进程1 --> localtime

线程2：localtime --> fork -- 子进程2 --> localtime

线程x: localtime --> fork -- 子进程x --> localtime

问题分析：

同事使用上述场景时候，遇到了死锁问题，死锁位置在随机子进程的localtime函数内，通过对glibc源码分析发现，localtime内部存在全局锁，而fork出的子进程会复制父进程的全局变量（但不共享），那么就存在如下场景会出现死锁问题：

线程1：localtime → 加锁 tzset_lock = 1 ----→ 解锁 tzset_lock = 0

↓ 调度 ↑ 调度

线程2： fork ----------------→ wait

↓

子进程2： tzset_lock = 1 （此时复制过来的全局锁是锁住的状态，且不和主进程共享，子进程2一旦调用localtime走到加锁的步骤，必然会死锁）

glibc源码如下，localtime实际调用的是__tz_convert函数：

/* This locks all the state variables in tzfile.c and this file.  */
__libc_lock_define_initialized (static, tzset_lock)

struct tm *
__tz_convert (const time_t *timer, int use_localtime, struct tm *tp)
{
  long int leap_correction;
  int leap_extra_secs;

  if (timer == NULL)
    {
      __set_errno (EINVAL);
      return NULL;
    }

  __libc_lock_lock (tzset_lock);    //实际tzset_lock就是个全局的mutex锁

  /* Update internal database according to current TZ setting.
     POSIX.1 8.3.7.2 says that localtime_r is not required to set tzname.
     This is a good idea since this allows at least a bit more parallelism.  */
  tzset_internal (tp == &_tmbuf && use_localtime, 1);

  if (__use_tzfile)
    __tzfile_compute (*timer, use_localtime, &leap_correction,
		      &leap_extra_secs, tp);
  else
    {
      if (! __offtime (timer, 0, tp))
	tp = NULL;
      else
	__tz_compute (*timer, tp, use_localtime);
      leap_correction = 0L;
      leap_extra_secs = 0;
    }

  if (tp)
    {
      if (! use_localtime)
	{
	  tp->tm_isdst = 0;
	  tp->tm_zone = "GMT";
	  tp->tm_gmtoff = 0L;
	}

      if (__offtime (timer, tp->tm_gmtoff - leap_correction, tp))
        tp->tm_sec += leap_extra_secs;
      else
	tp = NULL;
    }

  __libc_lock_unlock (tzset_lock);

  return tp;
}