lua之诡异的tonumber

最新推荐文章于 2024-03-04 16:23:42 发布

用生命写代码--码农的命

最新推荐文章于 2024-03-04 16:23:42 发布

阅读量1.6w

点赞数 2

分类专栏： lua 文章标签： lua tonumber 源码

本文链接：https://blog.csdn.net/qq_19079937/article/details/51906159

版权

lua 专栏收录该内容

3 篇文章 0 订阅

订阅专栏

昨晚翻看lua源码的时候，发现在luaO_str2num有特殊处理，确切来说是在l_str2int(字符串转换成整形),lstr2d(字符串转换成double类型)

size_t luaO_str2num (const char *s, TValue *o) {
  lua_Integer i; lua_Number n;
  const char *e;
  if ((e = l_str2int(s, &i)) != NULL) {  /* try as an integer */
    setivalue(o, i);
  }
  else if ((e = l_str2d(s, &n)) != NULL) {  /* else try as a float */
    setfltvalue(o, n);
  }
  else
    return 0;  /* conversion failed */
  return (e - s) + 1;  /* success; return string size */
}

我们以l_str2int为例，看看有哪些特殊的处理：

static const char *l_str2int (const char *s, lua_Integer *result) {
  lua_Unsigned a = 0;
  int empty = 1;
  int neg;
  while (lisspace(cast_uchar(*s))) s++;  /* 跳过‘空格’ */
  neg = isneg(&s);
  if (s[0] == '0' &&
      (s[1] == 'x' || s[1] == 'X')) {  /* 十六进制处理 */
    s += 2;  /* skip '0x' */
    for (; lisxdigit(cast_uchar(*s)); s++) {
      a = a * 16 + luaO_hexavalue(*s);
      empty = 0;
    }
  }
  else {  /* 十进制处理 */
    for (; lisdigit(cast_uchar(*s)); s++) {
      a = a * 10 + *s - '0';
      empty = 0;
    }
  }
  while (lisspace(cast_uchar(*s))) s++;  /*跳过尾部‘空格’*/
  if (empty || *s != '\0') return NULL;  /* empty为真，或者跳过‘空格’后没有到达字符串末尾，则转换失败 */
  else {
    *result = l_castU2S((neg) ? 0u - a : a);
    return s;
  }
}

上面在空格加了单引号，此空格非真的‘ ’空格。我们来看看lisspace的实现就知道真相了：

#define ALPHABIT	0
#define DIGITBIT	1
#define PRINTBIT	2
#define SPACEBIT	3
#define XDIGITBIT	4 


#define MASK(B)		(1 << (B))

#define testprop(c,p)	(luai_ctype_[(c)+1] & (p))

#define lislalpha(c)	testprop(c, MASK(ALPHABIT))
#define lislalnum(c)	testprop(c, (MASK(ALPHABIT) | MASK(DIGITBIT)))
#define lisdigit(c)	testprop(c, MASK(DIGITBIT))
#define lisspace(c)	testprop(c, MASK(SPACEBIT))
#define lisprint(c)	testprop(c, MASK(PRINTBIT))
#define lisxdigit(c)	testprop(c, MASK(XDIGITBIT))

const lu_byte luai_ctype_[UCHAR_MAX + 2];

luai_ctype是一个258的uchar数组，其luai_ctype[1:256]分表对应ascii值。通过MASK宏位移后，‘空格’表示8(1<<3)，那么luai_ctype中的值&8不为0，就表示是空格。红色部分已经标出是空格部分。分别对应ASCII表中的\t,\n,\v,\f,\r,空格(ascii为32，真正空格)。

const lu_byte luai_ctype_[UCHAR_MAX + 2] = {
  0x00,  /* EOZ */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* 0. */
  0x00,  <span style="color:#ff0000;">0x08,  0x08,  0x08,  0x08,  0x08</span>,  0x00,  0x00,
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* 1. */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,
  <span style="color:#ff0000;">0x0c</span>,  0x04,  0x04,  0x04,  0x04,  0x04,  0x04,  0x04,	/* 2. */
  0x04,  0x04,  0x04,  0x04,  0x04,  0x04,  0x04,  0x04,
  0x16,  0x16,  0x16,  0x16,  0x16,  0x16,  0x16,  0x16,	/* 3. */
  0x16,  0x16,  0x04,  0x04,  0x04,  0x04,  0x04,  0x04,
  0x04,  0x15,  0x15,  0x15,  0x15,  0x15,  0x15,  0x05,	/* 4. */
  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,
  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,	/* 5. */
  0x05,  0x05,  0x05,  0x04,  0x04,  0x04,  0x04,  0x05,
  0x04,  0x15,  0x15,  0x15,  0x15,  0x15,  0x15,  0x05,	/* 6. */
  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,
  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,  0x05,	/* 7. */
  0x05,  0x05,  0x05,  0x04,  0x04,  0x04,  0x04,  0x00,
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* 8. */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* 9. */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* a. */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* b. */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* c. */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* d. */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* e. */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,	/* f. */
  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,  0x00,
};

从上面分析可知，在tonumber这样的函数中，会跳过‘空格’。验证如下：

这只是代码中的一部分，校验的部分没有写出来。我准备通读lua代码，有问题欢迎讨论。QQ群：570924676

用生命写代码--码农的命

关注

2
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
lua之诡异的tonumber

昨晚翻看lua源码的时候，发现在luaO_str2num有特殊处理，确切来说是在l_str2int(字符串转换成整形),lstr2d(字符串转换成double类型)size_t luaO_str2num (const char *s, TValue *o) { lua_Integer i; lua_Number n; const char *e; if ((e = l_str2int(
复制链接

扫一扫