造轮子：补码实现与若干分析

最新推荐文章于 2021-01-24 20:10:46 发布

幺零做点正事吧

最新推荐文章于 2021-01-24 20:10:46 发布

阅读量897

点赞数 1

分类专栏：数学文章标签：计算机编码数学

本文链接：https://blog.csdn.net/zccz14/article/details/50858656

版权

数学专栏收录该内容

3 篇文章 0 订阅

订阅专栏

本文详细探讨了补码编码的数码转换和运算，包括按位取反函数的代数代换、补码加法、减法的证明。作者遵循特定约束，如不使用类型和特定函数，而是基于C语言实现补码运算。通过数学证明和算法分析，阐述了补码如何在位运算中表示和操作整数，特别是非0数的补码取反加一等于其相反数的补码这一特性。

摘要由CSDN通过智能技术生成

这周计算机原理课收到楼sir的一个作业：要自己实现一套整数编码的数码转换与若干运算，并分析。
我拿到的是补码，其他的队友分别要实现一套原码、移码或者自行设计一套“帅码”（666）。

编码之前…

编程语言： C

听说规定要用C语言造这个轮子，真是遗憾，要是C++还可以各种运算符重载可以很优雅。

约束

那好吧，在编码之前，先做个约束。

不能使用任何形式的 int 类型，包括中间变量，因为 int 本身就是一个补码实现。
看起来，这次的任务就是基于 unsigned int 实现一个 int 类型（32位）。
- 使用 unsigned int 作为二进制串 word 的容器结构。
  typedef unsigned int word;
- 不能使用 %d 输出 word ，而要使用码转数函数 mtoa 将 word 转化为 char*。
  当然也不能用 %d 读入 word，而要使用数转码函数 atom 将 char* 转化为 word 。
  本质上使用 %d 就是一个补码数码转换的实现，因此我们要避免使用它。

函数原型

函数签名	注释
`word atom(char*)`	从字符串中以带符号十进制的格式读入，转换为补码
`char* mtoa(word)`	将补码转换为带符号十进制的字符串格式
`word madd(word, word)`	补码加法
`word msub(word, word)`	补码减法
`word mmul(word, word)`	补码乘法
`word mdiv(word, word)`	补码整除

以上函数原型均为楼sir一手定义，于是我就照着来咯。

数码转换

atom函数与mtoa函数在数学上互逆的，即

(m t o a \circ a t o m) (x) \equiv x, x 是 数 (a t o m \circ m t o a) (x) \equiv x, x 是 码

$(mtoa \circ atom)(x) \equiv x , \text{$x$ 是数} \\ (atom \circ mtoa)(x) \equiv x , \text{$x$ 是码}$

然而在计算机中受到机器精度限制，并不能好好地做运算。

考虑到这点，我们可以给函数加上定义域与值域的限制。

特别地，对于32位补码，定义：

$atom : [-2^{31}, 2^{31}) \to [0, 2^{32})$
$mtoa : [0, 2^{32}) \to [-2^{31}, 2^{31})$

a t o m (x) = {x, 232 + x, 0 \leq x < 231 - 231 \leq x < 0

$atom(x) = \begin{cases} x, & 0 \le x < 2^{31}\\ 2^{32} + x, & -2^{31} \le x < 0 \end{cases}$
构成双射函数，于是求这个函数的反函数得到：

m t o a (x) = {x, - 232 + x, 0 \leq x < 231 231 \leq x < 232

$mtoa(x) = \begin{cases} x, & 0 \le x < 2^{31}\\ -2^{32} + x, & 2^{31} \le x < 2^{32} \end{cases}$
atom，mtoa的定义是等价的，都可以称为补码的定义。

接下来看一下在0-1串内部的一些位运算：

二进制码按位取反函数的代数代换

设有0-1串

m = (m 1 m 2 . . . m n) 2 \in [0, 2 n)

$m=(m_1m_2...m_n)_2 \in [0, 2^n)$
则有

N o t (m) = (N o t (m 1) N o t (m 2) . . . N o t (m n)) 2

$Not(m)=(Not(m_1)Not(m_2)...Not(m_n))_2$
显然有

m + N o t (m) = (11...1) 2 = 2 n - 1

$m+Not(m)=(11...1)_2 = 2^n-1$
于是

N o t (m) = 2 n - 1 - m \in [0, 2 n)

$Not(m) = 2^n -1-m \in [0, 2^n)$

按位取反运算：Not(m)在 $[0,2^n)$ 中封闭。

补码：取相反数的算法证明

设 $m \ne 0$ 是一个 $n$ 位的0-1串，证明： $mtoa(Not(m)+1) = -mtoa(m)$ 。
证：
代入按位取反公式：

m t o a (N o t (m) + 1) = m t o a (2 n - 1 - m + 1)

$mtoa(Not(m)+1)= mtoa(2^n-1-m+1)$
简单化简：

m t o a (2 n - 1 - m + 1) = m t o a (2 n - m)

$mtoa(2^n-1-m+1)=mtoa(2^n-m)$

若 $m \in (0, 2^{n-1}) \to 2^n-m \in (2^{n-1}, 2^n)$
根据补码的定义

$- m t o a (m) = - m m t o a (2 n - m) = - 2 n + (2 n - m) = - m$ $-mtoa(m)=-m\\ mtoa(2^n-m)=-2^n+(2^n-m)=-m$
所以：
$m t o a (2 n - m) = - m t o a (m)$ $mtoa(2^n-m)=-mtoa(m)$
若 $m \in [2^{n-1},2^n) \to 2^n-m \in (0,2^{n-1}]$
根据补码的定义

$- m t o a (m) = - (- 2 n + m) = 2 n - m m t o a (2 n - m) = 2 n - m$ $-mtoa(m)=-(-2^n + m)=2^n-m\\ mtoa(2^n-m)=2^n-m$
所以：
$m t o a (2 n - m) = - m t o a (m)$ $mtoa(2^n-m)=-mtoa(m)$

综上所述，

m t o a (N o t (m) + 1) = - m t o a (m)

$mtoa(Not(m)+1) = -mtoa(m)$
证毕。

所以，非0数的补码按位取反再加一即为其相反数的补码。
特别地，0的相反数的补码是其本身，这个可以单独验证。

补码加法

对于n位0-1串 $m_1,m_2$ ，证明：

m t o a (m 1) + m t o a (m 2) \equiv m t o a ((m 1 + m 2) mod 2 n) (mod 2 n)

$mtoa(m_1)+mtoa(m_2) \equiv mtoa((m_1+m_2) \mod 2^n) (\mod 2^n)$ 。

引理2： $mtoa(x) \equiv x ( \mod 2^n), x \in [0,2^n)$
证明：
1. 若 $x \in [0,2^{n-1})$
$mtoa(x) = x \equiv x ( \mod 2^n)$
2. 若 $x \in [2^{n-1} , 2^n)$
$mtoa(x) = -2^n + x \equiv x ( \mod 2^n)$
证毕。

证：
由引理2：

m t o a (m 1) \equiv m 1 (mod 2 n) m t o a (m 2) \equiv m 2 (mod 2 n) m t o a ((m 1 + m 2) mod 2 n) \equiv m 1 + m 2 (mod 2 n)

$mtoa(m_1) \equiv m_1 (\mod 2^n)\\ mtoa(m_2) \equiv m_2 (\mod 2^n)\\ mtoa((m_1+m_2)\mod 2^n) \equiv m_1+m_2 (\mod 2^n)$
由同余定理得

m t o a (m 1) + m t o a (m 2) \equiv m 1 + m 2 (mod 2 n)

$mtoa(m_1)+mtoa(m_2)\equiv m_1+m_2 (\mod 2^n)$
所以，

m t o a (m 1) + m t o a (m 2) \equiv m t o a ((m 1 + m 2) mod 2 n) (mod 2 n)

$mtoa(m_1)+mtoa(m_2) \equiv mtoa((m_1+m_2) \mod 2^n) (\mod 2^n)$
证毕。

这个部分证明了：两数的和的补码与两数的补码的和(高位丢弃)对 $2^n$ 同余。

补码减法

对于n位0-1串 $m_1,m_2$ ，证明 $mtoa(m_1)-mtoa(m_2) \equiv mtoa((m_1-m_2) \mod 2^n) (\mod 2^n)$ 。
证：
由引理2：

m t o a (m 1) \equiv m 1 (mod 2 n) m t o a (m 2) \equiv m 2 (mod 2 n) m t o a ((m 1 - m 2) mod 2 n) \equiv m 1 - m 2 (mod 2 n)

$mtoa(m_1) \equiv m_1 (\mod 2^n) \\ mtoa(m_2) \equiv m_2 (\mod 2^n) \\ mtoa((m_1 - m_2) \mod 2^n) \equiv m_1-m_2 (\mod 2^n)$
由同余定理：

m t o a (m 1) - m t o a (m 2) \equiv m 1 - m 2 (mod 2 n)

$mtoa(m_1)-mtoa(m_2)\equiv m_1-m_2 (\mod 2^n)$
所以，

m t o a (m 1) - m t o a (m 2) \equiv m t o a ((m 1 - m 2) mod 2 n) (mod 2 n)

$mtoa(m_1)-mtoa(m_2) \equiv mtoa((m_1-m_2) \mod 2^n) (\mod 2^n)$

这个部分证明了：两数的差的补码与两数的补码的差对 $2^n$ 同余。

补码乘法

终于到乘法了，这个部分的数学部分就比较麻烦了，二进制串(向量)乘法本质是一个离散卷积。关于其算法，普通可以使用 $O(n^2)$ 的朴素做法，也可以用FFT1算法进行优化到 $O(n \log (n))$ 。

// TODO
鉴于楼sir尚未要求实现乘法/除法，我暂时先不做这部分的工作。

编码实现

数码转换

数转码：

word atom(char* str){
    word res;
    sscanf(str, "%d", &res);
    return res;
}

码转数：

char* mtoa(word w){
    char* res = (char*) malloc(sizeof(char) * 12);
    sprintf(res, "%d", w);
    return res;
}

以上是错误示范。尽管完美地完成了补码的数码转换，但是这样就违反了一开始的约束了，用了 C 标准库中的轮子。
mtoa中给res开的空间是比较宽松的。32位整数的十进制表示不会超过11位字符，加上字符串结束符\0 最多12位。

数转码

正如之前的错误示范的正确思路，扫描字符串即可。

word atom(char* str){
    word res = 0, flag = 0;
    for(word i = 0; str[i]; i++){
        if('0' <= str[i] && str[i] <= '9')
            res = 10 * res + str[i] - '0';
        else if(i == 0 && str[i] == '-')
            flag = 1; // negative flag
        else break; // illegal character
    }
    // if res != 0 and negative flag 
    // consider about the input "-0"
    if(res && flag) 
        res = ~res + 1;
    return res;
}

码转数

这个需要将码逐位取出（或者一次取多位）然后转化为字符串即可。

char* mtoa(word w){
    char* res = (char*) malloc(sizeof(char) * 12);
    word flag = w >> 31; // get the topest digit
    word cur = 0;
    if(flag) { // if negative
        res[cur++] = '-';
        w = ~w + 1;
    }
    word startCur = cur;
    while(w){
        res[cur++] = w % 10 + '0';
        w /= 10;
    }
    word endCur = cur - 1;
    // let's reverse 
    while(startCur < endCur){
        char temp = res[startCur];
        res[startCur++] = res[endCur];
        res[endCur--] = temp;
    }
    if(cur == 0) 
        res[cur++] = '0'; // if 0
    res[cur] = '\0';
    return res;
}

这个操作相对低效一些并没有太大的关系，这个通常跟I/O相关，I/O瓶颈带来的时间代价远大于此函数。

补码加/减法

word madd(word a, word b){
    return a + b;
}
word msub(word a, word b){
    return a - b;
}

……没错，就是这么简单，补码就是这么方便。

附录：源代码

complement.c

Compile Mode : C99

#include <stdio.h>
#include <stdlib.h>

typedef unsigned int word;

word atom(char* str){
    word res = 0, flag = 0;
    for(word i = 0; str[i]; i++){
        if('0' <= str[i] && str[i] <= '9')
            res = 10 * res + str[i] - '0';
        else if(i == 0 && str[i] == '-')
            flag = 1; // negative flag
        else break; // illegal character
    }
    // if res != 0 and negative flag 
    // consider about the input "-0"
    if(res && flag) 
        res = ~res + 1;
    return res;
}

char* mtoa(word w){
    char* res = (char*) malloc(sizeof(char) * 12);
    word flag = w >> 31; // get the topest digit
    word cur = 0;
    if(flag) { // if negative
        res[cur++] = '-';
        w = ~w + 1;
    }
    word startCur = cur;
    while(w){
        res[cur++] = w % 10 + '0';
        w /= 10;
    }
    word endCur = cur - 1;
    // let's reverse 
    while(startCur < endCur){
        char temp = res[startCur];
        res[startCur++] = res[endCur];
        res[endCur--] = temp;
    }
    if(cur == 0) 
        res[cur++] = '0'; // if 0
    res[cur] = '\0';
    return res;
}

word madd(word a, word b){
    return a + b;
}

word msub(word a, word b){
    return a - b;
}


int main(){
    int t, x, y;
    char S[1000], X[100], Y[100];
    while(scanf("%d", &t))
        switch(t){
            case 0:
                scanf("%s", S);
                puts(mtoa(atom(S)));
                break;
            case 1:
                scanf("%s%s", X, Y);
                puts(mtoa(madd(atom(X), atom(Y))));
                break;
            case 2:
                scanf("%s%s", X, Y);
                puts(mtoa(msub(atom(X), atom(Y))));
                break;
            default:
                return 0;
        }
}

结语

数学真有趣。

快速傅里叶变换。 ↩

幺零做点正事吧

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
造轮子：补码实现与若干分析

这周计算机原理课收到楼sir的一个作业：要自己实现一套整数编码的数码转换与若干运算，并分析。我拿到的是补码，其他的队友分别要实现一套原码、移码或者自行设计一套“帅码”（666）。编码之前… 编程语言： C听说规定要用C语言造这个轮子，真是遗憾，要是C++还可以各种运算符重载可以很优雅。约束那好吧，在编码之前，先做个约束。不能使用任何形式的 int 类型，包括中间变量，因为 int 本身就
复制链接

扫一扫

专栏目录