Chap3.Arithmetic for computer¶

Introduction

Numbers¶

Signed and Unsigned Numbers Possible Representations¶

Example

Sign Magnitude. Positive & Negative Zero (Problem !)
One's Complement : 取反. Positive & Negative Zero (Problem !)
Two's Complement : 取反+1.

Def

To know a negative num's definite value : also invert and plus one

Biased notation

1000 0000 \(=\) minimal negative value(\(-2^7\))
0111 1111 \(=\) maximal positive value (\(2^7-1\))

Sign Extention lbu vs. lb

Operations¶

Signed integer

slt Set when less than

slti Set when less than immediate

Unsigned integer

sltu Set when less than

sltiu Set when less than immediate

"sltu"和"sltiu"都是MIPS汇编语言中的指令，但它们之间有一些重要的区别：

sltu：

"sltu"代表"set less than unsigned"，用于无符号比较。

它将两个寄存器中的无符号整数进行比较，如果第一个寄存器中的值小于第二个寄存器中的值，则结果为1；否则结果为0。

sltiu：

"sltiu"代表"set less than immediate unsigned"，也用于无符号比较。

与"sltu"类似，但是它将第一个寄存器中的值与一个立即数（常数）进行比较，而不是与另一个寄存器中的值进行比较。

Example

New RISC V instructions

lbu load byte unsigned :

Loads a byte into the lowest 8 bit of a register

Fills the remaining bits with 0

Lb load byte (signed)

Loads a byte into the lowest 8 bit of a register

Extends the highest bit into the remaining 24 bits

sltu: set on less than unsigned
slti: set on less than immediate
sltiu: set on less than unsigned immediate

Logical Operations

Arithmetic¶

Addition & subtraction¶

Overflow¶

\(XOR\) Most significant bit ^ Carry bit
Actually we should \([MSB\)^\(Carry] and \ Operation_{add\ or\ sub}\)

Overflows in signed arithmetic instructions cause exceptions:

addadd immediate (addi)

subtract (sub)

Overflows in unsigned arithmetic instructions don’t cause exceptions

add unsigned (addu)

add immediate unsigned (addiu)

Subtract unsigned (subu)

Constructing an ALU¶

Half Adder:

\(S=X\oplus Y\\ C=XY\)

Full Adder :

\(Sum = X\oplus Y\oplus Carry_{in}\)

\(Carry_{out} = XY +(X+Y)Carry_{in}\)

\(X\oplus Y\) only different from \(X+Y\) when \(XY=1\)

Overflow Detection¶

Overflow V = \(C_n\oplus C_{n-1}\)

module alu(A, B, ALU_operation, res, zero, overflow );
    input [31:0] A, B;
    input [2:0] ALU_operation;
    output [31:0] res;
    output zero, overflow ;
    wire [31:0] res_and,res_or,res_add,res_sub,res_nor,res_slt;
    reg [31:0] res;
    parameter one = 32'h00000001, zero_0 = 32'h00000000;
       assign res_and = A&B;
     assign res_or = A|B;
     assign res_add = A+B;
     assign res_sub = A-B;
     assign res_slt =(A < B) ? one : zero_0;
     always @ (A or B or ALU_operation)
            case (ALU_operation)
                3'b000: res=res_and;    
                3'b001: res=res_or; 
                3'b010: res=res_add;    
                3'b110: res=res_sub;    
            3'b100: res=~(A | B);
            3'b111: res=res_slt;
            default: res=32'hx;
            endcase
     assign zero = (res==0)? 1: 0;
endmodule

Overflow code ?
What is the difference The codes in the Synthesize?

Speed up¶

\(P_i = A_i\oplus B_i \ \ \ \ G_i = A_iB_i\)

\(S_i = P_i\oplus C_i\ \ \ \ C_{i+1} = G_i + P_iC_i\)

Improvement -- Reduce FAN-OUT

Carry Skip Adder¶

Carry Select Adder (CSA)¶

Already caclulate(parallel) different situations,once the \(C_0\) is delivered, the result can be output.

Multiplexer¶

Look at current bit position

If multiplier is \(1\) then add multiplicand Else add 0
shift multiplicand left by 1 bit

Example

Multiplexter V1

Multiplexter V2

Multiplexter V3

Booth's Algorithm¶

Idea:

If you have a sequence of 1s subtract at first '1' in multiplier
shift for the sequence of '1s'
add where prior step had last '1‘

Action

1 0 : subtract multiplicand from left
1 1 no arithmetic operation
0 1 add multiplicand to left half
0 0 no arithmetic operation

\(Bit_{-1} = 0\)

Arithmetic shift right:

keeps the leftmost bit constant
no change of sign bit !

仍然是放在左边一路右移
注意signed shift right
least significant bit初始加个零

Division¶

Division V1

Reduction of Divisor and ALU width by half
Shifting of the remainderSaving 1 iteration

Division V2

Division V3

4.1 已经结束了除法操作，此时的高位就是我们的余数，但是这最后一次的结果还没有放回到 Reminder 中，因此我们需要再往左移一位为商留出空间，放入后，再把高位余数往右移动以抵消影响

Signed division¶

除零会产生溢出，由软件检测

Floating point numbers¶

Standardized format IEEE 754

Single precision 8 bit exp, 23 bit significant
Double precision 11 bit exp, 52 bit significant
Both formats are supported by MIPS

Leading '1' bit of significand is implicit
M: 尾数. 即默认.xxx是1.xxx(因为科学计数法，没有0.xxx)
Exponent is biased(移码)

Bias 127 for single precision

Bias 1023 for double precision

Have to be transfered back,but treated like unsigned inside

NOTE :\((-1)^{sign} • (1 + significand) • 2^{exponent - bias}\)

Limitations¶

Overflow: &. Underflow

EXAMPLE

Floating Point Addition¶

Alignment
The proper digits have to be added
Addition of significant
Normalization of the result [重新规格化]
Rounding

EXAMPLE

Floating Point multiplication¶

Algorithm

Add exponents - bias.
Multiply the significands.
Normalize.
Over- underflow.
Rounding.
Sign.

EXAMPLE

float devision¶

Subtraction of exponents
Division of the significants
Normalisation
Rounding
Sign

Accurate Arithmetic¶

IEEE 754 always keeps two extra bits on the right during intermediate additions, called guard and round

为了保证四舍五入的精度,结果没有,只在运算的过程中保留

EXAMPLE

sticky bit

A bit used in rounding in addition to guard and round that is set whenever there are nonzero bits to the right of the round bit.

allows the computer to see the difference between \(0.50 ... 00_{ten}\) and \(0.50 ... 01_{ten}\) when rounding.

EXAMPLE for guard,round and sticky bit.

Round Modes¶

EXAMPLE

Parallelism and Computer Arithmetic: Associativity¶

if \(x + (y+ z) = (x + y) + z ?\)

\(x = -1.5_{ten} \times 10^{38},y= 1.5_{ten} \times 10^{38}, and\ z = 1.0\)
\(x + (y + z) = 0.0\)
\((x+y) + z = 1.0\)

Exercise¶

已知\(F(n) = Σ2^i = 2^{n+1} -1 = 111…1B\)

n+1个1计算F(n)的C语言函数f1如下

int  f1（ unsigned n){   
    int  sum = 1, power = 1;
    for ( unsigned i=0;  i<=n-1;  i++ ){    
         power * = 2;
        sum += power;
   }
   return sum;
}

将f1中的int都改为float，可得到计算f(n)的另一个函数f2.

假设unsigned和int型数据都占32位，float采用IEEE 754单精度标准(IEEE 754采用的是最近舍入round to nearest的方式) , 回答一下问题：

当n=0时，\(f1\)会出现死循环，为什么？若将\(f1\)中变量\(i\)和\(n\)都定义为int型，则\(f1\)是否还会出现死循环？为什么？
- unsigned 0 -1 -- 大数，i<=n-1 永真，所以死循环.
- 若i和n改为int类型，不会死循环:n=0时，n-1的值为-1，故i<=n-1 不成立，退出循环
f1(23)和f2(23) 的返回值是否相等？机器数各是什么（用十六进制表示)

Signle presion bias = 127

\(f1(23)=00FFFFFFh\)

\(f2(23)=0_{sign}10010110_{exponent(bias)}....=0100\ 1011\ 0111\ 1111\ 1111\ 1111\ 1111\ 1111=47BFFFFFH\)
\(f1(24)\)和\(f2(24)\)的返回值分别为\(33554431\)和\(33554432.0\)，为什么不相等？

当\(n=24\)时，\(f(24)=11….1B\),float只有24位有效位，舍入后数值增大，所以\(f2(24)\)比\(f1(24)\)大\(1\).
\(f(31) = 2^{32} -1\), 而\(f1(31)\)的返回值却为\(-1\)，为什么？若使\(f1(n)\)的返回值与\(f(n)\)相等，最大的n是多少？

\(F(31)\)超出int数据的表示范围，用\(f1(31)\)实现时得到机器数为32个1，作为int解释时其值为-1

因为int最大可表示数为0后跟31个1，所以\(f1(n)\)的返回值与\(f(n)\)相等的最大\(n\)值是30。
\(f2(127)\)的机器数为\(7F80 0000H\)，对应的值是什么？若使\(f2(n)\)的结果不溢出，则最大的n是多少？若使\(f2(n)\)的结果精确（无舍入），则最大的\(n\)是多少？
- IEEE用阶码全1,尾数为0表示无穷大
F2返回值为float，机器数为\(7F80 0000H\)对应的值是\(+∞\)
- n=126时，对应阶码为253，尾数部分舍入后阶码加1，最终阶码为254，不溢出的最大n值为126.
N=23时，f(23)为24位1，float有24位有效位，所以不需舍入，结果精确

故使f2获精确结果的最大n值为 23.

最后更新: 2024年3月29日 14:37:25
创建日期: 2024年3月12日 23:24:38