## NATIONAL UNIVERSITY OF SINGAPORE

## CS2100 - COMPUTER ORGANISATION

(Semester 2: AY2021/22)

Time Allowed: 2 Hours

## INSTRUCTIONS TO STUDENTS

1. This assessment paper consists of SEVENTEEN (17) questions in THREE (3) parts and comprises of THIRTEEN (13) printed pages.
2. Answer ALL questions on the ANSWER SHEETS.
3. This is an OPEN BOOK assessment.
4. Write your answers only on the ANSWER SHEETS. You may write in pen or pencil. You are to write within the space provided. No extra pages should be submitted.
5. Printed/written materials are allowed. Apart from calculators, electronic devices are not allowed.
6. Page 13 contains the MIPS Data Reference sheet.
7. The maximum mark of this assessment is 100 .

| Question | Max. mark |
| :---: | :---: |
| Part A: Q1 - 6 | 12 |
| Part B: Q7 - 12 | 18 |
| Part C: Q13 | 12 |
| Part C: Q14 | 16 |
| Part C: Q15 | 13 |
| Part C: Q16 | 13 |
| Part C: Q17 | 16 |
| Total | $\mathbf{1 0 0}$ |

## --- END OF INSTRUCTIONS ---

Part A: Multiple-Choice Questions [Total: 6×2=12 marks]
Each multiple-choice question (MCQ) is worth TWO marks and has exactly one correct answer. Please write your answers in CAPITAL LETTERS.

1. What is $(20.22)_{4}$ in decimal?
A. 8.75
B. 8.625
C. 8.5
D. 8.25
E. None of the above.
2. Consider the following IEEE 754 single-precision floating point number written in hexadecimal: $0 \times 42022000$. What is the number in decimal?
A. 130.125
B. 65.0625
C. 32.53125
D. 16.265625
E. None of the above
3. Given that the first print (i.e., printf \#1) is "123 321" (without the quotes) and the second print (i.e., printf \#2) is "123 654" (without the quotes). What is the correct definition of the struct data_t assuming that data_t data variable is correctly declared and initialised at the start of main function?
```
#include <stdio.h>
typedef struct { /* blank for question */ } data_t;
void foo(data_t);
int main() {
    /* data t data is declared correctly here */
    printf("%d %d\n", data.x[0], data.y[0]); /* printf #1 */
    foo(data);
    printf("%d %d\n", data.x[0], data.y[0]); /* printf #2 */
    return 0;
}
void foo(data_t data) {
    data.x[0] = 456;
    data.y[0] = 654;
}
    A. typedef struct { int x[1]; int y[1]; } data_t;
    B. typedef struct { int *x ; int y[1]; } data_t;
    C. typedef struct { int *x ; int *y ; } data_t;
    D. typedef struct { int x[1]; int *y ; } data_t;
    E. None of the above.
```

4. If we were to add "NAND $\$ \mathbf{r d}$, $\$ \mathbf{r s}, \$ \mathbf{r t}$ " operation into our MIPS instructions, given our processor implementation, what will be the ALUControl signal needed?
A. 1101
B. 1110
C. 1100
D. 1111
E. None of the above.
5. Given the Boolean function $\boldsymbol{F}(\mathbf{A}, \mathbf{B}, \mathbf{C}, \boldsymbol{D})=\Sigma \boldsymbol{\Sigma}(\mathbf{1}, \mathbf{3}, \mathbf{6}, \mathbf{8}, \mathbf{9}, \mathbf{1 1}, \mathbf{1 5})+\Sigma \boldsymbol{x}(\mathbf{7}, \mathbf{1 4})$ where $x$ denotes don'tcare, which of the following is not an EPI in the K-map of $F$ ?
A. C•D
B. $B \cdot C$
C. $B^{\prime} \cdot D$
D. $A \cdot B^{\prime} \cdot C^{\prime}$
E. None of the above.
6. You are given a single $2 \times 4$ decoder with 1-enable and active high output, and no other devices or logic gates. Which of the following expressions cannot be implemented with this single decoder, where $A, B$ and $C$ are Boolean variables?
A. $A \cdot B \cdot C$
B. $A^{\prime} \cdot B \cdot C$
C. $A \cdot B^{\prime} \cdot C$
D. $A^{\prime} \cdot B^{\prime} \cdot C^{\prime}$
E. None of the above.

Part B: Multiple-Response Questions [Total: 6×3=18 marks]
Each multiple-response question (MRQ) is worth THREE marks and may have one answer or multiple answers. Write out all correct answers. For example, if you think that $A, B, C$ are the correct answers, write $A, B, C$. Please write in CAPITAL LETTERS.

Only if you get all the answers correct will you be awarded three marks. No partial credit will be given for partially correct answers.
7. Which of the following is equivalent to the length of the string $s$ as returned by strlen(s)?
A. The index of the first null character in the string $s$.
B. The number specified during the definition of variable $s(e . g ., 10$ in char $s[10]$ ).
C. The amount of memory allocated to the string s.
D. The difference of the address of the first null character in the string $s$ and value of string $s$.
E. Twice the average of the address of the first null character in the string $s$ and the value of string s .
8. Aiken is running the following MIPS program. What are the possible number of instructions executed in total assuming that the program does not result in an error, if you can start with any starting value of $\$$ s1 and $\$ \mathbf{s} 2$ and with any possible memory content?

A. 9
B. 14
C. 15
D. 43
E. 63
9. Consider the instruction "lb \$rt, imm(\$rs)" in general. What is true about this instruction?
A. The value specified by imm must be a multiple of 4 .
B. The value in the register specified by $\$ \mathbf{r s}$ must be a multiple of 4 .
C. The sum of $\mathbf{i m m}$ and the value in the register specified by $\$ \mathbf{r s}$ must be a multiple of 4 .
D. The sum of $\mathbf{i m m}$ and the value in the register specified by $\$ \mathbf{r s}$ can be any value.
E. The value in the register specified by $\$ \mathbf{r s}$ can be any bit pattern.
10. A "no-operation" typically written as nop is an instruction that tells the processor to do nothing. However, we can simulate this using other instructions. Instead of telling the processor to do nothing, what we want is simply to have the processor produce "no effect". In other words, no registers or memory addresses can have their value changed (i.e., if the value before the operation is $n$, then the value after the operation is also $n$ ).

Which of the following operations are valid and produce no effect? You may assume that any label is valid.
A. bne \$0, \$0, label
B. addi \$0, \$0, 0
C. sll \$2, \$2, 0
D. add \$0, \$0, \$0
E. srl \$4, \$4, 0
11. Which of the following statements are true about the control signal in our processor implementation and our supported MIPS instructions in Lecture 12?
A. If RegWrite $=0$, it is actually possible to have a different implementation where there are 3 other control signals having the value of "don't care" for at least 1 instruction.
B. It is possible to have the value of MemRead equals to MemWrite in some instructions.
C. If the value of MemRead is not equal to MemWrite, then the value of ALUSrc is always 1.
D. There is an instruction where Branch is the only control signal that is equal to 1 .
E. sw instruction may have the value of MemRead changed to X since the result is not going to be written into a register.
12. Consider a machine with word size of 4 bytes and 16-bit fixed-length instructions with three types of instructions as shown below. Note that although Type A and Type B have the same number of bits for opcode, they are considered different instruction types because they have different kinds of operands.

- Type A: 4-bit opcode, 1 address and 1 register;
- Type B: 4-bit opcode and 1 immediate;
- Type C: 6-bit opcode and 2 registers.

Assume all bits are utilised and all instructions types exists, each address holds a word instead of a byte, the immediate field is in 2 s complement, all addresses are used and all registers can be encoded. Which of the following statements are true about the instruction?
A. The maximum number of Type A instructions is 15 .
B. The maximum number of Type C instructions is 56 .
C. There are a total of 512 bytes of memory.
D. The maximum value of the immediate field is 2047.
E. The maximum number of registers is 16 .

Part C: There are 5 questions in this part [Total: 70 marks]

## Q13. Sequential circuits [12 marks]

(a) A sequential circuit with 6 states: state $1\left(A B C=001_{2}\right)$ through state $6\left(A B C=110_{2}\right)$ is implemented using three $J K$ flip-flops as shown below. Complete the state diagram on the Answer Sheets. One of the transitions on the state diagram has been drawn for you.

(b) Is the circuit self-correcting? Explain. (Mark will not be awarded if no explanation is given or the explanation given is incomplete/incorrect.)
(c) Redesign the circuit using only $\boldsymbol{T}$ flip-flops. You do not have to follow where the unused states transit to in the given circuit above. That is, you only need to make sure that the transitions from the used states follow the above circuit. You do not need to draw your new design. Write out the flip-flop input functions $T A, T B$ and $T C$ so that your new design can be implemented with the fewest number of logic gates other than the flip-flops.
[6 marks]

## Q14. Combinational circuits [16 marks]

Note that logical constants 0 and 1 are available, but not complemented literals.
(a) Given the following circuit which comprises a $2 \times 4$ decoder with 1 -enable and active high outputs and a 4:1 multiplexer, write out the sum-of-minterms expression of $F(A, B, C, D)$ in the $\Sigma m$ notation.
[4 marks]

(b) The circuit below comprises a 2-bit parallel adder and 2 XOR gates. The circuit is housed inside a chip so the only connections available are those that extend out of the dotted box.
Using this circuit and one additional logic gate (inverter, AND, OR, NAND, NOR, XOR, or XNOR), implement the function $F$ in part (a) above.
[4 marks]

(c) [Total: 8 marks]

Given this Boolean function: $G(A, B, C, D)=\Sigma \mathrm{m}(0,4,6,8,9,10,12,13,14,15)$,
(i) How many PIs and EPIs are there in the K-map of $G$ ?
(ii) Write the simplified SOP expression for $G$.
(iii) Implement $G$ using at most two 4 -bit magnitude comparators and a 2 -input OR gate. The block diagram of a 4-bit magnitude comparator is shown below.


## Q15. MIPS [13 marks]

Study the following MIPS code on integer arrays $A$ and $B$, with array $B$ containing twice as many elements as array $A$. The following are the variable mappings:

- $\$ \mathrm{aO}=$ size (number of elements in array $A$ )
- \$a1 = base address of array $A$
- \$a2 = base address of array $B$

```
.data
size: .word 5
A: .word 1, 2, 3, 4, 5
B: .word 5, 3, 4, 5, 3, 8, 2, 5, 9, 4
.text
main: la $t0, size # $t0 is the address of size
    lw $a0, 0($t0) # $a0 is the content of size
    la $a1, A # $al is the base address of array A
    la $a2, B # $a2 is the base address of array B
# -------------------------------------------------------------------
    add $t0, $0, $0 # I1; i = 0
    addi $t1, $a1, 0 # I2; $t1 = &A[0]
    addi $t2, $a2, 0 # I3; $t2 = &B[0]
    sll $t3, $a0, 2 # I4
loop: slt $t4, $t0, $t3 # I5
    beq $t4, $0, end # I6
    lw $s1, 0($t1) # I7
    lw $s2, 0($t2) # I8
    slt $t4, $s1, $s2 # I9
    beq $t4, $0, skip # I10
    add $t9, $s1, $0 # I11
    add $s1, $s2, $0 # I12
    add $s2, $t9, $0 # I13
skip: sw $s1, 0($t1) # I14
    sw $s2, 0($t2) # I15
    addi $t0, $t0, 4 # I16
    addi $t1, $t1, 4 # I17
    addi $t2, $t2, 8 # I18
    j loop # I19
# ---------------------------------------------------------------------
end: li $v0, 10 # system call code for exit
    syscall
```

Q15. (continue...)
(a) Write out the contents of array B after the execution of the MIPS code.
(b) Using the variable names (size, $A, B$ ) shown in the variable mappings above, write an equivalent $C$ code corresponding to instructions I1 to I19 in the above MIPS code. You may use additional variable(s) if needed.

Do not do a line-by-line direct translation of the MIPS code. You do not need to declare the variables in your C code.
(c) Write the instruction encoding of instruction I5 (s7t \$t4, \$t0, \$t3) in hexadecimal. The value in shamt for s 7 t instructions is zero.
(d) Assuming that I19 ( j 100 p ) is stored at address $0 \times 0040$ 0084, (i) calculate the address of I5 (s7t \$t4, \$t0, \$t3) and (ii) write the instruction encoding of I19 (j 1oop) in hexadecimal.
(e) Change the code from I11 to I15 to make the code more efficient. Make the changes on the Answer Sheets. Except for moving the labels if necessary, you are not to change the code outside of I11 to I15.

## Q16. Pipelining [13 marks]

Refer to the following MIPS code which is the same as the one in question 15. Here, we look only at instructions I1 to I19. Pay attention to the assumptions (underlined) given below.


Assuming a 5 -stage MIPS pipeline, and all elements in array $A$ are smaller than all elements in array $B$, answer the parts below. You need to count until the last stage of instruction I19.
(a) How many cycles does this code segment take to complete its execution in the first iteration (I1 to I19) in an ideal pipeline, that is, one with no delays?

For parts (b) to (d) below, given the assumption for each part, how many additional cycles does this code segment (I1 to I19) take to complete its execution in the first iteration as compared to an ideal pipeline computed in (a)? Note that the jump instruction (j) computes the target address to jump to in its ID stage (stage 2). No delayed branching is used.

Write the total number of additional delay cycles for each of the parts (b) to (d). For example, if part (a) takes 10 cycles and part (b) takes 30 cycles, then you should write +20 for part (b).
(b) Assuming without forwarding and branch decision is made at MEM stage (stage 4). No branch prediction is made.
(c) Assuming with forwarding and branch decision is made at MEM stage (stage 4). No branch prediction is made.
(d) Assuming with forwarding and branch decision is made at ID stage (stage 2). Branch is predicted not taken.
(e) Assuming the setting in part (b) above (without forwarding and branch decision at MEM stage), without affecting the correctness of the code, is it possible to move one instruction to somewhere else to reduce the number of delay cycles? If so, indicate which instruction to move, where to move it to, and how many delay cycles are reduced by moving it. If it is not possible, explain.

## Q17. Cache [16 marks]

Refer to the following MIPS code which is the same as the one in question 15 . Here, we look only at instructions I1 to I19. The data segment in the MIPS code in question 15 no longer applies here as the arrays contain a lot more elements in this question.


For parts (a) and (b): You are given a 2-way set-associative instruction cache with 16 words in total. The replacement policy is LRU (least recently used). You may assume the following:

- There are $\mathbf{1 0 0}$ elements in array $A$ and twice as many elements in array $B$.
- All elements in array $A$ are smaller than all elements in array $B$.
- Instruction I1 is stored at address 0x4488 FFFC.
- Consider only instructions I1 to I19 in the execution of the code.
(a) Assume that each block contains 2 words.
(i) How many bits are there in the set index field? In the byte offset field?
(ii) How many misses in total are there in the execution of the code?
(b) Assume that each block contains 4 words.

How many misses in total are there in the execution of the code?

## Q17. Cache (continue...)

For parts (c) to (e): You are given a direct-mapped data cache with 1024 words in total and each block contains 16 words. Recall that array $B$ contains twice as many elements as array $A$. You may assume the following:

- Array A starts at address 0xFFFF 0000.
- Array $B$ follows immediately after array $A$ in the memory. That is, if the last element of array $A$ is at address $x$, then the first element of array $B$ is at address $(x+4)$.
- Only lw instructions are considered for the calculation of hits and misses; sw instructions are to be excluded from the calculation.
(c) How many bits are there in the index field? In the byte offset field?
(d) Assuming that array $A$ contains 512 elements, how many data access hits in total are in the data cache in the execution of the code (i) for array $A$ and (ii) for array $B$ ?
(e) Assuming that array $A$ contains 1024 elements, how many data access hits in total are in the data cache in the execution of the code (i) for array $A$ and (ii) for array $B$ ?

| CORE INSTRUCTION SET |  |  |  |  | OPCODE |
| :---: | :---: | :---: | :---: | :---: | :---: |
|  |  | FORMAT | OPERATION (in Verilog) |  | / FUNCT (Hex) |
| Add | add | R | $\mathrm{R}[\mathrm{rd}]=\mathrm{R}[\mathrm{rs}]+\mathrm{R}[\mathrm{rl}]$ |  | $0 / 20_{\text {tex }}$ |
| Add Immediate | adar | 1 | $\mathrm{R}[\mathrm{rr}]=\mathrm{R}[\mathrm{rs}]+$ SignExtlmm | $(1,2)$ | 8 bex |
| Add Imm. Unsigned | adalu | 1 | $\mathrm{R} \mid \mathrm{rt}]=\mathrm{R}\|\mathrm{rs}\|+$ SignExilmm | (2) | 9 bex |
| Add Unsigned | addu | R | $\mathrm{R}[\mathrm{rd}]=\mathrm{R}[\mathrm{rs}]+\mathrm{R}[\mathrm{rl}]$ |  | $0 / 21_{\text {hex }}$ |
| And | and | R | $\mathrm{R}[\mathrm{rd}]=\mathrm{R}[\mathrm{rs}]$ \& $\mathrm{R}[\mathrm{rt}]$ |  | $0 / 24_{\text {hex }}$ |
| And Immediate | andr | 1 | $\mathrm{R}[\mathrm{rr]}] \mathrm{R}[\mathrm{rs}]$ \& ZeroExllmm | (3) | $\mathrm{c}_{\text {hex }}$ |
| Branch On Equal | beq | 1 | $\mathrm{if}(\mathrm{R}[\mathrm{r})]=\mathrm{R}[\mathrm{rt}])$ <br> PC- $\mathrm{PC}+4+$ BranchAddr | (4) | $4_{\text {hex }}$ |
| Branch On Not Equal | bne | I | $\begin{aligned} & \mathrm{if}(\mathrm{R}\|[\mathrm{r}]\|] \mathrm{R} \mid \mathrm{rt]} \\ & \mathrm{PC}=\mathrm{PC}+4+\mathrm{Branch} \mathrm{Addr} \end{aligned}$ | (4) | 5 hex |
| Jump | J | J | $\mathbf{P C}=$ JumpAddr | (5) | 2 hex |
| Jump And Link | Jal | J | $\mathrm{R}[31]=\mathrm{PC}+8$; $\mathrm{PC}=$ JumpAddr | (5) | $3_{\text {bex }}$ |
| Jump Register | Jr | R | $\mathrm{PC}=\mathrm{R}[\mathrm{rs}]$ |  | $0 / 08_{\text {hex }}$ |
| Load Byte Unsigned | 1 bu | 1 |  |  | 24 b |
| Load Halfword Unsigned | thu | 1 | $\begin{aligned} \mathrm{R}[\mathrm{rt]}]= & {[16 \mathrm{~b} 0, \mathrm{M}[\mathrm{R}[\mathrm{rs}]} \\ & + \text { SignExtImm] } 15: 0)\}\end{aligned}$ | (2) | 25 |
| Load Linked | 11 | 1 | $\mathrm{R}[\mathrm{rt}]=\mathrm{M}[\mathrm{R}[\mathrm{rs}]+\mathrm{SignExtImm}]$ | $(2,7)$ | 30 |
| Load Upper Imm. | 101 | 1 | $\mathrm{R}[\mathrm{rt}]=\left\{\mathrm{imm}, 16^{\circ} \mathrm{b} 0\right\}$ |  | $\mathrm{f}_{\text {hex }}$ |
| Load Word | 1w | 1 | $\mathrm{R}[\mathrm{rr}]=\mathrm{M}[\mathrm{R}[\mathrm{rs}]+$ SignExtImm $]$ | (2) | $23_{\text {bex }}$ |
| Nor | nor | R | $\mathrm{R}[\mathrm{rd}]=\sim(\mathrm{R}[\mathrm{rs}] \mid \mathrm{R}[\mathrm{rt]}]$ |  | $0 / 27_{\text {hex }}$ |
| Or | or | R | $\mathrm{R}[\mathrm{rd}]=\mathrm{R}[\mathrm{s}] \mid \mathrm{R}[\mathrm{rr}]$ |  | $0 / 25_{\text {hex }}$ |
| Or Immediate | r1 | 1 | $\mathrm{R}[\mathrm{rt}]=\mathrm{R}[\mathrm{rs}] \mid$ ZeroExtImm | (3) | $\mathrm{d}_{\text {hex }}$ |
| Set Less Than | sit | R | $\mathrm{R}[\mathrm{rd}]=(\mathrm{R}[\mathrm{rs}]<\mathrm{R}[\mathrm{rt}])$ ? $1: 0$ |  | $0 / 2 a_{\text {hex }}$ |
| Set Less Than Imm. | 141 | 1 | $\mathrm{R}\left[\mathrm{rt]}=\right.$ ( $\mathrm{R}[\mathrm{rs}]<$ SignExtImm) ${ }^{\text {a }}$ | (2) | $\mathrm{a}_{\text {hex }}$ |
| Set Less Than Imm. Unsigned | sitiu | 1 | $\begin{gathered} \mathrm{R}[\mathrm{rt}]=(\mathrm{R}[\mathrm{rs}]<\text { SignExtImm }) \\ ? 1: 0 \end{gathered}$ | $(2,6)$ | $b_{\text {bee }}$ |
| Set Less Than Unsig. | situ | R | $\mathrm{R}[\mathrm{rd}]=(\mathrm{R}[\mathrm{rs}]<\mathrm{R}[\mathrm{rt}])$ ? $1: 0$ | (6) | $0 / 2 \mathrm{~b}_{\text {hex }}$ |
| Shift Left Logical | s11 | R | $\mathrm{R}[\mathrm{rd}]=\mathrm{R}[\mathrm{rt}] \ll$ shamt |  | $0 / 00_{\text {hex }}$ |
| Shift Right Logical | $\mathrm{srl}^{1}$ | R | $\mathrm{R}[\mathrm{rd}]=\mathrm{R}[\mathrm{rt}] \ggg$ shamt |  | $0 / 02_{\text {hex }}$ |
| Store Byte | sb | 1 | $\begin{gathered} \mathrm{M}[\mathrm{R}[\mathrm{r}]+\text { SignExtImm }](7: 0)= \\ \mathrm{R}[\mathrm{r}](7: 0) \end{gathered}$ | (2) | 28 hex |
| Store Conditional | sc | 1 | $\begin{aligned} & \mathrm{M}[\mathrm{R}[\mathrm{rs}]+\mathrm{SignExtmm}]=\mathrm{R}[\mathrm{r}] ; \\ & \mathrm{R}[\mathrm{r}]=(\text { atomic }) ? ~ \end{aligned}$ | $(2,7)$ | $38_{\text {bex }}$ |
| Store Halfword | sh | 1 | $\mathrm{M}[\mathrm{R}[\mathrm{ss}]+$ SignExtImm](15:0) $=$ $\mathrm{R}\|\mathrm{rt}\|(15: 0)$ |  | ${ }^{29}$ bex |
| Store Word | sw | 1 | $\mathrm{M}[\mathrm{R}[\mathrm{rs}]+$ SignExtimm $]=\mathrm{R}[\mathrm{rl}]$ | (2) | $2 \mathrm{~b}_{\text {hex }}$ |
| Subtract | sub | R | $\mathrm{R}[\mathrm{rd}]=\mathrm{R}[\mathrm{rs}]-\mathrm{R}[\mathrm{r}]$ | (1) | $0 / 22_{\text {hex }}$ |
| Subtract Unsigned | subu | R | $\mathrm{R}[\mathrm{rd}]=\mathrm{R}[\mathrm{rs}]-\mathrm{R}[\mathrm{rt}]$ |  | $0 / 23_{\text {hex }}$ |

(1) May cause overllow exception
(2) SignExtImm $=\{16\{$ immediate [15] $\}$, immediate $\}$
(3) ZeroExtImm $=\left\{16\left\{1 \mathrm{~b}^{\circ} 0\right\}\right.$, immediate $\}$
(4) Branch $\Lambda$ ddr $-\{14$ (immediate 151$\}$, immediate, $\left.2^{\circ} \mathrm{b} 0\right\}$
(5) JumpAddr $=\{\mathrm{PC}+4[31: 28]$, address, 2 ' B 0$\}$
(6) Operands considered unsigned numbers (vs. 2's comp.)
(7) Atomic test\&set pair, R|ri] 11 if pair atomic, 0 ir not atomic

## BASIC INSTRUCTION FORMATS



ARITHMETIC CORE INSTRUCTION SET
 Branch On FP False be1r FI if(tFPcond)PC=PC+4+BranchAddr(4) 11/8/0/-

 Divide Unsigned divu R Lo=R[rs]/R[rt]; Hi=R[rs]\%R[rt] (6) $0 / \ldots /-/ 1 \mathrm{lb}$ FP Add Single add.s $\mathrm{FR} \quad \mathrm{F}[\mathrm{fd}]=\mathrm{F}[\mathrm{fs}]+\mathrm{F}[\mathrm{ft}] \quad 11 / 10 /-/ 0$
FPAdd $\quad$ add. $\quad$ FR $\{F[f d], F[f d+1]\}=\{[\mathrm{F}[\mathrm{fs}], \mathrm{F}[\mathrm{fs}+1]\}+$
Double $\{\mathrm{F}[\mathrm{t}], \mathrm{F}[\mathrm{t}+1]\}$
FPCompare Single cex. $\mathrm{s}^{*}$ FR FPcond - (F|fslop F|fID ? $1: 0$ FP Compare cxa $\quad$ FR $\quad$ FPcond $=(\{\mathrm{F}[\mathrm{fs}], \mathrm{F}[\mathrm{fs}+1])$ op Double
$*(x$ is eq, 1 L , or 1 e$)(o p$ is,$-<$ or $<-)(y$ is $32,3 \mathrm{c}$, or 3 e$)$
FP Divide Single div.s FR F[fd] $=\mathrm{F}[\mathrm{fs}] / \mathrm{F}[\mathrm{ft}] \quad 11 / 10 /-/ 3$

FP Divide div.d FR $\{\mathrm{F}[\mathrm{fd}], \mathrm{F}[\mathrm{fd}+1]\}=\{\mathrm{F}[\mathrm{f}][\mathrm{F}[\mathrm{fs}+1]\} /$
Double $\{\mathrm{F}[\mathrm{T}] \mathrm{F}[\mathrm{n}+1]$
$11 / 11 /-/ 3$
FPMultiply Single mu1.s $\mathrm{FR} \quad \mathrm{F}[\mathrm{Id}]-\mathrm{F}[\mathrm{Is} \mid * \mathrm{~F}[\mathrm{II}]$
FPMultiply mul.a $\operatorname{PR}\{\mathrm{F}[\mathrm{fd}], \mathrm{F}[\mathrm{fd}+1]\}=\{\mathrm{F}[\mathrm{fs}], \mathrm{F}[\mathrm{fs}+1]\} *$
Double \{F[f],F[ft+1]\}
FP Subtract Single
FP Subtract sub. d FR $\{\mathrm{F}[\mathrm{fd}], \mathrm{F}[\mathrm{d}+1]\}=\{\mathrm{F}[\mathrm{ss}], \mathrm{F}[\mathrm{ss}+1]\}$
Double
Load FP Single
Load FP
Double $1 \mathrm{dc} 1 \quad \mathrm{~F}[\mathrm{rt}]=\mathrm{M}[\mathrm{R}[\mathrm{rs}]+$ SignExtImm $]$
Move From Hi $\quad \mathrm{F}[\mathrm{Tt}+1]=\mathrm{M}[\mathrm{R}[$ [rs $]+$ SignExtImm +4$]$
Move From Lo mrio R R[rd] Hi
Move From Control mfco $\begin{array}{lll}\mathrm{R} & \mathrm{R}[\mathrm{rd}]=\mathrm{Lo} & 0 /-/-/ 12 \\ & \mathrm{R} & 0 /[\mathrm{d}]=\mathrm{CR}[\mathrm{rs}]\end{array}$
$\begin{array}{lllll}\text { Move From Control } & \text { mfco } & \mathrm{R} & \mathrm{R}[\mathrm{rd}]=\mathrm{CR}[\mathrm{rs}] & 10 / 0 /-/ 10 \\ \text { Multiply } & \text { mult } & \mathrm{R} & \{\mathrm{Hi}, \mathrm{Lo}\}=\mathrm{R}[\mathrm{rs}] * \mathrm{R}[\mathrm{rt}] & 0 \%-/-/ 18\end{array}$
Multiply Unsigned mu1tu $\mathrm{R} \quad\{\mathrm{Hi}, \mathrm{L} 0\}=\mathrm{R}[\mathrm{rs}] * \mathrm{R}[\mathrm{rt}]$ (6) $0 /-(-/ 19$
Shift Right Arith. sra $\quad \mathrm{R} \quad \mathrm{R}[\mathrm{ra}]=\mathrm{R}[\mathrm{r}]] \ggg$ shamt $0 /-/-13$


FLOATING-POINT INSTRUCTION FORMATS


PSEUDOINSTRUCTION SET

| NAME | MNEMONIC | OPERATION |
| :---: | :---: | :---: |
| Branch Less Than | bit | if(R[rs] $<$ R [rt] $) \mathrm{PC}=$ Label |
| Branch Greater Than | bgt |  |
| Branch Less Than or Equal | ble | $\mathrm{if}(\mathrm{R}[\mathrm{rs}] \leqslant=\mathrm{R}[\mathrm{rt}]) \mathrm{PC}=$ Label |
| Branch Greater Than or Equal | bqe | $\mathrm{if}(\mathrm{R}[\mathrm{rs}])-\mathrm{R}[\mathrm{rt}]) \mathrm{PC}=$ Label |
| Load Immediate | 11 | $\mathrm{R} \mid \mathrm{rd}]=$ immediate |
| Move | move | $\mathrm{R}[\mathrm{rd}]=\mathrm{R}[\mathrm{rs}]$ |

REGISTER NAME, NUMBER, USE, CALL CONVENTION

| NAME | NUMBER | USE | PRESERVEDACROSS A CALL? |
| :---: | :---: | :---: | :---: |
| Szero | 0 | The Constant Value 0 | N.A. |
| Sat | 1 | Assembler Temporary | No |
| \$v0-\$v1 | 2-3 | Values for Function Results and Expression Evaluation | No |
| \$a0-543 | 4-7 | Arguments | No |
| \$10-\$17 | 8-15 | Temporaries | No |
| \$s0-\$s7 | 16-23 | Saved Temporaries | Yes |
| \$t8-\$t9 | 24-25 | Temporaries | No |
| \$k0-Sk1 | 26-27 | Reserved for OS Kernel | No |
| \$gp | 28 | Global Pointer | Yes |
| \$sp | 29 | Stack Pointer | Yes |
| \$1p | 30 | Frame Pointer | Yes |
| \$ra | 31 | Return Address | No |

Copyright 2009 by Elsevier, Inc., All rights reserved. From Patterson and Hennessy, Computer Organtzation and Design, 4th ed.

