#### エーナエロリ

# Introduction to Binary System and Codes; Digital Systems:

- Digital systems can be sneber to the present technology parial as digital age.
- Digital aystery are used in communication, business Evansactions, traffic control, internet and so on.
- we have digital telephones, digital televisions, digital versatile discs, digital cameras, & also digital computors
  - These devices have graphical user intertaces (GUIs), which enable them to execute commands that appear to the user to be simple.
    - It involves in precise execution of sequence of complex internal instructions
    - The mast striking properly of digital computer is its generality.
    - It can tollow a sequence of instructions, called a program, that operates on a given data.
    - one characteristic of digital system is their ability to represent and manipulate discrete elements of intermation.
    - Any set that is nestricted to a finite number of elements ontain disconete information.

89: 10 decimal digits.

26 letters at alphabet.

52 playing cools.

. 64 squares to a chess board,

- In this case, the discrete elements were the digits.
- From, this the term digital computer emerged.

- A digital system is simply one. Hat specieves input, processes (a) controls activity, and outputs information in a disconete (a) noncontinuous manner.
- The intermation may be "encoded (or)

  it can be in the tramiliar back to number cyclem or in some other number cyclem, such as back 2 (binary)
- Digital design is the application of set of rules and techniques tor developing degital circuits and subsystems to create a solution for some problem.

Attended to the religion linked to prompt good the same of

described adjust the receiving the restly as where it is the

that peralle or a green data.

on home with a light plan is then it has it is

whenite & though stoops oldgren by the go

the state of the state of the day got

Sodaly of the same

root motor of the

the triber .

in your of a charact.

the it all something

- Discorete elements of intermation one superesented in digital system by physical quantities called agrals.
- Electrical cignals such as voltage and current are most common.
- Electronic (circuits) devices called transistery predominate in the circuitry that implement these signals.
- The agnaly in dielectronic digital systems we just two discorde values and one thousand to be binary.
- A binary digit called a bit, has two values: 0 and 1
- Discorete elements of intermation are prepresented with group of bits alled binary rades.
  - Eq: Decimal digits a to 9 are superesented with town bits.
- the general-purpose digital computer is the best-known example of digital system.
- The major poals of a computer are a memory unit, a central processing unit, and input-output units.
- A digital computer is a powerful instrument that an pertorm not only withmetic computations, but also logical operations.
  - In addition, it can be programmed to make decisions based on internal and external commands.
  - A digital gytern is an interconnection of digital modules.
  - To understand the operation of each digital module, it is necessary to have a basic knowledge of digital circuits and their logical function.

- 1 min trad in ligital design methodology; the cut of HOL

- An HDL gesembles a programming language and is suitable for describing digital circuits in textual form.
  - It is used to simulate a digital system to vonity its appointm betore hardware is built.

#### the seal of the season of the seal of the Binary Numbery:

- A decimal number such as 7,392 suppresents a quantity equals to 7000+ 7 thousands + 3 hundred + 9 tens + 2 units. to be well well as a second
- The thousands, hundreds, etc. are powers of 10 implied by position of the coefficients (symbols) in the number. 5t can be expressed of commend in last

- In general, a number with a decimal point is siepresented by a reries of coefficients.

- The coefficients of are any of the adigits (0,1,2,3,...,9), and the subscript value i gives the place value and, hence, the power to by which the octobiciont must be multiplied.
  - Thus, a decimal number can be expressed of

10° as + 10° ay + 10° az + 10° a, + 10° a, + 10° a, + 10° az + 10° az with a==7, a==30, a,=9, a==2

- A decimal number system is said to be of base or radix 10, because It was 10 digits and weatherients are multiplied by powers of 10.

- The binary number gutern is a different number gutern.
- The coefficients of the binary number given have only two possible values: o and 1
- Each coefficient as is multiplied by a power of the radix, of 2 and the sneults are added to obtain the decimal equivalent of number.

Eq: Binary number 11010:11 & equivalent to decimal number 26.75

1x24+1x2+0x2+1x2+0x2+2x1+22x1 = 26.75 16+8+2+0.5+0.25

- There are different number systems. In general, a number expressed in base - & system has coefficients multiplied by power of r.

an. 8 + and 17 + and 1 + ... + a, 7 + a, 8 + a, 8 + ...

man almin - the coefficients as range in values from a to 7-1.

Eq: bak-5 number

(4021.2) = 4x5 + 0x5 + 2x5 + 1x5 + 2x5 + 125xy 15 500 + 10 + 1 + 0.4

The coefficient values for box 5 can be only 0,1,2,3,4 - The octal number geten is a baje-8 system that has 8 digits: 0,1,2,3,4,5,6,7.

5: (127.84)8 = 188+288+7×80 + 4×85 - 64+16+,7+0.5 = (87.5)

Note: The digits 8 and 9 can't appear in a octal number.

- It is customary to bottom the needed or digits don the coefficients from the decimal system when the boxe of the number is less than 10.
- The letters so the alphabet one used to supplement the 10 decimal digits when the base of the number is greater than 10.

· Copple apach a newly of a in piles. for eq. En the hexadecimal (base-16) number eyetem,

The first 10 digits one boomsed from the decimal system. The letter A, B, C, D, E and f one used for the digits 10, 11, 12, 13, 14 and 15

Eq: A Hexadecimal number

(B65F)16 = 11×163+6×162+5×16+15×16°

= (46,687)10

The hexadecimal system is used to suppresent long strings of bits in the addresses, instructions, and duta in digital gutans. B65F is used to supresent : hospillo 0110 0101 1111

The conversion from binary to decimal can be obtained by adding only the numbers with powers of two corresponding to the bits that are equal to 1.

For Eq:  $(110101)_2 = 32 + 16 + 4 + 1 = (53)_{10}$ 

- In computer work

 $2^{10} = K \text{ (Hild)} = 10^3 \quad 2^{20} = M \text{ (mega)} = 10^6 \quad 2^{30} = G \text{ (giga)} = 10^9$   $2^{40} = T \text{ (Tera)} = 10^{12} \quad 2^{50} = P \text{ (Peta)} = 10^{15} \quad 2^{50} = X = 10^{18}$ 

the number of his them to

- Computer capacity is usually given in bytes.
- A byte is equal to eight bits and can prepresents the ade of one keyboard character.
- A computer hard disk with 4 G1B ob storage has a capacity of  $4Gr = 2^2 \cdot 2^2 = 2^{32} = Capproximately 4 billian bytes)$
- Arithmetic operations with binary numbers in box or tollow the same rules as for decimal numbers.

Eq: addition, substraction and multipliplication of two binary numbers as tollows.

Augend 101101. minuend: 101101 multiplicand: 1011

Addend + 100111, substrational:-100111 multiplier: 101

1010100 101010

- Addition:
   The own of two binary numbers is calculated by the same realisticant as in decimal except that the digits to seem in any significant preition can be only o or 1.
  - Any carry obtained in a given significant position is used by the pair of digits one cignificant position higher.

#### substraction:

- Substraction is more complicated.
- The ruley are still the same of in decimal, except that the borrow in a given significant position adds 2 to a minuered digit.
- A boow in the decimal system adds to to a minueral digity Multiplication!
  - Multiplication is simple.
  - The multiplier digits are always I or o.
  - Theretore, the partial products one equal either to a shitted copy of multiplicand or to 0.

### Number - Base Convenions

- -Representation of numbers in a different madix are said to be equivalent it they have same decimal suppresentation.
  - for eq: (0011) and (001), are equivalent, both have same decimal value. 9.
  - The conversion of a number in base-r to decimal is done by expanding the number in a power series and adding all the numbers

A down 1 tolon

and accumulating the enemainders.

Eq: convert decimal 41 to binary

$$2 \frac{41}{20 - 1}$$
 $2 \frac{10 - 0}{25 - 0}$ 
 $2 \frac{5 - 0}{1 - 0}$ 

Eq: Convert decimal (153) to octal (1,1)8

of ve Lorchapthon of the

Eq! Convert (41; to octal

Eg: Convort (0.6875), to binary

Eq: 
$$(0.518)_{10}$$
 to octal

(0.513)  $\times 8 = 4.104$ 

0.104  $\times 8 = 0.832$ 

0.832  $\times 8 = 6.656$ 

0.656  $\times 8 = 5.248$ 

0.248  $\times 8 = 1.984$ 

(0.513)<sub>10</sub> = (0.40651...)<sub>8</sub>

$$-\frac{1}{2} - \frac{(41.6875)}{(153.513)}_{10} = \frac{(101001.1011)}{2}$$

$$-\frac{1}{2} - \frac{(153.513)}{80} = \frac{(231.406517)}{8}$$

# octal and Hexadecimal Numbers

- The conversions from and to binary, octal and hexadecimal plays an important role in digital computer, because shaler pattern of hex charactery one easier to necognite than long patterns of o's 4 is - since ≥0 ≥2=8 4 ≥4=16, each octal digit corresponds to three binary digits and each hexadecimal digit corresponds to four binary digits ⇒ The first 16 numbers in the decimal, binary, octal thexadecimal as

| Decimal  | Binary   | octal       | Hexadecimal |
|----------|----------|-------------|-------------|
| (bak 10) | (base 2) | (bak 8)     | (baje 16)   |
| 0        | 0000     | 00          | 2 3         |
| 2 3      | 0100     | 03          | 4 5         |
| 6 1      | 0101     | 06          | m /sacc not |
| 7 11     | 01161    | TIME OF THE | 8           |
| 8        | 1000     | 10          | 9           |

- The conversion from binary to octal is accomplished by Partitioning the binary number into group of three digits each, starting from the binary point and proceeding to the littered to the right.
- The corresponding actal digit is assigned to each grap.
  - E9: (10 110 001 101 011. 111 100 000 110)
    - =) (26153.7406)<sub>8</sub>
- Conversion from binary to hoxadecimal is similar, except that the binary number is divided into groups of four digits.

- (2 C 6 B. F 0 6),6
- Enversion from octal (or) hexadecimal to binary is done by supresenting the each octal digit to its three-digit binary equivalent.

- similarly, each hexadecimal digit is converted to its town-digit binary equivalent.

 $(673.124)_{8} = (10 111 011.001 010 100)_{2}$   $(306.0)_{16} = (0011 0000 0110.1101)_{2}$ 

### Compliments St Numbers:

- Compliments are used in digital computers to simplify the substraction operation and too logic operation.
- simplifying operations leads to simpler, less expensive circuits to implement the operations.

There are two types of complements for each base- & system.

- 1. Radix Complement (7's complement)
- 2. Diminished Rodex Complement. ((1-1)'s complement)
- The two types netword to as the e's' complement and is complement for binary numbers of 10's complement and 9's complement too decimal numbers.

# Diminished Radix Complement:

Given a number N in base r having n digits

The (8-1)'s complement of N is (8n-1)-N

for decimal number: 8=10 r-1=9So, (Diminished Rodix Complement) 9's complement of N is  $(0^{n}-1)-N$ 

planting 189 (2. it man = 4) (1) 7

(104-1)-N => 9999-N

-: 9's complement to a decimal number is obtained by substracting

of oring the est

The 9's complement 8t 547600 is
999999-547600 => 452399

The 9's complement of 012398 & 987601

the state of the first of the same of the same

For binary numbers Y=2 Y=1=1I's complement of  $N \neq (2^n-1)-N$ if N=4  $2^n=2^n=(10000)_2$   $4^n=(1111)_2$ i's complement of  $N \neq (2^n-1)-N$  = (1111)-N

Thus, the is complement of binary number is dotained by substracting each digit from 1.

we can have either 1-0=1(a) 1-1=0

is complement of 1011000 y 0100111

i's complement of 0101101 is 1010010 in the i's complement of a binary number is formed by changing is to 0's and 0's to 1's.

- (8-1)'s complement of octal or hexadecimal number is obtained by substracting each digit from 7 (a) F (decimal is), respectively

Rodox Complement

The 8's amplement 8t an n-digit number N in base r is defined as  $8^{\circ}-M$  for N \$0 \$ as a for N=0.

For decimal r=10

Annalysis Also is re-

 $y_0 - M \Rightarrow \omega_0 - M$ 

: 10's complement st N can be termed by Leaving all least significant o's unchanged, substracting the tisut non zero least significant digit from 10, and substracting all higher digits from a They

10's complement of 012398 4 987.602

Similarly 2's complement of N can be tormed by leaving all least significant o's and the tiset I undranged and supplacing i's with o's and o's with i's in all other higher significant digits.

2's complement of 1101100 & 0010100 e's complement of 0110111 & 100 1001

- It the original number N contains the radix point, the point should be snemoved temporarily in order to form the r's 4(r-1)'s complement.
- complement.

   The godix point is then nestored to the complemented number.
- The complement of the complement mentures the number to its

Substraction with complements

unsigned The aubstraction of two n-digit numbers M-N in bage of can be done as follows.

- 1. hold the minued 14 to the r's complement of the substrachend, Mathematically  $M + (r^2 - H) = M - N + 8'$
- 2. It M > N, the sum will produce an end away r, which on be dynaded; what left is the result 19-N.
- 3. It MICH, the sum does not produce and only and y equal to r'-(N-M), which is is complement of (N-M). . To obtain the answer in a tramiliar term, take the r's complement so sun & place a regative sign in tront.

Eg: wing 10's complement, substract 72532-3250

M = 72532

10's complement &N = 96750

Sun = 169282

Discord end any 105 = 100000 69285

59: using 10's complement, substract 3250 - 72532

M : 03250

10's complement of N = 27468 30718 they

Eq: Given the two binary numbers x=1010100 & y=1000011, perform substraction as x-y 4 b) & y-x by wing 2's complement

2's completed y = 0 11111.01

Discord end awyst: 10000000

D) 7= 1000011

2's complement & x = 01011.00

There is no ord away y-x=-(2') complement of 1101111)

= \_000001

#### Examply to work:

1) Convert the tollowing to decimal and then binary

a) (1876),6 b) (AB22),6 c) (1212), d) (1556),

2) anvert the tollowing hexa decimal numbers into an equivalent binary numbers

(i) 58 (ii) 234 (iii) ABC (iv) FB18

3) Substract the tollowing decimal numbers using 9's complent
(i) 347-265 (ii) 49-84 (iii) 349-436 (iv) 9-4

- 4) Convert the trollowing to octal and Hetadecimal
- is the state of the part of the state of the state of a) 100 101101011 b) 10 1101000001012
  - c) 791,0 d) 1600,0 e) 197,0

Eg: using is complement

I The state of the State of the

9 x= 1010100

is complement of y: 0111100

Sum = 10 010000

End-around carry:+

00 1 000 1

b) y-x= 1000001- 1010100

7:1000011

There is no end-coory. .: Y= x= - (is complete of 1101110) the talkers have derived and it are required in

- In \$ (8-1)'s amplement enemoving the end around avoing and to the sum is sufferred to as an end-around avery
- The procedure with end-orand coory is also applicable to

### Signed Binary Mumber:

- Possitive integers (including zero) can be represented as unsigned nois.
  - In ordinary arithmetic, a negative no is indicated by minus sign and a positive number by a plus sign.
- Because of hardware limitations, computers must represent everything with binary digits.
- in the lebt most position ob the number.
- The convention is to make the eign bit o for positive f
- Both signed and unsigned binary numbers consults of a string of bits when suppresented in a computer.
  - The user determines whether the number is signed or not
- It the binary number 4 signed, then the lebt mat bit represents the sign and the riest of the bits represent number.
  - It the binary number is unsigned, then the left most bit it the most significant bit of the number.
    - Eq: the string of bits 01001 is considered as

      9 unsigned number

      + 9 signed number.
- the string of bits 11001 is considered as

  25 unsigned number

  -9 Signed number.

a symbol (+ or -) or a bit (o or 1) indicating the sign.

Signed Complement System

when arithmetic operations are implemented in a computer, it is more constraint to use a different system, substrated to as the signed complement system, for supresenting negative numbers.

In this system a negative number is indicated by its complement whereas the signed magnitude system negates a number by changing its sign, the signed complement system negates a number by taking its complement.

Eq: Consider the number q, suppresented in binary with eight bit

Birray equivalent of +9 is 00001001 there are 3 ways to suppresent -9

Signed magnitude representation 10001001

Signed is complement representation 11110110

Signed 2's complement representation 1111021

Representation st signed numbery using 2's on is complement method.

1. It the number is possitive, the magnitude is nepresented in its true binary term and a sign bit o is placed in trent to the most 1958.

| Table ! |        |         |
|---------|--------|---------|
| Signed  | Birary | Numbery |

| Decimal   | signed - 2's<br>complement | signed-1's amplement | signed magnitude | 1     |
|-----------|----------------------------|----------------------|------------------|-------|
| 47        | 0111                       | 0 111                | 0000111          | 4     |
|           | 0110                       | 0110                 | 0110             |       |
| +6        | 0 101                      | 0101                 | 0101             | 1,.   |
| +5        |                            | 0100                 | 0 100            |       |
| +4        | 0100                       | 0011                 | 0011             |       |
| +2        | 0010                       | 0010                 | 0010             |       |
| +1        | 10001                      | 0001                 | 0 00 1           |       |
| +0        | 0000                       | 0000                 | 0 000            |       |
|           | -                          | 1 1 1 1              | 1000             |       |
| 20 = 1    | 11101                      | 10110                | 1001             |       |
| 1 52      | - AV - To                  | 1101                 | 1010             |       |
| -3        | 214/1021                   | 1100                 | the 1011 1       | or i- |
| -H-Marile | 1100                       | 1011                 | 1100             |       |
| -5        | 1011                       | 1010                 | 1.1010 m         |       |
| -6        | 1010                       | 1110                 | 1110             |       |
| -7        | 1001                       | 1000                 | and a little     |       |
| -8        | 1000                       | £ —                  | -                |       |

special case in e's complement! whenever a signed number has a 1 in the sign bit and all or for magnitude bits, the decimal equivalent is -e? sign bit and all or for magnitude bits, the decimal equivalent is -2°.

Arithmetic Addition

The addition of two signed binary numbery with negative numbery suppresented in signed - e's - complement form is obtained from the addition of the two numbers, including their sign bits.

A coory out of the sign-bit position is discorded.

+6 00000110 00001101 00010011

+6 00000110 -13 11110091

11111010 1 1100000T T

1111010 -13 11110011

carry has to be

hay to be

discorded.

# Arithmetic Substraction

- Substraction of two signed binary numbers when negative numbers are in e's complement from is simple and can be stated as Adlaws.

Take the e's complement of the substrational Cincluding the sign bit) and add it to the minuend (including the sign bit) A carry out of the sign-bit position is discarded.

This procedure is adopted because a substraction operation can be changed to an addition operation it the sign of the substratand is changed, as is demonstrated by the tollowing robationship.

$$(\pm A) - (\pm B) = (\pm A) + (-B)$$
  
 $(\pm A) - (-B) = (\pm A) + (\pm B)$ 

Eq! (-6)-(-13)=+7

Mad 1 1 1 1 1 0 10 - 11110011

11111010 000001101 700000111 away has to be discarded.

### Binary Codes

- A n-bit binary ande is a group of n-bits that can have upto en distinct combinations of o's and 1's.
- Each combination represents one element of the set that is being coded.
  - A set of 4 elements can be raded with two bits, with each element assigned one of the bit combinations: 00,01,10,11.
  - A set ob 8 elements sieguires a three bit cade and a set ob
  - The bit combination of an n-bit cade is determined from the count in binary from 0 to  $2^n-1$ .
- Each element must be assigned a unique binary bit ambination, and no two elements can have the same value; otherwise the code assignment will be ambiguous.
  - The minimum number of bits prequired to cade 2° diffinct quantities is no there is no maximum number of bits that prequired to code of may be used for a binary ande.

# Binary-Coded Decimal Code:

- A BCD code is one, in which the digits so a decimal number are encoded one at a time into groups so trown binary digits.
- There codes combine the treatures of decimal and binary numbers.
- In order to suppresent decimal digits 0,1,2,...,9, it 4

- such a sequence of binary digits which suppresents a decimal digit is called a code wad.
- A binary code will have some unaugned bit combinations it the number of elements in the set is not a multiple power of e.
  - The 10 decimal digits from such a set.
- A binary code that distinguishes among to elements must contain at least four bits, but 6 out 86 16 possible combinations remain un-agged kugered.
  - This schome is called Binary Goded Decimal and is commonly netroned to as BCD.

the tollowing table gives the tour-bit code for one decimal doit A number with K bits decimal digits will require 4k bits in

otiful, and

#### Binary Coded Decimal (BCD)

dried mile

| Dec          | imal symbol | BCD Digit |
|--------------|-------------|-----------|
| James J.     | do L        | 0000      |
|              | Line to the | 0001      |
|              | 2           | 0010      |
|              | 3           | 0011      |
| provide Land | 4. 1 80 00  | 0100      |
|              | 5           | 0101      |
| The same     | 6           | 0110      |
|              | 7           | 0111      |
|              | 8           | 1000      |
|              | 9           | 1001      |

- the binary combinations 1010 through 1111 are not used and have no meaning in RCD.

Eq: consider decimal (185), 4 its corresponding value in BCD & birary

(185),0 = (0001 1000 0101) BCD = (10111001)2

### BCD Addition:

- Consider the addition of two decimal digits in BCD, together with a possible carry from a previous less significant pair of digits.
- since each digit does not exceed 9, sum cannot be greater than  $9+9+1=19, \quad \text{with 1 being a previous corry.}$
- Suppose we add the BCD digits as it they were binary numbers.
- Then the binary wern will produce a signific in the range trom o to 19.
- In binary, this stange will be trom occo to 100.11. But in RCD, it is trom occo to 1 1001, with the birut 1 being a carry and the next trour bily being the BCD Rum.
  - when the birary rum is equal to a less than 1001 (without covery), the corresponding BCD digit is correct.
- when the binary rum is greater than a equal to 1010,

the small is an invalid BCD digit.

- The addition sto 6 = (0110) to the binary cum (11-1)
converts it to the correct digit and also produces a carry of nequired.

- This because a more in the most significant bit position and

Consider the tollowing BCD additions.

The addition to two n-digit unsigned BCD numbers tollows the same procedure

consider 
$$184+576=760$$
 in BCD

BCD  $0001 1000 0100 184$ 
 $0101 0111 0110 +576$ 
 $0110 0110 0110$ 
 $0110 0110$ 
 $0110 0110$ 

# BCD Substraction

- The BCD substraction is performed by substracting the digits to each 4-bit group of the substrated trom the corresponding 4-bit group of the minuend in binary starting from the LSD.

- It there is no bottom from the next higher group then no correction is enequived.

CThy is done to skip 6 illegal states)

# other Decimal codes

4 de Jujos I a

- Binary codes for decimal digits enequire a minimum of town bits
- Many sown different rodes can be termulated by avianging four bits into 10 distinct combinations.
- Each code vers only 10 out of a possible 16. bit combinations that can be accoranged with town bits
  - The other six unused combinations have no meaning and should be avoided.
- BCD and the 2421 code one examples of weighted rodes.
- In a weighted code, each bit position is assigned a weighting tactor in such a way that each digit can be evaluated by adding the weights of all the is in the coded combination.

#### M.JOSHNA

- The bit assignment 0110, for eq: is interpreted by weights to suppresent decimal 6. because 8x0+4x1+2x1+0x1=6.
- Note that some digits can be coded in two possible ways in 2421 code.
- for invarie, decimal 4 can be coded to bit combination 0100 as 1010, since both combinations add upto a total weight of 4.

Table: Some different Binary Codes for decimal digits.

| Decimal<br>Digit | BCD<br>8421 | 2421  | Exces-3 | 8,4,-2,-1 |
|------------------|-------------|-------|---------|-----------|
| 0                | 0000        | 0000  | 0011    | 00 00     |
| 1                | 0001        | 0001  | 0100    | 0111      |
| 2                | 0010        | 0010  | 0101    | 0110      |
| 3                | 0011        | 0011  | 0110    | 0101      |
| 4                | 0100        | 01.00 | 0111    | 0100      |
| 5                | 0.101       | 1011  | 1000    | 1011      |
| 6                | 0110        | 1,100 | 1001    | 1010      |
| 7                | 0 111       | 1101  | 1010    | 1001      |
| 8                | 1000        | 1110  | 1011    | 1000      |
| 9                | 1001        | 1111  | 1100    | 5. (II.)  |
| nund             | 1010        | 0101  | 0000    | 000 1     |
| codes            | 1011        | 0110  | 0001    | 0010      |
| codes            | 1100        | ou    | 0000    | 001)      |
| 4 phones         |             | 1000  | 1101    | 1100      |
|                  | 1101        | 1001  | 1110    | 1101      |
|                  | 1110        | 1010  | 1111    | 1110      |

- 2421 and the excess-3 are examples of self-complementing codes.
- such codes have the property that the 9's complement to a decimal number is obtained directly by changing is to o's and o's to i's

Eq: Decimal 395 excess-3 2421

9's compt of 395 y 604 1009 0011 0111 1000 0000 0100

- The excess-s code has been used in some older computers because of its self-complementing property.
  - Excers-3 is an unweighted code in which each coded combination is obtained from the corresponding binary value plus 3.
  - The 8,4,-2,-1 code is an example of anigning both positive and negative weights to a decimal code.

# Gray Code:

- the output data ob many physical systems are quantities that are continuous.
- These data must be converted into digital form before they are applied to a digital systems.
- Continuous con analog intermedian is converted into digital form by means so analog-to-digital converter.
- The advantage of Gray Code over the draight binary numbers sequence is that only one bit in the code group changes in going brom one number to the next number.
  - Eq: in going from 7 to 8, the gray code changy from 0100 to 1100.

    only fight bit changy from 0 to 1; then three bits memain same.

    By contrast, with binary numbery the change from 7 to 8 will

- the Gray rade is a non-weighted rade, and is not suitable for arithmetic operations.
- An N-bit Gray code can be obtained by retriecting an N-1 bit code about an axis at the end of the code, and pulling the MSB of a above the axis and the MSB of 1 below the axis.

  Retriection of Gray-Codes is shown in tollowing table.

|             | Gray    | Code     | e         |       | . 1-1-                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |             | Decim | al     | 4-bit binou |
|-------------|---------|----------|-----------|-------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------|-------|--------|-------------|
| 1-bit       | 2       | -bit     | 3-bit     | · ak  | 4-bit                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | net<br>Sett |       | . F    | 1-11        |
| 0           |         | 0        | 000       | Price |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | -, 41       | K     | 1-01-1 | 0000        |
| has gotting | 0       | propo    | 001       |       | 0 001                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |             | N8    | ri a   | 0000        |
|             | - 1     | -1       | 011       |       | the state of the s | . 1         | 4     | n 100  | 0100        |
|             | 1       | 0        | 010       |       | 0110                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |             | 5     |        | 0101        |
| ro fall o   |         |          | 1 10      |       | 0101                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | Hz          | 7     | 1 1    | 0110        |
|             |         |          | 101       |       | 0100                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |             | 8     |        | 1000        |
| pth m. h    | all or  | र्वेटी ल | 100       | 2.    |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |             | ly sw | dala   | 1010        |
|             | X 1 1 1 |          |           |       | 1 100                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | 100         | 9     |        |             |
|             | 0.33    |          | i Indyawa | 20 15 | 1111                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |             | III   |        | 1100        |
|             |         |          |           |       | 1710                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |             |       |        | 1101        |
| antic For   |         |          |           |       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |             |       |        |             |
|             | W. H.   | Tallance | dod.      | note. | 1011                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | K           | 25    | day to | 1111        |
| n et out    |         | herony   |           |       | 1001                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | ~           | 106   |        | 0.50        |
| of chemics  |         |          |           |       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | 4:1         | 1 7   | WE .   |             |
|             |         |          |           |       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | -10         | 41-4  |        |             |



### ASCII character code

- Many applications of digital computors naquires the handling of not only of numbers, but also or other chanacters or symbols, such as letters or alphabets.
  - An alphanumeric shoracter set is a set to elements that includes the so decimal digits, 26 letters of the alphabet, and a number to special characters.
  - The standard binary code for the alphanumeric charactery if the American standard code for Information Enterchange (ASCII), which was seven bits to code 128 chanactery.
  - The seven bits of the rade one designated by b, through by with by the most significant bit.
  - The ascir code antains 94 graphic characters that can be printed of 34 non printing abaracters used for various control functions.

M.JOSHN*A* 

and se special printable characters, such as 1. , + , and f

- ASCII is a 7-bit rade, but most computers manipulate an eight his quantity as a single unit called a byte.
- . ASCII character most often are stored one per byte.
- The cebra bit is sometimes used for other purposes, depending on the application.
- for eq: some printery necognize eight-bit ASCII characters with the most significant bit set to o.
- An additional 128 eight-bit characters with the most significant bit set to 1 are used for other symbols, such as the Greek. alphabet or italic type font.

### Earla - Detecting Code

- To detect oursy in data Communication and proceeding an eighth bit is sometimes added to the ASCII character to indicate its painty.
  - A parity bit is an order bit included with a message to make the total number of is either oven (a) add.

Consider the trollawing two charactery and their even and odd parity.

ASCII = 1000001 01000001 11000001

ASCII = 1010100 11010100 01010100

- The parity bit is helpful in detecting words during the transmission of intermation from one location to another.
- This trunction is handled by generating an even parity bit at the sending end for each docacter.
- The eight bit charactery that include painty bits one bransmitted

- the parity of each character is checked at the overexing end.
- It the parity of the secreted character is not even, than at least one bit has changed value during the transmission.
- This method detects one, three, or any odd combination of exists in each character that is transmitted.
  - An even combination to cooky, however, goes undetected, and additional and detection codes may be needed to take cone of that possibility.
  - After delecting an expiratione parability is to sequent statement won of the message on the assumption that the cool was random and will not occur again.
  - Thus, it the species a detects a parity early, it sends back the ASCIT NAK antrol character ansyting of an even-parity eight bits
  - It no could is delected, the energial sends back an Ack control diaracter, namely, 00000110.
  - Et, other a number of attempts, the transmission is still in early, a message can be sent to the operator to check for maltrunctions

# Kepresentation

Due to limitations of hardware, computers must suppresent everything with o's f's, including the aign of a number. As a consequence, it is customary to represent the agn bit pulsced in the lebb-most position to the number. The convention is to make the uga bit equal to o ter positive numbers 4 to 1 for negative numbers.

In addition to the ugn, a number may have a binary (or decimal) Point. The position of binary point is needed to supresent tractions, integers, (or) mixed integer-fraction numbers.

There are two ways or specitizing the position of the binary point in a siegiller.

- 2. Floating Point Representation.
- Fixed-Point Representation assumes that the binory point is fixed in one position.

The two positions most widely und are

- (i) a binary point in the extreme left of the negiter to make the World number a traction.
- (ii) a binary point in the extreme, right of the significan to make the stored number an integer.
- The floating-Point Supprepentation vy a record Suggister to 1880 a number that designates the parition of decimal point in the first negister.

# Floating-Point Representation

The floating point representation to a number has two Parts. The tirut part sepretents a agned, fixed point number called the mantissa. The second part designates the position of binory (or decimal) point and y called exponent. The fixed point mantissa may be a fraction (ar) an integra

12: Gossider the decimal number. + 6132.789

Floating point Depresentation 4

Fraction Exponent +0.6132789 +04

The value of the exponent indicates the actual position of the decimal point is town positions to the right of the indicated docimal point in the fraction.

Notation 4: +0.6132789 X 10

Floating-Point is always interpreted to supresent a number mxre in the tollawing form.

m - mantissa e - exponent (including their kign) and of all drawny - radies and and interest to the

Eg: Consider a binary number + 1001.11 is supresented with 8-bit traction & 6-bit exponent as tabus.

> Fraction Exponent 0 100/110 000100

: Floating point number is mx2=+(.1001110) x2

#### Normalization

A floating Point number is said to be normalited, it the most agniticant digit of the mantissa is non zero.

for eg: Desimal number 350 y normalized but 00035 y not. Moundized numbers provide the manumum possible precises

for the bloating point number v

Clandard forms st floating Point numbers

1. ANSI (Amoural National Standords Institute)

2. IEEE (Institute of Electrical & Electronic Engineery)

The ANSI 32-bit floating point numbery in byte format. is given below.

Byle 4 Byte format SEEE . HIMMMMMM MMMMMMM - MMMMMMMM Martissa

Binooy Point

S - Sign of Mantissa

E - Exponent bits in 2's complement 1' man ? 5000 47

- Mantiga bits.

I con sol - soll CHARZO

Eg:

= 0 0000 100 11010000 00000000 00000000

$$-17 = -10001 = -0.10001 \times 2^{5}$$

= 10000101 10001000 00000000 000000000

= 1111110 | 0000000 0 0000000 00000000

. Almer for minds

# or appear to a man and all all and to add the hard and REEE format

1. Single Precision 32-bits

| sign  | Exponent | Mantissa                           |
|-------|----------|------------------------------------|
| 1-Bit | S-Bih    | of production this product tide of |

$$= 1.010101001 \times 2^{6}$$

$$(2^{n+1})$$

Bland Exponent: 127+6=133=10000101

IEEE 754 single Precision Representation is

# 0 10000101 010101001000000000000000

Hexa decimal y



64-bit

|      | 6        | 4 bits - |
|------|----------|----------|
| xgn  | Exponent | Mantissa |
| -BIF | 11-64    | 52-bit   |

母:

(85.125) = 1.010101001 X26

Biand Exponent: 1023+6=1029=10000000101

singn bit : 0

SEEE 754 Double precision &

0 10000000101 010101001 0.....0 49 bits - pad zero

Hexa decimal value

4055480000000000

| M.JOSHNA |  |
|----------|--|
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |
|          |  |

## **UNIT I-PART-II**

<u>Syllabus:</u> Basic Structure Of Computers: Functional unit, Basic Operational concepts, Bus structures, System Software, Performance, The history of computer development.

<u>Computer:</u> Computer is a fast electronic calculating machine that accepts digitized input information, processing it according to a list of internally stored instructions and produces the resulting output information. The list of instructions is called as a Computer program and the internal storage is called as Computer memory.

<u>Types of Languages:</u> Just as humans use language to communicate, and different regions have different languages, computers also have their own languages that are specific to them. Different kinds of languages have been developed to perform different types of work on the computer. Basically, languages can be divided into two categories according to how the computer understands them.

- 1. **Low-Level Languages:** A language that corresponds directly to a specific machine. Low-level computer languages are either machine codes or are very close them. A computer cannot understand instructions given to it in high-level languages or in English. It can only understand and execute instructions given in the form of machine language i.e. binary. There are two types of low-level languages:
  - 1. Machine Language: a language that is directly interpreted into the hardware. Machine language is the lowest and most elementary level of programming language and was the first type of programming language to be developed. Machine language is basically the only language that a computer can understand and it is usually written in hex. It is represented inside the computer by a string of binary digits (bits) 0 and 1. The symbol 0 stands for the absence of an electric pulse and the 1 stands for the presence of an electric pulse. Since a computer is capable of recognizing electric signals, it understands machine language.

## **Advantages:**

Machine language makes fast and efficient use of the computer.

lt requires no translator to translate the code. It is directly understood by the computer.

## **Disadvantages:**

All operation codes have to be remembered

All memory addresses have to be remembered.

It is hard to amend or find errors in a program written in the machine language.

2. Assembly Language: A slightly more user-friendly language that directly corresponds to machine language. Assembly language was developed to overcome some of the many inconveniences of machine language. This is another low-level but very important language in which operation codes and operands are given in the form of alphanumeric symbols instead of 0's and 1's.

These alphanumeric symbols are known as mnemonic codes and can combine in a maximum of five-letter combinations e.g. ADD for addition, SUB for subtraction, START, LABEL etc. Because of this feature, assembly language is also known as 'Symbolic Programming Language.'

## **Advantages:**

Assembly language is easier to understand and use as compared to machine language.

lt is easy to locate and correct errors.

It is easily modified.

## **Disadvantages:**

- Like machine language, it is also machine dependent/specific.
- Since it is machine dependent, the programmer also needs to understand the hardware.
- 2. **High-Level Languages:** Any language that is independent of the machine. High-level computer languages use formats that are similar to English. The purpose of developing high-level languages was to enable people to write programs easily, in their own native language environment (English).

High-level languages are basically symbolic languages that use English words and/or mathematical symbols rather than mnemonic codes. Each instruction in the high-level language is translated into many machine language instructions that the computer can understand.

## **Advantages:**

- 1. High-level languages are user-friendly
- 2. They are easier to learn.
- 3. They are easier to maintain
- 4. A program written in a high-level language can be translated into many machine languages and can run on any computer
  - 5.programs developed in a high-level language can be run on any computer text

## **Disadvantages:**

6. A high-level language has to be translated into the machine language by a translator, which takes up time

<u>Computer Types:</u> Basing capacity, technology used and performance of computer, they are classified into two types

- → According to computational ability
- → According to generation

#### According to computational ability (Based on Size, cost and performance):

There are mainly 4 types of computers. These include:

- 1.Micro computers
- 2. Mainframe computers
- 3. Mini computers
- 4. Super computer

#### 1. Micro computers: -

Micro computers are the most common type of computers in existence today, whether at work in school or on the desk at home. These computers include:

- 1. Desktop computer
- 2. Personal digital assistants (more commonly known as PDA's)
- 3. Palmtop computers
- 4. Laptop and notebook computers

Micro computers were the smallest, least powerful and least expensive of the computers of the time. The first Micro computers could only perform one task at a time, while bigger computers ran multi-tasking operating systems, and served multiple users. Referred to as a personal computer or "desktop computer", Micro computers are generally meant to service one user (person) at a time. By the late 1990s, all personal computers run a multi-tasking operating system, but are still intended for a single user.

#### 2. Mainframe Computers :-

The term Mainframe computer was created to distinguish the traditional, large, institutional computer intended to service multiple users from the smaller, single user machines. These computers are capable of handling and processing very large amounts of data easily and quickly. A mainframe speed is so fast that it

is measured in millions of tasks per milliseconds (MTM). While other computers became smaller, Mainframe computers stayed large to maintain the ever growing memory capacity and speed.

Mainframe computers are used in large institutions such as government, banks and large corporations. These institutions were early adopters of computer use, long before personal computers were available to individuals. "Mainframe" often refers to computers compatible with the computer architectures established in the 1960's. Thus, the origin of the architecture also affects the classification, not just processing power.

## 3. Mini Computers / Workstation :-

**Mini computers, or Workstations**, were computers that are one step above the micro or personal computers and a step below mainframe computers. They are intended to serve one user, but contain special hardware enhancements not found on a personal computer. They run operating systems that are normally associated with mainframe computers, usually one of the variants of the UNIX operating system.

## 4. **Super Computer**:-

A Super computer is a specialized variation of the mainframe. Where a mainframe is intended to perform many tasks, a Super computer tends to focus on performing a single program of intense numerical calculations. Weather forecasting systems, Automobile design systems, extreme graphic generator for example, are usually based on super computers.

| Туре              | Word<br>length | Memory | Processing speed  | Application                                                                                     |
|-------------------|----------------|--------|-------------------|-------------------------------------------------------------------------------------------------|
| Super<br>computer | 64-96<br>bits  | 256MB  | 400-<br>10000mips | Sophisticated Scientific problems,<br>Weather forecasting, Aerodynamics,<br>Atomic Research etc |
| Main<br>Frame     | 48-64<br>bits  | 128mb  | 30-100mips        | Large industries, banks, airlines, NGO's.                                                       |

| Mini  | 32bits       | 96mb | 10-30mips            | Interactive and multi user environment.                                          |
|-------|--------------|------|----------------------|----------------------------------------------------------------------------------|
| Micro | 8-32<br>bits | 64MB | 1 <sub>f</sub> 5MIPS | General purpose calculations,<br>Industrial Control, Office Automation,<br>e.t.c |

## **According to Generations of Computers:**

The history of computer development is often referred to in reference to the different generations of computing devices. Each generation of computer is characterized by a major technological development that fundamentally changed the way computers operate, resulting in increasingly smaller, cheaper, more powerful and more efficient and reliable devices.

#### 1. First Generation (1940-1956): Vacuum Tubes:

The first computers used vacuum tubes for circuitry and magnetic drums for memory, and were often enormous, taking up entire rooms. They were very expensive to operate and in addition to using a great deal of electricity, generated a lot of heat, which was often the cause of malfunctions.

First generation computers relied on machine language, the lowest-level programming language understood by computers, to perform operations, and they could only solve one problem at a time. Input was based on punched cards and paper tape, and output was displayed on printouts.

Example: The UNIVAC and ENIAC computers are examples of first-generation computing devices. The UNIVAC was the first commercial computer delivered to a business client, the U.S. Census Bureau in 1951.

#### b) Second Generation (1956-1963): Transistors:-

Transistors replaced vacuum tubes and ushered in the second generation of computers. The transistor was invented in 1947 but did not see widespread use in computers until the late 1950s. The

transistor was far superior to the vacuum tube, allowing computers to become smaller, faster, cheaper, more energy-efficient and more reliable than their first-generation predecessors. Though the transistor still generated a great deal of heat that subjected the computer to damage, it was a vast improvement over the vacuum tube. Second-generation computers still relied on punched cards for input and printouts for output.

Second-generation computers moved from cryptic binary machine language to symbolic, or assembly, languages, which allowed programmers to specify instructions in words. High-level programming languages were also being developed at this time, such as early versions of COBOL and FORTRAN. These were also the first computers that stored their instructions in their memory, which moved from a magnetic drum to magnetic core technology.

The first computers of this generation were developed for the atomic energy industry.

## 1. Third Generation (1964-1971): Integrated Circuits

The development of the integrated circuit was the hallmark of the third generation of computers. Transistors were miniaturized and placed on silicon chips, called semiconductors, which drastically increased the speed and efficiency of computers.

Instead of punched cards and printouts, users interacted with third generation computers through keyboards and monitors and interfaced with an operating system, which allowed the device to run many different applications at one time with a central program that monitored the memory. Computers for the first time became accessible to a mass audience because they were smaller and cheaper than their predecessors.

## 2. Fourth Generation (1971-Present): Microprocessors

The microprocessor brought the fourth generation of computers, as thousands of integrated circuits were built onto a single silicon chip. What in the first generation filled an entire room could now fit in the palm of the hand. The Intel 4004 chip, developed in 1971, located all the components of the computer— from the central processing unit and memory to input/output controls—on a single chip.

In 1981 IBM introduced its first computer for the home user, and in 1984 Apple introduced the Macintosh. Microprocessors also moved out of the realm of desktop computers and into many areas of life as more and more everyday products began to use microprocessors.

As these small computers became more powerful, they could be linked together to form networks, which eventually led to the development of the Internet. Fourth generation computers also saw the development of GUIs, the mouse and handheld devices.

## 3. Fifth Generation (Present and Beyond): Artificial Intelligence)

Fifth generation computing devices, based on artificial intelligence, are still in development, though there are some applications, such as voice recognition, that are being used today. The use of parallel processing and superconductors is helping to make artificial intelligence a reality. Quantum computation and molecular and nanotechnology will radically change the face of computers in years to come. The goal of fifth-generation computing is to develop devices that respond to natural language input and are capable of learning and self-organization.

## Functional Unit (Or) Structure of a Computer System:

Every Digital computer systems consist of five distinct functional units. These units are as follows:

- 1. Input unit
- 2. Memory unit
- 3. Arithmetic logic unit
- 4. Output unit
- 5. Control Unit



Figure 1.1 Basic functional units of a computer.

These units are interconnected by electrical cables to permit communication between them. A computer must receive both data and program statements to function properly and be able to solve problems. The method of feeding data and programs to a computer is accomplished by an input device. Computer input devices read data from a source, such as magnetic disks, and translate that data into electronic impulses for transfer into the CPU. Example for input devices are a keyboard, a mouse, or a scanner. Central Processing Unit The brain of a computer system is the central processing unit (CPU). The CPU processes data transferred to it from one of the various input devices. It then transfers either an intermediate or final result of the CPU to one or more output devices. A central control section and work areas are required to perform calculations or manipulate data. The CPU is the computing center of the system. It consists of a control section, an arithmetic-logic section, and an internal storage section (main memory). Each section within the CPU serves a specific function and has a particular relationship with the other sections within the CPU.

<u>Input Unit</u>: An input device is usually a **keyboard or mouse**, the input device is the conduit through which data and instructions enter a computer.

- 1. The most common input device is the *keyboard*, which accepts letters, numbers, and commands from the user.
- 2. **Another important type of input device is** *the mouse***, which lets you select options from on- screen menus**. You use a mouse by moving it across a flat surface and pressing its buttons. A variety of other input devices work with personal computers, too:
- 3. The <u>trackball</u> and touchpad are variations of the mouse and enable you to draw or point on the screen.

The joystick is a swiveling lever mounted on a stationary base that is well suited for playing video games

**Memory unit:** memory is used to store programs and data. There are two classes of storage, called primary and secondary.

<u>Primary storage</u>: It is a fast memory that operates at electronic speeds. Programs must stay in memory while they are being executed. The memory contains a large number of semiconductor storage cells, each capable of storing one bit of information. To provide easy access to any word in the memory, a distinct address is associated with each word location. Addresses are numbers that identify successive locations. A given word is accessed by specifying its address and issuing a control command.

The number of bits in each word is referred as the word length of the computer. Typical word lengths range from 16 to 64 bits.

Programs must reside in the memory during execution. Instructions and data can be written into the memory or read out under the control of the processor.

- 1. Memory in which any location can be reached in a short and fixed amount of time after specifying its address is called *random access memory* (RAM).
- 2. The time required to access one word is called the *memory access time*.
- 3. The small, fast, Ram units are <u>called caches</u>. They are tightly coupled with the processor and are often contained on the same integrated circuit chip to achieve high performance.
- 4. The largest and slowest units are referred to as the *main memory*.

<u>Secondary storage</u>: Secondary storage is used when large amounts of data and many programs have to be stored, particularly for information that is accessed infrequently.

Examples for secondary storage devices are Magnetic Disks, Tape and Optical disks.

<u>Arithmetic-Logic Unit:</u>- The arithmetic-logic section performs arithmetic operations, such as addition, subtraction, multiplication, and division.

Arithmetic-Logic Unit usually called the ALU is a digital circuit that performs two types of operations— arithmetic and logical.

**Arithmetic operations** are the fundamental mathematical operations consisting of addition, subtraction, multiplication and division.

**Logical operations** consist of comparisons. That is, two pieces of data are compared to see whether one is equal to, less than, or greater than the other. The ALU is a fundamental building block of the central processing unit of a computer.

<u>Out put Unit:</u> An **output device** is any piece of computer hardware equipment used to communicate the results of data processing carried out by an information processing system (such as a computer) to the outside world.

In computing, input/output, or I/O, refers to the communication between an information processing system (such as a computer), and the outside world. Inputs are the signals or data sent to the system, and outputs are the signals or data sent by the system to the outside.

Examples of output devices:

- 1. Speaker
- 2. Headphones
- 3. Screen
- 4. Printer

<u>Control Unit</u>: All activities inside the machine are directed and controlled by the control unit. Control Unit is the part of the computer's central processing unit (CPU), which directs the operation of the processor. A control unit works by receiving input information to which it converts into control signals, which are then sent to the central processor

## The Basic Operational Concepts of a Computer:-

- 1. The program contains of a list of instructions is stored in the memory.
- 2. Individual instructions are brought from the memory into the processor, which execute the specified operations.
- 3. Data to be used as operands are also stored in the memory.

#### **Add R1,R2,R3**

In This instruction add is the operation perform on operands R1,R2 and place the result stored in R3.

The top level view of the computer is as follows:

## 1. Instruction register (IR):

- 1. The instruction register holds the instruction that is currently being executed.
- 2.Its output is available to the control circuits, which generate the timing signals that control the various processing elements involved in executing the instruction.

## 2. Program counter (PC):

- 1. The program counter is another specialized register.
- 2.It keeps track of the execution of a program.
- 3.It contains the memory address of the next instruction to be fetched and executed.
- 4. During the execution of an instruction, the contents of the PC are updated to correspond to the address of the next instruction to be executed



## 3. Memory address register (MAR) & Memory data register (MDR):-

- 1. These two registers facilitate communication with the memory.
- 2. The MAR holds the address of the location to be accessed.
- 3. The MDR contains the data to be written into or read out of the addressed location.

#### 4. Operating steps for Program execution (or) Instruction Cycle:

- 1. Execution of the program (stored in memory) starts when the PC is set to point to the first instruction of the program.
- 2. The contents of the PC are transferred to the MAR and a Read control signal is sent to the memory.
- 3. The addressed word is read out of the memory and loaded into the MDR. Next, the contents of the MDR are transferred to the IR. At this point, the instruction is ready to be decoded and executed.
- 4.If the instruction involves an operation to be performed by the ALU, it is necessary to obtain the required operands.

- 5.If an operand resides in memory (it could also be in a general purpose register in the processor), it has to be fetched by sending its address to the MAR and initiating a Read cycle.
- 6. When the operand has been read from the memory into the MDR, it is transferred from the MDR to ALU.
- 7. After one or more operands are fetched in this way, the ALU can perform the desired operation.
- 8. If the result of the operation is to be stored in the memory, then the result is ent to the MDR.
- 9. The address of the location where the result is to be stored is sent to the MAR, and a write cycle is initiated.
- 10. At some point during the execution of the current instruction, the contents of the PC are incremented so that the PC pints to the next instruction to be executed.
- 11. Thus, as soon as the execution of the current instruction is completed, a new instruction fetch may be started.
- 12. In addition to transferring data between the memory and the processor, the computer accepts data from input devices and sends data to output devices. Thus, some machine instructions with the ability to handle I/O transfers are provided.

## **Bus Structures:-**

1. BUS:A group of lines(wires) that serves as a connecting path for several devices of a computer is called a bus.

The following are different types of busses:

1. Address Bus

2. Data Bus

3. Control Bus

<u>The Data bus</u> Carries(transfer) data from one component (source) to other component (destination) connected to it. The data bus consists of 8, 16, 32 or more parallel signal lines. The data bus lines are bi-directional. This means that CPU can read data on these lines from memory or from a port, as well as send data out on these lines to a memory location.

<u>The Address bus</u> is the set of lines that carry(transfer) address information about where in memory the data is to be transferred to or from. It is an unidirectional bus. **The address bus consists of 16, 20, 24 or more parallel signal lines.** On these lines CPU sends out the address of the memory location.

<u>The Control Bus</u> carries the Control and timing information. Including these three the following are various types of busses. They are

System Bus: A System Bus is usually a combination of address bus, data bus, and control bus respectively.

**Internal Bus:** The bus that operates **only with the internal circuitary of the CPU**.

External Bus: Buses which connects computer to external devices is nothing but external bus.

Back Plane: A Back Plane bus includes a row pf connectors into which system modules can be plugged in.

I/O Bus: The bus used by I/O devices to communicate with the CPU is usually reffered as I/O bus.

**Synchronous Bus:** While using Synchronous bus, data transmission between source and destination units takes place in a **given timeslot** which is already known to these units.

**Asynchronous Bus:** In this case the data transmission is governed by a special concept. That is handshaking control signals.

#### The Bus interconnection Scheme:-



## Single bus structure :-

- 1. A group of lines(wires) that serves as a connecting path for several devices of a computer is called a bus.
- 2. In addition to the lines that carry the data, the bus must have lines for address and control purposes.
- 3. The simplest way to interconnect functional units is to use a single bus, as shown below.



- 4. All units are connected to this bus. Because the bus can be used for only one transfer at a time, only two units can actively use the bus at any given time.
- 5. Bus control lines are used to arbitrate multiple requests for use of the bus.

#### **ADVANTAGE:**

Its is low cost and its flexibility for attaching peripheral devices

## **DISADVANTAGE:**

**low performance** because at time only one transfer

<u>Traditional / Multiple bus Structure:</u> There is a local bus that connects the processor to cache memory and that may support one or more local devices. There is also a cache memory controller that connects this cache not only to this local bus but also to the system bus.

On the system, the bus is attached to the main memory modules. In this way, I/O transfers to and from the main memory across the system bus do not interfere with the processor's activity. An expansion bus interface buffers data transfers between the system bus and the I/O controllers on the expansion bus. Some typical I/O devices that might be attached to the expansion bus include: Network cards (LAN), SCSI (Small Computer System Interface), Modem, Serial Com etc..



Advantages: better performance

Disadvantage: increased cost.

## **Software:-**

A total computer system includes both software and Hardware.

- 1. Hardware consists of physical components and all associated equipment.
- 2. **Software refers to the collection programs that are written for the computer** and writing a program for a computer consists of specifying, directly or indirectly a sequence of machine instructions.
- 3. The computer software consists of the instructions and data that the computer manipulates to perform various data processing tasks.

## **Types**:

- 1. Application software,
- **2.** System software

System software: System software is used to run application software.

System software is a collection of programs that are executed as needed to perform functions such as

- 1. Receiving and interpreting user commands.
- 2. Entering and editing application programs and sorting them as files in secondary storage devices.(Editor)
- 3. Managing the storage and retrieval of files in secondary storage devices.
- 4. Running standard application programs such as word processors, spread sheets, or games, with data supplied by the user.
- 5. Controlling I/O units to receive input information and produce output results.
- 6. Translating programs from high level language to low level language.(Assemblers)
- 7. Linking and running user-written application program with existing standard library routines, such as numerical computation packages.(Linker)

**Application software: Application software** allows end users to accomplish one or more specific (not directly computer development related) tasks. Its usually written in high level languages, such as c ,c++, java. Typical applications include:

- 1. Word processing
- 2. spreadsheet
- 3. computer games
- 4. databases
- 5. industrial automation
- 6. business software
- 7. quantum chemistry and solid state physics software
- 8. telecommunications (i.e., the internet and everything that flows on it)
- 9. educational software
- 10. medical software
- 11. military software
- 12. molecular modeling software
- 13. image editing
- 14. simulation software
- 15. Decision making software

Compiler:- A compiler is a computer program (or set of programs) that transforms source code written in a computer language (the *source language*) into another computer language (the *target language*, often having a binary form known as *object code*). The most common reason for wanting to transform source code is to create an executable program. The name "compiler" is primarily used for programs that translate

source code from a high-level programming language to a lower level language (e.g., assembly language or machine code). A program that translates from a low level language to a higher level one is a *decompiler*. A program that translates between high-level languages is usually called a *language translator*, *source to source translator* 

Linker: - Linker is a program in a system which helps to link a object modules of program into a single object file. It performs the process of linking. Linker are also called link editors. Linking is process of collecting and maintaining piece of code and data into a single file. Linker also link a particular module into system library. It takes object modules from assembler as input and forms an executable file as output for loader. Linking is performed at both compile time, when the source code is translated into machine code and load time, when the program is loaded into memory by the loader. Linking is performed at the last step in compiling a program.

Assembler: - An assembler is a program that converts assembly language into machine code. It takes the basic commands and operations from assembly code and converts them into binary code that can be recognized by a specific type of processor. Assemblers are similar to compilers in that they produce executable code. However, assemblers are more simplistic since they only convert low-level code (assembly language) to machine code. Since each assembly language is designed for a specific processor, assembling a program is performed using a simple one-to-one mapping from assembly code to machine code.

**Loader:-** A loader is a major component of an operating system that ensures all necessary programs and libraries are loaded, which is essential during the startup phase of running a program. It places the libraries and programs into the main memory in order to prepare them for execution.

#### **Performance**

<u>Performance: -</u> The most important measure of the performance of a computer is how quickly it can compute programs. The speed with which a computer executes programs is affected by the design of its hardware and its machine language instructions. To represent the performance of a processor, we should consider only the periods during which the processor is active.

At the start of execution, all program instructions and the required data are stored in the memory as shown below. As execution proceeds, instructions are fetched one by one over the bus into the processor, and a copy is placed in the cache. When the execution of instruction calls for data located in the main memory, the data are fetched and a copy is placed in the cache. Later, if the same instruction or data item is needed a second time, it is read directly from the cache.



Figure 1.5. The processor cache.

Computer performance is often described in terms of clock speed (usually in MHz or GHz). This refers to the cycles per second of the main clock of the CPU. Performance of a computer depends on the following factors.

#### 1. Processor clock:-

- 1.Processor circuits are controlled by a timing signal called a clock. A clock is a microchip that regulates speed and timing of all computer functions.
- 2.Clock Cycle is the speed of a computer processor, or CPU, which is the amount of time between two pulses of an oscillator. Generally speaking, the higher number of pulses per second, the faster the computer processor will be able to process information
- 3.CPU clock speed, or clock rate, is measured in Hertz generally in gigahertz, or GHz. A CPU's clock speed rate is a measure of how many clock cycles a CPU can perform per second
- 4.To execute a machine instruction, the processor divides the action to be performed into a sequence of basic steps, such that each step can be completed in one clock cycle.
- 5. The length P of one clock cycle is an important parameter that affects processor performance.
- 6. Its inverse is the clock rate, R = 1/P, which is measured in cycles per second.
- 7.If the clock rate is 500(MHz) million cycles per second, then the corresponding clock period is 2 nanoseconds.
- **2. Basic performance equation:-** The **Performance Equation** is a term used in computer science. It refers to the calculation of the performance or speed of a central processing unit (CPU).

Basically the *Basic Performance Equation [BPE]* is an equation with 3 parameters which are required for the calculation of "Basic Performance" of a given system. It is given by

## T = (N\*S)/R

Where 'T' is the *processor time* [Program Execution Time] required to execute a given program written in some high level language. The compiler generates a machine language object program corresponding to the source program.

'N' is the total number of steps required to complete program execution. 'N' is the actual number of instruction executions, not necessarily equal to the total number of machine language instructions in the object program. Some instructions are executed more than others (loops) and some are not executed at all (conditions).

'S' is the average number of basic steps each instruction execution requires, where each basic step is completed in one clock cycle. We say average as each instruction contains a variable number of steps depending on the instruction.

'R' is the clock rate [In cycles per second]

#### 3. Pipelining and Super scalar operation:-

- 1.A substantial improvement in performance can be achieved by overlapping the execution of successive instructions, using a technique called pipelining.
- 2. Consider the instruction
- 3.Add R1, R2, R3
- 4. Which adds the contents of registers R1 and R2, and places the sum into R3
- 5. The contents of R1 and R2 are first transferred to the inputs of the ALU.
- 6. After the add operation is performed, the sum is transferred to R3.
- 7. Processor can read the next instruction from the memory while the addition operation is being performed.
- 8. Then, if that instruction also uses the ALU, its operands can be transferred to the ALU inputs at the same time that the result of add instruction is being transferred to R3.
- 9. Thus, pipelining increases the rate of executing instructions significantly.

## 4. Super scalar operation:-

- 1.A higher degree of concurrency can be achieved if multiple instruction pipelines are implemented in the processor.
- 2. This means that multiple function units are used, creating parallel paths through which different instructions can be executed in parallel.
- 3. With such an arrangement, it becomes possible to start the execution of several instructions in every clock cycle.
  - 4. This mode of execution is called super scalar operation.

#### 5. Clock rate:-

- 1. There are two possibilities for increasing the clock rate, R.
- 2. First, improving the Integrated Circuit technology makes logic circuit faster, which reduces the needed to complete a basic step. This allows the clock period, P, to be reduced and the clock rate, R, to be increased.
- 3. Second, reducing the amount of processing done in one basic step also makes it possible to reduce the clock period, P.

## 6. Instruction set: CISC and RISC:-

- 1. The terms CISC and RISC refer to design principles and techniques.
- 2.RISC: Reduced instruction set computers.
- 3. Simple instructions require a small number of basic steps to execute.
- 4. For a processor that has only simple instructions, a large number of instructions may by need to perform a given programming task. This could lead to a large value of N and a small value for S.
- 5.It is much easier to implement efficient pipelining in processors with simple instruction sets.
- 6.CISC: Complex instruction set computers.
- 7. Complex instructions involve a large number of steps.
- 8.If individual instructions perform more complex operations, fewer instructions will be needed, leading to a lower value of N and a larger value of S.
- 9. Complex instructions combined with pipelining would achieve good performance.

#### 7. Optimizing Compiler:-

- 1.A compiler translates a high-level language program into a sequence of machine instructions.
- 2.To reduce N, we need to have a suitable machine instruction set and a compiler that makes good use of it.
- 3.An optimizing compiler takes advantage of various features of the target processor to reduce the product N \* S.
- 4. The compiler may rearrange program instructions to achieve better performance.

#### 8. Performance measurement:-

- 1.SPEC rating.
- 2.A nonprofit organization called" System Performance Evaluation Corporation" (SPEC) selects and publishes representative application programs for different application domains.
- 3. The SPEC rating is computed as follows.
- 4.SPEC rating = Running time on the reference computer
  Running time on the computer under test.
- 5. Thus SPEC rating of 50 means that the computer under test is 50 times faster than the reference computer for these particular benchmarks.
- 6. The test is repeated for all the programs in the SPEC suite, and the geometric mean of the results is computed.
- 7.Let SPEC, be the rating for program 'i' in the suite.

The overall SPEC rating for the computer is given by

SPEC rating =  $\prod_{i=1}^{n}$  (SPEC<sub>i</sub>)

## UNIT-2

<u>Syllabus: Machine Instruction and Programs:</u> Instruction and Instruction sequencing: Register Transfer notation, Assembly Language Notation, Basic Instruction types, Addressing Modes, Basic Input / Output operations, the role of Stacks and queues in computer programming equation. Component of instructions: logic instructions shift and rotate instructions.

<u>Instructions and Instruction Sequencing:</u> computer programming consists of a sequence of small steps, such as adding two numbers, testing for particular condition, reading the character from keyboard and sending a character to be displayed on screen.

A computer must have instructions capable of performing four types of operations:

- 1. Data transfers between the memory and the processor registers
- 2. Arithmetic and logic operations on data
- 3. Program sequencing and control
- 4. I/O transfers

**Register Transfer Notation:** It is used to transfer information from one location to other location inside the computer. In RTN, source is always a value specified on right hand side of "← ". Destination is always a processor register, specified on left hand side.

## Syntax:

## **Register** ← Source

The right hand side of RTN is always denotes a value and the left hand side is the name of a location where the value is to be placed. Source can be processor register, I/O register, memory location, but destination register is always a processor register. RTN uses square brackets to indicate content of location. These braces are always placed only around the Source. For example,

1.  $R3 \leftarrow [R1] + [23]$ 

This operation that adds the contents of registers R1 and R2, and places their sum into register R3

2.  $R_2 \leftarrow [LOC]$ , means that the contents of memory location LOC are transferred into processor register R2.

Assembly Language Notation: Assembly Language Notation is a type of notation which is used to represent machine instructions and programs.

For example:

LOAD LOC, R2

a generic instruction that causes the transfer, from memory location <u>LOC</u> to <u>processor</u> <u>register R2</u>, is specified by the statement

The contents of LOC are unchanged by the execution of this instruction, but the old contents of register R2 are overwritten. The name Load is appropriate for this instruction, because the contents read from a memory location are *loaded* into a processor register.

The second example :

ADD R4, R2, R3

Adding two numbers contained in processor registers R2 and R3 and placing their sum in R4 can be specified by the assembly-language statement

In this case, registers R2 and R3 hold the source operands, while R4 is the destination.

An *instruction* specifies an operation to be performed and the operands involved. In the above examples, we used the English words Load and Add to denote the required operations. **In the assembly-language instructions of actual (commercial) processors, such operations are defined by using** *mnemonics***, which are typically abbreviations of the words describing the operations. For example, the operation Load may be written as LD, while the operation Store, which transfers a word from a processor register to the memory, may be written as STR or ST. Assembly languages for different processors often use different mnemonics for a given operation.** 

**Basic Instruction types:** An instruction is of various lengths depending upon the number of addresses it contains. Generally CPU organization is of three types on the basis of number of address fields:

- 1. Single Accumulator organization
- 2. General register organization
- 3. Stack organization
  - 1. In first organization operation is done involving a special register called accumulator.
  - 2.In second on multiple registers are used for the computation purpose.
  - 3.In third organization the work on stack basis operation due to which it does not contain any address field. On the basis of number of addresses instructions are classified as:

Three address Instructions: This instruction has three operands(address fields) to specify a register or a memory location.

1. Syntax

: operation source1, source2,

EX: Add A, B, C [C < -[A] + [B])

Where A, B are called source operands, C is called destination operand.

- 2. **Two address Instructions**: This instruction has two operands (address fields) to specify a register or a memory location.
- 3. Syntax: operation source, destination.

For example,

Add  $A, C \quad (C \leftarrow [A] + [C])$ 

4.One address Instructions: This instruction has one operand (address field) to specify a register or a memory location. This use a implied Accumulator(AC) Register for data manipulation. One operand is in AC and other is in register or memory location. Implied means that the CPU already know that one operand is in AC so there is no need to specify it. For example,

```
LOAD A (AC \leftarrow [A])

ADD B (AC \leftarrow [AC] + [B])

STORE C (C \leftarrow [AC])
```

5.Zero address Instructions: This instruction has zero address fields. A stack based computer do not use address field in instruction. It uses stack operations PUSH and POP to perform operations. To evaluate a expression first it is converted to revere Polish Notation i.e. Post fix Notation. For example,

```
Push A (TOS \leftarrow [A])

Push B (TOS \leftarrow [B])

Add (TOS \leftarrow [A] + [B])

Pop C (C \leftarrow [TOS])
```

Example: evaluate X = (A + B) \* (C + D)

## **Three Address:**

Add A, B, R1  $(R1 \leftarrow [A] + [B])$ Add C, D, R2  $(R2 \leftarrow [C] + [D])$ 

Mul R1, R2, X  $(X \leftarrow [R1] * [R2])$ 

## Two Address:

Move A, R1 (  $R1 \leftarrow [A]$  )

Add B, R1  $(R1 \leftarrow [R1] + [B])$ 

Move C, R2  $(R2 \leftarrow [C])$ 

Add D, R2  $(R2 \leftarrow [R2] + [D])$ 

 $Mul R1, R2 \qquad (R2 \leftarrow [R1] * [R2])$ 

Move R2,  $X \leftarrow [R2]$ 

#### **One Address:**

Load A  $(AC \leftarrow [A])$ 

Add B  $(AC \leftarrow [AC] + [B])$ 

Store T1  $(T1 \leftarrow [AC])$ 

Load C  $(AC \leftarrow [C])$ 

Add D  $(AC \leftarrow [AC] + [D])$ 

Mul T1  $(AC \leftarrow [AC] * [T1])$ 

Store X  $(X \leftarrow [AC])$ 

#### **Zero address:**

Push A  $(TOS \leftarrow [A])$ 

Push B  $(TOS \leftarrow [B])$ 

Add  $(TOS \leftarrow [A] + [B])$ 

Push C  $(TOS \leftarrow [C])$ 

Push D  $(TOS \leftarrow [D])$ 

Add  $(TOS \leftarrow [C] + [D])$ 

Mul  $(TOS \leftarrow ([A] + [B]) * ([C] + [D])$ 

Pop X  $(X \leftarrow [A] + [B]) * ([C] + [D])$ 

**Instruction Execution and Straight-Line Sequencing:** To begin **executing** a program, the address of its first **instruction** (I in our example) must be placed into the PC. Then, the processor control circuits use the information in the PC to fetch and **execute instructions**, one at a time, in the order of increasing addresses. This is called **straight-line sequencing**. For example, consider the following assembly instruction which add contents of two memory locations. i.e.  $C \leftarrow [A] + [B]$ . The following diagram shows a possible program segment for this task as it appears in the memory of a computer.



Figure 2.8 A program for  $C \leftarrow [A] + [B]$ .

The four instructions of the program are in successive word locations, starting at location i. Since, each instruction is 4 bytes long, the second, third, and fourth instructions are at addresses i + 4, i + 8, and i + 12. The processor contains a register called the *program counter* (PC), which holds the address of the next instruction to be executed. To begin executing a program, the address of its first instruction (i in our example) must be placed into the PC. Then, the processor control circuits use the information in the PC to fetch and execute instructions, one at a time, in the order of increasing addresses. This is called *straight-line sequencing*. During the execution of each instruction, the PC is incremented by 4 to point to the next instruction. Thus, after the Store instruction at location i + 12 is executed, the PC contains the value i + 16, which is the address of the first instruction of the next program segment.

Executing a given instruction is a two-phase procedure. In the first phase, called *instruction fetch*, the instruction is fetched from the memory location whose address is in the PC. This instruction is placed in the *instruction register* (IR) in the processor. At the start of the second phase, called *instruction execute*, the instruction in IR is examined to determine which operation is to be performed. The specified operation is then performed by the processor. This involves a small number of steps such as fetching operands from the memory or from processor registers, performing an arithmetic or logic operation, and storing the result in the destination location. At some point during this two-phase procedure, the contents of the PC are advanced to point to the next instruction. When the execute phase of an instruction is completed, the PC contains the address of the next instruction, and a new instruction fetch phase can begin.

**Branching:** Normally, the instructions executed in linear fashion through the program, and the address of the instructions is obtained from PC in the control unit. This sequence is interrupted when a branch instruction is executed, at such a time the address field of the Branch instruction is inserted into the PC and the process continues. Consider the task of adding a list of n numbers. The following diagram shows straight line sequencing program to add list of n numbers.

|       | Move        | NUM1,R0 |
|-------|-------------|---------|
| +4    | Add         | NUM2,R0 |
| +8    | Add         | NUM3,R0 |
|       |             | :       |
| +4n-4 | Add         | NUMm,R0 |
| +4n   | Move        | R0,SUM  |
|       |             | :       |
| UM T  |             |         |
| UMI   | -0-072/50-0 |         |
| TUM2  | 11272       |         |
|       |             | •       |
| UMn - |             |         |

Figure 2.9 A straight-line program for adding n numbers.

The addresses of the memory locations containing the n numbers are symbolically given as NUM1, NUM2, . . . , NUMn, and separate Add instructions is used to add each number to the contents of register R0. After all the numbers have been added, the result is placed in memory location SUM.

Instead of using a long list of Add instructions, it is possible to implement a program loop in which the instructions read the next number in the list and add it to the current sum. To add all numbers, the loop has to be executed as many times as there are numbers in the list. The following shows the structure of the desired program. The body of the loop is a straight-line sequence of instructions executed repeatedly. It starts at location LOOP and ends at the instruction Branch > 0. During each pass through this loop, the address of the next list entry is determined, and that entry is fetched and added to R0.

Assume that the number of entries in the list, n, is stored in memory location N, as shown. Register R1 is used as a counter to determine the number of times the loop is executed. Hence, the contents of location N are loaded into register R1 at the beginning of the program. Then, within the body of the loop, the instruction

Decrement R1

reduces the contents of R1 by 1 each time through the loop. Execution of the loop is repeated as long as the contents of R1 are greater than zero.

Next use branch instruction. This type of instruction loads a new address into the program counter. As a result, the processor fetches and executes the instruction at this new address, called the *branch target*, instead of the instruction at the location that follows the branch instruction in sequential address order. A *conditional branch* instruction causes a branch only if a specified condition is satisfied. If the condition is not satisfied, the PC is incremented in the normal way, and the next instruction in sequential address order is fetched and executed.



Figure 2.10 Using a loop to add n numbers.

The instruction

#### **Branch > 0 LOOP**

is a conditional branch instruction that causes a branch to location LOOP if the contents of register R1 are greater than zero. This means that the loop is repeated as long as there are entries in the list that are yet to be added to R0. At the end of the *n*th pass through the loop, the Subtract instruction produces a value of zero in R2, and, hence, branching does not occur. Instead, the Store instruction is fetched and executed. It moves the final result from R0 into memory location SUM.

<u>Condition Codes:</u> The processor keeps track of information about the results of various operations for use by subsequent conditional branch instructions. This is done by recording the required information in individual bits, often called condition code flags. These flags are grouped together in a special processor register called condition code register or status register. Individual condition code flags are set to 1 or cleared to 0, depending on outcome of operation performed. Four commonly used flags are

- 1. **N (negative):** Set to 1 if the result is negative; otherwise, cleared to 0.
- 2. **Z** (zero): Set to 1 if the result is zero; otherwise, cleared to 0.
- 3. V (overflow): Set to 1 if arithmetic overflow occurs; otherwise, cleared to 0.
- 4. C (carry): Set to 1 if a carry-out results from the operation; otherwise, cleared to 0.

<u>Addressing Modes:</u> The different ways for specifying the locations of instruction operands are known as *addressing modes*.

| Table 2.1 Generic addressing modes |                  |                      |  |  |  |
|------------------------------------|------------------|----------------------|--|--|--|
| Name                               | Assembler syntax | Addressing function  |  |  |  |
| Immediate                          | #Value           | Operand = Value      |  |  |  |
| Register                           | R <i>i</i>       | EA = Ri              |  |  |  |
| Absolute (Direct)                  | LOC              | EA = LOC             |  |  |  |
| Indirect                           | (Ri)             | EA = [Ri]            |  |  |  |
|                                    | (LOC)            | EA = [LOC]           |  |  |  |
| Index                              | X(Ri)            | EA = [Ri] + X        |  |  |  |
| Base with index                    | (Ri,Rj)          | EA = [Ri] + [Rj]     |  |  |  |
| Base with index                    | X(Ri,Rj)         | EA = [Ri] + [Rj] + X |  |  |  |
| and offset                         |                  |                      |  |  |  |
| Relative                           | X(PC)            | EA = [PC] + X        |  |  |  |
| Autoincrement                      | (Ri)+            | EA = [Ri];           |  |  |  |
|                                    |                  | Increment Ri         |  |  |  |
| Autodecrement                      | -(R <i>i</i> )   | Decrement Ri;        |  |  |  |
|                                    |                  | EA = [Ri]            |  |  |  |
| EA = effective address             | \/alue           | = a signed number    |  |  |  |

- 1. **Implementation of Variables and constants:** In assembly language, a variable is represented by allocating a register or memory location to hold its value. Thus, the value can be changed as needed using appropriate instructions. **Variables** can be represented by register and absolute addressing modes.
  - 1. **Immediate mode: Immediate mode:** The operand is given explicitly in the instruction. For example, the instruction

MOV #200, R0

Moves the value 200 to register R0. Constants are frequently used in high level languages. For example, the statement A = B + 6. This statement can be represented as

MOV B, R1 ADD #6, R1 MOV R1, A

- 2. **Register mode: The operand is the contents of a processor register**; the name of the register is given in the instruction.
  - 1. For example, instruction

## Add R1, R2, R3

Uses the Register mode for all three operands. Registers R1 and R2 hold the two source operands, while R3 is the destination.

2. **Absolute mode(direct): The operand is in a memory location**; the address of this location is given explicitly in the instruction. The Absolute mode is used in the instruction

#### Add A.B.C

Uses the Register mode for all three operands. Registers A and B hold the two source operands, while C is the destination.

1. Indirection and Pointers: Here, the instruction does not give the operand or its address explicitly. Instead, it provides information from which memory address of the operand can be determined. This information is called as Effective address.

**Indirect mode:** In this mode, the effective address of an operand is the contents of a register or memory location whose address appears in the instruction. Indirection can be represented by placing name of the register or the memory address given in the instruction in parenthesis. For example, to execute the **Add instruction shown below the processor uses the value B which is in register R1**, **as the effective address of operand**. It requests a read operation from memory to read the contents of location B. the value read is the required operand, which adds to the contents of register R0.



Figure 2.11 Indirect addressing.

Indirect addressing through memory location is also possible as shown above. In this case, the processor first reads the contents of memory location A, and then requests a second read operation using the value B as an address to obtain the operand. The register or memory location that contains the address of an operand is called a pointer.

Consider a program for adding a list of numbers using indirect addressing as shown below.

| Address | Contents  | III.     | 1500000          |
|---------|-----------|----------|------------------|
|         | Move      | N,R1     | 1                |
|         | Move      | #NUM1,R2 | > Initialization |
|         | Clear     | R0       | I                |
| → LOOP  | Add       | (R2),R0  |                  |
|         | Add       | #4,R2    |                  |
|         | Decrement | R1       |                  |
|         | Branch>0  | LOOP     |                  |
|         | Move      | RO,SUM   |                  |

Here, register R2 is used as a pointer to the numbers in the list, and the operands are accessed indirectly through R2. The initialization section of the program loads the counter value n from memory location N to R1 and uses the immediate addressing mode to place the address value NUM1, which is the address of first number in the list, into R2. Then it clears R0 to 0. The instruction ADD (R2), R0 fetches the operand at location NUM1 and adds it ro R0. The second ADD instruction adds 4 to the contents of the pointer R2, so that it will contain the address value NUM2, when the above instruction is executed in the second pass through the loop and son on.

Consider the C-language statement A = \*B; where B is a pointer variable. This statement may be compiled into

MOVE B, R1 MOVE (R1), A

Using indirect addressing through memory, the same action can be achieved with

MOVE (B), A

- 2. **Indexing and Arrays:** It is useful in dealing with lists and arrays.
- 1. **Index Mode**, the effective address of the operand is generated by adding a constant value to the contents of a register. The register used may be either a general purpose register or index register. Symbolically index mode can be represented as

$$X(R_i)$$

Where X denotes the constant value contained in the instruction and R<sub>i</sub> is the name of the register involved. The effective address of the operand is given by

$$EA = X + [R_i]$$

The following shows the way of using Index mode.



Figure 2.13 Indexed addressing.

In first case, the index register R1 contains the address of a memory location and the value x defines an offset from this address to the location where the operand is found.

In second case, the constant X corresponds to a memory address and the contents of the index register define the offset to the operand. In either case, the effective address is the sum of two values: one is given explicitly in the instruction and the other is stored in a register.

Consider a simple example involving a list of test scores for students taking a given course. Assume that the list of scores beginning at location LIST as shown below.



Figure 2.14 A list of students' marks.

A four word memory block comprises a record that stores the relevant information for each student. Each record consists of the students ID, followed by the scores the student earned on three tests. There are n students in the class, and the value n is stored in location N immediately in front of the list. Suppose our aim is to find the sum of all scores obtained on each of the tests and store these sums in memory locations SUM1, SUM2, and SUM3. A possible program for this task is given below.

On the first pass through the loop, test scores of the first student are added to the running sums held in registers R1, r2, and R3, which are initially cleared to zero. These scores are accessed using index addressing modes 4(R0), 8(R0), and 12(R0). The index registers then incremented by 16 to point to the ID location of the second student. Register R4, which is initialized to the value n is decremented by 1 at the end of each pass through the loop. When the contents of R4 reached to 0, all the student records have been accessed, and the loop terminates. Until then, the conditional branch instruction transfer control back to the start of the loop to process the next record. The last three instructions transfer the accumulated sums from R1, R2, and R3 into memory locations SUM1, SUM2, and SUM3.

2. **Base with Index Mode**: In this mode, effective address is generated by adding the contents of base register with the contents of index register. It is represented as shown below.

Add 
$$(R_i, R_j)$$

The effective address is the sum of the contents of registers  $R_i$ , and  $R_j$ . The second register is usually called the base register.

$$EA = [R_i] + [R_i]$$

3. **Base with Index and Offset:** In this mode, effective address is the sum of the constant X and the contents of registers R<sub>i</sub> and R<sub>j</sub>. It is represented as sown below.

$$X(R_i, R_j)$$

$$EA = X + [R_i] + [R_j]$$



- 3. **Relative Addressing:** In Relative Addressing, the Program counter is used instead of a general purpose register.'
  - 1. **Relative Mode:** In this mode, the effective address is determined by adding the contents of program counter to offset value. It is represented symbolically as

The effective address of the operand is given by

$$EA = X + [PC]$$

This mode can be used to access data operands. It's most common use is to specify the target address in branch instructions. An instruction such as

Causes program to go to branch location identified by the name LOOP if the branch condition is satisfied. This location can be computed by specifying it as n offset from the current value of the program counter. For example, suppose that the relative mode is used to generate the branch target address LOOP in the branch instruction of the program using indirect addressing. Assume that the four instructions of the loop body, starting at LOOP, are located at memory locations 1000, 1004, 1008, and 1012. Hence, the updated contents of the PC at the time the branch target address is generated will be 1016. To branch to location LOOP (1000), the offset value needed is X = -16.

- 4. **Additional Modes:** The additional modes like auto increment and auto decrement are useful for accessing data items in successive locations in memory.
  - 1. **Auto increment Mode:** The effective address of the operand is the contents of a register specified in the instruction. **After accessing the operand, the contents of this register are automatically incremented to point to the next item in a list.** It is represented as

$$(R_i) +$$

It normally increments 1, but in byte sized operands or byte addressable memory. Thus, the increment is 1 for 8 bit operands, 2 for 16 bit and 4 for 32 bit operands. The effective address of the operand is

$$EA = [R_i]; Increment R_i$$

2. Auto decrement Mode: The contents of a register specified in the instruction are first automatically decremented and are then used as effective address of the operand. It is represented as

$$-(R_i)$$

The effective address of the operand is

Decrenebt Ri;

$$EA = [R_i]$$

The following program describes how to use auto increment mode



**Basic Input / Output Operations**: Input / Output operations are essential which has a significant effect on performance of a computer. **An I/O device is connected to the interconnection network by using a circuit, called the** *device interface***,** which provides the means for data transfer and for the exchange of status and control information needed to facilitate the data transfers and govern the operation of the device. The interface includes some registers that can be accessed by the processor.

Consider a task that reads in **character input from a keyboard** and **produces character output on a display screen.** A simple way of performing such I/O tasks is to use a method known as Program controlled I/O. A solution to this problem is as follows: On output, the processor sends the first character and then waits for a signal from the display that the character has been received. It then sends the second character, and so on. Input is sent from the keyboard in a similar way; the processor waits for a signal indicating that a character key has been struck and that its code is available in some buffer register associated with the keyboard. Then the processor proceeds to read that code.

The keyboard and the display are separate devices as shown below.



Figure 2.19 Bus connection for processor, keyboard, and display.

Consider the problem of moving a character from the keyboard to the processor. Striking a key stores the corresponding character code in an 8-bit buffer register DATAIN which is associated with the keyboard. To inform the processor that a valid character is in DATAIN, a status control flag, SIN, is set to 1 (initially SIN=0).

A program monitors SIN, and when SIN is set to 1, the processor reads the contents of DATAIN.

When the character is transferred to the processor, SIN is again set to 0 and the process repeats.

is

To transfer the characters from processor to display, a buffer register, DATAOUT, and a status control flag, SOUT, are used.

When SOUT equals 1, the display is ready to receive a character. Under program control, the processor monitors SOUT, and when SOUT is set to 1, the processor transfers a character code to DATAOUT clears SOUT to 0; when the display device is ready to receive a second character, SOUT is again set to 1.

The processor can monitor the keyboard status flag SIN and transfer a character from DATAIN to register R1 by the **following sequence of operations**.

# READWAIT Branch to READWAIT if SIN = 0 MOVE DATAIN, R1

The contents of the keyboard character buffer DATAIN can be transferred to processor register instruction is

## MoveByte DATAIN,R1

An analogous sequence of operations is used for transferring output to display as shown below.

# WRITEWAIT Branch to WRITEWAIT if SOUT = 0 MOVE R1, DATAOUT

The contents of the processor register R1 transferred to display buffer DATAOUT instruction

# MoveByte R1,OUTDATA

The following program explains how to read sequence of characters and display it.

|      | Move     | #LOC,R0      | Initialize pointer register R0 to point to the<br>address of the first location in memory<br>where the characters are to be stored. |
|------|----------|--------------|-------------------------------------------------------------------------------------------------------------------------------------|
| READ | TestBit  | #3,INSTATUS  | Wait for a character to be entered                                                                                                  |
|      | Branch=0 | READ         | in the keyboard buffer DATAIN.                                                                                                      |
|      | MoveByte | DATAIN,(R0)  | Transfer the character from DATAIN into<br>the memory (this clears SIN to 0).                                                       |
| ECHO | TestBit  | #3,OUTSTATUS | Wait for the display to become ready.                                                                                               |
|      | Branch=0 | ECHO         |                                                                                                                                     |
|      | MoveByte | (R0),DATAOUT | Move the character just read to the display<br>buffer register (this clears SOUT to 0).                                             |
|      | Compare  | #CR,(R0)+    | Check if the character just read is CR<br>(carriage return). If it is not CR, then                                                  |
|      | Branch≠0 | READ         | branch back and read another character.  Also, increment the pointer to store the next character.                                   |

Here assume that  $3^{\rm rd}$  bit in registers INSTATUS and OUTSTATUS corresponds to SIN and SOUT respectively.

The role of STACKS and QUEUES in computer programming equation: In order to organize the control and information linkage between the main program and the subroutine, a data structure called a stack is used. A stack is the list of data elements, usually words or bytes, with the accessing restriction that elements can be added or removed at one end of the list only. This end is called as top of the stack, and another end is called as bottom. The structure is sometimes referred as pushdown stack. Stack follows Last-In-First-Out (LIFO) strategy, where elements inserted last will be the element removed first. Two basic operations that can be performed on stack are PUSH and POP, which add and remove elements from top of the stack respectively.

Data stored in the memory of a computer can be organized as a stack, with successive elements occupying successive memory locations. Assume that first element is placed in location BOTTOM, and when new elements are pushed onto the stack, they are placed in successive lower address locations. The following diagram shows a stack of word data items in the memory of a computer.



Figure 2.21 A stack of words in the memory.

It contains numerical values, with 43 at bottom and -28 at the top. A processor register is used to keep track of the address of the element of the stack that is at the top at any given time. This register is called the Stack Pointer (SP). It could be one of the general purpose registers or a register dedicated to this function. Assume a byte addressable memory with 32-bit word length, the PUSH operation can be implemented as

Subtract #4,SP Move NEWITEM,(SP)

Where the Subtract instruction subtracts the source operand 4 from the destination operand contained in SP and places the result in SP. These two instructions move the word from location NEWITEM onto the top of the stack, decrementing the stack pointer by 4 before the move.

Move (SP),ITEM Add #4,SP

if the processor has the auto increment and auto decrement address modes the PUSH operation can be performed by the instruction:

Move NEWITEM, -(SP)

The POP operation can be implemented as:

Move (SP)+,ITEM

SAFEPOP

Compare

These two instructions move the top value from the stack into location ITEM and then increment the stack pointer by 4, so that it points to the new top element as shown below.



(a) After push from NEWITEM

#2000,SP

(b) After pop into ITEM

Check to see if the stack pointer contains

Suppose that a stack runs from location 2000 (BOTTOM) down no further than 1500. The stack pointer is loaded initially with the address value 2004. SP is decremented by 4 before new data are stored on the stack. Hence, an initial value of 2004 means that the first item pushed on to the stack will be at location 2000. To prevent either pushing an item on a full stack or popping an item off an empty stack, the single instruction PUSH and POP operations can be replaced by the following instruction sequences.

|          | Branch>0            | EMPTYERROR                                                        | an address value greater than 2000. If it<br>does, the stack is empty. Branch to the<br>routine EMPTYERROR for appropriate<br>action.                                                      |
|----------|---------------------|-------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|          |                     | Otherwise, pop the top of the stack into<br>memory location ITEM. |                                                                                                                                                                                            |
|          |                     | a safe pop                                                        | operation                                                                                                                                                                                  |
| SAFEPUSH | Compare<br>Branch≤0 | #1500,SP<br>FULLERROR                                             | Check to see if the stack pointer<br>contains an address value equal<br>to or less than 1500. If it does, the<br>stack is full. Branch to the routine<br>FULLERROR for appropriate action. |
|          | Move                | NEWITEM,-(SP                                                      | <ol> <li>Otherwise, push the element in memory<br/>location NEWITEM onto the stack.</li> </ol>                                                                                             |

## a safe push operation

Another useful data structure that is similar to the stack is called a Queue. Data are stored in and retrieved from a queue on a First-In-First-Out (FIFO) basis. Thus, if queue grows in the direction of increasing addresses in memory, new data are added at the back (high address end) and retrieved from the front (low address end) of the queue.

There are two important differences between stack and queue. One end of the stack is fixed (the bottom), while the other end raises and falls as data are pushed and popped. A single pointer is needed to point to the top of the stack at any given time. On the other hand, both ends of a queue move to higher addresses as data are added at the back and removed from the front. So, two pointers are needed to keep track of two ends of the queue.

Another difference between stack and queue is that, a queue would continuously move through memory of a computer in the direction of higher addresses. This can be avoided by using Circular Queue.

# **Additional Instructions (Or) Component of Instructions:**

# Additional Instructions (Or) Component of Instructions are two types

- 1.Logic Instructions
- 2. Shift and Rotate Instructions

Logic Instructions: Logic operations such as AND, OR, and NOT applied to individual bits, are the basic building blocks of digital circuits. It is also useful to be able to perform logic operations in software. For example,

## 1's complement:

NOT dst

NOT dst complements all bits contained in the destination operand, changing 0's to 1 and 1's to 0.

## 2's complement:

The following two instructions calculates 2's complement of a number

NOT R0

ADD #1, R0

Many computers have a single instruction for 2's complement..i.e. NEGATE R0 Logic Instructions AND, OR, and NOT represented as bits in a table as follows:

AND table:

| Operand 1 | Operand 2 | AND |
|-----------|-----------|-----|
| 0         | 0         | 0   |
| 0         | 1         | 0   |
| 1         | 0         | 0   |
| 1         | 1         | 1   |

OR table

| Operand 1 | Operand 2 | OR |
|-----------|-----------|----|
| 0         | 0         | 0  |
| 0         | 1         | 1  |
| 1         | 0         | 1  |
| 1         | 1         | 1  |

NOT table

| Operand | NOT |
|---------|-----|
| 0       | 1   |
| 1       | 0   |

**Shift and Rotate Instructions:** There are many applications that require the bits of an operand to be shifted right or left some specified number of bit positions. There are two types of shift instructions.

- 1. **Logical Shift Instructions**((LShiftL, LShiftR)
- 2. Arithmetic Shift Instructions
- 1. **Logical Shift Instructions**: These instructions shift an operand over a number of bit positions specified in a count operand in the instruction. There are two types of Logical Shift Instructions: Logical left shift (LShiftL) and Logical right shift ((LShiftR)

Logical left shift (LShiftL): The general form of Logical Shift Left Instruction is

#### LShiftL count, DST

The count operand may be an immediate operand or it may be contained in a processor register. For example,

LShiftL #2, R0, this is represented as



It shifts the contents of register R0 left by two positions. Vacated positions are filled with 0's.

1.**Logical right shift (LShiftR)**: The general form of Logical Shift Right Instruction is **LShiftR count, DST** 

The count operand may be an immediate operand or it may be contained in a processor register. For example,

LShiftR #2, R0, this is represented as



It shifts the contents of register R0 right by two positions. Vacated positions are filled with 0's.

- 2. Arithmetic Shift Instructions: There are two types of Arithmetic Shift Instructions:
  - 1. **Arithmetic Shift Left (AShiftL):** A Left Arithmetic shifts a binary number by specified number of positions towards left. The vacated positions are filled with 0's. It can be represented as

#### AShiftL count, DST

For example, AShiftL #2, R0 shifts contents of R0 by 2 positions and vacated positions are filled with zeros as shown below.



2. **Arithmetic Shift Right (AShiftR):** A Right Arithmetic shifts a binary number by specified number of positions towards right. The vacated positions are filled with copies of the original MSB bit. It can be represented as

#### AShiftR count, DST

For example, AShiftR #2, R0 shifts contents of R0 by 2 positions and vacated positions are filled with copies of the original MSB bit as shown below.



**Rotate Instructions:** In shift operations, the bits shifted out of the operand are lost, except for the last bit shifted out which is in the carry flag C. To preserve all bits, a set of rotate instructions can be used. They move the bits that are shifted out of one end of the operand back into the other end. Different rotate instructions are

- 1. Rotate Left without Carry
- 2. Rotate Left with Carry
- 3. Rotate Right without Carry
- 4. Rotate Right with Carry

They are represented as follows.







## **Digit-Packing Example:**

Consider a task which uses both shift and logic operations. Suppose two decimal digits represented in ASCII code are located in memory at byte locations LOC and LOC + 1. Our goal is to represent each of these digits in the 4-bit BCD code and store both of them in a single byte location PACKED. The result is said to be in packed-BCD format.

To do this, extract the lower order 4 bits in LOC and LOC + 1 and concatenate them into the single byte at PACKED as shown below.

| Move     | #LOC,R0   | R0 points to data.             |  |
|----------|-----------|--------------------------------|--|
| MoveByte | (R0)+,R1  | Load first byte into R1.       |  |
| LShiftL  | #4,R1     | Shift left by 4 bit positions. |  |
| MoveByte | (R0),R2   | Load second byte into R2.      |  |
| And      | #\$F,R2   | Eliminate high-order bits.     |  |
| Or       | R1,R2     | Concatenate the BCD digits.    |  |
| MoveByte | R2,PACKED | Store the result.              |  |

Here, R0 is a pointer to the ASCII characters in memory, and use the registers R1 and R2 to develop the BCD codes. MoveByte instruction transfers a byte between memory and a 32-bit processor register. The And instruction is used to mask out all except the four rightmost bits in R2. Immediate operand \$F, interpreted as a 32-bit pattern, has 28 zero's in most significant bit positions.

Vector Dot Product Program: Let A and B be two vectors of length n. Their dot product can be defined as

Dot Product 
$$=\sum_{i=0}^{n-1} A(i) \times B(i)$$

The following program computes dot product and storing it in memory location DOTPROD. The first elements of each vector, A(0), and B(0), are stored at memory locations AVEC and BVEC.

| M.JOSHNA |      |           |          |                                 |
|----------|------|-----------|----------|---------------------------------|
|          |      | Move      | #AVEC,R1 | R1 points to vector A.          |
|          |      | Move      | #BVEC,R2 | R2 points to vector B.          |
|          |      | Move      | N,R3     | R3 serves as a counter.         |
|          |      | Clear     | RO       | R0 accumulates the dot product. |
| L        | LOOP | Move      | (R1)+,R4 | Compute the product of          |
|          |      | Multiply  | (R2)+,R4 | next components.                |
|          |      | Add       | R4,R0    | Add to previous sum.            |
|          |      | Decrement | R3       | Decrement the counter.          |
|          |      | Branch>0  | LOOP     | Loop again if not done.         |
|          |      |           |          |                                 |

Move

Figure 2.33 A program for computing the dot product of two vectors.

Byte Sorting Program: Consider a program for sorting a list of bytes stored in memory into ascending alphabetic order. Assume that the list consists of n bytes. Let the list stored in memory locations LIST through LIST + n - 1, and let n be a 32-bit value stored at address N. To do this, use straight selection sort algorithm. First, the largest number is found and placed at the end of the list in location LIST + n - 1. Then the largest number in the remaining sub list of n - 1 numbers is placed at the end of the sub list in location LIST + n - 2. The procedure is repeated until the list is sorted.

R0,DOTPROD Store dot product in memory.

|       | Move        | #LIST.R0   | Load LIST into base register R0.                                 |  |
|-------|-------------|------------|------------------------------------------------------------------|--|
|       | Move        | N,R1       | Initialize outer loop index                                      |  |
|       | Subtract    | #1,R1      | register R1 to $j = n - 1$ .                                     |  |
| OUTER | Move        | R1,R2      | Initialize inner loop index                                      |  |
|       | Subtract    | #1,R1      | register R2 to $k = j - 1$ .                                     |  |
|       | MoveByte    | (R0,R1),R3 | Load LIST(j) into R3, which holds<br>current maximum in sublist. |  |
| INNER | CompareByte | R3,(R0,R2) | If LIST( $k$ ) $\leq$ [R3],                                      |  |
|       | Branch≤0    | NEXT       | do not exchange.                                                 |  |
|       | MoveByte    | (R0,R2),R4 | Otherwise, exchange $LIST(k)$                                    |  |
|       | MoveByte    | R3,(R0,R2) | with $LIST(j)$ and load                                          |  |
|       | MoveByte    | R4,(R0,R1) | new maximum into R3.                                             |  |
|       | MoveByte    | R4,R3      | Register R4 serves as TEMP.                                      |  |
| NEXT  | Decrement   | R2         | Decrement index registers R2 and                                 |  |
|       | Branch≥0    | INNER      | R1, which also serve as                                          |  |
|       | Decrement   | R1         | as loop counters, and branch                                     |  |
|       | Branch>0    | OUTER      | back if loops not finished.                                      |  |

# (b) Assembly language program for sorting

**Linked Lists:** Suppose to maintain list of student records in consecutive memory locations in increasing order of student ID numbers, use a data structure called Linked List as shown below. To insert a record between i and i+1, the link address in record i is coped into the link field in the new record and then the address of the new record is written into the link field of record i. To delete record i, the address in its link field is copied into the link field of record i-1.





(b) Inserting a new record between Record 1 and Record 2

Figure 2.35 Linked-list data structure.

A subroutine for performing insertion and deletion are shown below.



Figure 2.37 A subroutine for inserting a new record into a linked list.



Figure 2.38 A subroutine for deleting a record from a linked list.

## **FREQUENTLY ASKED QUESTIONS**

- **1.** What is a register transfer language?
- 2. Differentiate the instruction execution for adding 'n' numbers using Straight line sequencing and branching.
- **3.** Write short notes on shift and rotate instructions.
- **4.** Write about various means by which data are transferred between memory of a computer and outside world.
- 5. What is register transfer notation? Write and explain these notations to three-address, two-address, single address and zero-address instruction types.
- 6. In how many ways the location of an operand is specified in an instruction? Explain each mode with suitable examples.
  - 7. What are different addressing modes? Explain.
  - 8. Give example for left and right shift operations.
  - 9.List basic input and output operations.
  - 10. Write short notes on additional addressing modes
  - 11. With an example write about relative addressing.
  - 12. Illustrate the concept of assembly directives with an assembly language program
  - 13. Illustrate with examples rotate instruction?
  - 14. Differentiate between shift and rotate instructions.

# UNIT - 3

Types of Instructions: Arithmetic and Logic Instructions, Branch Instructions, Addressing modes, Input / Output Instructions.

**Arithmetic and Logic Instructions:** The ARM instruction set has a number of instructions for **arithmetic operations** on operands that are either contained in the general-purpose registers or given as immediate operands in the instruction itself. The basic assembly-language format for arithmetic and logic instructions is shown below.

# opcode Rd, Rn, Rm

where the operation specified by the opcode is performed on the source operands in general-purpose registers Rn and Rm. The result is placed in destination register Rd.

Arithmetic Instructions: Different types of Arithmetic instructions are ADD, SUB, MUL and MLA.

For example, the instruction

**ADD R0, R2, R4** 

performs the operation

 $R0\leftarrow [R2] + [R4]$ 

The instruction

**SUB R0, R6, R5** 

performs the operation

 $R0 \leftarrow [R6] - [R5]$ 

The second source operand can be specified in the immediate mode. Thus,

ADD R0, R3, #17

performs the operation

$$R0 \leftarrow [R3] + 17$$

The immediate operand is an 8-bit value contained in bits b7-0 of the encoded machine instruction. It is an unsigned number in the range 0 to 255. The assembly language allows negative values to be used as immediate operands. If the instruction

ADD R0, R3, #-17

is used in a program, the assembler replaces it with the instruction

SUB R0, R3, #17

When the second source operand is specified as the contents of a register, they can be shifted or rotated before being used in the operation. Logical shift left (LSL), logical shift right (LSR), arithmetic shift right (ASR), and rotate right (ROR). For example, the instruction

**ADD R0, R1, R5, LSL #4** 

is executed as follows. The second source operand, which is contained in register R5, is shifted left 4 bit positions (equivalent to [R5] × 16), then added to the contents of register R1. The sum is placed in register R0. The carry bit, C, is not involved in these operations. The shift or rotation amount can also be specified as the contents of a fourth register.

Two basic versions of a multiply instruction are provided. The first version multiplies the contents of two registers and places the low-order 32-bits of the product in a third register. The high-order bits of the product are discarded. If the operands are 2's-complement numbers, and if their product can be represented in 32 bits, then the retained low-order 32 bits of the product represent the correct result. For example, the instruction

MUL R0, R1, R2

Performs the operation

$$R0 \leftarrow [R1] \times [R2]$$

The second version of the basic Multiply instruction specifies a fourth register whose contents are added to the product before the result is stored in the destination register. Hence, the instruction

MLA R0, R1, R2, R3

Performs the operation

$$R0 \leftarrow ([R1] \times [R2]) + [R3]$$

This is called a Multiply-Accumulate operation. It is often used in signal-processing applications.

<u>Logic Instructions</u>: Different types of Logic Instructions are AND, OR, XOR, BIC and MVC. For example, The AND instruction

AND Rd, Rn, Rm

performs a bitwise logical AND of the operands in registers Rn and Rm and places the result in register Rd. For example, if register R0 contains the hexadecimal pattern 02FA62CA and R1 contains the pattern 0000FFFF, then the instruction

AND R0, R0, R1  $(R0 \leftarrow [R0] \land [R1])$ 

will result in the pattern 000062CA being placed in register R0.

The Bit Clear instruction, BIC, is closely related to the AND instruction. It complements each bit in operand Rm before ANDing them with the bits in register Rn. Using the same R0 and R1 bit patterns as in the above example, the instruction

BIC R0, R0, R1  $(R0 \leftarrow [R0] \land [NOT R1])$ 

results in the pattern 02FA0000 being placed in R0.

The OR instruction

OR R0,R0,R1  $(R0 \leftarrow [R0] \lor [R1])$ 

Performs OR operation between contents of R0, R1 registers.

The XOR instruction

XOR R0,R0,R1  $(R0 \leftarrow [R0] \bigcirc [R1])$ 

Performs XOR operation between contents of R0, R1 registers.

**The Move Negation instruction**, with the opcode Mnemonic MVN, complements the bits of the source operand and places the result in Rd. for example,

**MVN R0, R3.** 

If the contents of R3 are the hexadecimal pattern 0F0F0F0F, then it places the result F0F0F0F0 in the register R0.

The following ARM program merges two BCD digits into a byte.

| R0, POINTER     | Load address LOC into R0.                             |
|-----------------|-------------------------------------------------------|
| R1,[R0]         | Load ASCII characters                                 |
| R2,[R0,#1]      | into R1 and R2.                                       |
| R2,R2,#&F       | Clear high-order 28 bits of R2.                       |
| R2,R2,R1,LSL #4 | Or [R1] shifted left into [R2].                       |
| R2,PACKED       | Store packed BCD digits into PACKED.                  |
|                 | R1,[R0]<br>R2,[R0,#1]<br>R2,R2,#&F<br>R2,R2,R1,LSL #4 |

Figure 3.5 An ARM program for packing two 4-bit decimal digits into a byte.

The first instruction in the program loads the address LOC into register R0. The two ASCII characters containing the BCD digits in their low-order four bits are loaded into the low-order byte positions of registers R1 and R2 by the next two Load instructions. The AND instruction clears the high-order 2 bits of R2 to zero, leaving the second BCD digit in the four low-order bit positions. The '&' character in this instruction signifies hexadecimal notation for the immediate value. The ORR instruction then shifts the first BCD digit in R1 to the left four positions and places it to the left of the second BCD digit in R2. The two digits packed into the low-order byte of R2 are then stored into location PACKED.

**Branch Instructions:** Conditional branch instructions contain a signed, 2's complement, 24-bit offset that is added to the updated contents of the program counter to generate the branch target address. The format for the branch instruction is shown below.



Figure 3.2 ARM instruction format.

The BEQ instruction (Branch if Equal to 0) causes a branch if the Z flag is set to 1. The higher order 4 bits  $b_{31-28}$ , of the instruction word determines whether or not branching should takes place. At the time the branch target address is computed, the contents of the PC will have been updated to contain the address of the instruction that is two words beyond the Branch instruction itself. This is due to pipelined instruction execution. If the Branch instruction is at address location 1000 and the branch target address is 1100, as shown in Figure D.6, then the offset is 92, because the contents of the updated PC will be 1000 + 8 = 1008 when the branch target address 1100 is computed.

**Setting condition codes:** The Compare and Test instructions always update the condition code flags. Some instructions, such as compare, given by CMP Rn, Rm performs the operation [Rn] - [Rm]. The arithmetic, logic, and Move instructions affect the condition code flags only if explicitly specified to do so by a bit in the OP-code field. This is indicated by appending the suffix S to the assembly language OP-code mnemonic. For example, the instruction ADDS R0, R1, R2 sets the condition code flags, but ADD R0, R1, R2 does not.

The following is an ARM program for adding n numbers. Here, Location N contains the number of entries in the list, and location SUM is used to store the sum. The Load and Store operations performed by

the first and last instructions use the Relative addressing mode. This assumes that the memory locations N and SUM are within the range reachable by offsets relative to the PC. The address NUM1 of the first of the numbers to be added into register R2 by the second instruction. The Post-indexed addressing mode, which includes writeback, is used in the first instruction of the loop.

|      | LDR  | R1,N       | Load count into R1.        |
|------|------|------------|----------------------------|
|      | LDR  | R2,POINTER | Load address NUM1 into R2. |
|      | MOV  | R0,#0      | Clear accumulator R0.      |
| LOOP | LDR  | R3,[R2],#4 | Load next number into R3.  |
|      | ADD  | R0,R0,R3   | Add number into R0.        |
|      | SUBS | R1,R1,#1   | Decrement loop counter R1. |
|      | BGT  | LOOP       | Branch back if not done.   |
|      | STR  | RO,SUM     | Store sum.                 |
|      |      | (A)        |                            |

Figure 3.7 An ARM program for adding numbers.

# **Input / Output Instructions:**

**Input / Output Instructions:** The ARM architecture uses memory mapped I/O. Input / Output operations are essential which has a significant effect on performance of a computer. An I/O device is connected to the interconnection network by using a circuit, called the *device interface*, which provides the means for data transfer and for the exchange of status and control information needed to facilitate the data transfers and govern the operation of the device. The interface includes some registers that can be accessed by the processor. One register may serve as a buffer for data transfers, another may hold information about the current status of the device, and yet another may store the information that controls the operational behavior of the device. These *data*, *status*, and *control* registers are accessed by program instructions as if they were memory locations.

Consider a task that reads in character input from a keyboard and produces character output on a display screen. A simple way of performing such I/O tasks is to use a method known as Program controlled I/O. A solution to this problem is as follows: On output, the processor sends the first character and then waits for a signal from the display that the character has been received. It then sends the second character, and so on. Input is sent from the keyboard in a similar way; the processor waits for a signal indicating that a character key has been struck and that its code is available in some buffer register associated with the keyboard. Then the processor proceeds to read that code.

The keyboard and the display are separate devices as shown below.



Figure 2.19 Bus connection for processor, keyboard, and display.

Consider the problem of moving a character from the keyboard to the processor. Striking a key stores the corresponding character code in an 8-bit buffer register DATAIN which is associated with the keyboard. To inform the processor that a valid character is in DATAIN, a status control flag, SIN, is set to 1 (initially SIN=0). A program monitors SIN, and when SIN is set to 1, the processor reads the contents of DATAIN. When the character is transferred to the processor, SIN is again set to 1 and the process repeats.

To transfer the characters from processor to display, a buffer register, DATAOUT, and a status control flag, SOUT, are used. When SOUT equals 1, the display is ready to receive a character. Under program control, the processor monitors SOUT, and when SOUT is set to 1, the processor transfers a character code to DATAOUT clears SOUT to 0; when the display device is ready to receive a second character, SOUT is again set to 1.

Suppose that bit 3 in each of the device status registers INSTTUS and OUTSTATUS contains the respective control flags SIN and SOUT. Also assume that the keyboard DATAIN and display DATAOUT registers are located at addresses INSTATUS + 4 and OUTSTATUS + 4. The READ and WRITE wait loops can be implemented as follows.

READWAIT LDR R3,[R1]
TST R3,#8
BEQ READWAIT
LDRB R3,[R1,#4]
WRITEWAIT LDR R4,[R2]
TST R4,#8
BEQ WRITEWAIT
STRB R3,[R2,#4]

Assume that the address INSTATUS has been loaded into register R1. The instruction sequence reads a character into register R3 when a key has been pressed on the keyboard. The test (TST) instruction performs the bitwise logical AND operation on its two operands and sets the condition code flags based on the result. The immediate operand 8 (0000 1000) has a single 1 in the bit 3 position. Therefore, the result of the TST operation will be zero if bit 3 of INSTATUS is zero and will be non zero if bit 3 is one, signifying that a character is available in DATAIN. The BEQ instruction branches back to READWAIT if the result is zero.

Assuming that the address OUTSTATUS has been loaded into register R2, the instruction sequence sends the character in register R3 to the DATAOUT register when display is ready to receive it.

The following ARM program reads line of characters.



Figure 3.9 An ARM program that reads a line of characters and displays it.

# **ARM Addressing Modes:**

| Name                        | Assembler syntax             | Addressing function                   |
|-----------------------------|------------------------------|---------------------------------------|
| With immediate offset:      |                              |                                       |
| Pre-indexed                 | [Rn, #offset]                | EA = [Rn] + offset                    |
| Pre-indexed                 |                              |                                       |
| with writeback              | [Rn, #offset]!               | EA = [Rn] + offset;                   |
|                             |                              | $Rn \leftarrow [Rn] + offset$         |
| Post-indexed                | [Rn], #offset                | EA = [Rn];                            |
|                             |                              | $Rn \leftarrow [Rn] + offset$         |
| With offset magnitude in    | Rm:                          |                                       |
| Pre-indexed                 | $[Rn, \pm Rm, shift]$        | $EA = [Rn] \pm [Rm]$ shifted          |
| Pre-indexed                 |                              |                                       |
| with writeback              | $[Rn, \pm Rm, shift]!$       | $EA = [Rn] \pm [Rm]$ shifted:         |
|                             |                              | $Rn \leftarrow [Rn] \pm [Rm]$ shifted |
| Post-indexed                | $[Rn]$ , $\pm Rm$ , shift    | EA = [Rn];                            |
|                             |                              | $Rn \leftarrow [Rn] \pm [Rm]$ shifted |
| Relative                    | Location                     | EA = Location                         |
| (Pre-indexed with           |                              | = [PC] + offset                       |
| immediate offset)           |                              |                                       |
| EA = effective address      |                              |                                       |
| offset = a signed number co | ntained in the instruction   |                                       |
| shift = direction #integer  |                              |                                       |
| where direction is LS       | SL for left shift or LSR for | right shift; and                      |
| integer is a 5-bit uns      | igned number specifying th   | ne shift amount                       |

The basic method for addressing memory operands is an indexed addressing mode, defined as

1. Pre-indexed mode: The effective address of the operand is the sum of the contents of a base register, Rn, and a signed offset. For example, the Load instruction

# LDR Rd, [Rn, #offset]

specifies the offset (expressed as a signed number) in the immediate mode and performs the operation

## $Rd \leftarrow [Rn] + offset$

The following shows an example of the Pre-indexed mode with the offset contained in register R6 and the base value contained in R5. The Store instruction (STR) stores the contents of R3 into the word at memory location 1200.





## (b) Pre-indexed addressing mode

2. *Pre-indexed with write back mode:* The effective address of the operand is generated in the same way as in the Pre-indexed mode, then the effective address is written back into Rn. The exclamation mark signifies write-back in pre-indexed addressing mode.



## (b) Pre-indexed addressing with writeback

Above shows an example of pushing the contents of register R0, which are 27, onto a programmer-defined stack. Register R5 is used as the stack pointer. Initially, it contains the address 2012 of the current TOS (top-of-stack) element. The Pre-indexed addressing mode with write back can be used to perform the Push operation with the instruction

The immediate offset –4 is added to the contents of R5 and the new value is written back into R5. Then, this address value of the new top of the stack, 2008, is used as the effective address for the Store operation. The contents of register R0 are then stored at this location.

3. **Post-indexed mode**—The effective address of the operand is the contents of Rn. The offset is then added to this address and the result is written back into Rn. The post-indexed mode always involves write back, so the exclamation is not needed.

pre- and post-indexing are distinguished by the way the square brackets are used. When only the base register is enclosed in square brackets, its contents are used as the effective address. The offset is added to the register contents after the operand is accessed. In other words, post-indexing is specified. When both the base register and the offset are placed inside the square brackets, their sum

is used as the effective address of the operand, that is, pre-indexing is used. If writeback is to be performed, it must be indicated by the exclamation character.



(a) Post-indexed addressing

The first time that the Load instruction is executed, the effective address is [R2] = 1000. Therefore, the number 6 at this address is loaded into R1. Then, the write back operation changes the contents of R2 from 1000 to 1100 so that it points to the second number, -17. It does this by shifting the contents, 25, of the offset register R10 left by two bit positions and then adding the shifted value to the contents of R2. The contents of R10 are not changed in this process. The left shift is equivalent to multiplying 25 by 4, generating the required offset of 100. When the Load instruction is executed on the second pass through the loop, the second number, -17, is loaded into R1. The third number, 321, is loaded into R1 on the third pass, and so on.

In all three indexed addressing modes, the offset may be given as an immediate value in the Range  $\pm 4095$ . Alternatively, the magnitude of the offset may be specified as the contents of the Rm register, with the sign (direction) of the offset specified by a  $\pm$  prefix on the register name. For example, the instruction

LDR R0, [R1, -R2]!

**perf**orms the operation

$$R0 \leftarrow [[R1] - [R2]]$$

The effective address of the operand, [R1]-[R2], is then loaded into R1 because write back is specified.

When the offset is given in a register, it may be scaled by a power of 2 before it is used by shifting it to the right or to the left. This is indicated in assembly language by placing the shift direction (LSL for left shift or LSR for right shift) and the shift amount after the register name, Rm. For example, the contents of R2 in the example above may be multiplied by 16 before being used as an offset by modifying the instruction as follows:

LDR R0, [R1, -R2, LSL #4]!

This instruction performs the operation

$$R0 \leftarrow [[R1] - 16 \times [R2]]$$

and then loads the effective address into R1 because write back is specified.

4. Relative Addressing: The program counter PC, may be used as the base register R<sub>n</sub>. In this case, the relative addressing mode is used. The assembler determines the immediate offset as the signed distance

between the address of the operand and the contents of the updated PC. When the effective address is calculated at instruction execution time, the contents of PC will have been updated to the address two words (8 bytes) forward from the instruction containing the relative addressing mode.



(a) Relative addressing mode

The address of the operand, given symbolically in the instruction as ITEM, is 1060. There is no Absolute addressing mode available in the ARM architecture. Therefore, when the address of a memory location is specified by placing an address label in the operand field, the assembler uses the Relative addressing mode. This is implemented by the Pre-indexed mode with an immediate offset, using PC as the base register. As shown in the figure, the offset calculated by the assembler is 52, because the updated PC will contain 1008 when the offset is added to it during program execution. The effective address generated by this instruction is 1060 = 1008 + 52. The operand must be within a distance of 4095 bytes forward or backward from the updated PC. If the operand address is outside this range, an error is indicated by the assembler and a different addressing mode must be used to access the operand.

**68000** Addressing Modes: The 68000 has several addressing modes which are defined as follows.

| Name              | Assembler syntax | Addressing function  Operand = Value     |  |
|-------------------|------------------|------------------------------------------|--|
| Immediate         | #Value           |                                          |  |
| Absolute Short    | Value            | EA = Sign Extended WValue                |  |
| Absolute Long     | Value            | EA = Value                               |  |
| Register          | Rn               | $EA = R_n$<br>that is, Operand = $[R_n]$ |  |
| Register Indirect | (An)             | $EA = [A_n]$                             |  |
| Autoincrement     | (An)+            | $EA = [A_n];$ Increment $A_n$            |  |
| Autodecrement     | -(An)            | Decrement $A_n$ ;<br>$EA = [A_n]$        |  |
| Basic index       | WValue(An)       | $EA = WValue + [A_n]$                    |  |
| Full index        | BValue(An, Rk)   | $EA = BValue + [A_n] + [R_k]$            |  |
| Basic relative    | WValue(PC)       | EA = WValue + [PC]                       |  |
| Full relative     | BValue(PC, Rk)   | $EA = BValue + [PC] + [R_k]$             |  |

- 1. *Immediate mode*: The operand is a constant value that is contained within the instruction. Four sizes of immediate operands can be specified. Small 3-bit numbers can be included in the OP-code word of certain instructions. Byte, word, and long word operands are found in one or two extension words that follow the OP-code word.
- 2. Absolute mode: The memory address of an operand is given in the instruction immediately after the OP-code word. There are two versions of this mode—long and short. In the long mode, a full 24-bit address is specified in two extension words. In the short mode, a 16-bit value is given in one extension word.
  - 3. Register mode: The operand is in a processor register, An or Dn, that is specified in the instruction.
- 4. Register indirect mode: The effective address of the operand is in an address register, An, that is specified in the instruction.
- 5. Auto increment mode: The effective address of the operand is in an address register, An, that is specified in the instruction. After the operand is accessed, the contents of An are incremented by 1, 2, or 4, depending on whether the operand is a byte, a word, or a long word.
- 6. Auto decrement mode: The contents of an address register, An, that is specified in the instruction are first decremented by 1, 2, or 4, depending on whether the operand is a byte, a word, or a long word. The effective address of the operand is then given by the decremented contents of An.
- 7. Basic index mode: A 16-bit signed offset and an address register, An, are specified in the instruction. The offset is sign-extended to 32 bits, and the sum of the sign-extended offset and the 32-bit contents of An is the effective address of the operand.
- 8. Full index mode: An 8-bit signed offset, an address register An, and an index register Rk (either an address or a data register) are given in the instruction. The effective address of the operand is the sum of the signextended offset, the contents of register An, and the signed contents of register Rk.
- 9. *Basic relative mode:* This mode is the same as the Basic index mode, except that the program counter (PC) is used instead of an address register, An.
- 10. *Full relative mode*: This mode is the same as the Full index mode, except that the program counter (PC) is used instead of an address register, An.

**IA-32 Addressing Modes:** The IA-32 architecture has a large and flexible set of addressing modes which are defined as follows.

| Name                                | Assembler syntax         | Addressing function                    |  |
|-------------------------------------|--------------------------|----------------------------------------|--|
| Immediate                           | Value                    | Operand = Value                        |  |
| Direct                              | Location                 | EA = Location                          |  |
| Register                            | Reg                      | EA = Reg<br>that is, Operand = [Reg]   |  |
| Register indirect                   | [Reg]                    | EA = [Reg]                             |  |
| Base with displacement              | [Reg + Disp]             | EA = [Reg] + Disp                      |  |
| Index with displacement             | [Reg * S + Disp]         | $EA = [Reg] \times S + Disp$           |  |
| Base with index                     | [Reg1 + Reg2 * S]        | $EA = [Reg1] + [Reg2] \times S$        |  |
| Base with index<br>and displacement | [Reg1 + Reg2 * S + Disp] | $EA = [Reg1] + [Reg2] \times S + Disp$ |  |

Value = an 8- or 32-bit signed number

Location = a 32-bit address

Reg, Reg1, Reg2 = one of the general purpose registers EAX, EBX, ECX, EDX, ESP, EBP, ESI, EDI, with the exception that ESP cannot be used as an index register.

Disp = an 8- or 32-bit signed number, except that in the Index with displacement mode it can only be 32 bits.

S = a scale factor of 1, 2, 4, or 8

- 1. *Immediate mode:* The operand is contained in the instruction. It is a signed 8-bit or 32-bit number, with the length being specified by a bit in the opcode of the instruction. This bit is 0 for the short version and 1 for the long version.
- 2. Direct mode: The memory address of the operand is given by a 32-bit value in the instruction.
  - 3. Register mode: The operand is contained in one of the eight general-purpose registers specified in the instruction.
  - 4. Register indirect mode: The memory address of the operand is contained in one of the eight general- purpose registers specified in the instruction.
  - 5.Base with displacement mode: An 8-bit or 32-bit signed displacement and one of the eight general- purpose registers to be used as a base register are specified in the instruction. The effective address of the operand is the sum of the contents of the base register and the displacement.
  - 6.Index with displacement mode: A 32-bit signed displacement, one of the eight general purpose registers to be used as an index register, and a scale factor of 1, 2, 4, or 8, are specified in the instruction. To obtain the effective address of the operand, the contents of the index register are multiplied by the scale factor and then added to the displacement.
  - 7. Base with index mode: Two of the eight general-purpose registers and a scale factor of 1, 2, 4, or 8, are specified in the instruction. The registers are used as base and index registers. The effective address of the operand is determined by first multiplying the contents of the index register by the scale factor and then adding the result to the contents of the base register.
  - 8.Base with index and displacement mode: An 8-bit or 32-bit signed displacement, two of the eight general-purpose registers, and a scale factor of 1, 2, 4, or 8, are specified in the instruction. The registers are used as base and index registers. The effective address of the operand is determined by first multiplying the contents of the index register by the scale factor and then adding the result to the contents of the base register and the displacement.

# Concept of Boolean Algebra

# Introduction:

→ In 1854 George Boolean Introduced a Systematic treatement of logic and developed for this purpose of an algebraic System Now Called Boolean Algebra.

→ In 1938 C.E Shanon Introduced a two-valued Boolean
Algebra Called a switching Algebra

Fundamental Postulateds of Boolean Algebra

| 00000 |                                                                                                                                                                           |                                           |
|-------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------|
| 3.NC  | Postulates                                                                                                                                                                | Comments                                  |
| 1:    | -> Result of Each operator 95 E9ther 0 or 1                                                                                                                               | 1,0 € 8                                   |
| 2.    | a) 0+0=0, 0+1=1+0=1<br>b) 1:1=1; 0:1=1:0=0                                                                                                                                | Identity element "o" for "+" and "1 fox.) |
| 3.    | a) $CA+B$ ) = $CB+A$ )<br>b) $CA\cdot B$ ) = $CB\cdot A$ )                                                                                                                | Commulative law                           |
| 4.    | a) A · (B+c) = (A · B) + (A·c)<br>b) A + (B·c) = (A + B) · (A+c)                                                                                                          | Distributive law                          |
| 5.    | a) $-A+\overline{A} = 1$ , $0+\overline{0} = 1$ , $1+\overline{1} = 1$<br>b) $A.\overline{A} = 0$ , $9+\overline{0}.\overline{0} = 0.1 = 0$<br>$1.\overline{1} = 1.0 = 0$ | Complement                                |
|       | [ 0 = 1 =                                                                                                                                                                 | 1 = 07                                    |

# Basic theorems and propertys

- The pranaple of Duality theorem says that starting with a bodean relation, we can derive another boolean relation by

1. Changing Each OR" sign to an "AND" sign

2. Ehangeng Each "AND" sign to an "OR" sign

3. Any 0 or 1 operating in the Expression of boolean function.

principal of Dualitytherian says Ex: Dual of relation A+ A = 1 98 A.A = 0  $A \cdot \underline{A} = 0$ 

Theorems:

| 110VI 1 10       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Annul ment       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| COUNT FORTA A    |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| SI CHA THA A     |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| I dentity        |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| STILLY ALTHOUGH  |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| MA DU Idempotent |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| Cauton S         |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|                  |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| Double Nagation  |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|                  |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|                  | SI AM JUAN A Identity  SI AM JUAN A IDENTITY |

|            | M.JOSHNA                                         |              |         |                    |      |
|------------|--------------------------------------------------|--------------|---------|--------------------|------|
|            | Boolean Expr                                     | ression      | history | BA law cor) Rule   | A.   |
|            | $\Rightarrow A+A=1$ $A\cdot A=0$                 |              |         | Complement         |      |
|            | => A+N = 1<br>-A · B =                           | B.A          | 1       | commitative law    | hood |
|            | => <del>***</del> = 7                            | Z-B<br>A+B   |         | De-mosigan         | .1.  |
| 3          | ,                                                | functions :  | pullors |                    | S.   |
|            | Function                                         | Description  |         | -Expression        |      |
|            | 1.                                               | NULL         | VI-A    | 100 00 Con 1       | ×3   |
|            | 2.                                               | Identity     |         | 1 1 A              |      |
| į          | 3.                                               | INput A      |         | A                  |      |
| . <b>!</b> | 4.<br>5. dates                                   | INPUT B      |         | B A Paris In Journ | 6    |
|            | 1                                                | on Ima NOT B |         | Billiamore         |      |
|            | 7.                                               | A AND B      | t       | A·B                |      |
|            | 8.                                               | AND NO       | OT B    | A-8                |      |
|            | 9.                                               | NOTA AN      | DB      | A·B                |      |
|            | 10. 1001                                         | NOT A        | ND)     | A · A+A · C        |      |
| •          | ~ <u>4.7.7.7.7.7.7.7.7.7.7.7.7.7.7.7.7.7.7.7</u> | should A OR  | B (OR)  | A 1 TO             |      |
|            | 12-                                              | A OR         | NOTB    | 1 A+B              |      |

| Description  | Expression ( |
|--------------|--------------|
| NOTA OR B    | 7-1B         |
| NOT OR (NOR) | A+B+AB       |
|              | 7.5          |
|              | NOTA OR B    |

# Theorems:

(3) a) 
$$\overline{A} = A$$

$$A+A = (A+A) \cdot (1)$$

$$= (A+A) \cdot (A+A)$$

$$= (A+A) \cdot (A+A)$$

$$= AA + AA + AA + AA$$

$$= AA + AA$$

$$= A(A+A)$$

$$= A(A+A)$$

$$= A(A+A)$$

$$A \cdot A = A \cdot A + (0)$$

$$= A \cdot A + A \cdot A$$

$$= A (A + A)$$

a) 
$$A+AB=A$$
  
b)  $A(A+B)=A$ 

$$\sum_{i \in A} A = A$$

$$\sum_{i \in A} A = 0; 0 + 0 = 0$$

$$\sum_{i \in A} A = 1; 1 + 1 = 1$$

$$\sum_{i \in A} A = A$$

$$\begin{cases}
A = A = A \\
A = 0 = 0.0 = 0
\end{cases}$$

$$A = 0 = 0.0 = 0$$

$$A = 1 = 0$$

$$A = 0 = 0.0 = 0$$

$$A = 0 = 0.0 = 0$$

Proof: 
$$A+1 = CDA+1$$

$$= (A+\overline{A}) \cdot (A+1)$$

$$\overline{A} + A - \overline{A} + A + A + \overline{A} - A + \overline{A}$$

with made

$$= A.+A+O+\overline{A},$$

$$= A + A + \overline{A}$$

$$=$$
  $A+\overline{A}$ 

$$Proof: A \cdot O = A \cdot (A \cdot \overline{A})$$

$$\bar{A} = A$$

Proof: if 
$$A=0$$
;  $\overline{0}=\overline{1}=0$ 

$$A = A$$

I A hast aras

$$\begin{bmatrix} A \cdot \overline{A} = 0 \\ A + \overline{A} = 1 \end{bmatrix}$$

A A COLON

: hn -1(00)

$$\begin{bmatrix} \overline{O} = 1 \\ \overline{T} = \overline{O} \end{bmatrix}$$

- = A(I+B)+AB
- = A +AB
  - = ACI+B)
    - = A 11
- (5) (a) A+AB = A+B

proof: A+AB = CA+A) (A+B)

= A+B

Command Homen [1 = A-A-3]

ALAR OLDER METRE

was every make to make the first of a

$$(\vec{b}) + (\vec{A} + \vec{B}) = A\vec{B}$$

proof: A(A+B) = A.A + AB = O+AB

1 = AB //

\*\*\* De-Morgan's theoram:

- (9) AB = A+B
- (9) A+B = A-B

| (P) | AB = | 72 | Ā | B | AB      | A +B     |
|-----|------|----|---|---|---------|----------|
|     | A    | b  |   |   | 770     | () ( d + |
|     | 0    | 0  | 1 | 1 |         | 1 .      |
|     | 0    | ١  | 1 | 0 | L. Linn | 10 1 5.  |
| •   | 1    | 0  | 0 | 1 | 1       | 1        |
|     | 1    | 1  | 0 | 0 | 0       | 0        |

(9) A+B = A·B A+B A.B B A B A 1 0 0 AD TA I DI HOA I STA 1 0 0 1 0 0 0 0 0 1341

ONE ACCIAB STREET (STA)

```
> Consensus theoram :
```

The simplification of Boolean Expression, an Expression of the form

AB+AC+BC. The term BC 98 redented and can be Eleman

ated to form the Equilent Expression AB+AC.

The theorem 9s used for this simplification 9s known as Consensus theorem. and It 9s stated as

AB+AC+BC = AB+AC

Proof: L.H.S => AB+AC+BC = AB+AC+B((1)

C: A+A=n

= AB+AC+BC(A+A)

= AB+AC+BCA+BCA

[: 1+c=1]

= AB(1+C)+Ac(1+B)

= AB+AC

\* Solve the given expression using consensus theorem
(9)  $\overline{AB} + Ac + Bc + Bc + AB$ 

=> AB+AC+BC+BC(1)+AB

[A+A=1]

=> AB+AC+BT+BCCA+A)+AB

=) AB+ AC+BC+BCA+BCA+AB

[HC = 1

=> AB (1+0)+Ac (1+B)+BC+AB

=> AB+AC+BC+AB

=> AB+AC+BE+ABCI)

=) AB+AC+BT+AB(GC+T)

=) AB+AC+BC+ABC+ABC

1+A =17

=> AB+AC(1+B)+BT(1+A)

```
AB+AC+BE
 Dual of Consensus theorem:
The Dual form of consensus theorem 9s stated as
  (A+B)(A+c)(B+c)=(A+B)(A+c)
    (AA+AC+AB+BC)(B+C) = (AA+AC+AB+BC)
     (0+AC+AB+BC)(B+C) = (0+AC+AB+BC) [A-A=0]
     ABC + AB. B+BC. B+AC.C+ABC +BC.C = AC+AB+BC
    BC(A+B)+ AB(B+C
    ABC+ AB+BC+AC+ABC+BC = AC+AB+BC
      ABCI+O)+ BC(I+A)+AC = AC+, AB+BC
        AB+ BC +AC = AC+AB+BC
  Boolean function (or) Switching function:
-> Boolean Equations are constructed by connecting the boolean
 Constants and Variables with the boolean operation.
-> This Boolean Expressions are known as boolean formulas we like
  boolean Expression to describe bodean functions.
for Example: If Boolean Expression (A+B) C is used to describe
 the function of f, then boolean function is written as f(A,B,c)=(A+B)=

F = C-A+B >
 -> Let us consider the whole four Variable boolean function.
  Product-terms: product
                                         O181109.
         PCAIB, CID) = A+(BC)+ ACD)
```

| Sum-Terms: Sum terms                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |             |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------|
| CAR (102 = CR) = 2.(A) = 1(2(A+C)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | はいる         |
| F(A,B,c,D) = (B+D)·(A+B+c)(A+c)  Atems  Atems                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |             |
| Rterals                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | 1           |
| The literals and terms are arranged in the forms to in tour                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |             |
| 1. Sum of product (sop)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |             |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |             |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |             |
| 1. Sum of products:  The sum of product 9s also called Distunctional mound form corp                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | )           |
| The sum of product 4s also carried                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |             |
| 10 11 COVIII PIQ TOVIII PIQ                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |             |
| 1 C C TO A CHARLES CHARLES                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |             |
| The words sum & production (+ and *)  The words sum & production (+ and *)  The words sum & production (+ and *)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |             |
| Sepresentation "OR" and "AND" function ( ) sum terms  Ez=1 f(A1B,C) = AFBC + ABC                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |             |
| products products                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | Till Albert |
| pailorement pailor | 4           |
| ex = f(P,Q,R) = PQ + QR + RS  Products  Products                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |             |
| Products products                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | 10          |
| his vooler of the large bedeen the large                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | T           |
| a. Froduct of anni-                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | od          |
| The product of sum 9s also called confunctional mormal                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | 15          |
| form (0x) Confunctional normal format of sum terms ANDED                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | 3           |
| The product of sum 9s also called conjunctional mormal formula.  form (08) conjunctional normal formula.  A product of sum 9s any group of sum terms ANDed 1.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | _           |
| to gether. Product of sum is any                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |             |
| Contract of the second of the  | 0           |
| EXT (CAIBIC) = (AHBHO) (AHBHC)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | 1           |
| Sum                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |             |

M.JOSHNA 6788 +(P, P, T) = (P+4). (x+p) . 9 Sum Liproduct 1 10 11 10 11 Standard sop and pos forms) > Basically Canonical forms are two typs 1. Standard Sop Coro monterm canonical form Q. Standard POS Cor) maxterm canonical form 1. standard sop: (menterm canonical form) The stop 98 given by f(AIB, C) = (ABO)+ (ABC)+ (ABC) leterals en Ether completed form or uncompleted form Each product term 98 consistent all 2. standard pos: (maxterm) (A+B+c); (A+B+c). (A+B+c) Each sum term consists of all literals an the completed \* Converting - Expressions In standard sop (or pos form steps to Convert sop to standard sop form: 1. find the massing literal in each product term if any 2. "AND" Each product term having missing literals which term 1 formed by OR line literals and its complement 3. Expand the terms by applying Distributive law and seconded 4. Reduce the Expression omitting (or) remove repeated product the literals on the product terms theme is any. because (A+A) = A = B(A+A)A

```
Ex: convert the given Expression In standard 80p form
     fcaB,c) = AC+AB+BC
Step: 1 f(A,B,c) = find the missing literal an Each product
   f(AB,C) = AC+AB+BC
                            _ A literal is missing
                            - cliteral missing
                            - Bliteral missing
step: 2 AND product terms with (missing literal + complement)
   f (A,B,C) = AC(DBAS) + AB(1)+BCCA+A)
                                              A+A = 1
       - ACCB+B)+ ABCC+T)+BCCA+A)
Step 3: Expand the term and recorded the terms
    Expand:
   f(AB)() = ACB + ACB+ ABC+ ABC+ BCA+BCA
 Kecorded:
 f(A,B,C) = ABC+ABC+ABC+ABC+ABC+ABC
 Step 4: omitting the repeted product term
      + (A1B,C) = ABC+ABC+ABC+ABC+ABC+ABC
     f(A(B)() = ABC+ABC+ABC+ABC
Ex: Convert the given Expresion In standard sop form
    fcA(B(C) = A+ABC
                               2. And Early product - tom
Step1: find
     &CAIB, C) = VA + ABC
                          Bateral missing
                              literal missing
 Step 2: AND product term with (missing literal + complement)
     PCAIBIC) = ACDUDTABC
              = A(B+B) (C+E) +ABC) scuped - 100 1
```

| Function | Description                               | Expression. |
|----------|-------------------------------------------|-------------|
| 13:      | NOTA OR B                                 | A+B         |
| 14:      | NOT OR (NOR)                              | -A+B -AB+AB |
| 15. 117  | Exclusive OR (X-OR) Exclusive NOR (X-NOR) |             |
| 16:      | - JAON SVIOUXSKS                          |             |

# Theorems:

$$\begin{array}{ccc} \text{(1)} & \text{(2)} & \text{(3)} & \text{(4)} & \text{(4)} & \text{(5)} & \text{(5)} & \text{(5)} & \text{(5)} & \text{(6)} & \text{($$

$$3$$
 a)  $\overline{A} = A$ 

$$A+A = CA+A) \cdot (1)$$

$$= CA+A) \cdot (A+A)$$

$$= AA+AA+AA+AA$$

$$= AA+AA$$

$$= ACA+A)$$

$$= ACA+A$$

$$A \cdot A = A \cdot A + (0)$$

$$= A \cdot A + A \cdot \overline{A}$$

$$= A (A + \overline{A})$$

a) 
$$A+AB=A$$
b)  $A(A+B)=A$ 

(5) a)  $A+\overline{A}B=A+B$ 

B) A (A + B) = AB

$$A = A + A$$

$$\sum_{i=1+1}^{n} (1 + i) = 1$$

$$\sum_{i=1+1}^{n} (1 + i) = 1$$

$$\begin{cases}
A = 0 = 0.0 = 0 \\
A = 0 = 0.0 = 0
\end{cases}$$

$$\begin{bmatrix}
A = 0 = 0.0 = 0 \\
A = 0
\end{bmatrix}$$

Recorded: -FCAIBIC) = [ (A+B+C) CA+B+E)]. [(A+B+C). CA+B+O]. [A+B+O] CA+B+() Step4: omitting the repeated product term - P(A1B10) = (A+B+C)(A+B+T)(A+B+C)(A+B+C) Ez: Convert the given expression on standard pos forms P(A18, 1)= (A). (A+B+() Step 1: find the missing literal an Each product f(A1B1C) = A · (A+B+C) B, c literals missing step 2: OR product term with (missing literal + complement) f(AIB,C) = [A+CB·B)+Cc.c). CA+B+C) Step 3: Expand the term and recorded the terms Expand: [A+(B+c)+(B+c)+(B+c)+(B+c)].(A+B+c) = (A+B+c)· (A+B+c)· (A+B+c)· (A+B+c)· (A+B+c) Step 4: omitting the repeated product term f(A|B(e)=(A+B+C). (A+B+C). (A+B+C). (A+B+C) Treamount [ [ Comment ] Chang

at the hadroner bin must all banges

| M.JOSHNA                                                                                         | 4                                 |                                        |                                         |  |
|--------------------------------------------------------------------------------------------------|-----------------------------------|----------------------------------------|-----------------------------------------|--|
| M- Notations                                                                                     | : (minterms and                   | maxterms).                             | A. Can                                  |  |
| Dearnal NO                                                                                       | Brary Numbers                     | Manterm (m)                            | (POS)                                   |  |
|                                                                                                  | 0 (421)                           | ABC (mo)                               | A+B+( (M6)                              |  |
| 0                                                                                                | 000                               | · ABC (mi)                             | A+B+C (M)                               |  |
| 1                                                                                                | 001                               | ABC (ma)                               | NETC (NO)                               |  |
| 2                                                                                                | 010                               | 1 4 1 1 1 4                            | 1 1 E + 7 (H2)                          |  |
| 3                                                                                                | 011                               | ABC (m3)                               |                                         |  |
| 4                                                                                                | 100                               | ABC (m4)                               | _ /                                     |  |
| 5                                                                                                | 1010                              | ABC (m <sub>6</sub> )                  |                                         |  |
| 6                                                                                                | 110                               | 1000 Miles                             | A+B+7 (H)                               |  |
| +                                                                                                | 111                               | A BC (MIT)                             | THE PLEASE                              |  |
| Example: $f(A_1B_1c) = \overline{ABC} + \overline{ABC} + \overline{ABC}$                         |                                   |                                        |                                         |  |
|                                                                                                  | = mo-                             | + m, + m3+m6                           |                                         |  |
|                                                                                                  | = 5 m                             | (0,113,6)                              | 4                                       |  |
| 8x÷ f (A)1                                                                                       | B,() = (A+B+T)                    | · (A+B+C) (A+B                         | +()                                     |  |
|                                                                                                  | = M1 · M3 ·                       | M6 ;                                   |                                         |  |
| exito And                                                                                        | = TT M(1/3/6<br>Sum of Product fo | m to the given table                   | e a a a a a a a a a a a a a a a a a a a |  |
|                                                                                                  | ABC )                             | \( \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ | n-Put Variables                         |  |
| Exito find sum of product form to ABC 98 a  ABC Y  GnPut Yariables  O 0 0 1  O 10  I Sop k9 1'87 |                                   |                                        |                                         |  |
|                                                                                                  | 0 10 1                            | (C(E)MIT                               | 1                                       |  |

| A.BC | 1 year | (m) partition ) ? | Volatori |
|------|--------|-------------------|----------|
| 101  | 6      | (156 A) Grant 1   | ON John  |
| 1 10 | 1 4    | (10)              | o        |
| 1 11 | D      | 5.0               | D        |

$$= m_2 + m_3 + m_6$$

= \le m(213,6) TO find the product of sum from the given table [ Product of

Sum Ks Zero's]

| 10) 21 4 + V | A+B+C | , y   |  |  |
|--------------|-------|-------|--|--|
|              | 000   | 1     |  |  |
|              | 0.01  | 1     |  |  |
|              | 010   | 0 <   |  |  |
|              | 011   | 14 14 |  |  |
|              | 100   | 1.    |  |  |
| (1)          | 1,01  | 0     |  |  |
|              |       |       |  |  |

Algebraic Emplisications:

$$\overline{A} + \overline{B} + \overline{A} + AB$$

$$=$$
  $(A+\overline{A})+\overline{B}$ 

$$\left[ \overline{AB} = \overline{A} + \overline{B} \right]$$

$$= \overline{A + AB + BB}$$

- ABCD+ABD = ABD
- =) ABD(1+C) .
- => ABD
- =) ABCD+ABD = ABD
- 3 A(A+B)=A
  - =) A-A + A-B
  - =) A+ AB
  - => A (1+B)
  - => A

- [1910](A18) (ASTALIS)
- (4) AB+ ABC +AB(D+E) = AB
  - =) AB+ABC+AB(D+BEF)
  - =) AB(I+c) + ABD+ABE
    - => AB + ABCO+E)
    - =) AB (1+ D+E)
    - AB((1+D)+ F)
    - => AB[€18+E]
    - ABCID
      - AB
- 5. xy+ xyz+xyz +xyz = y(x+z)

1= >1 AH (7+2) + XYZ

- => 7y(1)+xyz+xyz+xyz
- > xy(z+2)+xyz+xyz+xyz
- > メリマトスリモ + ユリマト ガラト えりて
- ないなりと りていれて)

- フィー・フィー・フィー・フィー・フィー・スタイプ
- 29 49(242) 4: (4: 427 + 4245
  - 4(2+5)
  - ユリナスリエナダエナダエ
- (or)
  - yextz) 11

0-1107

$$\Rightarrow$$
  $A(B+B)\cdot(T+B)$ 

(F) ABC+ ABC+ABC

ABCHABCHABC

AC(B+B)+ABC AC+ ABT

A((C+B)·(C+T))

A(+B)·(C+T)

(1-din). (8) A (C+B) ) (3/11/A) (5/11/A) ACHABINET ACHABINET ACHABINET ACHARINE

[(z)+(xz) - (z+x)+(xx)

(AIA) (AIA)

[BTHB = (8+8)-(2+8)

EC(B+B) + ABC

= ) A(T+B) = AC+ABC - RictBC).

- A ((C+B). (C+1) - ACLAB

ABC+ ABC+ ABC

AC(BIB) LABE

ACC+BT) P ((C+B), (C+E))

(Fig. 11) ACC++B) ACC++B)

M.JOSHNA +AB+AB = A+B

A+AB+AB

A (1+B)+AB

A(1)+AB

A+AB

=) (A+A)·(A+B)

=) (1). (A+B)

=) A+B/

Existinglify the following three variable Expression by using Bodean

algebra Y = Sm(1,3,5,7)

Given Data Y = Em (113,5,7)

= ABC + ABC + ABC + ABC

Ac(B+B) + Ac(B+B)

ACCD+ACCI)

CCA+A)

Ex: 89mplify the following three variable Expression by using Boolean algebra y = TTM (1,3,5,7)

Given that Y=TTM (1,3,5,7)

= M1, M3. M5. M7

= (A+B+Z)(A+B+Z). (A+B+Z). (A+B+

(A E) (A.T) L (BA)+(BB)+(B·T)+(CA)+(CB)+(C·E)

ATEBLAS : A+B

(P.I.A.).(A+B)

Un. CAYB)

- ABC + ABC+

AC(B+B)+ACCH

EACTAC

```
M.JOSHNA
 = ABT+ABT+ATT+ABT+ ABT+ABT+ABT+ABT+ABT+ABT+ABT+ABT+ABT+
     BELABELBELABELABELACLABELABELABELABELABELABE
      +BT+AT+ABT+AT+AB+BT+AT+BT+T
 = ABC+ ABC+ AC+ ABC+ ABC+ BC+ ABC+ BC+AC+AB+C
 = BE(A+A)+BE(I+A)+T(A+A)+AB(I+C)+T(I+B)+ABT
 = BC+BC+ T+AB+T+ABC
= AB+ABC+C//
Explanplify the following three Variable Expresion. Convert the Expresion
anto minterm complementary property and &
        y = TTH (113,517)
 Jol: The given Expression 98 marterm Y = TTM (1131517).
   To convert given sequences to mintern Complementary form
     mintern = Complementary of Maxtern
  The minterm 93 given by = Em(0,2,4,6)
    y = Em (0,2,4,6) = ABE+ ABE+ ABE+ ABE
                = mot met my + mg = \mathcal{D}((B \setminus \overline{B}) + \mathcal{D}(B \setminus \overline{B})
     = ABC+ABC = AT + ABC+ABC = AT + AC
                                              = Z(A+N)
                    = AC(B+B) + AF(B+B) =c
                    = AC + AC
                    = T (A+A) = T1)
   transform Each of the following Canonical Expression anto
 Its other Canonical form Its decemal notations.
 (9) f(x1 y , 7) = 5m(1,315)
                                     (0x14/2)= TM (C,2,11/67)
 (91) f(w, t, y, z) = TTH (0, 2, 5, 6, 7, 8, 9, 11, 12)
 (9) f(x,y,z)= Em(1,3,5)
              = Mo · Ma · M4 · M6 · M4 (x+y+z) · (x+y+z) · (x+y+z) ·
      (+1917/810) MIT = (21/8/x)
                                                (I+y+W). (£+y+)
```

=(ス+xy+スマナスソナリマ+スマナダントマン(ス+スリナスマナスリナスマナスリナスアナスリナスナスリナスナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナステナスリナストスリー

= キャマナスダマナ メマナズダマナダマナズマナス

= xz(y+g)+ yz(x+1)+ yz(1+x)+z(x+x)+z

=  $\chi z + y z + y z + z = 1 \times (x + y) + z (x + y)$ =  $\chi z + x y + z //$ 

(\$1,11,p,8,7,6,5,6,7,8,9,11) = (5,6,7,8,9,11,12)

f(w,z,y,z)= TM(0,8,5,6,7,8,9,11,12)

= 2m(1,3,4,10,14,15)

= m1+m3+ m4+m10+m13+m14+m15

= wxyz+ wxyz+ wxyz+ wxyz+ wxyz+ wxyz+ + wxyz+

= wxz(y+y)+ wxz(y+y)+wyz(x+x)+wxyz

= WXZ+WXZ+WYZ+WXYZ

|   | M.JOSHNA                                 |                                       |                  |                                                  |
|---|------------------------------------------|---------------------------------------|------------------|--------------------------------------------------|
| 1 | Deamal NO                                | Bloary number                         | minterm          | Maxtern                                          |
| f |                                          | 1000                                  | A BOOD           | (M) A+B+C+C                                      |
|   | 8                                        | 1001                                  | ABCD(mg)         | A+B+c+D                                          |
|   | 9                                        | ( rotter)                             | ABCD Cm          | A+B+C+D                                          |
|   | ro                                       | 1010                                  | ARCD             | 1740                                             |
|   | 1                                        | 1011                                  | ABCD             | A+B+T+D                                          |
|   | 11                                       | 1011                                  |                  | A+B+C+D                                          |
| 1 | 1 1 1 10 10 10 10 10 10 10 10 10 10 10 1 | 1100                                  | ABCD             |                                                  |
| 1 | 12                                       | ASSTORE Last                          | Post.            | A+B+C+D                                          |
|   | (a Lea ) 1 .                             | 1101                                  | ABCD             | a Larlan tra                                     |
|   | 13                                       | is more to be                         | ABCD             | $\overline{A} + \overline{B} + \overline{c} + D$ |
|   | 14                                       | 1110                                  |                  | 5 5 5 5                                          |
|   | 17                                       | (1,)                                  | ABCD             | A+B+C+D                                          |
|   | 15                                       | 1111                                  | N 9 -            | 4                                                |
| _ | · (11/5) (11/5)                          | · · · · · · · · · · · · · · · · · · · | To defend on the | 11.2.10                                          |
|   |                                          |                                       | 1.7.2.22.        | ACTOLOGY - PULC                                  |
|   |                                          |                                       |                  |                                                  |

Mass & B = Warren A allo problem to a lideling CC

allo 8 . iv. adolent 8

Allow at = 00 = oldpicety pl

1 Vortable Markov D

:-2904

The map method gives us a systamatic approach for simplyfying a boolean expression. The map method first proposed by veitch and modified by Karnaugh, Hence H9s known as the witch diagram or the kamaugh map (k-map)

One-Variable, two variable, three varible and four variable Maps:

care 8 = 8 cells 310 - 23 = 8 cent white 24 = 18 and

1 Variable K-Map:

1- Variable Map (2cell8)

2-Variable K-Map:



| 9            | A AT     | 3 2 A  | В3.     | - ! Y | • |
|--------------|----------|--------|---------|-------|---|
| 3- Variable: | BC BT    | BC     | BC BC   | BC    |   |
| o A          | ABC -ABC | ABC 5. | ABC / 3 | ABC 6 | ! |





) / 1

1131 A

| A | В | C | У   |
|---|---|---|-----|
| 0 | 0 | 0 | 0   |
| 0 | 0 | ) | (1) |
| 0 | 1 | 0 | 0   |
| 0 | 1 | 1 | 0   |
| 1 | 0 | 0 | 0   |
| 1 | 0 | 1 | 0   |
| 1 | 1 | 0 |     |

| BC  | 00  | 01  | 11 | 10 |
|-----|-----|-----|----|----|
| K   | 0   | 1   | 0  | 0  |
| 0   | 0   | (-1 | 3  | 2  |
| 5 , | 0   | 0   | 1  | 1  |
|     | 4 4 | 5   | 1  | 6  |
|     |     |     |    |    |

Ez: plot Boolean Expresion y = ABCD + ABCD +

| 000  | D '  | 9 9   | : 0.1  |      | ,      |
|------|------|-------|--------|------|--------|
| AB / | 00   | 01    | 11     | 1 10 |        |
| 00   | 0000 | 0001  | 0011   | 1 3  | ] .    |
| 01   | 0001 | 0101  | 0111   | 0110 | > ABCD |
| ABCD | 1100 | 5     | 1111 - | 1110 |        |
| 11   | 12   | 1) 13 | 15     | 14   |        |
| 10   | 8/   | 1001  | 11 0   | 010  |        |
| 4    | A-Bi | D     | ABCD   |      | ABID   |

Representation of standard pos on k-map

Three Variable k-map:

| i   | m. R | 0000 |     | 1 3 4 9 9    |
|-----|------|------|-----|--------------|
|     | A    | B    | ) c | Maxtern (M)  |
| , ; | 0,   | 0    | 6   | A+B+C (Mo)   |
|     | O    | 0    | į)  | A+B+C (M1)   |
| -   | 0    | 1    | 0   | A+B+c (Ma)   |
|     | 0    | N1   | 1 - | A+B+C (M3)   |
|     | . 1  | 0    | 0   | + A+B+c (M4) |
|     | 1    | 0    | 1   | A+B+C (M5)   |
|     | 1    | 1    | 0   | A+B+c: (M6)  |
| 4   | 1    | 1    | ,   | A+B+C (N7)   |

foure Variable kimapin and my - 1 notaring which

| 1 |                                         |           |             |                                          |                                                                                                                                                                                                                                                                                               |
|---|-----------------------------------------|-----------|-------------|------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|   | A                                       | В         | C           | J. D                                     |                                                                                                                                                                                                                                                                                               |
|   | 000000001111111111111111111111111111111 | 000001111 | 00110011001 | 0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, | A+B+C+D (Mo) A+B+C+D (Ma) |
| 1 |                                         | ١         |             | 1                                        |                                                                                                                                                                                                                                                                                               |

Ext plot Boolean Expresion Y= (A+B+c)(A+B+c)(A+B+c)(A+B+c) Y = M2 . M3 . M6 . M1

10 7 A+B+C BC 11 01 00 7 A+B+C 010 01/ 100 000. 10 3 2 0 0 7 A+B+C 110 111 101 100 5 A+B+C







y = AD

(2)

| M.JOSHNA         |            |                                         |         | ~,            |
|------------------|------------|-----------------------------------------|---------|---------------|
| Ex+6 CD          | KD BY      | o ab                                    | CD      |               |
| AB               | 00 01      | 11                                      | 1.0     | un as not in  |
| AB 00            | 7), 0      | - 0                                     | 11      | Bō 1. 1       |
| AB 01            | 0 0        | 0::                                     | 0       | NB (PI        |
| AB 11            | 0 0        | 0                                       | 0       | CD OU TO      |
| A.B. 10          | 1; 0       | 0                                       | [1]     | 0000          |
| Ans:             | J=BD       | 4                                       |         | Q[, 5+B[,     |
| .0.              |            |                                         | _       | · 35(6+c)     |
| Ezite AB CI      | 00 01      |                                         | GD      | 1 0 3 10      |
| AB 00            | 0 0        |                                         | 0       |               |
| AB OI            | 0 0        | 0 0                                     | 0       |               |
| Group 7 AB AB 11 | TIT        | - 12                                    | 1       | Group 3 -> AC |
| A B 10           | 0 1        | 11                                      | 4       |               |
|                  |            | 1                                       | ,       |               |
|                  | K Broup &  | a → AD                                  |         | •             |
| 7 (              | (A) y = AP | 54 AD+AC                                |         |               |
| Grouping Eig     | ht Adjacen | of ones (                               | Poctet) | h solo se     |
| traction         | က က က      |                                         | 3:      | ED CD CD CD   |
| (1) 00 (         | CD CD CI   |                                         | AB AB   | 00 01 .11 10  |
| HB YTT           |            | AB                                      | DAB O   | 0 0 2 1 0     |
| AB OU            |            | . 014                                   |         | 10111111      |
| AB 01 51 3       |            | 1 0 1                                   | 4       |               |
| B 11 11          | 1 1 1      | 110                                     | AB !    | 10/6          |
| AR 10 10         | 0 0 0      | 11/1/1/2                                | AB.     | 10 10 10 1    |
| AB 00 0 C        |            | 000000000000000000000000000000000000000 | AB O    | 11/1/0        |

( L) ...



Simplifications of sop expressions

-> from the above discussion we can outline generalized procedure to simplify Boolean Expressions as to llows:

1. Plot the K-map and place 1.8 in those cells corresponding to the 18 in the truth table or sum of product expression place 0.8 in other cells.

2. check the k-map for adjacent 18 and encorcle those I's which are not adjacent to any other 18 these are called

3. check for those 1's which are adjacent to only one other 1 and Encircle such pairs. 4. Check for quads and octets of adjacent 18 even of 9+ contains some 18 that have already bee encircled. While doing this make Sure that there are minimum no of groups. 5. Combaine any pairs necessary to anclude any 18 that have not Yet been grouped. Simplified Expression by summing product terms of all groups. Exis minimire the Expression 1 = ABC+ ABC+ ABC+ ABC A= 101+001+011+100+000 = m6+m1+m3+m4+m0 = 2m(5,3,4,0) Step 1: The k-map for three Variables and 9496 ploted according to the BC given expression. 10 010 (1) Step 2: There are no Isolated 18 100 101 step 3: 1 90 the cell 3 adjacent only to 1 90 the cell 4. This pair 95 combained and referred to as group 1 Step 4: There 98 no octet, but there 98 a BC BCB 00 01 10 quad all 8 0,1,4 and 5 from a quard. 0 1 11 0 This quad as combained and reffered to 1 0 a group

altedy been

Steps: All to have



Est Manize the Expression  $y = \overline{ABCD} + \overline{ABCD} + \overline{ABCD} + \overline{ABCD} + \overline{ABCD} + \overline{ABCD}$ 

= 0100+0101+1100+1101+1001+0010

= m4+ m5+ m12+ m13+m9+ m2

= 2m(4,5,18,13,9,2)

|         |          |         | T BCD     | 1-1-11     | : Last  |
|---------|----------|---------|-----------|------------|---------|
| CD      | CD CD    | CD C    | D Group 3 |            | 1       |
| AB AB   | 00 01    | 11 17   | 7 G1      | 612        | Giz     |
| AB 00   | 0 1      | 3 2     | ABC       | D ABIL     | ABCD    |
|         | 4 5      | 7 6     | 0 1 9     | 9 11/01    | o ook   |
| AB OI   | 1 1      | 1- 11   | 1916      | 1 001      | ĀĒCD    |
| AB 11   | (a) (a)  | 15 14   | 110       | A CD       | 47.10   |
| · AB 10 | 8 / 9    | 11 10   | BC        | not led    | or had  |
| Grapa G | 1        | - 168)i | hal plan  |            |         |
| BC.     | rolipa 1 | iony    |           | no at male | . Agrid |

med photo

mal of MA. again

\* Complify the logic function specified by the truth table by using the kanaugh map method. Y 9s the output variable and Alband c are the Input Nariables

| · A | B  | C    | У   |
|-----|----|------|-----|
| 0   | 0  | Ó    | 1   |
| 0   | 0  | 1    | 0   |
| 0   | 1  | 0    | 0   |
| 0   | 1  | 1    | 1   |
| 1   | 0, | 0, - | 1.  |
| 1   | 0  | 1    | 0   |
| 1   | 1  | 0.   | 2   |
| 1   | 1  | 1    | (1) |

= mo+ m3+ m4+ m7

= Em(0,3,4,7)

| 0  |      | 11  | 01  | 00       | AB |
|----|------|-----|-----|----------|----|
| રી | Ži ° | (1  | , 1 | (4)      | 0  |
| 6  | 7 (  | 17. | 5   | 1 4      | 1  |
|    | Gray | 17. | 5   | Girou R. | 1  |

| Part    | Group.2 |
|---------|---------|
| Group 1 | ABC     |
| 0/00    | 0/11    |
| 00      | 1 1     |
| BC      | BC      |

Ext Reduce the following function using k-map technique. f (A,B,c,D) = &m (0,1,4,8,9,10)

= mo+m1+m4+m8+m9+m10 = ABCD+ ABCD+ ABCD+ ABCD+

| M.JOSHNA                                                      |                  |        |                                                                                   |
|---------------------------------------------------------------|------------------|--------|-----------------------------------------------------------------------------------|
| AB 00 01  00 01  01 4 5  01 4 5  01 20  01 4 5  01 20  01 4 5 | 3 2 F            | Ollock | Group 2  A B CD  O O O O  1 O O O  1 O O O  1 O O O  TBC  Group 3  AB CD  O I O O |
| Delint of Sun                                                 | a simplification | no÷    | ABED                                                                              |

Froduct of sum simplification:

1. Plot the k-map and place 08 90 those cells corresponding to the as in the truth table or maxterns in the product of sum expression

2. check the k-map for adjacent as and Enarche those as which are anot adjacent to any other os. These are Called Asolated Os.

3. Check for those as which are adjacent to only one other and Encircle such pairs.

4. Check for quads and octets of adjacent 08 Even 9f 9t contains some is that have already been encarcled. while doing this make sure that there are minimum no of groups.

5. Combaine any poins necessary to Include any 08 that have not yet been grouped.

6. From the simplified pos Expression for F by taking product of sum terms of all the groups

To get familiar with these steps we will solve some Examples.

Ext minimize the Expression Y= (A+B+E) (A+B+E) (A+B+E) (A+B+C) (A+B+C)

= (A+B+T) = M1, (A+B+T) = M3, (A+B+T) = M7 (NIB+C)=MA

| Step 1: (a) shows the k-map for three Variable A 00 01 11 10 and 9t 98 plotted according to given maxterns 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0                                                                                                                                                                                                                                                                                                                                                                                          |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Step 3: O 90 the cell 4 98 adjacent only to 0 BC 00 01 11 10  90 the cell oand 0 90 cell 7 98 adjacent only  40 0 90 the cell 3. These two parrs are com-  to 0 90 the cell 3. These two parrs are com-  to o 90 the cell 3. These two parrs are com-  to o 90 the cell 3. These two parrs are com-  to o 90 the cell 3. These two parrs are com-  to o 90 the cell 3. These two parrs are com-  to o 90 the cell 3. These two parrs are com-  to o 90 the cell 3. These two parrs are com-  to o 90 the cell 3. These two parrs are com- |
| bained and referred to as group 1 and group  a respectively.  Step 4: There are no quads and octets A BC 00.01 11 10  slep 5: The 0 90 the cell 1 can be 0 101 6 701                                                                                                                                                                                                                                                                                                                                                                      |

Step 5: The 0 9n the cell 1 can be 0 0 1 3 2 Combalned with 0 9n the cell 3 to be Combalned with 0 9n the cell 3 to be from a pair. This pager 9s referred to as groups Graup 1 Grap 2 Graup Step 6: In group 1 and 9n group 2, A 9s ABC ABC ABC aliminated, where an graup 3 Variable B 0 0 0 0 111 011 98 Eliminated and we get BC AC

$$\overline{y} = \overline{BC} + BC + \overline{AC}$$

$$\overline{y} = \overline{y} = \overline{BC} + BC + \overline{AC}$$

$$y = \overline{BC} + BC + \overline{AC}$$

$$y = \overline{BC} + BC + \overline{AC}$$

$$y = \overline{BC} + BC + \overline{AC}$$

$$= (\overline{BC}) \cdot (\overline{BC}) \cdot (\overline{AC})$$

$$= (\overline{B} + \overline{C}) \cdot (\overline{B} + \overline{C}) \cdot (\overline{A} + \overline{C})$$

$$= (\overline{B} + \overline{C}) \cdot (\overline{B} + \overline{C}) \cdot (\overline{A} + \overline{C})$$

Y = (B+C)(B+E)(A+E)

0001

150

Ext: Mammize the following Expression in the pos form  $\gamma = (\overline{A} + \overline{B} + c + D)(\overline{A} + \overline{B} + \overline{c} + D)$ (A+B+C+O)(A+B+C+D)(A+B+C+D) ON: (A+B+C+D) = MIR, (A+B+C+D)= MIA, (A+B+C+D)=MIB (A+B+C+D) = M8, (A+B+T+D)= M6, (A+B+T+D)=M7 (A+B+C+D) = Mo and (A+B+C+D) = M13 C+D C+D C+D 00 AB Step 1: Shows the k-map for four A+B 00 Variable and 9+ plotted according to 0 4 5 6 A+B 01 given max terms 0 13 15 12 14 Steps: There are no 9 solated 1's A+B 10 0 0 O 0 Step3: 0 90 the cell 0 98 adjacent A+B 10 Only to In the cell this pair is combained and refferd to as group, step4: There are two quads cells 12, 13, 14 and 15 form a quad I and cells 6,7,14,15 forms a quade These two quads are group & and groups, respectively reffered to as 00 ... steps: All os have already been grouped 2 Step6: In group 1, Variable A is Eliminated 00 6 5 Engrapz, variable c and D are Elimenated and OI 0 in group 3 variable A and Dare Diminated. 15 (0) 01 i we get simplified pos Expression Group 2 10 (0) Group2 GIVOUP 3 Groups AB CD CD 13 Group3 ON 00 0 00 BCD

$$\overline{y} = \overline{B}\overline{D} + AB + BC$$

$$y = \overline{y} = \overline{B}\overline{D} + AB + BC$$

$$y = \overline{B}\overline{D} + AB + BC$$

$$y = \overline{B}\overline{D} + AB + BC$$

$$y = \overline{B}\overline{D} + AB + BC$$

Incompletely specified functions (Don't Care-terms correconditions)

|                                 | est.     | · [ ] | B-() | CI   | ¥   |
|---------------------------------|----------|-------|------|------|-----|
|                                 | 2        |       | 174  | 1 1  |     |
|                                 | 1:11     | 0     | 0 .  | 0    |     |
|                                 |          | 101   | 0    | 11 4 | 0   |
|                                 | f - 1    |       | 1    | 0    | 0   |
| 1 0 0                           |          | 1     | \ '  | 1    |     |
| 11 3 3 X 3 3 13 1 1 1 1 1 1 1 1 | v. 181   | +84 1 | 101  | 0    | 0   |
|                                 | in a a v | \ 1   | 0    | 1    | 1,0 |
|                                 | 1        |       | 1    | 0    | X   |
|                                 |          | 1 1   | )    | 1    | X   |

|          |     | 4  |    |                |       |      | 1 1       |
|----------|-----|----|----|----------------|-------|------|-----------|
| Sol:     | BC  | 00 | 01 | .11            | 10    |      |           |
| <u>.</u> | 700 | ,0 | 1  | 1              | 2     |      | 1         |
| 0        | 10  | 4  | 5  | <del>+</del> X | X 6   |      |           |
|          | 9   | 1  | 3  | 1              | Doult | Care | Condition |
|          | 100 |    |    | 43             |       |      |           |

| BC  | 00 | 01   | 11    | 10    |
|-----|----|------|-------|-------|
| A T | 0  | 71+  | -,3   | 2     |
| 0 \ |    | 15   | +     | -6    |
| , 1 | 4  | 1:1  | 100   | 0     |
| 1)  |    | 12-1 | -     | 2.01  |
| ſ   | 3  |      | 7 01  | oup 1 |
| 1   | 4  | [.   | 1 = 0 | 1     |
| - / |    | 11   | 4     | 1     |

# Describing Incomplète Boolean Function

| B | C  |        |    |    |
|---|----|--------|----|----|
| A | 00 | 01     | 11 | 10 |
| 0 |    | 1<br>X | 3  | 2  |
| 1 | 4  | 5<br>X | 7  | 6  |

| 1     | -      |
|-------|--------|
| Gu,   | Gra    |
| 511   |        |
| A BC  | AN B C |
| 0/00  | 0 0    |
| 100   | 0/0/1  |
| - 5 T |        |
| BC    | ty R   |

| 1   | Bi Bi | E BC | BC  | BL | <b>B</b> |
|-----|-------|------|-----|----|----------|
| b / | 00    | 01   | 11  | 10 | Groups   |
| A O |       | D    | (   | 0  | 7        |
| A I | 1     | 4    | - 1 |    |          |
|     | Group | p1   |     | 1  |          |

| ./.    | 0  | =   | AB | 1 BC | 11   |
|--------|----|-----|----|------|------|
|        | A  |     |    |      | BIBT |
|        | 0  | 0   | 0  |      |      |
|        | 0  | . 0 | 1  |      |      |
| 1(1,3) | l, | 0   | 0  |      |      |

| (99) | FCAIB  | (2) = TH  | (2,5 | 1774 | 4(1,3) |
|------|--------|-----------|------|------|--------|
| 171/ | TCHILD | ( ) - ( ) | ()   |      |        |

| A BC | 00 | 01 | 11 | 10 |
|------|----|----|----|----|
| -    | 0  | 1  | 3  | 2  |
| 0    |    | X  | X  | O  |
| -    | 4  | 5  | 7  | 6  |
| 1    |    | 0  | 0  |    |

| Gi    | City, G | 13     |
|-------|---------|--------|
| ABC   | A BC    | A B (  |
| 0/21  | ĺ√þ 1   | 0 1 01 |
| 1/00/ | 1/1     | ABC    |
| BR    | AC      | ) .    |
|       |         |        |

## pon't care conditions en logic Design

|                                     |          |                    |              |                  | /         |
|-------------------------------------|----------|--------------------|--------------|------------------|-----------|
|                                     | A        | B                  | С            | D                | P         |
| 0                                   | 9        | 0                  | 0            | 0                | 0         |
| 1                                   | 0        | 0                  | 0            | 23               | 1         |
| 2                                   | 0        | 0                  | 1            | 0                | 1         |
| 3                                   | D        | 0 1                | 1            | 1                | 0         |
| 4                                   | 0        | 1                  | 0            | 0)               | 11(1)     |
| _                                   | 90000000 | 1.                 | 0            | 1                | 到。日日日日日11 |
| 6                                   | 0        | 1                  | 1            | 0                | 0         |
| 7                                   | 0        | 1                  | 1            | 1                | (1)       |
| 8                                   | 01       | 0                  | 0            | 0                | 1         |
| 9                                   | 1 6 1    | 0                  | 0            | 1                | 0         |
| 10                                  | 161      | 0                  | 1            | 0                | -         |
| 1)                                  | 1 1 1    | 0                  | 1            | 1                | - 1       |
| 12                                  | 01       | . 1                | 0            | 0                | -         |
| 13                                  | 1        | 1                  | 0            | 1                | -         |
| 14                                  | 1        | 000011111000011111 | 001100110011 | 0404040404040404 | -         |
| 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 |          | 1                  | 11           | 1                | Ti        |
| -1                                  |          | 1                  | 1            |                  | 1         |

P = \( \Sin \left( 1,2,4,7,8 \right) + d(10,11,12,13,14,15 \right) \)

Ex: find the reduced Sop form of the following function.

\[
\text{f(A1Br(1D)} = \( \Sin \text{(1,3,7,11,15)} + \( \text{d(0,2,4)} \)

\[
\text{f(A1B,(1D)} = \( \Sin \text{(1,3,7,11,15)} + \( \text{d(0,2,4)} \)
\]

| į. |                   |               |                                         |                           |
|----|-------------------|---------------|-----------------------------------------|---------------------------|
| 00 | 01                | 11            | 17                                      | )                         |
| X  | (1)               | (1)3          | X                                       |                           |
| ч× | 5                 | 7             | 6                                       | ,                         |
| 18 | 13                | (1)           | 4                                       |                           |
| 8  | 9                 | (1)           | 10                                      |                           |
|    | X<br>X<br>X<br>I8 | X 1 5 X 18 13 | X D D T T T T T T T T T T T T T T T T T | X D D X X Y S F 6 X D S M |

|       |    |      | (-    | nrou42 |          |               |
|-------|----|------|-------|--------|----------|---------------|
| AB CD | 00 | 01   | 11    | 10     | GIL      | G12<br>A B (0 |
| 00    | 1  | 1_   | 1     | 1)     | AB (D    | 0000          |
| - 01  | 0  | •    | 1     | 13     | 0.11     | 0010          |
| []    |    |      | 111   |        | (0       | ĀB            |
| 10    |    | 40   | 12    | I= A   | <u>.</u> |               |
|       | A  | 16 G | Group | 1      |          |               |

.. Y = AB+(D







| 100      |    | 1.        | _ |     |     |    | 1      |
|----------|----|-----------|---|-----|-----|----|--------|
| <u>G</u> | N. |           |   | G12 |     |    |        |
| A        | В  | C         | D | A   | В   | C  | Ď      |
| 0        | 1  | 0         | 0 | 0   | 1   | .1 | O      |
| 0        | 1  | 0         | ) | C   | 1 ( | 1  | 1      |
| 1        | 8  | 0         | 0 |     | B   | 1  | 0      |
| -1       | *  | 0         | ) |     | 1 0 | 1_ | 23/0-2 |
|          | В  | $\bar{c}$ |   | _   | В   | c  | _      |
|          |    |           |   |     | -   |    | _      |

Ex: Reduce the following function by using k-map technique

F(A,B,C,D) = ATH(0,3,4,7,8,10,12,14)+ and (2,6)

| ABED | 00      | DI | 11  | 10   |
|------|---------|----|-----|------|
| 00   | 0       | 1  | 0 3 | Xa   |
| 01   | . 0     | 5  | 0   | ×    |
| li   | D<br>12 | 13 | 15  | 0 14 |
| 10   | 0 8     | 9  | 1   | 0 0  |

|             |    |    | Gra  | mbs Que | 71         |
|-------------|----|----|------|---------|------------|
| _           | 00 | 01 | 11   | 10/     | ABCD       |
| 00          | 0  |    | 0    | 1       | 0 0 10     |
| 01          | 0  |    | 0    | 0       | 1 010      |
| <u>-</u> 11 | 0  | 01 | 17   | 0       | 1 980 B CO |
| 10          | 0  | 1  | 1 1- | 0_      | 0 0 10     |
| 1           |    | B  |      | ú       | 0 110      |

$$\overline{Y} = \overline{D} + \overline{A} C$$

$$\overline{Y} = \overline{D} + \overline{A} C$$







COMP - CHAM (PR)



Extraplementation the following Boolean Expression with NAND-NAND logic. F(A,B,C) = Em(0,1,3,5)



FCA(B(C) = AB+AC+BC

## NAND - NAND



Ex: DAind the reduced pos form of the following Equation F(A1B161D)= Sm(11317,11115)+d(01215) Implement and using NAND LOGEC ?

| AB CD      | 00  | 01   | 11   | 10   |
|------------|-----|------|------|------|
| 00         | X   | 1,   | 13   | X    |
| 01         | 0 4 | × 5  | 1,   | 06   |
| <b>f</b> r | 0 2 | 0 13 | 1 15 | 0 14 |
| 10         | 08  | 0 9  | 1,11 | 010  |

| 100  |     |    |    |        |
|------|-----|----|----|--------|
| AB   | 60  | 01 | 11 | 10     |
| 00   | 0;  |    |    | 10     |
| 01   | 0   | X  |    | O,     |
| 11.  | -0, | O) |    | 0      |
| 10   | 10- | 0  |    | 0-     |
|      | Gra | P2 | 6  | in out |
| 11.3 |     |    |    |        |

| Gii     | Giz  |
|---------|------|
| A B (D) | ABCD |
| 0100    | 1000 |
| 1.000   | 1001 |
| 0110    | AC   |
| 1010    |      |

In Whitele - Club

$$\vec{y} = \vec{D} + \vec{A}\vec{C}$$

$$\vec{y} = \vec{D} + \vec{A}\vec{C}$$

$$\vec{y} = \vec{D} \cdot (\vec{A}\vec{C})$$

using k-map, Determine the minimal sop Expression and resize the simplified Expresion using NAND logic? f(N,x,y,7)=TIM(0,2,3,7,8;9,10)

E1:

X



$$Y = (\overline{A} \overline{c} \cdot D)$$

$$= (\overline{A} + \overline{c}) \cdot D$$

$$Y = (\overline{A} + C) \cdot D / C$$

E1: To amplement X-OR gate by using NAND-NAND logge gate

Y = AB+ AB
The x-or gate function is two inputs and one output. The ilp variables are AandB
The x-or gate function is two inputs and one output. The ilp variables are AandB

( S) MARIO - CHARLES



$$\sqrt{=\overline{AB}+\overline{\overline{AB}}}$$





NOR-NOR Implementation: M.JOSHNA (F) OR-AND (9) NOR-Bubbled AND (971) NOR-NOR Ext Y= (A+B+c), (D+E). F 3) OR-AND A+B+C B D+E (9) NOR-Bubbled AND (A+B+() (A+B+C), (D+E), F =(A+B+C) (D+F).F (O+E) NOR-NOR 999) (A+B+C) Y= (A+B+C) + (D+E) + F D+E (A+B+C).(D+E).F [: A+B = A.B]

[A= =

M.JOSHNA Ex - Implement the following Bookan function with NOR-NOR logic Y = AC +BC+ ABYD The given Boolean Expression Y = AC+BC+AB+D A Samption, By using Duality theorem AC+BC+ AB+D = (A) · (BC) · (AB) D = (±+5). (£+5). D (9) OR-AND (T+T) (B+c) (F1B) D NOR- Bubbled-AND 9 A+C B+T (A+C) (B+C).(A+B).D A+B (999) Ā+Ē B+C A+C+ B+C + A+B +D A+B Y= (A+c). (B+c). (A+B).D D

5



Implement the X-

P-TIMU

$$y = (\overline{A+B}) \cdot (\overline{B+A})$$

. VA 0 318

Tas of the second

alo B - aller - chord

Lite in Mibles - Be- Har

mond phrong in

Combinational circuit is a circuit in which we combine the different gates in the circuit, for example encoder, decoder, multiplexer and demultiplexer. Some of the characteristics of combinational circuits are following –

- 1. The output of combinational circuit at any instant of time, depends only on the levels present at input terminals.
- 2. The combinational circuit do not use any memory. The previous state of input does not have any effect on the present state of the circuit.
- 3. A combinational circuit can have an n number of inputs and m number of outputs.



#### Half Adder

Half adder is a combinational logic circuit with two inputs and two outputs. The half adder circuit is designed to add two single bit binary number A and B. It is the basic building block for addition of two **single** bit numbers. This circuit has two outputs





## Half adder using NAND gates



## Half Adder using NOR gates



#### Full Adder

Full adder is developed to overcome the drawback of Half Adder circuit. It can add two one-bit numbers A and B, and carry c. The full adder is a three input and two output combinational circuit.

Block diagram Truth Table Circuit Diagram

#### M.JOSHNA



|   | Inputs | Output |      |
|---|--------|--------|------|
| А | В      | Cin    | S Co |
| 0 | 0      | 0      | 0 0  |
| 0 | 0      | 1      | 1 0  |
| 0 | 1      | 0      | 1 0  |
| 0 | 1      | 1      | 0 1  |
| 1 | 0      | 0      | 1 0  |
| 1 | 0      | 1      | 0 1  |
| 1 | 1      | 0      | 0 1  |
| 1 | 1      | 1      | 1 1  |



## Full Adder using NAND gate

A NAND gate is one kind of universal gate, used to execute any kind of logic design. The FA circuit with the NAND gates diagram is shown below.



Full Adder using NOR gate



#### N-Bit Parallel Adder

The Full Adder is capable of adding only two single digit binary number along with a carry input. But in practical we need to add binary numbers which are much longer than just one bit. To add two n-bit binary numbers we need to use the n-bit parallel adder. It uses a number of full adders in cascade. The carry output of the previous full adder is connected to carry input of the next full adder. 4 Bit Parallel Adder In the block diagram, Ao and Bo represent the LSB of the four bit words A and B. Hence Full Adder-0 is the lowest stage. Hence its Cin has been permanently made 0. The rest of the connections are exactly same as those of n-bit parallel adder is shown in fig. The four bit parallel adder is a very common logic circuit.

#### Block diagram



**Combinational circuits** consist of Logic gates. These circuits operate with binary values.

## **Binary Adder**

The most basic arithmetic operation is addition. The circuit, which performs the addition of two binary numbers is known as **Binary adder**. First, let us implement an adder, which performs the addition of two bits.

#### Half Adder

Half adder is a combinational circuit, which performs the addition of two binary numbers A and B are of **single bit**. It produces two outputs sum, S & carry, C.

The **Truth table** of Half adder is shown below.

| Inp | outs | Outputs |   |  |
|-----|------|---------|---|--|
| А   | В    | С       | S |  |
| 0   | 0    | 0       | 0 |  |
| 0   | 1    | 0       | 1 |  |
| 1   | 0    | 0       | 1 |  |
| 1   | 1    | 1       | 0 |  |

When we do the addition of two bits, the resultant sum can have the values ranging from 0 to 2 in decimal. We can represent the decimal digits 0 and 1 with single bit in binary. But, we can't represent decimal digit 2 with single bit in binary. So, we require two bits for representing it in binary.

Let, sum, S is the Least significant bit and carry, C is the Most significant bit of the resultant sum. For first three combinations of inputs, carry, C is zero and the value of S will be either zero or one based on the **number of ones** present at the inputs. But, for last combination of inputs, carry, C is one and sum, S is zero, since the resultant sum is two.

From Truth table, we can directly write the Boolean functions for each output as

$$S=A \oplus BS=A \oplus B$$

#### C=ABC=AB

We can implement the above functions with 2-input Ex-OR gate & 2-input AND gate. The **circuit diagram** of Half adder is shown in the following figure.



In the above circuit, a two input Ex-OR gate & two input AND gate produces sum, S & carry, C respectively. Therefore, Half-adder performs the addition of two bits. Full Adder

Full adder is a combinational circuit, which performs the **addition of three bits** A, B and C<sub>in</sub>. Where, A & B are the two parallel significant bits and C<sub>in</sub> is the carry bit, which is generated from previous stage. This Full adder also produces two outputs sum, S & carry, C<sub>out</sub>, which are similar to Half adder.

| The  | Truth | table | of Full  | adder | is shown    | helow  |
|------|-------|-------|----------|-------|-------------|--------|
| 1110 | HUULI | Labie | OI I UII | auuei | 13 31100011 | DEIDW. |

|   | Inputs |                 | Outpu | S |
|---|--------|-----------------|-------|---|
| Α | В      | C <sub>in</sub> | Cout  | S |
| 0 | 0      | 0               | 0     | 0 |
| 0 | 0      | 1               | 0     | 1 |
| 0 | 1      | 0               | 0     | 1 |
| 0 | 1      | 1               | 1     | 0 |
| 1 | 0      | 0               | 0     | 1 |
| 1 | 0      | 1               | 1     | 0 |
| 1 | 1      | 0               | 1     | 0 |
| 1 | 1      | 1               | 1     | 1 |

When we do the addition of three bits, the resultant sum can have the values ranging from 0 to 3 in decimal. We can represent the decimal digits 0 and 1 with single bit in binary. But, we can't represent the decimal digits 2 and 3 with single bit in binary. So, we require two bits for representing those two decimal digits in binary.

Let, sum, S is the Least significant bit and carry,  $C_{\text{out}}$  is the Most significant bit of resultant sum. It is easy to fill the values of outputs for all combinations of inputs in the truth table. Just count the **number of ones** present at the inputs and write the equivalent binary number at outputs. If  $C_{\text{in}}$  is equal to zero, then Full adder truth table is same as that of Half adder truth table.

We will get the following **Boolean functions** for each output after simplification.

$$S=A\bigoplus B\bigoplus C_{in}S=A\bigoplus B\bigoplus C_{in}$$

 $c_{out} = AB + (A \oplus B)c_{in}cout = AB + (A \oplus B)c_{in}$ 

The sum, S is equal to one, when odd number of ones present at the inputs. We know that Ex-OR gate produces an output, which is an odd function. So, we can use either

two 2input Ex-OR gates or one 3-input Ex-OR gate in order to produce sum, S. We can implement carry,  $C_{\text{out}}$  using two 2-input AND gates & one OR gate. The **circuit diagram** of Full adder is shown in the following figure.



This adder is called as **Full adder** because for implementing one Full adder, we require two Half adders and one OR gate. If C<sub>in</sub> is zero, then Full adder becomes Half adder. We can verify it easily from the above circuit diagram or from the Boolean functions of outputs of Full adder.

## 4-bit Binary Adder

The 4-bit binary adder performs the **addition of two 4-bit numbers**. Let the 4-bit binary numbers,  $A=A_3A_2A_1A_0A=A_3A_2A_1A_0$  and  $B=B_3B_2B_1B_0B=B_3B_2B_1B_0$ . We can implement 4-bit binary adder in one of the two following ways.

- 1. Use one Half adder for doing the addition of two Least significant bits and three Full adders for doing the addition of three higher significant bits.
- 2. Use four Full adders for uniformity. Since, initial carry C<sub>in</sub> is zero, the Full adder which is used for adding the least significant bits becomes Half adder.

For the time being, we considered second approach. The **block diagram** of 4-bit binary adder is shown in the following figure.



Here, the 4 Full adders are cascaded. Each Full adder is getting the respective bits of two parallel inputs A & B. The carry output of one Full adder will be the carry input of subsequent higher order Full adder. This 4-bit binary adder produces the resultant sum having at most 5 bits. So, carry out of last stage Full adder will be the MSB.

In this way, we can implement any higher order binary adder just by cascading the required number of Full adders. This binary adder is also called as **ripple carry** binarybinary **adder** because the carry propagates ripplesripples from one stage to the next stage.

# **Binary Subtractor**

The circuit, which performs the subtraction of two binary numbers is known as **Binary subtractor**. We can implement Binary subtractor in following two methods.

- 1. Cascade Full subtractors
- 2. 2's complement method

In first method, we will get an n-bit binary subtractor by cascading 'n' Full subtractors. So, first you can implement Half subtractor and Full subtractor, similar to Half adder & Full adder. Then, you can implement an n-bit binary subtractor, by cascading 'n' Full subtractors. So, we will be having two separate circuits for binary addition and subtraction of two binary numbers.

In second method, we can use same binary adder for subtracting two binary numbers just by doing some modifications in the second input. So, internally binary addition operation takes place but, the output is resultant subtraction.

We know that the subtraction of two binary numbers A & B can be written as,

$$A-B=A+(2 \cdot scompliment of B)A-B=A+(2 \cdot scompliment of B)$$

 $\Rightarrow$ A-B=A+(1'scomplimentofB)+1 $\Rightarrow$ A-B=A+(1'scomplimentofB)+1

# 4-bit Binary Subtractor

The 4-bit binary subtractor produces the **subtraction of two 4-bit numbers**. Let the 4bit binary numbers,  $A=A_3A_2A_1A_0A=A_3A_2A_1A_0$  and  $B=B_3B_2B_1B_0B=B_3B_2B_1B_0$ . Internally, the operation of 4-bit Binary subtractor is similar to that of 4-bit Binary adder. If the normal bits of binary number A, complemented bits of binary number B and initial carry borrowborrow,  $C_{in}$  as one are applied to 4-bit Binary adder, then it becomes 4-bit Binary subtractor. The **block diagram** of 4-bit binary subtractor is shown in the following figure.



This 4-bit binary subtractor produces an output, which is having at most 5 bits. If Binary number A is greater than Binary number B, then MSB of the output is zero and the remaining bits hold the magnitude of A-B. If Binary number A is less than Binary number B, then MSB of the output is one. So, take the 2's complement of output in order to get the magnitude of A-B.

In this way, we can implement any higher order binary subtractor just by cascading the required number of Full adders with necessary modifications.

# Binary Adder / Subtractor

The circuit, which can be used to perform either addition or subtraction of two binary numbers at any time is known as **Binary Adder / subtractor**. Both, Binary adder and Binary subtractor contain a set of Full adders, which are cascaded. The input bits of binary number A are directly applied in both Binary adder and Binary subtractor.

There are two differences in the inputs of Full adders that are present in Binary adder and Binary subtractor.

- 1. The input bits of binary number B are directly applied to Full adders in Binary adder, whereas the complemented bits of binary number B are applied to Full adders in Binary subtractor.
- 2. The initial carry,  $C_0 = 0$  is applied in 4-bit Binary adder, whereas the initial carry borrowborrow,  $C_0 = 1$  is applied in 4-bit Binary subtractor.

We know that a **2-input Ex-OR gate** produces an output, which is same as that of first input when other input is zero. Similarly, it produces an output, which is complement of first input when other input is one.

Therefore, we can apply the input bits of binary number B, to 2-input Ex-OR gates. The other input to all these Ex-OR gates is  $C_0$ . So, based on the value of  $C_0$ , the Ex-OR gates produce either the normal or complemented bits of binary number B.

# 4-bit Binary Adder / Subtractor

The 4-bit binary adder / subtractor produces either the addition or the subtraction of two 4-bit numbers based on the value of initial carry or borrow,  $C_0$ . Let the 4-bit binary

numbers,  $A=A_3A_2A_1A_0A=A_3A_2A_1A_0$  and  $B=B_3B_2B_1B_0B=B_3B_2B_1B_0$ . The operation of 4-bit Binary adder / subtractor is similar to that of 4-bit Binary adder and 4-bit Binary subtractor.

Apply the normal bits of binary numbers A and B & initial carry or borrow, C<sub>0</sub> from externally to a 4-bit binary adder. The **block diagram** of 4-bit binary adder / subtractor is shown in the following figure.



If initial carry,  $C_0$  is zero, then each full adder gets the normal bits of binary numbers A & B. So, the 4-bit binary adder / subtractor produces an output, which is the addition of two binary numbers A & B.

If initial borrow,  $C_0$  is one, then each full adder gets the normal bits of binary number A & complemented bits of binary number B. So, the 4-bit binary adder / subtractor produces an output, which is the **subtraction of two binary numbers** A & B.

Therefore, with the help of additional Ex-OR gates, the same circuit can be used for both addition and subtraction of two binary numbers.

## N-Bit Parallel Subtractor

The subtraction can be carried out by taking the 1's or 2's complement of the number to be subtracted. For example we can perform the subtraction (A-B) by adding either 1's or 2's complement of B to A. That means we can use a binary adder to perform the binary subtraction.

#### 4 Bit Parallel Subtractor

The number to be subtracted (B) is first passed through inverters to obtain its 1's complement. The 4-bit adder then adds A and 2's complement of B to produce the subtraction.  $S_3$   $S_2$   $S_1$   $S_0$  represents the result of binary subtraction (A-B) and carry output  $C_{out}$  represents the polarity of the result. If A > B then Cout = 0 and the result of binary form (A-B) then  $C_{out}$  = 1 and the result is in the 2's complement form.

## Block diagram



## Half Subtractors

Half subtractor is a combination circuit with two inputs and two outputs (difference and borrow). It produces the difference between the two binary bits at the input and also produces an output (Borrow) to indicate if a 1 has been borrowed. In the subtraction (AB), A is called as Minuend bit and B is called as Subtrahend bit.

#### **Truth Table**



| Inpu | its | Output  |        |  |  |
|------|-----|---------|--------|--|--|
| Α    | В   | (A - B) | Borrow |  |  |
| 0    | 0   | 0       | 0      |  |  |
| 0    | 1   | 1       | 1      |  |  |
| 1    | 0   | 1       | 0      |  |  |
| 1    | 1   | 0       | 0      |  |  |



# **Full Subtractors**

The disadvantage of a half subtractor is overcome by full subtractor. The full subtractor is a combinational circuit with three inputs A,B,C and two output D and C'. A is the 'minuend', B is 'subtrahend', C is the 'borrow' produced by the previous stage, D is the difference output and C' is the borrow output.

#### **Truth Table**

| 4 | Inputs | 5 | Output  |    |
|---|--------|---|---------|----|
| A | В      | С | (A-B-C) | C, |
| О | 0      | 0 | 0       | 0  |
| ) | 0      | 1 | 1       | 1  |
| ) | 1      | 0 | 1       | 1  |
| ) | 1      | 1 | 0       | 1  |
|   | 0      | 0 | 1       | 0  |
| L | 0      | 1 | 0       | 0  |
| 1 | 1      | 0 | 0       | 0  |
| L | 1      | 1 | 1       | 1  |

## Circuit Diagram



**Decoder** is a combinational circuit that has 'n' input lines and maximum of 2<sup>n</sup> output lines. One of these outputs will be active High based on the combination of inputs present, when the decoder is enabled. That means decoder detects a particular code. The outputs of the decoder are nothing but the **min terms** of 'n' input variables lineslines, when it is enabled.

# Decoder

A decoder is a combinational circuit. It has n input and to a maximum m = 2n outputs. Decoder is identical to a demultiplexer without any data input. It performs operations which are exactly opposite to those of an encoder.

## Block diagram



Examples of Decoders are following.

- 1. Code converters
- 2. BCD to seven segment decoders

- 3. Nixie tube decoders
- 4. Relay actuator

## 2 to 4 Line Decoder

The 2-to-4 line binary decoder **consists of an array of four AND gates. The 2 binary inputs labelled A and B are decoded into one of 4 outputs**, hence the description of 2-to-4 binary decoder. Each output represents one of the minterms of the 2 input variables,

The block diagram of 2 to 4 line decoder is shown in the fig. A and B are the two inputs where D through D are the four outputs. Truth table explains the operations of a decoder. It shows that each output is 1 for only a specific combination of inputs.



### 3to8 line Decoder



How do you implement a half adder using a 2to4 line decoder?

By connecting an OR gate with output line 1 & 2 of 2X4 Decoder. Half Adder can be implemented with 2X4 decoder. Similarly by connecting two Half Adders, we can form a Full Adder by using 2, 2X4 Decoder

Full Adder implementation using 2to8 decoder



Fig1: Full Adder Implementation using 3:8 decoder

Full adder using decoder and NAND gates



## 2 to 4 Decoder

Let 2 to 4 Decoder has two inputs  $A_1$  &  $A_0$  and four outputs  $Y_3$ ,  $Y_2$ ,  $Y_1$  &  $Y_0$ . The **block diagram** of 2 to 4 decoder is shown in the following figure.



One of these four outputs will be '1' for each combination of inputs when enable, E is '1'. The **Truth table** of 2 to 4 decoder is shown below.

| Enable | Inputs                |    | Outputs               |                       |                       |    |
|--------|-----------------------|----|-----------------------|-----------------------|-----------------------|----|
| Е      | <b>A</b> <sub>1</sub> | Ao | <b>Y</b> <sub>3</sub> | <b>Y</b> <sub>2</sub> | <b>Y</b> <sub>1</sub> | Yo |
| 0      | х                     | х  | 0                     | 0                     | 0                     | 0  |
| 1      | 0                     | 0  | 0                     | 0                     | 0                     | 1  |
| 1      | 0                     | 1  | 0                     | 0                     | 1                     | 0  |
| 1      | 1                     | 0  | 0                     | 1                     | 0                     | 0  |
| 1      | 1                     | 1  | 1                     | 0                     | 0                     | 0  |

From Truth table, we can write the **Boolean functions** for each output as

$$Y_3=E.A_1.A_0Y_3=E.A_1.A_0$$
 $Y_2=E.A_1.A_0'Y_2=E.A_1.A_0'$ 
 $Y_1=E.A_1'.A_0Y_1=E.A_1'.A_0$ 
 $Y_0=E.A_1'.A_0'Y_0=E.A_1'.A_0'$ 

Each output is having one product term. So, there are four product terms in total. We can implement these four product terms by using four AND gates having three inputs

each & two inverters. The **circuit diagram** of 2 to 4 decoder is shown in the following figure.



Therefore, the outputs of 2 to 4 decoder are nothing but the **min terms** of two input variables  $A_1$  &  $A_0$ , when enable, E is equal to one. If enable, E is zero, then all the outputs of decoder will be equal to zero.

Similarly, 3 to 8 decoder produces eight min terms of three input variables  $A_2$ ,  $A_1$  &  $A_0$  and 4 to 16 decoder produces sixteen min terms of four input variables  $A_3$ ,  $A_2$ ,  $A_1$  &  $A_0$ .

# Implementation of Higher-order Decoders

Now, let us implement the following two higher-order decoders using lower-order decoders.

- 1. 3 to 8 decoder
- 2. 4 to 16 decoder

#### 3 to 8 Decoder

In this section, let us implement **3 to 8 decoder using 2 to 4 decoders**. We know that 2 to 4 Decoder has two inputs,  $A_1$  &  $A_0$  and four outputs,  $Y_3$  to  $Y_0$ . Whereas, 3 to 8 Decoder has three inputs  $A_2$ ,  $A_1$  &  $A_0$  and eight outputs,  $Y_7$  to  $Y_0$ .

We can find the number of lower order decoders required for implementing higher order decoder using the following formula.

 $Required number of lower order decoders = m_2 m_1 \\ Required number of lower order decoder \\ s = m_2 \\ m_3 \\ Required number of lower order decoder \\ s = m_3 \\ m_4 \\ Required number of lower order decoder \\ s = m_3 \\ m_4 \\ Required number of lower order decoder \\ s = m_3 \\ m_4 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number order \\ s = m_3 \\ Required number order \\ s = m_3 \\ Requir$ 

2m1

Where,

 $m_1m_1$  is the number of outputs of lower order decoder.  $m_2m_2$  is the number of outputs of higher order decoder. Here,  $m_1m_1 = 4$  and  $m_2m_2 = 8$ . Substitute, these two values in the above formula.

Requirednumber of 2 to 4 decoders = 84 = 2 Required number of 2 to 4 decoders = 84 = 2

Therefore, we require two 2 to 4 decoders for implementing one 3 to 8 decoder. The **block diagram** of 3 to 8 decoder using 2 to 4 decoders is shown in the following figure.



The parallel inputs  $A_1$  &  $A_0$  are applied to each 2 to 4 decoder. The complement of input  $A_2$  is connected to Enable, E of lower 2 to 4 decoder in order to get the outputs,  $Y_3$  to  $Y_0$ . These are the **lower four min terms**. The input,  $A_2$  is directly connected to Enable, E of upper 2 to 4 decoder in order to get the outputs,  $Y_7$  to  $Y_4$ . These are the **higher four min terms**.

#### 4 to 16 Decoder

In this section, let us implement **4 to 16 decoder using 3 to 8 decoders**. We know that 3 to 8 Decoder has three inputs  $A_2$ ,  $A_1$  &  $A_0$  and eight outputs,  $Y_7$  to  $Y_0$ . Whereas, 4 to 16 Decoder has four inputs  $A_3$ ,  $A_2$ ,  $A_1$  &  $A_0$  and sixteen outputs,  $Y_{15}$  to  $Y_0$ 

We know the following formula for finding the number of lower order decoders required.

 $Required number of lower order decoders = m_2 m_1 \\ Required number of lower order decoder \\ s = m_2 \\ m_3 \\ Required number of lower order decoder \\ s = m_3 \\ m_4 \\ Required number of lower order decoder \\ s = m_3 \\ m_4 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order decoder \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number of lower order \\ s = m_3 \\ Required number order \\ s = m_3 \\ Required number order \\ s = m_3 \\ Re$ 

2m1

Substitute,  $m_1m_1 = 8$  and  $m_2m_2 = 16$  in the above formula.

Requirednumber of 3 to 8 decoders = 168 = 2 Required number of 3 to 8 decoders = 168 = 2

Therefore, we require two 3 to 8 decoders for implementing one 4 to 16 decoder. The **block diagram** of 4 to 16 decoder using 3 to 8 decoders is shown in the following figure.



The parallel inputs  $A_2$ ,  $A_1$  &  $A_0$  are applied to each 3 to 8 decoder. The complement of input, A3 is connected to Enable, E of lower 3 to 8 decoder in order to get the outputs,  $Y_7$  to  $Y_0$ . These are the **lower eight min terms**. The input,  $A_3$  is directly connected to Enable, E of upper 3 to 8 decoder in order to get the outputs,  $Y_{15}$  to  $Y_8$ . These are the **higher eight min terms**.

## **Encoder**

Encoder is a combinational circuit which is designed to perform the inverse operation of the decoder. An encoder has n number of input lines and m number of output lines. An encoder produces an m bit binary code corresponding to the digital input number. The encoder accepts an n input digital word and converts it into an m bit another digital word.



Examples of Encoders are following.

1. Priority encoders

- 2. Decimal to BCD encoder
- 3. Octal to binary encoder
- 4. Hexadecimal to binary encoder



8 to 3 Line Encoder Using OR Gate

# **Priority Encoder**

This is a special type of encoder. Priority is given to the input lines. If two or more input line are 1 at the same time, then the input line with highest priority will be considered. There are four input  $D_0$ ,  $D_1$ ,  $D_2$ ,  $D_3$  and two output  $Y_0$ ,  $Y_1$ . Out of the four input  $D_3$  has the highest priority and  $D_0$  has the lowest priority. That means if  $D_3 = 1$  then  $Y_1$   $Y_1 = 11$  irrespective of the other inputs. Similarly if  $D_3 = 0$  and  $D_2 = 1$  then  $Y_1$   $Y_0 = 10$  irrespective of the other inputs.

Block diagram Truth Table Logic Circuit







An **Encoder** is a combinational circuit that performs the reverse operation of Decoder. It has maximum of 2<sup>n</sup> input lines and 'n' output lines. It will produce a binary code equivalent to the input, which is active High. Therefore, the encoder encodes 2<sup>n</sup> input lines with 'n' bits. It is optional to represent the enable signal in encoders.

## 4 to 2 Encoder

Let 4 to 2 Encoder has four inputs  $Y_3$ ,  $Y_2$ ,  $Y_1$  &  $Y_0$  and two outputs  $A_1$  &  $A_0$ . The **block diagram** of 4 to 2 Encoder is shown in the following figure.



At any time, only one of these 4 inputs can be '1' in order to get the respective binary code at the output. The **Truth table** of 4 to 2 encoder is shown below.

|                       | Inp                   | Outputs               |                |                       |    |
|-----------------------|-----------------------|-----------------------|----------------|-----------------------|----|
| <b>Y</b> <sub>3</sub> | <b>Y</b> <sub>2</sub> | <b>Y</b> <sub>1</sub> | Y <sub>0</sub> | <b>A</b> <sub>1</sub> | Ao |
| 0                     | 0                     | 0                     | 1              | 0                     | 0  |

| 0 | 0 | 1 | 0 | 0 | 1 |
|---|---|---|---|---|---|
| 0 | 1 | 0 | 0 | 1 | 0 |
| 1 | 0 | 0 | 0 | 1 | 1 |

From Truth table, we can write the **Boolean functions** for each output as

$$A_1=Y_3+Y_2A_1=Y_3+Y_2$$
  
 $A_0=Y_3+Y_1A_0=Y_3+Y_1$ 

We can implement the above two Boolean functions by using two input OR gates. The **circuit diagram** of 4 to 2 encoder is shown in the following figure.



The above circuit diagram contains two OR gates. These OR gates encode the four inputs with two bits

# Octal to Binary Encoder

Octal to binary Encoder has eight inputs,  $Y_7$  to  $Y_0$  and three outputs  $A_2$ ,  $A_1$  &  $A_0$ . Octal to binary encoder is nothing but 8 to 3 encoder. The **block diagram** of octal to binary Encoder is shown in the following figure.



At any time, only one of these eight inputs can be '1' in order to get the respective binary code. The **Truth table** of octal to binary encoder is shown below.

| _                     | Inputs                |                       |                       |                       |                       |                       |                |                | Outputs               |                |
|-----------------------|-----------------------|-----------------------|-----------------------|-----------------------|-----------------------|-----------------------|----------------|----------------|-----------------------|----------------|
| <b>Y</b> <sub>7</sub> | <b>Y</b> <sub>6</sub> | <b>Y</b> <sub>5</sub> | <b>Y</b> <sub>4</sub> | <b>Y</b> <sub>3</sub> | <b>Y</b> <sub>2</sub> | <b>Y</b> <sub>1</sub> | Y <sub>0</sub> | $\mathbf{A}_2$ | <b>A</b> <sub>1</sub> | A <sub>0</sub> |
| 0                     | 0                     | 0                     | 0                     | 0                     | 0                     | 0                     | 1              | 0              | 0                     | 0              |
| 0                     | 0                     | 0                     | 0                     | 0                     | 0                     | 1                     | 0              | 0              | 0                     | 1              |
| 0                     | 0                     | 0                     | 0                     | 0                     | 1                     | 0                     | 0              | 0              | 1                     | 0              |
| 0                     | 0                     | 0                     | 0                     | 1                     | 0                     | 0                     | 0              | 0              | 1                     | 1              |
| 0                     | 0                     | 0                     | 1                     | 0                     | 0                     | 0                     | 0              | 1              | 0                     | 0              |
| 0                     | 0                     | 1                     | 0                     | 0                     | 0                     | 0                     | 0              | 1              | 0                     | 1              |
| 0                     | 1                     | 0                     | 0                     | 0                     | 0                     | 0                     | 0              | 1              | 1                     | 0              |
| 1                     | 0                     | 0                     | 0                     | 0                     | 0                     | 0                     | 0              | 1              | 1                     | 1              |

From Truth table, we can write the **Boolean functions** for each output as

$$A_2=Y_7+Y_6+Y_5+Y_4A_2=Y_7+Y_6+Y_5+Y_4$$
 $A_1=Y_7+Y_6+Y_3+Y_2A_1=Y_7+Y_6+Y_3+Y_2$ 
 $A_0=Y_7+Y_5+Y_3+Y_1A_0=Y_7+Y_5+Y_3+Y_1$ 

We can implement the above Boolean functions by using four input OR gates. The **circuit diagram** of octal to binary encoder is shown in the following figure.



The above circuit diagram contains three 4-input OR gates. These OR gates encode the eight inputs with three bits.

#### Drawbacks of Encoder

Following are the drawbacks of normal encoder.

- 1. There is an ambiguity, when all outputs of encoder are equal to zero. Because, it could be the code corresponding to the inputs, when only least significant input is one or when all inputs are zero.
- 2. If more than one input is active High, then the encoder produces an output, which may not be the correct code. For **example**, if both  $Y_3$  and  $Y_6$  are '1', then the encoder produces 111 at the output. This is neither equivalent code corresponding to  $Y_3$ , when it is '1' nor the equivalent code corresponding to  $Y_6$ , when it is '1'.

So, to overcome these difficulties, we should assign priorities to each input of encoder. Then, the output of encoder will be the binarybinary code corresponding to the active High inputss, which has higher priority. This encoder is called as **priority encoder**.

# **Priority Encoder**

A 4 to 2 priority encoder has four inputs  $Y_3$ ,  $Y_2$ ,  $Y_1$  &  $Y_0$  and two outputs  $A_1$  &  $A_0$ . Here, the input,  $Y_3$  has the highest priority, whereas the input,  $Y_0$  has the lowest priority. In this case, even if more than one input is '1' at the same time, the output will be the binarybinary code corresponding to the input, which is having **higher priority**.

We considered one more **output**, **V** in order to know, whether the code available at outputs is valid or not.

- 1. If at least one input of the encoder is '1', then the code available at outputs is a valid one. In this case, the output, V will be equal to 1.
- 2. If all the inputs of encoder are '0', then the code available at outputs is not a valid one. In this case, the output, V will be equal to 0.

The **Truth table** of 4 to 2 priority encoder is shown below.

|                       | Inp                   | uts                   | Outputs        |                       |                |   |
|-----------------------|-----------------------|-----------------------|----------------|-----------------------|----------------|---|
| <b>Y</b> <sub>3</sub> | <b>Y</b> <sub>2</sub> | <b>Y</b> <sub>1</sub> | Y <sub>0</sub> | <b>A</b> <sub>1</sub> | A <sub>0</sub> | V |
| 0                     | 0                     | 0                     | 0              | 0                     | 0              | 0 |
| 0                     | 0                     | 0                     | 1              | 0                     | 0              | 1 |
| 0                     | 0                     | 1                     | Х              | 0                     | 1              | 1 |
| 0                     | 1                     | Х                     | X              | 1                     | 0              | 1 |



Use 4 variable K-maps for getting simplified expressions for each output.



The simplified Boolean functions are

$$A_1 = Y_3 + Y_2 A_1 = Y_3 + Y_2$$

 $A_0 = Y_3 + Y_2 Y_1 A_0 = Y_3 + Y_2 Y_1$ 

Similarly, we will get the Boolean function of output, V as

$$V=Y_3+Y_2+Y_1+Y_0V=Y_3+Y_2+Y_1+Y_0$$

We can implement the above Boolean functions using logic gates. The **circuit diagram** of 4 to 2 priority encoder is shown in the following figure.



The above circuit diagram contains two 2-input OR gates, one 4-input OR gate, one 2input AND gate & an inverter. Here AND gate & inverter combination are used for

producing a valid code at the outputs, even when multiple inputs are equal to '1' at the same time. Hence, this circuit encodes the four inputs with two bits based on the **priority** assigned to each input.

# Multiplexers

Multiplexer is a special type of combinational circuit. There are n-data inputs, one output and m select inputs with 2m = n. It is a digital circuit which selects one of the n data inputs and routes it to the output. The selection of one of the n inputs is done by the selected inputs. Depending on the digital code applied at the selected inputs, one out of n data sources is selected and transmitted to the single output Y. E is called the strobe or enable input which is useful for the cascading. It is generally an active low terminal that means it will perform the required operation when it is low.

## Block diagram



Multiplexers come in multiple variations

1. 2:1 multiplexer

2. 4:1 multiplexer

3. 16:1 multiplexer

4. 32:1 multiplexer

# **Block Diagram**



## Truth Table

| Enable | Select | Output         |
|--------|--------|----------------|
| E      | S      | Y              |
| 0      | x      | 0              |
| 1      | 0      | Do             |
| 1      | 1      | D <sub>1</sub> |

# **Demultiplexers**

A demultiplexer performs the reverse operation of a multiplexer i.e. it receives one input and distributes it over several outputs. It has only one input, n outputs, m select input. At a time only one output line is selected by the select lines and the input is transmitted to the selected output line. A de-multiplexer is equivalent to a single pole multiple way switch as shown in fig.

Demultiplexers comes in multiple variations.

1. 1:2 demultiplexer

2. 1:4 demultiplexer

3. 1:16 demultiplexer

4. 1:32 demultiplexer

## Block diagram



**Truth Table** 

| Enable | Select | Output |  |
|--------|--------|--------|--|
| E      | S      | Y0 Y1  |  |
| 0      | x      | 0 0    |  |
| 1      | 0      | 0 Din  |  |
| 1      | 1      | Din 0  |  |

**Multiplexer** is a combinational circuit that has maximum of 2<sup>n</sup> data inputs, 'n' selection lines and single output line. One of these data inputs will be connected to the output based on the values of selection lines.

Since there are 'n' selection lines, there will be  $2^n$  possible combinations of zeros and ones. So, each combination will select only one data input. Multiplexer is also called as **Mux**.

# 4x1 Multiplexer

4x1 Multiplexer has four data inputs  $I_3$ ,  $I_2$ ,  $I_1$  &  $I_0$ , two selection lines  $s_1$  &  $s_0$  and one output Y. The **block diagram** of 4x1 Multiplexer is shown in the following figure.



One of these 4 inputs will be connected to the output based on the combination of inputs present at these two selection lines. **Truth table** of 4x1 Multiplexer is shown below.

| Selecti n      | Output |                |
|----------------|--------|----------------|
| S <sub>1</sub> | S₀     | Y              |
| 0              | 0      | I <sub>o</sub> |
| 0              | 1      | I <sub>1</sub> |
| 1              | 0      |                |
| 1              | 1      | l <sub>3</sub> |

From Truth table, we can directly write the **Boolean function** for output, Y as

$$Y = S_1 ' S_0 ' I_0 + S_1 ' S_0 I_1 + S_1 S_0 ' I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 ' I_0 + S_1 ' S_0 I_1 + S_1 S_0 ' I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 ' I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 ' S_0 I_1 + S_1 S_0 I_2 + S_1 S_0 I_3 Y = S_1 ' S_0 I_0 + S_1 S_0 I_1 + S_1 S_0 I_$$

We can implement this Boolean function using Inverters, AND gates & OR gate. The **circuit diagram** of 4x1 multiplexer is shown in the following figure.



We can easily understand the operation of the above circuit. Similarly, you can implement 8x1 Multiplexer and 16x1 multiplexer by following the same procedure.

# Implementation of Higher-order Multiplexers.

Now, let us implement the following two higher-order Multiplexers using lower-order Multiplexers.

• 8x1

Multiplexer •

16x1

Multiplexer

## 8x1 Multiplexer

In this section, let us implement 8x1 Multiplexer using 4x1 Multiplexers and 2x1 Multiplexer. We know that 4x1 Multiplexer has 4 data inputs, 2 selection lines and one output. Whereas, 8x1 Multiplexer has 8 data inputs, 3 selection lines and one output.

So, we require two **4x1 Multiplexers** in first stage in order to get the 8 data inputs. Since, each 4x1 Multiplexer produces one output, we require a **2x1 Multiplexer** in second stage by considering the outputs of first stage as inputs and to produce the final output.

Let the 8x1 Multiplexer has eight data inputs  $I_7$  to  $I_0$ , three selection lines  $s_2$ ,  $s_1$  & s0 and one output Y. The **Truth table** of 8x1 Multiplexer is shown below.

|    | Selection Inputs | Output |                       |
|----|------------------|--------|-----------------------|
| S₂ | S <sub>1</sub>   | S₀     | Y                     |
| 0  | 0                | 0      | I <sub>0</sub>        |
| 0  | 0                | 1      | I <sub>1</sub>        |
| 0  | 1                | 0      | I <sub>2</sub>        |
| 0  | 1                | 1      | <b>I</b> <sub>3</sub> |
| 1  | 0                | 0      | I <sub>4</sub>        |
| 1  | 0                | 1      | I <sub>5</sub>        |
| 1  | 1                | 0      | I <sub>6</sub>        |
| 1  | 1                | 1      | I <sub>7</sub>        |

We can implement 8x1 Multiplexer using lower order Multiplexers easily by considering the above Truth table. The **block diagram** of 8x1 Multiplexer is shown in the following figure.



The same **selection lines**,  $s_1$  &  $s_0$  are applied to both 4x1 Multiplexers. The data inputs of upper 4x1 Multiplexer are  $I_7$  to  $I_4$  and the data inputs of lower 4x1 Multiplexer are  $I_3$  to  $I_0$ . Therefore, each 4x1 Multiplexer produces an output based on the values of selection lines,  $s_1$  &  $s_0$ .

The outputs of first stage 4x1 Multiplexers are applied as inputs of 2x1 Multiplexer that is present in second stage. The other **selection line**,  $s_2$  is applied to 2x1 Multiplexer.

- 1. If  $s_2$  is zero, then the output of 2x1 Multiplexer will be one of the 4 inputs  $I_3$  to  $I_0$  based on the values of selection lines  $s_1$  &  $s_0$ .
- 2. If  $s_2$  is one, then the output of 2x1 Multiplexer will be one of the 4 inputs  $I_7$  to  $I_4$  based on the values of selection lines  $s_1$  &  $s_0$ .

Therefore, the overall combination of two 4x1 Multiplexers and one 2x1 Multiplexer performs as one 8x1 Multiplexer. 16x1 Multiplexer

In this section, let us implement 16x1 Multiplexer using 8x1 Multiplexers and 2x1 Multiplexer. We know that 8x1 Multiplexer has 8 data inputs, 3 selection lines and one output. Whereas, 16x1 Multiplexer has 16 data inputs, 4 selection lines and one output.

So, we require two **8x1 Multiplexers** in first stage in order to get the 16 data inputs. Since, each 8x1 Multiplexer produces one output, we require a 2x1 Multiplexer in second stage by considering the outputs of first stage as inputs and to produce the final output.

Let the 16x1 Multiplexer has sixteen data inputs  $I_{15}$  to  $I_0$ , four selection lines  $s_3$  to  $s_0$  and one output Y. The **Truth table** of 16x1 Multiplexer is shown below.

| Selection Inputs | Output |  |
|------------------|--------|--|
|                  |        |  |

| S₃ | <b>S</b> <sub>2</sub> | S₁ | S₀ | Υ                     |
|----|-----------------------|----|----|-----------------------|
| 0  | 0                     | 0  | 0  | I <sub>o</sub>        |
| 0  | 0                     | 0  | 1  | I <sub>1</sub>        |
| 0  | 0                     | 1  | 0  |                       |
| 0  | 0                     | 1  | 1  | l <sub>3</sub>        |
| 0  | 1                     | 0  | 0  | l <sub>4</sub>        |
| 0  | 1                     | 0  | 1  | I <sub>5</sub>        |
| 0  | 1                     | 1  | 0  | <b>I</b> <sub>6</sub> |
| 0  | 1                     | 1  | 1  | l <sub>7</sub>        |
| 1  | 0                     | 0  | 0  | I <sub>8</sub>        |
| 1  | 0                     | 0  | 1  | l <sub>9</sub>        |
| 1  | 0                     | 1  | 0  | I <sub>10</sub>       |
| 1  | 0                     | 1  | 1  | I <sub>11</sub>       |
| 1  | 1                     | 0  | 0  | I <sub>12</sub>       |
| 1  | 1                     | 0  | 1  | I <sub>13</sub>       |
| 1  | 1                     | 1  | 0  | I <sub>14</sub>       |
| 1  | 1                     | 1  | 1  | I <sub>15</sub>       |

We can implement 16x1 Multiplexer using lower order Multiplexers easily by considering the above Truth table. The **block diagram** of 16x1 Multiplexer is shown in the following figure.



The **same selection lines**,  $s_2$ ,  $s_1$  &  $s_0$  are applied to both 8x1 Multiplexers. The data inputs of upper 8x1 Multiplexer are  $I_{15}$  to  $I_8$  and the data inputs of lower 8x1 Multiplexer are  $I_7$  to  $I_0$ . Therefore, each 8x1 Multiplexer produces an output based on the values of selection lines,  $s_2$ ,  $s_1$  &  $s_0$ .

The outputs of first stage 8x1 Multiplexers are applied as inputs of 2x1 Multiplexer that is present in second stage. The other **selection line**,  $s_3$  is applied to 2x1 Multiplexer.

- 1. If  $s_3$  is zero, then the output of 2x1 Multiplexer will be one of the 8 inputs  $l_5$  to  $l_0$  based on the values of selection lines  $s_2$ ,  $s_1$  &  $s_0$ .
- 2. If  $s_3$  is one, then the output of 2x1 Multiplexer will be one of the 8 inputs  $I_{15}$  to  $I_8$  based on the values of selection lines  $s_2$ ,  $s_1$  &  $s_0$ .

Therefore, the overall combination of two 8x1 Multiplexers and one 2x1 Multiplexer performs as one 16x1 Multiplexer.

Multiplexer. It has single input, 'n' selection lines and maximum of 2<sup>n</sup> outputs. The input will be connected to one of these outputs based on the values of selection lines.

Since there are 'n' selection lines, there will be 2<sup>n</sup> possible combinations of zeros and ones. So, each combination can select only one output. De-Multiplexer is also called as **De-Mux**.

# 1x4 De-Multiplexer

1x4 De-Multiplexer has one input I, two selection lines,  $s_1$  &  $s_0$  and four outputs  $Y_3$ ,  $Y_2$ ,  $Y_1$  & $Y_0$ . The **block diagram** of 1x4 De-Multiplexer is shown in the following figure.



The single input 'I' will be connected to one of the four outputs,  $Y_3$  to  $Y_0$  based on the values of selection lines  $s_1$  &  $s_2$ 0. The **Truth table** of 1x4 De-Multiplexer is shown below.

| Selection | O ıtputs |                       |                  |                       |                |  |
|-----------|----------|-----------------------|------------------|-----------------------|----------------|--|
| S₁        | S₀       | <b>Y</b> <sub>3</sub> | $\mathbf{Y}_{2}$ | <b>Y</b> <sub>1</sub> | Y <sub>0</sub> |  |
| 0         | 0        | 0                     | 0                | 0                     | I              |  |
| 0         | 1        | 0                     | 0                | I                     | 0              |  |
| 1         | 0        | 0                     | I                | 0                     | 0              |  |
| 1         | 1        | I                     | 0                | 0                     | 0              |  |

From the above Truth table, we can directly write the **Boolean functions** for each output as

We can implement these Boolean functions using Inverters & 3-input AND gates. The **circuit diagram** of 1x4 De-Multiplexer is shown in the following figure.



We can easily understand the operation of the above circuit. Similarly, you can implement 1x8 De-Multiplexer and 1x16 De-Multiplexer by following the same procedure.

# Implementation of Higher-order De-Multiplexers

Now, let us implement the following two higher-order De-Multiplexers using lower-order De-Multiplexers.

- 1. 1x8 De-Multiplexer
- 2. 1x16 De-Multiplexer

## 1x8 De-Multiplexer

In this section, let us implement 1x8 De-Multiplexer using 1x4 De-Multiplexers and 1x2 De-Multiplexer. We know that 1x4 De-Multiplexer has single input, two selection lines and four outputs. Whereas, 1x8 De-Multiplexer has single input, three selection lines and eight outputs.

So, we require two **1x4 De-Multiplexers** in second stage in order to get the final eight outputs. Since, the number of inputs in second stage is two, we require **1x2 DeMultiplexer** in first stage so that the outputs of first stage will be the inputs of second stage. Input of this 1x2 De-Multiplexer will be the overall input of 1x8 De-Multiplexer.

Let the 1x8 De-Multiplexer has one input I, three selection lines  $s_2$ ,  $s_1$  &  $s_0$  and outputs  $Y_7$  to  $Y_0$ . The **Truth table** of 1x8 De-Multiplexer is shown below.

| Selection Inputs      |                       |                | Outputs               |                       |                       |                       |                       |                       |                       |    |
|-----------------------|-----------------------|----------------|-----------------------|-----------------------|-----------------------|-----------------------|-----------------------|-----------------------|-----------------------|----|
| <b>S</b> <sub>2</sub> | <b>S</b> <sub>1</sub> | S <sub>0</sub> | <b>Y</b> <sub>7</sub> | <b>Y</b> <sub>6</sub> | <b>Y</b> <sub>5</sub> | <b>Y</b> <sub>4</sub> | <b>Y</b> <sub>3</sub> | <b>Y</b> <sub>2</sub> | <b>Y</b> <sub>1</sub> | Yo |
| 0                     | 0                     | 0              | 0                     | 0                     | 0                     | 0                     | 0                     | 0                     | 0                     | I  |
| 0                     | 0                     | 1              | 0                     | 0                     | 0                     | 0                     | 0                     | 0                     | I                     | 0  |
| 0                     | 1                     | 0              | 0                     | 0                     | 0                     | 0                     | 0                     | I                     | 0                     | 0  |
| 0                     | 1                     | 1              | 0                     | 0                     | 0                     | 0                     | I                     | 0                     | 0                     | 0  |
| 1                     | 0                     | 0              | 0                     | 0                     | 0                     | I                     | 0                     | 0                     | 0                     | 0  |
| 1                     | 0                     | 1              | 0                     | 0                     | I                     | 0                     | 0                     | 0                     | 0                     | 0  |
| 1                     | 1                     | 0              | 0                     | I                     | 0                     | 0                     | 0                     | 0                     | 0                     | 0  |
| 1                     | 1                     | 1              | I                     | 0                     | 0                     | 0                     | 0                     | 0                     | 0                     | 0  |

We can implement 1x8 De-Multiplexer using lower order Multiplexers easily by considering the above Truth table. The **block diagram** of 1x8 De-Multiplexer is shown in the following figure.



The common **selection lines**,  $s_1$  &  $s_0$  are applied to both 1x4 De-Multiplexers. The outputs of upper 1x4 De-Multiplexer are  $Y_7$  to  $Y_4$  and the outputs of lower 1x4 DeMultiplexer are  $Y_3$  to  $Y_0$ .

The other **selection line**,  $s_2$  is applied to 1x2 De-Multiplexer. If  $s_2$  is zero, then one of the four outputs of lower 1x4 De-Multiplexer will be equal to input, I based on the values of selection lines  $s_1$  &  $s_0$ . Similarly, if  $s_2$  is one, then one of the four outputs of upper 1x4 DeMultiplexer will be equal to input, I based on the values of selection lines  $s_1$  &  $s_0$ .

# 1x16 De-Multiplexer

In this section, let us implement 1x16 De-Multiplexer using 1x8 De-Multiplexers and 1x2 De-Multiplexer. We know that 1x8 De-Multiplexer has single input, three selection lines and eight outputs. Whereas, 1x16 De-Multiplexer has single input, four selection lines and sixteen outputs.

So, we require two **1x8 De-Multiplexers** in second stage in order to get the final sixteen outputs. Since, the number of inputs in second stage is two, we require **1x2 DeMultiplexer** in first stage so that the outputs of first stage will be the inputs of second stage. Input of this 1x2 De-Multiplexer will be the overall input of 1x16 De-Multiplexer.

Let the 1x16 De-Multiplexer has one input I, four selection lines  $s_3$ ,  $s_2$ ,  $s_1$  &  $s_0$  and outputs  $Y_{15}$  to  $Y_0$ . The **block diagram** of 1x16 De-Multiplexer using lower order Multiplexers is shown in the following figure.



The common **selection lines s<sub>2</sub>, s<sub>1</sub> & s<sub>0</sub>** are applied to both 1x8 De-Multiplexers. The outputs of upper 1x8 De-Multiplexer are  $Y_{15}$  to  $Y_{8}$  and the outputs of lower 1x8 DeMultiplexer are  $Y_{7}$  to  $Y_{0}$ .

The other **selection line**,  $s_3$  is applied to 1x2 De-Multiplexer. If  $s_3$  is zero, then one of the eight outputs of lower 1x8 De-Multiplexer will be equal to input, I based on the values of selection lines  $s_2$ ,  $s_1$  &  $s_0$ . Similarly, if  $s_3$  is one, then one of the 8 outputs of upper 1x8 De-Multiplexer will be equal to input, I based on the values of selection lines  $s_2$ ,  $s_1$  &  $s_0$ .

# **UNIT-3: Computer Arithmetic and Process Organization**

Computer Arithmetic: Algorithms for fixed point and floating point addition, subtraction, multiplication and division operations.

**Processor Organization:** Introduction to CPU, Execution of a Complete Instruction, Multiple-Bus Organization, Hardwired Control and Multi programmed Control.

## **Computer Arithmetic:**

Data is manipulated by using the arithmetic instructions in digital computers. Data is manipulated to produce results necessary to give solution for the computation problems. The Addition, subtraction, multiplication and division are the four basic arithmetic operations. If we want then we can derive other operations by using these four operations. To execute arithmetic operations there is a separate section called arithmetic processing unit in central processing unit. The arithmetic instructions are performed generally on binary or decimal data. Fixed-point numbers are used to represent integers or fractions. We can have signed or unsigned negative numbers. Fixed-point addition is the simplest arithmetic operation. If we want to solve a problem then we use a sequence of well-defined steps. These steps are collectively called algorithm. To solve various problems we give algorithms. In order to solve the computational problems, arithmetic instructions are used in digital computers that manipulate data. These instructions perform arithmetic calculations. And these instructions perform a great activity in processing data in a digital computer. As we already stated that with the four basic arithmetic operations addition, subtraction, multiplication and division, it is possible to derive other arithmetic operations and solve scientific problems by means of numerical analysis methods. A processor has an arithmetic processor(as a sub part of it) that executes arithmetic operations. The data type, assumed to reside in processor, registers during the execution of an arithmetic instruction. Negative numbers may be in a signed magnitude or signed complement representation. There are three ways of representing negative fixed point binary numbers signed magnitude, signed 1's complement or signed 2's complement. Most computers use the signed magnitude representation for the mantissa.

# Algorithms for fixed point and floating point addition, subtraction, multiplication and division operations:

In a <u>computer</u>, the basic arithmetic operations are Addition and Subtraction. Multiplication and Division can always be managed with successive addition or subtraction respectively. However, hardware algorithms are implemented for Multiplication and Division.

It is to be recollected that <u>computers</u> deal with binary numbers unless special hardware is implemented for dealing with other number systems. Although instructions may be available for treating signed and unsigned operations, the programmer must deal with the numbers and handling of the result. The hardware assists the programmer by way of appropriate instructions and flags.

#### Addition

Adding two numbers is an addition. We may add signed or unsigned numbers. When we add two numbers, say 8 and 5, the result is 13 i.e. while adding two single-digit numbers, we may get a two-digit number in the result. A similar possibility exists in the binary system too. Thumb rule of binary addition is:

$$0 + 0 = 0$$

$$0 + 1 = 1$$

$$1 + 0 = 1$$

$$1 + 1 = 10$$

Examples (a –e) of unsigned binary addition are given in figure 8.1.



Figure 8.1 Examples of binary Addition

#### Adder

The hardware circuit which executes this addition is called Adder. There are two types of adders namely Half adder and Full adder. Basic adder circuit does 1-bit addition and is extended for n-bit addition. The adder circuit characteristics are detailed by a circuit, a truth

table, Formula and a block symbol. The adder circuits are constructed from logic gates which satisfy the formula as per truth table. These are also called combinational logic. A Combinational logic output reflects the input without clocking.



Figure 8.2 Half adder

The Half Adder (HA) has two inputs (A, B) and two outputs (Sum and Carry). The Sum is XOR of input while the Carry is AND of the input. The Half Adder is detailed in figure 8.2.

A Full Adder (FA) also performs 1-bit addition but taking 3 inputs (A, B and  $C_i$ ) and produces two outputs (Sum and Carry). Like HA, FA generates result consisting of Sum (S) and Carry out ( $C_{out}$ ).  $C_{out}$  is used as  $C_{i+1}$  while cascading for multiple bits of a word. Full Adder is detailed in figure 8.3. A full adder can also be constructed using half adder blocks as in figure 8.4.



Figure 8.4 Full Adder using Half Adder blocks

#### Subtraction

Subtraction is finding the difference of B from A i.e A-B. Basis of binary subtraction is:

$$0 - 0 = 0$$

$$0 - 1 = -1$$

$$1 - 0 = 1$$

$$1 - 1 = 0$$

Of course, the usual borrow logic from the adjacent digit is applied as in the case of decimal numbers. Examples of signed binary Subtraction is as below:



Examples of signed binary subtraction

You may note that the above examples are in sign-magnitude representation. In sign-magnitude form, MSB is reserved for sign representation. This is only for basic understanding. Computers internally use 2's complement representation.

Recall: In 2's complement representation MSB is sign bit, (n-1) bits of the represent magnitude of the number. Conversion sample for 8-bit word is shown in figure 8.5.



Figure 8.5 1's and 2's Complement Representation

2's Complement for Subtraction

"1's complement + 1 = 2's complement"

Generating this 2's complement is very simple using an XOR circuit. The XOR circuit will generate 1's complement. A control signal called SUBTRACT is used as add value of 1. This way, an adder executes subtraction. See the example below, where case (b), case (c) and case (e) are worked out as 2's complement representation; and A-B becomes A + (2's complement(B)). The result is obtained in 2's complement form discarding the carry. Observe that this method works for all kind of data.



Interpreting 2's complement numbers

- Observe the sign bit (MSB)
- If '0', the number is positive; the (n-1) bits mean the absolute value of the number in binary
- If '1', the number is negative; (n-1) bits mean the 2's complement value of the number in binary; Invert the (n-1) bits and add 1 to get the absolute value of this negative number.

### Error Detection and Status Flags

No one does maths perfectly, but computers can do, provided your data is right! There is a probability that your data may not be rightly defined or may be out of range. For this reason, the CPU detects certain errors like OVERFLOW(O), UNDERFLOW(U) and CARRY(C). It also detects SIGN(S) and ZERO(Z) status. The acronym is ZSOC (Zero, sign, Overflow and Carry) as many <u>processors</u> may treat overflow and underflow together as out of range. The detection is done by the Arithmetic and Logic Unit (ALU) of the CPU. Upon detection corresponding flag is set to ON status. These flags have bit positions allotted in the Processor Status Register and most famously known as Processor Status Word (PSW). ZSOC flags are collectively known as Condition Codes. The purpose of these Condition Codes status flags is to facilitate the programmer to catch data dependant errors and act accordingly.

Overflow: To put in simple English, when a result obtained exceeds the maximum number possible to be represented, Overflow is said to occur. In other words, the addition of two numbers with sign bit '0' resulting in value with sign bit '1' is said to be an OVERFLOW.

For example : An 8 bit word can maximum represent +127 decimal, 011111111 in binary. If we add, 120 + 10 -> 130;

1000 0010 -> in sign magnitude form, MSB (Mos significant bit) '1'

Max is +127, hence this is an overflow scenario

In Overflow scenario, the result is wrong and this needs to be communicated to the programmer/user that there is an error encountered. This situation is detected by the CPU hardware and sets a status bit called "OVERFLOW". The user, if interested, can catch this error by reading this OVERFLOW status bit and take necessary action over the data handling.

Underflow: While Overflow is related to positive magnitude, Underflow is related to negative magnitude for the same reasons. As an example, when you add two negative numbers like - 120 and -10, the result expected is -130 which is beyond the representable range in an 8-bit signed word definition. This is a scenario of UNDERFLOW. In other words, the addition of two numbers with sign bits '1' resulting in a number with sign bit '0' is said to be UNDERFLOW. The CPU hardware detects and sets a status bit called UNDERFLOW to this effect. Again, this status bit is accessible to the programmer/user to take necessary action over the data handling.

Carry: CARRY is another status detected and set by CPU while executing arithmetic instructions. CARRY flag is relevant to Unsigned arithmetic operations while OVERFLOW and UNDERFLOW are relevant to signed operations.

The CARRY Flag is set by the CPU at the end of arithmetic operations if there is a Carry (C<sub>out</sub>) out of the most significant bit of the word. The Carry is set at the end of an execution cycle of the addition or subtraction instructions. Many CPUs do not differentiate between signed and unsigned operations, in which case the CARRY and OVERFLOW may both be set by the CPU. However, there are CPUs which have different instruction codes and instructions for signed and unsigned integer operations and in this case, the CPU appropriately sets the CARRY or OVERFLOW flag.

Never forget that it is the programmer who decides whether he is operating with signed or unsigned numbers. So the programmer has to decide whether he should catch OVERFLOW, UNDERFLOW or CARRY flag for error detection and corrective action.

ZERO: At the end of an instruction execution cycle, if the accumulator value is zero, this status bit is set by CPU. This could be a possibility at the end of arithmetic or logical instructions or load instructions.

SIGN: Sign bit reflects the MSB of the accumulator. This is also set at the end of Instruction execution cycle.

#### n-Bit Adder Formation

A 4-bitFull Adder is integrated by cascading four numbers of 1-bit adders as in figure 8.6. When cascaded the C<sub>out</sub> of ith goes as C<sub>in</sub> of i+1th position and hence the carry is said to be propagated. S is the Sum bits, C<sub>out</sub> is the final Carry out of the adder. A and B are the input

numbers. C<sub>i</sub> is carry-in if available. Such cascading can be extended to any number of bits using 1-bit FA or n-bit FA blocks.



Figure 8.6 4-bit Ripple Carry Full Adder

This method of Adder expansion is known as Spatial Expansion as the output of all the n-bits are available at the same time as 1-bit operation, probably in one clock cycle. Spatial expansion is also known as Parallel Adder. The other name for this method is Ripple Carry adder as the carry is propagated internally. However, for large n-value, the carry propagation delay for clean and settled output proportionately increases. This is a disadvantage of Ripple Carry Adder which is solved by Carry-Look-Ahead Adder Technique.

#### Carry Look Ahead Adder

This is also a spatial expansion and ripple carry type. The Carry Look Ahead Adder (CLA) uses specialized logic called Carry Look ahead Logic to compute carries in parallel and hence is faster than Ripple Carry Adder.



Carry Look Ahead Full Adder

M.JOSHNA

CLA Adder generates two other signals namely Propagate Carry and Generate Carry which can be used by the next stage for Carry Calculation.

Propagate Carry Pi = Ai + Bi

i.e when either of the number has '1' in the bit position, the carry is likely depending on the Ci

Generate Carry Gi = AiBi

i.e. when both the numbers have '1' in their bit position in which case carry is sure to be generated.

We already have defined the formula for Sum and Carry as

$$S = A \oplus B \oplus C_i$$

$$C_{out} = AB + C_i(A+B)$$

The carry formula can be rewritten in terms of Propagate and Generate carry as  $C_{out} = P_i + C_iG_i$ .

In a Carry look-ahead adder, the carries are computed in parallel using carry look-ahead logic, in one gate delay as compared to 2-gate delays per bit in the case of Ripple carry adder.

# Implementation of Multiplication and Division.

### Multiplication

Just recall with micro details as to how do we do multiplication using pen and paper. Then it is easier to visualize that it is possible to implement a hardware algorithm.

-----

Product P =  $132 \ 10000100$ 

As you see, we start with LSB of the Multiplier Q, multiply the Multiplicand, the partial product is jotted down. Then we used the next higher digit of Q to multiply the multiplicand. This time while jotting the partial product, we shift the jotting to the left corresponding to the Q-digit position. This is repeated until all the digits of Q are used up and then we sum the partial products. By multiplying 12x11, we got 132. You may realize that we used binary values and the product also in binary. Binary multiplication was much simpler than decimal multiplication. Essentially this is done by a sequence of shifting and addition of multiplicand when the multiplier consists only of 1's and 0's. This is true and the same, in the case of Binary multiplication. Binary multiplication is simple because the multiplier would be either a 0 or 1 and hence the step would be equivalent to adding the multiplicand in proper shifted position or adding 0's.

It is to be observed that when we multiplied two 4-bit binary numbers, the product obtained is 8-bits. Hence the product register (P) is double the size of the M and Q register. The sign of the product is determined from the signs of multiplicand and multiplier. If they are alike, the sign of the product is positive. If they are unlike, the sign of the product is negative.

### **Unsigned Multiplication**

When multiplication is implemented in a digital <u>computer</u>, it is convenient to change the process slightly. It is required to provide an adder for the summation of only two binary numbers and successively accumulate the partial products in a register. The registers, Shift Counter and the ALU width is decided by the word size of the CPU. For simplicity of understanding, we will take 4-bit word length i.e the Multiplier (Q) and Multiplicand (M) are both 4-bits sized. The logic is extrapolated to the word size requirement.

We need registers to store the Multiplicand (M) and Multiplier (Q) and each 4-bits. However, we use 8-bit register which is standard and minimum and hence the register to collect Product (P) is 16-bits. Refer to figure 9.1. The Shift counter keeps track of the number of times the addition is to be done, which is equal to the number of bits in Q. The shifting of the contents of the registers is taken care of by shift register logic. The ALU takes care of addition and hence partial product and product are obtained here and stored in P register. The control unit controls the cycles for micro-steps. The product register holds the partial results. The final result is also available in P when the shift counter reaches the threshold value.



Figure 9.1 Data path for typical Multiplication

The flowchart for the unsigned multiplication is shown in figure 9.2 and table 9.1 explains the work out with an example of 12 x 11 values. The flowchart is self-explanatory of the unsigned multiplication algorithm. In an unsigned multiplication, the carry bit is used as an extension of the P register. Since the Q value is a 4-bit number, the algorithm stops when the shift counter reaches the value of 4. At this point, P holds the result of the multiplication.



Figure 9.2 Flowchart for

Unsigned Multiplication algorithm

Table 9.1 Workout for unsigned multiplication ( $12 \times 11 = 132$ )

|                                                      | Shift | Counter Multiplicand | Multiplier |                  |
|------------------------------------------------------|-------|----------------------|------------|------------------|
| Operation Step                                       | Value | M                    | Q          | <b>Product P</b> |
| Initial Values for multiplication of 12x11           | 0     | 1100                 | 1011       | 0000 0000        |
| $Q_0 = 1$ , So, Left half of P <- Left half of P $M$ | +0    | 1100                 | 1011       | 1100 0000        |
| Shift Right P, Shift Right Q                         | 0     | 1100                 | 0101       | 0110 0000        |
| SC <- SC + 1                                         | 1     | 1100                 | 0101       | 0110 0000        |
| $Q_0 = 1$ , So, Left half of P <- Left half of P $M$ | +1    | 1100                 | 0101       | 10010<br>0000    |

Table 9.1 Workout for unsigned multiplication ( $12 \times 11 = 132$ )

|                                                  | Shift | Counter Multiplicand | Multiplier |                  |
|--------------------------------------------------|-------|----------------------|------------|------------------|
| Operation Step                                   | Value | M                    | Q          | <b>Product P</b> |
| Shift Right P, Shift Right Q                     | 1     | 1100                 | 0010       | 1001 0000        |
| SC <- SC + 1                                     | 2     | 1100                 | 0010       | 1001 0000        |
| $Q_0 = 0$ , do nothing                           | 2     | 1100                 | 0010       | 1001 0000        |
| Shift Right P, Shift Right Q                     | 2     | 1100                 | 0001       | 0100 1000        |
| SC <- SC + 1                                     | 3     | 1100                 | 0001       | 0100 1000        |
| $Q_0 = 1$ , So, Left half of P <- Left half of M | FP+3  | 1100                 | 0001       | 10000<br>1000    |
| Shift Right P, Shift Right Q                     | 3     | 1100                 | 0000       | 1000 0100        |
| SC <- SC + 1                                     | 4     | 1100                 | 0000       | 1000 0100        |

### **Signed Multiplication**

Signed numbers are always better handled in 2's complement format. Further, the earlier signed algorithm takes n steps for n digit number. The multiplication process although implemented in hardware 1-step per digit is costly in terms of execution time. Booths algorithm addresses both signed multiplication and efficiency of operation.

### **Booth's Algorithm**

Booth observed that multiplication can also be done with mixed additions and subtractions, instead of only additions. And it deals with signed multiplication as well.

The motivation for Booth's Algorithm is that ALU with add or subtract can get the same result in more than one way .i.e. the multiplier 6 can be dealt as:

$$6 = -2 + 8$$

Booth's Algorithm categorises the multiplier as the run of 1's and further as begin, middle and end of runs. The run is identified as below for a number 01110.

### Run of 1's



Run of 1's

Based on the run status, the operation to be performed in the multiplication process is defined as in table 9.2. The values of the current bit (Q0) and the outgoing bit (Qe) of the multiplier decide the operation to be performed. By this, the multiplication is achieved in less number of cycles based on the multiplier. A multiplier may have many combinations of runs based on its value. This algorithm is sensitive to bit patterns of Multiplier. A pattern like 01010101 may be the worse as it has many begin and end runs necessitating as many additions and subtractions and may not save cycle time. But by and large Booth's algorithm saves cycles.

Table 9.2 Booth Encoding for Multiplication – Operation regarding the run

| Current | Bit Bit to the | right                                                                           |
|---------|----------------|---------------------------------------------------------------------------------|
| $(Q_0)$ | $(Q_e)$        | <b>Explaination Example Operation</b>                                           |
| 1       | 0              | Begins run of 1s 00011 <b>110</b> 00 Subtract multiplicand from partial product |
| 1       | 1              | Middle of run of 0001111000 No arithmetic operation 1s                          |
| 0       | 1              | End of run of 1s 0001111000 Add multiplicand to partial product                 |
| 0       | 0              | Middle of run of <b>00</b> 01111000 No arithmetic operation 0s                  |

Booth's algorithm uses Arithmetic Shift Right for collecting partial product. Arithmetic Shift right is a sign-extended shift; i.e if the sign bit is 0, then 0 is extended while shifting; if the sign bit is 1, then 1 is extended while shifting. For this reason, n+1 is the register size. You may observe this in our work out in table 9.3. The work out is for (-12x -11). This example is taken to demonstrate the outcome of signed multiplication with Booth's algorithm. Both multiplicand (M) and Multiplier (Q) use 5-bits as against 4-digit binary number.

The partial product and Product is collected in P and Q register. The Q register initially holds the Multiplier; as it gets shifted out with every digit multiplication, the space in Q register is occupied by partial product. Qe is a 1-bit register holding the outgoing bit. Together PQQe is treated as one entity during the arithmetic shift, whereas only P is considered for addition or subtraction of multiplicand. The multiplicand is loaded in M. Both Multiplicand and Multiplier

are loaded in the simple binary form if these are positive numbers and in 2's complement form if these are negative numbers. The shift counter stops the operation once it reaches the digit count of Q, in this case, 4. Q0Qe are evaluated at every step to decide the operation to be carried out on M and P.

| Comments                                   | SC | SC Multiplicand |       | Product |                |
|--------------------------------------------|----|-----------------|-------|---------|----------------|
| Contract the second of contract the second |    | М               | Р     | Q       | Q <sub>e</sub> |
| Initial (-12 x -11)                        | 0  | 10100           | 00000 | 10101   | 0              |
| $Q_0Q_e = 10$ ; $P \leftarrow P - M$       | 0  | 10100           | 01100 | 10101   | 0              |
| Ar.sh.r . PQ Q <sub>e</sub>                | 0  | 10100           | 00110 | 01010   | 1              |
| SC ← SC+1                                  | 1  | 10100           | 00110 | 01010   | 1              |
| $Q_0Q_e = 01$ ; $P \leftarrow P + M$       | 1  | 10100           | 11010 | 01010   | 1              |
| Ar.sh.r . PQ Q <sub>e</sub>                | 1  | 10100           | 11101 | 00101   | 0              |
| SC ← SC+1                                  | 2  | 10100           | 11101 | 00101   | 0              |
| $Q_0Q_e = 10$ ; $P \leftarrow P-M$         | 2  | 10100           | 01001 | 00101   | 0              |
| Ar.sh.r . PQ Q <sub>e</sub>                | 2  | 10100           | 00100 | 10010   | 1              |
| SC ← SC+1                                  | 3  | 10100           | 00100 | 10010   | 1              |
| $Q_0Q_e = 01;$                             | 3  | 10100           | 11000 | 10010   | 1              |
| Ar.sh.r . PQ Q <sub>e</sub>                | 3  | 10100           | 11100 | 01001   | 0              |
| SC ← SC+1                                  | 4  | 10100           | 11100 | 01001   | 0              |
| Q <sub>0</sub> Q <sub>e</sub> = 10; P←P-M  | 4  | 10100           | 01000 | 01001   | 0              |
| Ar.sh.r . PQ Q <sub>e</sub>                | 4  | 10100           | 00100 | 00100   | 1              |
| SC ← SC+1                                  | 5  | 10100           | 00100 | 00100   | 1              |

Booth's Algorithm Work out Example (-12 x -11 =132)



Table 9.3 Booth's Signed Multiplication

It is seen that the resulting Product of multiplying two negative numbers is a positive number which is correct. One need not handle the signs separately. It is handled as part of the algorithm. The flow chart and the datapath may be drawn by an interested reader as an exercise or the reader may contact the author.

There is a category of Multipliers called Array multiplier which avoids this sequential operation and produces the result at once. These require a large number of gates for implementation. However, with the advancement in VLSI, it is a reality. Different CPUs have different implementations.

### **DIVISION:**

Performing division is a difficult task as we have seen in case of fixed point arithmetic also. Divider architectures are complex to implement. Floating point division is nothing but a fixed point division with some extra hardwares to take care for the exponents. This extra hardwares make the divider circuit more complex. A floating point division where a number divides another number can be expressed as

Thus it can be said that in a floating point division, mantissas are divided and exponents are subtracted.

The major steps for a floating point division are

- Extract the sign of the result from the two sign bits.
- Find the magnitude of the difference between two exponents ( ) Add to the bias if or subtract from the bias if .
- Divide mantissa of ( ) by mantissa of ( ) considering the hidden bits.
- If there is a leading zero then normalize the result by shifting it left.
- Due to the normalization, the exponent is to be decremented according to the number of left shifts.

Floating point division can be more clearer with an example. Lets discuss a division operation between two numbers and . The result of the division operation is .

**Example: Floating Point Division** 

- Representation: The input operands are represented as and
- Sign extraction: As one of the number is negative then sign of the output will be negative. Thus

- Exponent subtraction: and . Thus magnitude of their difference is . As thus the resulted exponent is
- Mantissa division: Divide the mantissas by any division algorithm used in the fixed point arithmetic. Considering the hidden bits, the division operation is restricted to 12-bits. The result of the division is
- There is a leading zero in the result thus a left shift can be applied to normalize the result. Thus the new result is . The final value of the mantissa ( ) is excluding the hidden bit.
- The action of normalization step must reflect on exponent correction. The value of the exponent is corrected by a decrement corresponding to a left shift. The new value of the exponent ( ) is .
- The final result is . The decimal value of this is .

A simple architecture for floating point division is shown below in Figure 1. There are three 4-bit subtractors used in the divider architecture, two for exponent subtraction and one for correction of exponents. The major hardware block is the divider block. The divider used here is a 12-bit unsigned divider and that can be any divider circuit as discussed in the blog for division. If the result of the divider contains any leading zero then normalizing step is executed. But here in this case, as the hidden bit is also considered thus the result can not go below . Thus there will be maximum of one leading zero present in the result. This why only the MSB of the result ( ) is considered and left shift block shifts only by one bit. Pipeline registers are also must be inserted according to the pipe lining stages of the divider.



Figure 1: A Basic Scheme for Floating Point Division

### **Introduction to CPU**

# **Introduction to CPU (Central Processing Unit)**

The CPU (Central Processing Unit) is the brain of the computer responsible for executing instructions of a program. It interprets, controls, and processes the data stored in memory using arithmetic and logical operations.

In Digital Logic Design & Computer Organization (DLD&CO), the CPU is studied as the control and execution unit of the computer system.

### 1. Functional Role of CPU

- Fetches instructions from memory.
- Decodes the instructions to understand what operation to perform.
- Executes the instructions using ALU and registers.
- Stores results back into registers or memory.

This cycle is called the Instruction Cycle (Fetch  $\rightarrow$  Decode  $\rightarrow$  Execute  $\rightarrow$  Store).

# 2. Major Components of CPU

# 1. Arithmetic Logic Unit (ALU):

- Performs arithmetic operations (addition, subtraction, multiplication, division).
- Performs logical operations (AND, OR, NOT, XOR, comparison).

# 2. Control Unit (CU):

- Directs the flow of data between memory, ALU, and I/O devices.
- o Generates control signals for execution of instructions.
- Ensures proper sequencing of instruction cycle.

# 3. Registers:

- Small, high-speed storage units inside CPU.
- Examples:
  - Program Counter (PC): Holds address of next instruction.
  - Instruction Register (IR): Holds current instruction.
  - Accumulator (ACC): Stores intermediate results.
  - General Purpose Registers (R0, R1, ...): Temporary storage.
  - Memory Address Register (MAR) & Memory Data Register (MDR): For memory communication.

# 3. CPU Organization

- Single Bus Organization: All units share a single communication path.
- Multiple Bus Organization: Separate buses for instructions, data, and control (faster).

# 4. Types of CPU Operations

- Data transfer (move, load, store).
- Arithmetic operations  $(+, -, \times, \div)$ .
- Logical operations (AND, OR, NOT, shift).
- Control operations (branch, jump, halt).

### 5. CPU Performance Factors

- Clock Speed: Number of instructions executed per second.
- Word Length: Number of bits processed at a time.
- Instruction Set Architecture (ISA): Defines operations CPU can perform (RISC, CISC).
- Number of Cores: Multiple processing units improve performance.

### **Execution of a Complete Instruction**

n the blog post on the <u>von Neumann Architecture</u>, we established that the CPU consists of a control unit for processing the instructions sent to the CPU, the arithmetic logic unit for performing the operations specified in the instructions, and registers for storing instructions and data that are immediately required by the CPU.

The fetch-decode-execute cycle makes use of these components in addition to the memory unit.

In the fetch step of the cycle, the instructions are retrieved from the memory unit (RAM) and stored in the registers on the CPU. Next, the control unit decodes the instructions, which are then executed by the arithmetic and logic unit. The results of the instruction execution are sent back to RAM for storage, and the next instruction cycle begins.

# What is CPU Clock Speed?

The number of instruction cycles a CPU can execute is stated as clock speed and measured in Hertz. If a CPU has a clock speed of 2 700 000 000 Hertz or 2.7 GHz, it executes 2.7 billion instruction cycles per second.

# The Fetch Decode Execute Cycle Step By Step

In the following section, we will walk through the operations performed during the instruction cycle. Recall that a CPU has several different registers

- o Program counter
- Memory address register
- Memory data register
- Current instruction register
- Accumulator

For an explanation of what these registers do, check out my post on von Neumann architecture.

### What is a Fetch Cycle?

The instruction cycle begins with the fetch operation. The program counter keeps track of the next instruction to be processed. A fetch operation starts by loading the memory address of the next instruction into the program counter. In the next step, the processor transfers the address from the program counter to the memory address register and subsequently loads the data stored at that memory location into the memory data register. The program counter is automatically incremented to the next memory location unless the current instruction explicitly points to a different memory location for the next instruction.

Let's see how that works in practice using a concrete example:

- 1. The program counter initially points to the memory address 001
- 2. The memory address 001 is loaded into the memory address register by the processor
- 3. The processor next retrieves the instruction stored at memory address 001 and loads it into the memory data register.

- 4. Since the data contains the instruction "Get 203", it is forwarded to the instruction register.
- 5. The program counter is incremented by 1, pointing to 002
- 6. The instruction in the instruction register is forwarded to the control unit.

# What is an Execute Cycle?

After the fetch operation, the instruction cycle continues with the decode and execute portions. During the fetch stage, the control unit has been supplied with the instruction. It now needs to decode the instruction so that the processor can understand what to do next. In our example, I've supplied the instruction in plain English, such as "Get 203" which tells the processor to get the piece of data stored at memory location 203. In memory, the instruction is supplied in binary. For example, in a 16-bit memory, the first 4 bits may encode the operation to be performed, while the remaining bits specify the address from which to load the data.

Lastly, the processor will execute the instruction supplied. So if the instruction is to get some other piece of data, the "execute" action will consist of retrieving the data from the supplied memory address and storing it in the appropriate register. If the instruction specifies a calculation such as adding two numbers, the execution of the calculation will be handed off to the arithmetic and logic unit (ALU)

### Let's continue with our concrete example:

- 1. The Control Unit decodes the instruction and tells the processor to go to memory address 203 and fetch the piece of information stored there.
- 2. The address 203, is stored in the memory address register.

- 3. The data, the number 4, is stored in the memory data register.
- 4. The
- 5. Since the data is a number that will be necessary for a future operation and not another instruction, the number is stored in the accumulator.

This concludes the first fetch-decode-execute cycle. The processor starts the next cycle by fetching the next instruction stored in the program counter.

The fetching process is the same as in the previous cycle. This time, the instructions tell the processor to add the number stored at memory location 204 to the number currently stored in the accumulator.

After fetching the instruction, the processor retrieves the number stored at memory address 204 and places it in the accumulator while the previously stored number is forwarded to the arithmetic and logic unit (ALU).

Then, the number 3 is also forwarded to the arithmetic and logic unit, where the addition specified in the instruction is performed. Finally, the result is returned to the accumulator, where it will sit until the next instruction is executed.

In this example, we've used two instruction cycles to perform the addition. But modern processors may also load several pieces of data and perform calculations in one cycle.

# What is an Interrupt in Computer Organization and Architecture?

As the term implies, an interrupt is a mechanism by which the normal course of actions of the processor is interrupted. This may be necessary for a variety of reasons, such as hardware failure or waiting for an I/O operation to complete.

Interrupts are part of a broader class of events known as exceptions. Exceptions essentially handle cases when the CPU encounters conditions that interfere with normal processing.

The main utility of interrupts lies in their ability to improve efficiency. Performing I/O operations is usually orders of magnitude slower than normal processing. If the computer had to communicate with an external device attached via USB, such as a flash drive, without the use of interrupts, the processor would have to wait until the i/O operation completes. The processor would spend thousands of instruction cycles just polling the peripheral device, asking if it was done processing without doing any useful work.

To make processing more efficient, the processor can receive an interrupt signal from the I/O device enabling it to work on something that is unrelated to the I/O operation while that operation is in progress.

Once the I/O device is done with its operations and requires communication with the processor, it sends an interrupt request signal to the processor. The processor then interrupts the execution of its current program and services the I/O device. This is achieved via a special device known as the interrupt handler. When the processor is finished with the I/O processing, it returns to the original process.

# What happens during an Interrupt?

Once an Interrupt signal arrives, the processor has to perform a series of steps to handle the interrupt and continue processing:

- The CPU needs to save the current context as it exists in the registers to memory. Some processor architectures push the context onto a stack and then pop it off the stack. That way, they can restore the previous context in reverse order.
- Secondly, the CPU needs to retrieve the instructions of the interrupt handler from memory. The interrupt handler is basically a set of instructions stored in memory. Each type of interrupt has its own associated set of instructions.
- The CPU executes the instructions specified by the interrupt handler
- After concluding the operations specified by the interrupt handler, the CPU needs to restore the context of its previous operations by loading the associated instructions and data into the registers
- o The CPU continues the previous flow of operations.

To include the handling of interrupts into the instruction cycle, an additional interrupt cycle is included.

Source: "Computer Organization and Architecture" 10th Edition by William Stallings

# **Multiple Interrupts**

If multiple interrupts occur, there are essentially two options.

If an interrupt is currently being handled and a second interrupt occurs, the processor can push the second and all subsequent interrupts onto a stack and execute them in reverse sequential order. This has the disadvantage that we cannot prioritize interrupts. If an

I/O device that causes a notoriously long interrupt, like a printer, is currently executing, all other interrupts would have to wait.

Alternatively, interrupts can be associated with priorities. If an interrupt with a higher priority were to occur while a lower priority interrupt is being handled, the lower priority interrupt would itself be interrupted. The processor then would handle the higher-priority interrupt first before turning back to the lower-priority one. Naturally, the second approach engenders more complexity but is usually more efficient.

### **Types of Interrupts**

So far, we have focused on interrupts as caused by I/O devices. In fact, there are several reasons for a processor to interrupt its course of action leading to different types of interrupt handlers.

### Failure of Hardware

A computer relies on electricity. If there is a power outage or something overheats, the processor needs to be able to handle that case when the underlying hardware fails. The hardware failure handler also kicks in when there is an inconsistency in <a href="memory access">memory access</a>. For example, if a piece of data is different in memory when it is accessed from when it was stored, it may cause system crashes.

# **Scheduled Interrupt**

Interrupts may be generated by the processor on a regular basis to perform updates or other functions that may be necessary.

# **Program Condition**

If an error occurs during the execution of a program, the program itself can trigger an interrupt. If you are a programmer, you probably have run into buffer overflow errors or other errors generated when executing your program. These errors are triggered when your

program attempts to do something that the processor cannot or will not handle. In that case, the processor will generate an interrupt or an exception. In fact, good programmers anticipate potential modes of failure and handle these through exceptions in their code.

### I/O Devices

The controller of an I/O device can trigger interrupts as described previously. They either start a new interrupt by requesting service from the processor, signal normal completion of an I/O process, or indicate an error condition.

### **MULTIPLE BUS ORGANIZATION:**

Rout, R=B, IRin

R4outA, R5outB, SelectA, Add, R6in, End

• Instruction execution proceeds as follows:

Step 1--> Contents of PC are passed through ALU using R=B control-signal and loaded into

MAR to start a memory Read operation. At the same time, PC is incremented by 4.

Step2--> Processor waits for MFC signal from memory.

Step3--> Processor loads requested-data into MDR, and then transfers them to IR.

Step4--> The instruction is decoded and add operation take place in a single step.

### Note:

To execute instructions, the processor must have some means of generating the control signals needed in the

proper sequence. There are two approaches for this purpose:

1) Hardwired control and 2) Microprogrammed control.

### Hardwired Control and Multi programmed Control:-

### Introduction:

In computer architecture, the control unit is responsible for directing the flow of data and instructions within the CPU. There are two main approaches to implementing a control unit: hardwired and microprogrammed.

A hardwired control unit is a control unit that uses a fixed set of logic gates and circuits to execute instructions. The control signals for each instruction are hardwired into the control unit, so the control unit has a dedicated circuit for each possible instruction. Hardwired control units are simple and fast, but they can be inflexible and difficult to modify.

On the other hand, a micro-programmed control unit is a control unit that uses a microcode to execute instructions. The microcode is a set of instructions that can be modified or updated, allowing for greater flexibility and ease of modification. The control signals for each instruction are generated by a microprogram that is stored in memory, rather than being hardwired into the control unit.

The control unit is the brain of the CPU, and it can be implemented in two ways: hardwired or micro-programmed. Understanding the differences between these implementations is essential for those studying computer organization.

If we talk about Micro-programmed control units they are generally slower than hardwired control units because they require an extra step of decoding the microcode to generate control signals, but they are more flexible and easier to modify. They are commonly used in modern CPUs because they allow for easier implementation of complex instruction sets and better support for instruction set extensions.

To execute an instruction, the control unit of the CPU must generate the required control signal in the proper sequence. There are two approaches used for generating the control signals in proper sequence as Hardwired Control unit and the Micro-programmed control unit.

Hardwired Control Unit: The control hardware can be viewed as a state machine that changes from one state to another in every clock cycle, depending on the contents of the instruction register, the condition codes, and the external inputs. The outputs of the state machine are the control signals. The sequence of the operation carried out by this machine is determined by the wiring of the logic elements and hence named "hardwired".

- Fixed logic circuits that correspond directly to the Boolean expressions are used to generate the control signals.
- Hardwired control is faster than micro-programmed control.
- A controller that uses this approach can operate at high speed.
- RISC architecture is based on the hardwired control unit



### Micro-programmed Control Unit -

- The control signals associated with operations are stored in special memory units inaccessible by the programmer as Control Words.
- Control signals are generated by a program that is similar to machine language programs.
- The micro-programmed control unit is slower in speed because of the time it takes to fetch microinstructions from the control memory.

# Some Important Terms

- 1. Control Word: A control word is a word whose individual bits represent various control signals.
- 2. Micro-routine: A sequence of control words corresponding to the control sequence of a machine instruction constitutes the micro-routine for that instruction.
- 3. Micro-instruction: Individual control words in this micro-routine are referred to as microinstructions.

- 4. Micro-program: A sequence of micro-instructions is called a micro-program, which is stored in a ROM or RAM called a Control Memory (CM).
- 5. Control Store: the micro-routines for all instructions in the instruction set of a computer are stored in a special memory called the Control Store.



The differences between hardwired and micro-programmed control units:

|                | Hardwired Control Unit                | Micro-programmed Control Unit |
|----------------|---------------------------------------|-------------------------------|
| Implementation | Fixed set of logic gates and circuits | Microcode stored in memory    |

|                       | Hardwired Control Unit             | Micro-programmed Control Unit               |
|-----------------------|------------------------------------|---------------------------------------------|
| Flexibility           | Less flexible, difficult to modify | More flexible, easier to modify             |
| Instruction Set       | Supports limited instruction sets  | Supports complex instruction sets           |
| Complexity of Design  | Simple design, easy to implement   | Complex design, more difficult to implement |
| Speed                 | Fast operation                     | Slower operation due to microcode decoding  |
| Debugging and Testing | Difficult to debug and test        | Easier to debug and test                    |
| Size and Cost         | Smaller size,<br>lower cost        | Larger size, higher cost                    |

|                               | Hardwired Control Unit            | Micro-programmed Control Unit  |
|-------------------------------|-----------------------------------|--------------------------------|
| Maintenance and Upgradability | Difficult to upgrade and maintain | Easier to upgrade and maintain |

Types of Micro-programmed Control Unit - Based on the type of Control Word stored in the Control Memory (CM), it is classified into two types :

# 1. Horizontal Micro-programmed Control Unit:

The control signals are represented in the decoded binary format that is 1 bit/CS. Example: If 53 Control signals are present in the processor then 53 bits are required. More than 1 control signal can be enabled at a time.

- It supports longer control words.
- It is used in parallel processing applications.
- It allows a higher degree of parallelism. If degree is n, n CS is enabled at a time.
- It requires no additional hardware(decoders). It means it is faster than Vertical Microprogrammed.
- It is more flexible than vertical microprogrammed

# 2. Vertical Micro-programmed Control Unit:

The control signals are represented in the encoded binary format. For N control signals- Log2(N) bits are required.

• It supports shorter control words.

- It supports easy implementation of new control signals therefore it is more flexible.
- It allows a low degree of parallelism i.e., the degree of parallelism is either 0 or 1.
- Requires additional hardware (decoders) to generate control signals, it implies it is slower than horizontal microprogrammed.
- It is less flexible than horizontal but more flexible than that of a hardwired control unit.

Note: Types of Control Unit in descending order of speed:

Hardwired control unit > Horizontal microprogrammed CU > Vertical microprogrammed CU

#### **UNIT-IV: THE MEMORY MANAGEMENT SYSTEM**

Basic Concepts, Semiconductor RAM, Types of Read-only Memory (ROM), Cache Memory, Performance Considerations, Virtual Memory, Secondary Storage.

#### 4.1 Basic Concepts:

The maximum size of the memory that can be used in any computer is determined by the addressing scheme.

| Address | Memory Locations             |
|---------|------------------------------|
| 16 Bit  | $2^{16} = 64 \text{ K}$      |
| 32 Bit  | $2^{32} = 4G \text{ (Giga)}$ |
| 40 Bit  | $2^{40} = IT (Tera)$         |

Fig: Connection of Memory to Processor:



If MAR is k bits long and MDR is n bits long, then the memory may contain upto 2K addressable locations and the n-bits of data are transferred between the memory and processor.

This transfer takes place over the processor bus.

The processor bus has,

Address Line

Data Line

② Control Line (R/W, MFC − Memory Function Completed)

The control line is used for co-ordinating data transfer.

The processor reads the data from the memory by loading the address of the required memory location into MAR and setting the R/W line to 1.

The memory responds by placing the data from the addressed location onto the data lines and confirms this action by asserting MFC signal.

Upon receipt of MFC signal, the processor loads the data onto the data lines into MDR register.

The processor writes the data into the memory location by loading the address of this location into MAR and loading the data into MDR sets the R/W line to 0.

Measures for the speed of a memory:

? memory access time.

It is the time that elapses between the initiation of an Operation and the completion of that operation.

I memory cycle time.

It is the minimum time delay that required between the initiation of the two successive memory operations.

### **RAM (Random Access Memory):**

In RAM, if any location that can be accessed for a Read/Write operation in fixed amount of time, it is independent of the location's address.

### **Cache Memory:**

It is a small, fast memory that is inserted between the larger slower main memory and the processor.

It holds the currently active segments of a program and their data.

#### Virtual memory:

The address generated by the processor does not directly specify the physical locations in the memory.

The address generated by the processor is referred to as a virtual / logical address.

The virtual address space is mapped onto the physical memory where data are actually stored.

The mapping function is implemented by a special memory control circuit is often called the memory management unit.

Only the active portion of the address space is mapped into locations in the physical memory.

The remaining virtual addresses are mapped onto the bulk storage devices used, which are usually magnetic disk.

As the active portion of the virtual address space changes during program execution, the memory management unit changes the mapping function and transfers the data between disk and memory.

Thus, during every memory cycle, an address processing mechanism determines whether the addressed in function is in the physical memory unit.

If it is, then the proper word is accessed and execution proceeds. If it is not, a page of words containing the desired word is transferred from disk to memory.

This page displaces some page in the memory that is currently inactive.

#### **Semiconductor RAM**

Semi-Conductor memories are available is a wide range of speeds. Their cycle time ranges from 100ns to 10ns.

### **INTERNAL ORGANIZATION OF MEMORY CHIPS:**

Memory cells are usually organized in the form of array, in which each cell is capable of storing one bit of information.

Each row of cells constitute a memory word and all cells of a row are connected to a common line called as word line.

The cells in each column are connected to Sense / Write circuit by two bit lines.

The Sense / Write circuits are connected to data input or output lines of the chip. During a write operation, the sense / write circuit receive input information and store it in the cells of the selected word.



Figure 5.2 Organization of bit cells in a memory chip.

The data input and data output of each senses / write ckt are connected to a single bidirectional data line that can be connected to a data bus of the cptr.

R / W 2 Specifies the required operation.

CS 
Chip Select input selects a given chip in the multi-chip memory system

### **Static Memories:**

Memories that consist of circuits capable of retaining their state as long as power is applied are known as static memory.

### Fig: Static RAM cell

?



- Two inverters are cross connected to form a batch.
- The batch is connected to two bit lines by transistors T1 and T2.
- These transistors act as switches that can be opened / closed under the control of the word line.
- ② When the word line is at ground level, the transistors are turned off and the latch retain its state.

### **Read Operation:**

In order to read the state of the SRAM cell, the word line is activated to close switches T1 and T2.

If the cell is in state 1, the signal on bit line b is high and the signal on the bit line b is low.
Thus b and b are complements of each other.

② Sense / write circuit at the end of the bit line monitors the state of b and b' and set the output accordingly.

### **Write Operation:**

1 The state of the cell is set by placing the appropriate value on bit line b and its complement on b and then activating the word line. This forces the cell into the corresponding state.

The required signal on the bit lines are generated by Sense / Write circuit.

# Fig:CMOS cell (Complementary Metal oxide Semi Conductor):

Figure 5.4 A static RAM cell.



Figure 5.5 An example of a CMOS memory cell.

Transistor pairs (T3, T5) and (T4, T6) form the inverters in the latch.

In state 1, the voltage at point X is high by having T5, T6 on and T4, T5 are OFF.

Thus T1 and T2 returned ON (Closed), bit line b and b will have high and low signals respectively.

The CMOS requires 5V (in older version) or 3.3.V (in new version) of power supply voltage.

The continuous power is needed for the cell to retain its state

#### Merit:

It has low power consumption because the current flows in the cell only when the cell is being activated accessed.

Static RAM's can be accessed quickly. It access time is few Nano seconds.

## **Demerit:**

SRAM's are said to be volatile memories because their contents are lost when the power is interrupted.

## **Asynchronous DRAMS:-**

① Less ex pensive RAMs can be implemented if simplex call s are used such c ell s cannot retain their state indefinitely. Hence they are called Dynamic RAM's (DRAM).

The information stored in a dynamic memory cell in the form of a charge on a capacitor and this charge can be maintained only for tens of Milliseconds.

The contents must be periodically refreshed by restoring by restoring this capacitor charge to its full value.

In order to store information in the cell, the transistor T is turned on & the appropriate voltage is applied to the bit line, which charges the capacitor.

② After the transistor is turned off, the capacitor begins to discharge which is caused by the capacitor's own leakage resistance.

Hence the information stored in the cell can be retrieved correctly before the threshold value of the capacitor drops down.

Fig:A single transistor dynamic Memory cell



Figure 5.6 A single-transistor dynamic memory cell.

During are ad operation, the transistor is turned "o n" & a sense amplifier connected to the bit line detects whether the charge on the capacitor is above the threshold value.

If charge on capacitor > threshold value -> Bit line will have logic value 1.

If charge on capacitor < threshold value -> Bit line will set to logic value 0.

# Fig:Internal organization of a 2M X 8 dynamic Memory chip



Figure 5.7 Internal organization of a 2M x 8 dynamic memory chip.

#### **DESCRIPTION:**

- The 4 bit cells in each row are divided into 512 groups of 8.
- 21 bit address is needed to access a byte in the memory(12 bit→To select a row,9 bit→Specify the group of 8 bits in the selected row).

 $A_{8-0} \rightarrow \text{Row address of a byte.}$  $A_{20-9} \rightarrow \text{Column address of a byte.}$ 

- During Read/ Write operation, the row address is applied first. It is loaded into the
  row address latch in response to a signal pulse on Row Address Strobe(RAS)
  input of the chip.
- When a Read operation is initiated, all cells on the selected row are read and refreshed.
- Shortly after the row address is loaded, the column address is applied to the address pins & loaded into Column Address Strobe(CAS).
- The information in this latch is decoded and the appropriate group of 8 Sense/Write circuits are selected.
- R/W =1(read operation)→The output values of the selected circuits are transferred to the data lines D0 D7.
- R/W =0(write operation)→The information on D0 D7 are transferred to the selected circuits.
- RAS and CAS are active low so that they cause the latching of address when they
  change from high to low. This is because they are indicated by RAS & CAS.
- To ensure that the contents of a DRAM 's are maintained, each row of cells must be accessed periodically.
- Refresh operation usually perform this function automatically.
- A specialized memory controller circuit provides the necessary control signals RAS & CAS, that govern the timing.
- The processor must take into account the delay in the response of the memory. Such memories are referred to as **Asynchronous DRAM's.**

# Fast Page Mode:

Transferring the bytes in sequential order is achieved by applying the consecutive sequence of column address under the control of successive CAS signals.

This scheme allows transferring a block of data at a faster rate. The block of transfer capability is called as Fast Page Mode.

## **Synchronous DRAM:**

Here the operations are directly synchronized with clock signal.

The address and data connections are buffered by means of registers.

The output of each sense amplifier is connected to a latch.

② A Read operation causes the contents of all cells in the selected row to be loaded in these latches.

# Fig: Synchronous DRAM



Figure 5.8 Synchronous DRAM.

② Data held in the latches that correspond to the selected columns are transferred into the data output register, thus becoming available on the data output pins.

Fig: Timing Diagram 2 Burst Read of Length 4 in an SDRAM



Fig:Timing Diagram →Burst Read of Length 4 in an SDRAM

- Tirst, the row address is latched under control of RAS signal.
- The memory typically takes 2 or 3 clock cycles to activate the selected row.
- Then the column address is latched under the control of CAS signal.
- ② After a delay of one clock cycle, the first set of data bits is placed on the data lines.
- The SDRAM automatically increments the column address to access the next 3 sets of bits in the selected row, which are placed on the data lines in the next 3 clock cycles.

#### Latency & Bandwidth:

A good indication of performance is given by two parameters. They are,

- Latency
- Bandwidth

## Latency:

- 1. It refers to the amount of time it takes to transfer a word of data to or from the memory.
- Por a transfer of single word, the latency provides the complete indication of memory performance.
- 1 For a block transfer, the latency denotes the time it takes to transfer the first word of data.

#### Bandwidth:

It is defined as the number of bits or bytes that can be transferred in one second.

Bandwidth mainly depends upon the speed of access to the stored data & on the number of bits that can be accessed in parallel.

#### **Double Data Rate SDRAM (DDR-SDRAM):**

- The standard SDRAM performs all actions on the rising edge of the clock signal.
- The double data rate SDRAM transfer data on both the edges (loading edge, trailing edge).
- The Bandwidth of DDR-SDRAM is doubled for long burst transfer.
- ① To make it possible to access the data at high rate, the cell array is organized into two banks.
- Each bank can be accessed separately.
- Consecutive words of a given block are stored in different banks.
- ② Such interleaving of words allows simultaneous access to two words that are transferred on successive edge of the clock.

# **Larger Memories:**

#### **Dynamic Memory System:**

The physical implementation is done in the form of Memory Modules.

If a large memory is built by placing DRAM chips directly on the main system printed circuit board that contains the processor, often referred to as Motherboard; it will occupy large amount of space on the board.

☑ These packaging considerations have led to the development of larger memory unit s known as SIMMs & DIMMs.

o SIMM-Single Inline memory Module

o DIMM-Dual Inline memory Module

SIMM & DIMM consists of several memory chips on a separate small board that plugs vertically into single socket on the motherboard.

MEMORY SYSTEM CONSIDERATION:

To reduce the number of pins, the dynamic memory chips use multiplexed

Address inputs. The address is divided into two parts. They are,

High Order Address Bit (Select a row in cell array & it is provided first and latched into memory chips under the control of RAS signal).

② Low Order Address Bit(Selects a column and they are provided on same Address pins and latched using CAS signals).

The Multiplexing of address bit is usually done by Memory Controller Circuit.

Fig:Use of Memory Controller



The Controller accepts a complete address & R/W signal from the processor, under the control of a Request signal which indicates that a memory access operation is needed.

The Controller then forwards the row & column portions of the address to the memory and generates RAS &CAS signals.

It also sends R/W &CS signals to the memory. The CS signal is usually active low, hence it is shown as CS.

?

#### **Refresh Overhead:**

All dynamic memories have to be refreshed. In DRAM ,the period for refreshing all rows is 16ms whereas 64ms in SDRAM.

Eg:Given a cell array of 8K(8192).

Clock cycle=4

Clock Rate=133MHZ

No of cycles to refresh all rows =8192\*4 =32,768

Time needed to refresh all rows=32768/133\*10=246\*10-6 sec=0.246sec

Refresh Overhead=0.246/64

Refresh Overhead =0.0038

## **Rambus Memory:**

The usage of wide bus is expensive.

Rambus developed the implementation of narrow bus

Rambus technology is a fast signaling method used to transfer information between chips.

Instead of using signals that have voltage levels of either 0 or Vsupply to represent the logical values, the signals consists of much smaller voltage swings around a reference voltage Vref.

The reference Voltage is about 2V and the two logical values are represented by 0.3V swings above and below Vref

This type of signaling is generally is known as Differential Signalling.

Rambus provides a complete specification for the design of communication links (Special Interface circuits) called as Rambus Channel.

② Rambus memory has a clock frequency of 400MHZ. The data are transmitted on both the edges of the clock so that the effective data transfer rate is 800MHZ.

The circuitry needed to interface to the Rambus channel is included on the chip. Such chips are known as Rambus DRAMs (RD RAM).

Rambus channel has,

② 9 Data lines(1-8② Transfer the data,9th line②parity checking).

Control line

Power line

A two channel rambus has 18 data lines which has no separate address lines. It is

also called as Direct RD RAM's. Communication between processor or some other device that can serves as a master and RDRAM modules are serves as slaves, is carried out by means of packets transmitted on the data lines.

There are 3 types of packets. They are,

- Request
- Acknowledge
- Data

## Types of Read-only Memory (ROM)

Both SRAM and DRAM chips are volatile, which means that they lose the stored information if power is turned off.

Many application requires Non-volatile memory (which retain the stored

information if power is turned off).

2 Eg: Operating System software has to be loaded from disk to memory which

2 requires the program that boots the Operating System ie. It requires non-volatile memory.

☑ Non-volatile memory is used in embedded system.

② Since the normal operation involves only reading of stored data ,a memory of this type is called ROM.



At Logic value '0' Transistor (T) is connected to the ground point(P). Transistor switch is closed & voltage on bitline nearly drops to zero.

At Logic value '1' Transistor switch is open.

#### M.JOSHNA

The bitline remains at high voltage. To read the state of the cell, the word line is activated. A Sense circuit at the end of the bitline generates the proper output value. Types of ROM: Different types of non-volatile memory are, PROM PEPROM EEPROM Flash Memory PROM:-Programmable ROM: PROM allows the data to be loaded by the user. Programm ability is achieved by inserting a fuse at point P in a ROM cell. Before it is programmed, the memory contains all 0"s The user can insert 1s at the required location by burning out t he fuse at these locations using high-current pulse. This process is irreversible. Merit: It provides flexibility. It is faster. It is less expensive because they can be programmed directly by the user. **EPROM:-Erasable reprogrammable ROM:** 

PEPROM allows the stored data to be erased and new data to be loaded.

In an EPROM cell, a connection to ground is always made at P and a special transistor is used, which has the ability to function either as a normal transistor or as a disabled transistor that is always turned off.

This transistor can be programmed to behave as a permanently open switch, by injecting charge into it that becomes trapped inside.

Erasure requires dissipating the charges trapped in the transistor of memory cells. This can be done by exposing the chip to ultra-violet light, so that EPROM chips are mounted in packages that have transparent windows.

#### Merits:

It provides flexibility during the development phase of digital system.

1 It is capable of retaining the stored information for a long time.

#### **Demerits:**

The chip must be physically removed from the circuit for reprogramming and its entire contents are erased by UV light.

#### Electrically erasable programmable read-only memory (EEPROM)

Electrically erasable programmable read-only memory (EEPROM) chips that can be electrically programmed and erased. EEPROMs are typically changed 1 byte at time. Erasing EEPROM takes typically quite long. The drawback of EEPROM is their speed. EEPROM chips are too slow to use in many products that make quick changes to the data stored on the chip. Typically EEPROMs are found in electronics devices for storing the small amounts of nonvolatile data in applications where speed is not the most important. Small EEPROMs with serial interfaces are commonly found in many electronics devices.

## Flash Memory:

In EEPROM, it is possible to read & write the contents of a single cell.

1 In Flash device, it is possible to read the contents of a single cell but it is only possible to write the entire contents of a block.

Prior to writing, the previous contents of the block are erased. Eg. In MP3 player, the flash memory stores the data that represents sound.

Single flash chips cannot provide sufficient storage capacity for embedded system application.

There are 2 methods for implementing larger memory modules consisting of number of chips. They are,

o Flash Cards

o Flash Drives.

#### Merits:

- Plash drives have greater density which leads to higher capacity & low cost per bit.
- 1 It requires single power supply voltage & consumes less power in their operation.

#### Flash Cards:

- ② One way of constructing larger module is to mount flash chips on a small card.
- 2 Such flash card have standard interface.
- The card is simply plugged into a conveniently accessible slot.
- Eg:A minute of music can be stored in 1MB of memory. Hence 64MB flash cards

can store an hour of music.

#### Flash Drives:

- Larger flash memory module can be developed by replacing the hard disk drive.
- The flash drives are designed to fully emulate the hard disk.
- The flash drives are solid state electronic devices that have no movable parts.

#### Merits:

- They have shorter seek and access time which results in faster response.
- They have low power consumption which makes them attractive for battery driven application.
- They are insensitive to vibration.

#### Demerits:

- The capacity of flash drive (<1GB) is less than hard disk(>1GB).
- It leads to higher cost per bit.
- Flash memory will deteriorate after it has been written a number of times(typically at least 1 million times.)

# SPEED, SIZE COST:

| Characteristics | SRAM      | DRAM           | Magnetis Disk         |  |
|-----------------|-----------|----------------|-----------------------|--|
| Speed           | Very Fast | Slower         | Much slower than DRAM |  |
| Size            | Large     | Small          | Small                 |  |
| Cost            | Expensive | Less Expensive | Low price             |  |

#### Magnetic Disk:

 A huge amount of cost effective storage can be provided by magnetic disk; The main memory can be built with DRAM which leaves SRAM's to be used in smaller units where speed is of essence.

| Memory              | Speed                         | Size      | Cost       |
|---------------------|-------------------------------|-----------|------------|
| Registers           | Very high                     | Lower     | Very Lower |
| Primary cache       | High                          | Lower     | Low        |
| Secondary cache     | Low                           | Low       | Low        |
| Main memory         | Lower than<br>Seconadry cache | High      | High       |
| Secondary<br>Memory | Very low                      | Very High | Very High  |

#### 4.4 Cache Memory

Ideally, computer memory should be fast, large and inexpensive. Unfortunately, it is impossible to meet all the three requirements simultaneously. Increased speed and size are achieved at increased cost. Very fast memory systems can be achieved if SRAM chips are used. These chips are expensive and for the cost reason it is impracticable to build a large main memory using SRAM chips. The alternative used to use DRAM chips for large main memories. The processor fetches the code and data from the main memory to execute the program. The DRAMs which form the main memory are slower devices. So it is necessary to insert wait states in memory read/write cycles. This reduces the speed of execution. The solution for this problem is in the memory system small section of SRAM is added along with the main memory, referred to as cache memory. The program which is to be executed is loaded in the main memory, but the part of the program and data accessed from the cache memory. The cache controller looks after this swapping between main memory and cache memory with the help of DMA controller, Such cache memory is called **secondary cache**. Recent processors have the built in cache memory called **primary cache**. The size of the memory is still small compared to the demands of the large programs with the voluminous data. A solution is provided by using secondary storage, mainly magnetic disks and magnetic tapes to implement large memory spaces, which is available at reasonable prices. To make efficient computer system it is not possible to rely on a single memory component, but to employ a memory hierarchy which uses all different types of memory units that gives efficient computer system. A typical memory hierarchy is illustrated below in the figure:

# Fig:Memory Hierarchy



- Fastest access is to the data held in processor registers. Registers are at the top of the memory hierarchy.
- Relatively small amount of memory that can be implemented on the processor chip. This is processor cache.
- Two levels of cache. Level 1 (L1) cache is on the processor chip. Level 2 (L2) cache is in between main memory and processor.
- Next level is main memory, implemented as SIMMs. Much larger, but much slower than cache memory.
- Next level is magnetic disks. Huge amount of inexpensive storage.
- Speed of memory access is critical, the idea is to bring instructions and data that will be used in the near future as close to the processor as possible.

The effectiveness of cache mechanism is based on the property of "Locality of reference'.

## **Locality of Reference:**

Many instructions in the localized areas of the program are executed repeatedly during some time period and remainder of the program is accessed relatively infrequently.

It manifests itself in 2 ways. They are,

- ☑ Temporal (The recently executed instruction are likely to be executed again very soon.)
- **Spatial**(The instructions in close proximity to recently executed instruction are also likely to be executed soon.) If the active segment of the program is placed in cache memory, then the total execution time can be reduced significantly.

The term Block refers to the set of contiguous address locations of some size.

The cache line is used to refer to the cache block.

# Fig: Use of Cache Memory



- The Cache memory stores a reasonable number of blocks at a given time but this number is small compared to the total number of blocks available in Main Memory.
- The correspondence between main memory block and the block in cache memory is specified by a mapping function.
- The Cache control hardware decide that which block should be removed to create space for the new block that contains the referenced word.
- The collection of rule for making this decision is called the replacement algorithm.
- The cache control circuit determines whether the requested word currently exists in the cache.
- If it exists, then Read/Write operation will take place on appropriate cache location. In this case Read/Write hit will occur.

- In a Read operation, the memory will not involve.
- The write operation is proceeding in 2 ways. They are,
- o Write-through protocol
- o Write-back protocol

## Write-through protocol:

Here the cache location and the main memory locations are updated simultaneously.

## Write-back protocol:

- This technique is to update only the cache location and to mark it as with associated flag bit called dirty/modified bit
- The word in the main memory will be updated later, when the block containing this marked word is to be removed from the cache to make room for a new block.
- If the requested word currently not exists in the cache during read operation, then read miss will occur.
- ☑ To overcome the read miss Load through / Early restart protocol is used.

#### **Read Miss:**

The block of words that contains the requested word is copied from the main memory into cache.

## Load - through:

- ② After the entire block is loaded into cache, the particular word requested is forwarded to the processor.
- If the requested word not exists in the cache during write operation, then Write Miss will occur.
- If Write through protocol is used, the information is written directly into main memory.
- If Write back protocol is used then block containing the addressed word is first brought into the cache and then the desired word in the cache is over-written with the new information.

#### **Cache Memories – Mapping Functions**

First generation processors, those designed with vacuum tubes in 1950 or those designed with integrated circuits in 1965 or those designed as microprocessors in 1980 were generally about the same speed as main memory. On such processors, this naive model was perfectly

reasonable. By 1970, however, transistorized supercomputers were being built where the central processor was significantly faster than the main memory, and by 1980, the difference had increased, although it took several decades for the performance difference to reach today's extreme.

Solution to this problem is to use what is called a *cache memory* between the central processor and the main memory. Cache memory takes advantage of the fact that, with any of the memory technologies available for the past half century, we have had a choice between building large but slow memories or small but fast memories. This was known as far back as 1946, when Berks, Goldstone and Von Neumann proposed the use of a memory hierarchy, with a few fast registers in the central processor at the top of the hierarchy, a large main memory in the middle, and a library of archival data, stored off-line, at the very bottom.

A cache memory sits between the central processor and the main memory. During any particular memory cycle, the cache checks the memory address being issued by the processor. If this address matches the address of one of the few memory locations held in the cache, the cache handles the memory cycle very quickly; this is called a **cache hit.** If the address does not, then the memory cycle must be satisfied far more slowly by the main memory; this is called a **cache miss.** 



Figure 7: Adding a cache to the naive view

The correspondence between the main memory and cache is specified by a Mapping function. When the cache is full and a memory word that is not in the cache is referenced, the cache control hardware must decide which block should be removed to create space for the new block that constitutes the Replacement algorithm.

#### **Mapping Functions**

There are three main mapping techniques which decides the cache organization:

- 1. Direct-mapping technique
- 2. Associative mapping Technique
- 3. Set associative mapping technique

To discuss possible methods for specifying where memory blocks are placed in the cache, we use a specific small example, a cache consisting of 128 blocks of 16 word each, for a total of

2048(2k) word, and assuming that the main memory is addressable by a 16-bit address. The main memory has 64k word, which will be viewed as 4K blocks of 16 word each, the consecutive addresses refer to consecutive word.

## **Direct Mapping Technique**

The cache systems are divided into three categories, to implement cache system. As shown in figure, the lower order 4-bits from 16 words in a block constitute a **word field**. The second field is known as **block field** used to distinguish a block from other blocks. Its length is 7-bits, when a new block enters the cache; the 7-bit cache block field determines the cache position in which this block must be stored. The third field is a **Tag field**, used to store higher order 5-bits of the memory address of the block, and to identify which of the 32blocks are mapped into the cache.



Figure 8:Main Memory Address

It is the simplest mapping technique, in which each block from the main memory has only one possible location in the cache organization. For example, the block I of the main memory maps on to block i module 128 of the cache. Therefore, whenever one of the main memory blocks 0, 128, 256, ...... Is loaded in the cache, it is stored in the block 0. Block 1, 129, 257,.... are stored in block 1 of the cache and so on.



Figure 9:Direct Mapped Cache

block is present. This is called associative-mapping technique. It gives the complete freedom in choosing the cache location in which to place the memory block.



Figure 10: Associative mapped cache

## **Set-Associative Mapping**

It is a combination of the direct and associative-mapping techniques can be used. Blocks of the cache are grouped into sets and the mapping allows a block of main memory to reside in any block of the specific set. In this case memory blocks 0, 64,128......4032 mapped into cache set 0, and they can occupy either of the two block positions within this set. The cache might contain the desired block. The tag field of the address must then be associatively compared to the tags of the two blocks of the set to check if the desired block is present this two associative search is simple to implement.



Figure 11: Set-Associative Mapped Cache

## **Replacement Algorithms**

In a direct-mapped cache, the position of each block is fixed, hence no replacement strategy exists. In associative and set-associative caches, when a new block is to be brought into the cache and all the Positions that it may occupy are full, the cache controller must decide which of the old blocks to overwrite. This is important issue because the decision can be factor in system performance.

The objective is to keep blocks in the cache that are likely to be referenced in the near future. Its not easy to determine which blocks are about to be referenced. The property of locality of reference gives a clue to a reasonable strategy. When a block is to be over written, it is sensible to overwrite the one that has gone the longest time without being referenced. This block is called the least recently used(LRU) block, and technique is called the LRU Replacement algorithm.

The LRU algorithm has been used extensively for many access patterns, but it can lead to poor performance in some cases. For example, it produces disappointing results when accesses are made to sequential elements of an array that is slightly too large to fit into the cache. Performance of LRU algorithm can be improved by introducing a small amount of randomness in deciding which block to replace.

## **Example of mapping techniques**

We now consider a detailed example to illustrate the effects of different cache mapping techniques. Assume that a processor has separate instruction and data caches. To keep the example simple, assume the data cache has space for only eight blocks of data. Also assume that each block consists of only one 16-bit word of data and the memory is word-addressable with 16-bit addresses. (These parameters are not realistic for actual computers, but they allow us to illustrate mapping techniques clearly.) Finally, assume the LRU replacement algorithm is used for block replacement in the cache.

Let us examine changes in the data cache entries caused by running the following application: A  $4 \times 10$  array of numbers, each occupying one word, is stored in main memory locations 7A00 through 7A27 (hex). The elements of this array, A, are stored in column order, as shown in Figure 5.18. The figure also indicates how tags for different cache mapping techniques are derived from the memory address. Note that no bits are needed to identify a word within a block, as was done in Figures 5.15 through 5.17, because we have assumed that each block contains only one word. The application normalizes the elements of the first row of A with respect to the average value of the elements in the row. Hence, we need to compute the average of the elements in the row



Figure 5.18 An array stored in the main memory.

$$SUM := 0$$
for j:= 0 to 9 do
$$SUM := SUM + A(0,j)$$
end
$$AVE := SUM / 10$$
for i:= 9 downto 0 do
$$A(0,i) := A(0,i) / AVE$$
end

Figure 5.19 Task for example in Section 5.5.3.

|                   | Contents of data cache after pass: |        |        |        |                                         |        |        |        |        |
|-------------------|------------------------------------|--------|--------|--------|-----------------------------------------|--------|--------|--------|--------|
| Block<br>position | j = 1                              | j = 3  | j = 5  | j = 7  | j = 9                                   | i = 6  | i = 4  | i = 2  | i = 0  |
| 0<br>1<br>2       | A(0,0)                             | A(0,2) | A(0,4) | A(0,6) | A(0,8)                                  | A(0,6) | A(0,4) | A(0,2) | A(0,0) |
| 3 4               | A(0,1)                             | A(0,3) | A(0,5) | A(0,7) | A(0,9)                                  | A(0,7) | A(0,5) | A(0,3) | A(0,1) |
| 6<br>7            |                                    |        |        |        | *************************************** |        |        |        |        |

Figure 5.20 Contents of a direct-mapped data cache.

and divide each element by that average. The required task can be expressed as

$$A(0,i) \leftarrow \frac{A(0,i)}{\left(\sum_{j=0}^{9} A(0,j)\right)/10}$$
 for  $i = 0, 1, ..., 9$ 

Figure 5.19 gives the structure of a program that corresponds to this task. In a machine language implementation of this program, the array elements will be addressed as memory locations. We use the variables SUM and AVE to hold the sum and average values, respectively. These variables, as well as index variables i and j, will be held in processor registers during the computation.

#### **Direct mapped Cache:**

In a direct-mapped data cache, the contents of the cache change as shown in Figure 5.20. The columns in the table indicate the cache contents after various passes through the two program loops in Figure 5.19 are completed. For example, after the second pass through the first loop (j = 1), the cache holds the elements A(0, 0) and

A(0, 1). These elements are in block positions 0 and 4, as determined by the three least-significant bits of the address. During the next pass, the A(0, 0) element is replaced by A(0, 2), which maps into the same block position. Note that the desired elements map into only two positions in the cache, thus leaving the contents of the other six positions unchanged from whatever they were before the normalization task was executed.

After the tenth pass through the first loop (j = 9), the elements A(0, 8) and A(0, 9) are found in the cache. Since the second loop reverses the order in which the elements are handled, the first two passes through this loop (i = 9, 8) will find the required data in the cache. When i = 7, the element A(0, 9) is replaced with A(0, 7). When i = 6, element A(0, 8) is replaced with A(0, 6), and so on. Thus, eight elements are replaced while the second loop is executed.

# Associate mapped cache:

Figure 5.21 presents the changes if the cache is associative-mapped. During the first eight passes through the first loop, the elements are brought into consecutive block positions, assuming that the cache was initially empty. During the ninth pass (j = 8), the LRU algorithm chooses A(0,0) to be overwritten by A(0,8). The next and last pass through the j loop sees A(0,1) replaced by A(0,9). Now, for the first eight passes through the second loop (i = 9, 8, ..., 2) all required elements are found in the cache. When i = 1, the element needed is A(0,1), so it replaces the least recently used element, A(0,9). During the last pass, A(0,0) replaces A(0,8).

In this case, when the second loop is executed, only two elements are not found in the cache. In the direct-mapped case, eight of the elements had to be reloaded during the second loop. Obviously, the associative-mapped cache benefits from the complete freedom in mapping a memory block into any position in the cache. Good utilization

|                   | Contents of data cache after pass: |        |        |        |              |  |  |
|-------------------|------------------------------------|--------|--------|--------|--------------|--|--|
| Block<br>position | j = 7                              | j = 8  | j = 9  | i = 1  | <i>i</i> = 0 |  |  |
| 0                 | A(0,0)                             | A(0,8) | A(0,8) | A(0,8) | A(0,0)       |  |  |
| 1                 | A(0,1)                             | A(0,1) | A(0,9) | A(0,1) | A(0,1)       |  |  |
| 2                 | A(0,2)                             | A(0,2) | A(0,2) | A(0,2) | A(0,2)       |  |  |
| 3                 | A(0,3)                             | A(0,3) | A(0,3) | A(0,3) | A(0,3)       |  |  |
| 4                 | A(0,4)                             | A(0,4) | A(0,4) | A(0,4) | A(0,4)       |  |  |
| 5                 | A(0,5)                             | A(0,5) | A(0,5) | A(0,5) | A(0,5)       |  |  |
| 6                 | A(0,6)                             | A(0,6) | A(0,6) | A(0,6) | A(0,6)       |  |  |
| 7                 | A(0,7)                             | A(0,7) | A(0,7) | A(0,7) | A(0,7)       |  |  |

Figure 5.21 Contents of an associative-mapped data cache.

|       | j = 3  | j = 7  | j = 9   | i = 4  | i = 2  | i = 0  |  |  |  |
|-------|--------|--------|---------|--------|--------|--------|--|--|--|
| Set 0 | A(0,0) | A(0,4) | A(0,8)  | A(0,4) | A(0,4) | A(0,0) |  |  |  |
|       | A(0,1) | A(0,5) | A(0,9)  | A(0,5) | A(0,5) | A(0,1) |  |  |  |
|       | A(0,2) | A(0,6) | A(0,6)  | A(0,6) | A(0,2) | A(0,2) |  |  |  |
| l     | A(0,3) | A(0,7) | A(0,7)  | A(0,7) | A(0,3) | A(0,3) |  |  |  |
| Set 1 |        |        | <u></u> |        |        |        |  |  |  |

Figure 5.22 Contents of a set-associative-mapped data cache.

of this cache also occurred because we chose to reverse the order in which the elements are handled in the second loop of the program. It is interesting to consider what would happen if the second loop dealt with the elements in the same order as in the first loop (see Problem 5.12). Using the LRU algorithm, all elements would be overwritten before they are used in the second loop. This degradation in performance would not occur if a random replacement algorithm were used.

#### **Set Associative mapped cache:**

For this example, we assume that a set-associative data cache is organized into two sets, each capable of holding four blocks. Thus, the least-significant bit of an address determines which set the corresponding memory block maps into. The high-order 15 bits constitute the tag.

Changes in the cache contents are depicted in Figure 5.22. Since all the desired blocks have even addresses, they map into set 0. Note that, in this case, six elements must be reloaded during execution of the second loop.

Even though this is a simplified example, it illustrates that in general, associative mapping performs best, set-associative mapping is next best, and direct mapping is the worst. However, fully associative mapping is expensive to implement, so set-associative mapping is a good practical compromise.

#### 4.5 Performance Considerations:

Two Key factors in the commercial success are the performance & cost ie the best possible performance at low cost.

② A common measure of success is called the Price/ Performance ratio. Performance depends on how fast the machine instruction are brought to the processor and how fast they are executed.

☑ To achieve parallelism(ie. Both the slow and fast units are accessed in the same manner), interleaving is used.

## Interleaving:

If the main memory system is divided into a number of memory modules. Each module has its own address buffer register (ABR) and data buffer register (DBR).

① Memory access operations may proceed in more than one module at the same time. Thus the aggregate rate of transmission of words to and from the main memory system can be increased.

- Two methods of address layout are indicated they are
- Consecutive words in a module
- Consecutive words in a consecutive module

#### Consecutive words in a module



(a) Consecutive words in a module

Consecutive words are placed in a module.

- ☑ High-order k bits of a memory address determine the module.
- ② Low-order m bits of a memory address determine the word within a module.

① When a block of words is transferred from main memory to cache, only one module is busy at a time.

#### Consecutive words in a consecutive module

?



(b) Consecutive words in consecutive modules

Consecutive words are located in consecutive modules.

Consecutive addresses can be located in consecutive modules.

① While transferring a block of data, several memory modules can be kept busy at the same time.

# This is called interleaving

① When requests for memory access involve consecutive addresses, the access will be to different modules.

② Since parallel access to these modules is possible, the average rate of fetching words from the Main Memory can be increased.

## **Example:**

The effect of interleaving is substantial. Consider the time needed to transfer a block of data from the main memory to the cache when a read miss occurs. Suppose that a cache with 8-word blocks is used, similar to our examples in Section 5.5. On a read miss, the block that contains the desired word must be copied from the memory into the cache. Assume that the hardware has the following properties. It takes one clock cycle to send an address to the main memory. The memory is built with relatively slow DRAM chips that allow the first word to be accessed in 8 cycles, but subsequent words of the block are accessed in 4 clock cycles per word. (Recall from Section 5.2.3 that, when consecutive locations in a DRAM are read from a given row of cells, the row address is decoded only once. Addresses of consecutive columns of the array are then applied to access the desired words, which takes only half the time per access.) Also, one clock cycle is needed to send one word to the cache.

If a single memory module is used, then the time needed to load the desired block into the cache is

$$1 + 8 + (7 \times 4) + 1 = 38$$
 cycles

Suppose now that the memory is constructed as four interleaved modules, using the scheme in Figure 5.25b. When the starting address of the block arrives at the memory, all four modules begin accessing the required data, using the high-order bits of the address. After 8 clock cycles, each module has one word of data in its DBR. These words are transferred to the cache, one word at a time, during the next 4 clock cycles. During this time, the next word in each module is accessed. Then it takes another 4 cycles to transfer these words to the cache. Therefore, the total time needed to load the

block from the interleaved memory is

$$1 + 8 + 4 + 4 = 17$$
 cycles

Thus, interleaving reduces the block transfer time by more than a factor of 2.

#### Hit Rate and Miss Penalty

An excellent indicator of the effectiveness of a particular implementation of the memory hierarchy is the success rate in accessing information at various level of the hierarchy. A successful access to data in a cache is called a hit.

The number of hits stated as fraction of all attempted access is called the hit rate, and the miss rate is the number of misses stated as a fraction of attempted accesses.

12 Hit rate can be improved by increasing block size, while keeping cache size constant.

Block sizes that are neither very small nor very large give best results.

Miss penalty can be reduced if load-through approach is used when loading new blocks into cache.

#### **Example:**

Consider now the impact of the cache on the overall performance of the computer. Let h be the hit rate, M the miss penalty, that is, the time to access information in the main memory, and C the time to access information in the cache. The average access time experienced by the processor is

$$t_{ave} = hC + (1 - h)M$$

We use the same parameters as in Example 5.1. If the computer has no cache, then, using a fast processor and a typical DRAM main memory, it takes 10 clock cycles for each memory read access. Suppose the computer has a cache that holds 8-word blocks and an interleaved main memory. Then, as we showed in Section 5.6.1, 17 cycles are

needed to load a block into the cache. Assume that 30 percent of the instructions in a typical program perform a read or a write operation, which means that there are 130 memory accesses for every 100 instructions executed. Assume that the hit rates in the cache are 0.95 for instructions and 0.9 for data. Let us further assume that the miss penalty is the same for both read and write accesses. Then, a rough estimate of the improvement in performance that results from using the cache can be obtained as follows:

$$\frac{\textit{Time without cache}}{\textit{Time with cache}} = \frac{130 \times 10}{100(0.95 \times 1 + 0.05 \times 17) + 30(0.9 \times 1 + 0.1 \times 17)} = 5.04$$

This result suggests that the computer with the cache performs five times better.

It is also interesting to consider how effective this cache is compared to an ideal cache that has a hit rate of 100 percent (in which case, all memory references take one cycle). Our rough estimate of relative performance for these caches is

$$\frac{100(0.95 \times 1 + 0.05 \times 17) + 30(0.9 \times 1 + 0.1 \times 17)}{130} = 1.98$$

This means that the actual cache provides an environment in which the processor effectively works with a large DRAM-based main memory that appears to be only two times slower than the circuits in the cache.

In this example, we made a simplifying assumption that the same clock is used to access the on-chip cache and the main memory via the system bus. A high-performance processor is likely to operate under the control of a clock that is much faster than the system bus clock, perhaps up to ten times faster. Let us consider the impact of a cache in a system of this type.

#### Example 2:

Suppose that there is a single cache that is implemented on the processor chip and that the main memory is realized using SDRAM chips. Assume that the system bus clock is four times slower than the processor clock. As in Example 5.2, assume that a cache block contains 8 words, and that the hit rates in the cache are 0.95 for instructions and 0.9 for data. The SDRAM timing diagram is similar to Figure 5.9. The only difference is that there is a burst of 8 data words rather than four. Thus, according to Figure 5.9, it will take 14 clock cycles from when the RAS signal is asserted to transfer a block of data between the main memory and the cache. Since the RAS and CAS signals are generated by the memory controller, as indicated in Figure 5.11, one more cycle is needed during which the processor sends the address of the first word in a block to the memory controller. Therefore, a total of 15 cycles is needed to transfer a block. The cycles shown in Figure 5.9 are the system bus clock cycles. If the processor clock is four times faster, then it takes 60 processor cycles to transfer an 8-word block to or from the main memory. Note also that Figure 5.9 indicates that the processor can read or write a single word in the main memory in 9 bus clock cycles, consisting of the 8 cycles indicated in Figure 5.9 plus one cycle needed to send an address to the memory controller. Hence, 36 processor cycles are needed to access a single word in the main memory. Yet, the processor accesses a word in the cache in one processor cycle!

Repeating the calculation in Example 5.2 gives:

$$\frac{\textit{Time without cache}}{\textit{Time with cache}} = \frac{130 \times 36}{100(0.95 \times 1 + 0.05 \times 60) + 30(0.9 \times 1 + 0.1 \times 60)} = 7.77$$

Thus, accounting for the differences between processor and system bus clock speeds shows that the cache has an even greater positive effect on the performance.

Caches on processor chip:

In high-performance processors two levels of caches are normally used. The L1 cache(s) is on the processor chip. The L2 cache, which is much larger, may be implemented externally using SRAM chips. But, a somewhat smaller L2 cache may also be implemented on the processor chip,

If both L1 and L2 caches are used, the L1 cache should be designed to allow very fast access by the processor because its access time will have a large effect on the clock rate of the processor. A cache cannot be accessed at the same speed as a register file because the cache is much bigger and, hence, more complex. A practical way to speed up access to the cache is to access more than one word simultaneously and then let the processor use them one at a time. This technique is used in many commercial processors.

The L2 cache can be slower, but it should be much larger to ensure a high hit rate. Its speed is less critical because it only affects the miss penalty of the L1 cache. A workstation computer may include an L1 cache with the capacity of tens of kilobytes and an L2 cache of several megabytes.

Including an L2 cache further reduces the impact of the main memory speed on the performance of a computer. The average access time experienced by the processor in a system with two levels of caches is

$$t_{ave} = h_1 C_1 + (1 - h_1) h_2 C_2 + (1 - h_1) (1 - h_2) M$$

where

h<sub>1</sub> is the hit rate in the L1 cache.

h2 is the hit rate in the L2 cache.

C<sub>1</sub> is the time to access information in the L1 cache.

C2 is the time to access information in the L2 cache.

M is the time to access information in the main memory.

The number of misses in the L2 cache, given by the term  $(1 - h_1)(1 - h_2)$ , should be low. If both  $h_1$  and  $h_2$  are in the 90 percent range, then the number of misses will be less than 1 percent of the processor's memory accesses. Thus, the miss penalty M will be less critical from a performance point of view.

#### Other enhancements:

Write buffer

Write-through:

- Each write operation involves writing to the main memory.
- If the processor has to wait for the write operation to be complete, it slows down the processor.
- Processor does not depend on the results of the write operation.
- Write buffer can be included for temporary storage of write requests.

- Processor places each write request into the buffer and continues execution.
- If a subsequent Read request references data which is still in the write buffer, then this data is referenced in the write buffer.

#### Write-back:

- Block is written back to the main memory when it is replaced.
- If the processor waits for this write to complete, before reading the new block, it is slowed down.
- Fast write buffer can hold the block to be written, and the new block can be read first.

## Prefetching

- New data are brought into the processor when they are first needed.
- Processor has to wait before the data transfer is complete.
- Prefetch the data into the cache before they are actually needed, or a before a Read miss occurs.
- Prefetching can be accomplished through software by including a special instruction in the machine language of the processor.

Inclusion of prefetch instructions increases the length of the programs.

• Prefetching can also be accomplished using hardware:

② Circuitry that attempts to discover patterns in memory references and then prefetches according to this pattern.

#### Lockup-Free Cache

- Prefetching scheme does not work if it stops other accesses to the cache until the prefetch is completed.
- A cache of this type is said to be "locked" while it services a miss.
- Cache structure which supports multiple outstanding misses is called a lockup free cache.
- Since only one miss can be serviced at a time, a lockup free cache must include circuits that keep track of all the outstanding misses.
- Special registers may hold the necessary information about these misses.

#### **4.6 VIRTUAL MEMORY:**

- Techniques that automatically move program and data blocks into the physical main memory when they are required for execution is called the **Virtual Memory**.
- The binary address that the processor issues either for instruction or data are called the virtual / Logical address.
- The virtual address is translated into physical address by a combination of hardware and software components. This kind of address translation is done by MMU (Memory Management Unit).

When the desired data are in the main memory, these data are fetched /accessed immediately.

If the data are not in the main memory, the MMU causes the Operating system to bring the data into memory from the disk.

Transfer of data between disk and main memory is performed using DMA scheme.

## Fig: Virtual Memory Organization



Figure 5.26 Virtual memory organization.

Memory management unit (MMU) translates virtual addresses into physical addresses.

- If the desired data or instructions are in the main memory they are fetched as described previously.
- If the desired data or instructions are not in the main memory, they must be transferred from secondary storage to the main memory.
- MMU causes the operating system to bring the data from the secondary storage into the main memory.

#### Address Translation:

In address translation, all programs and data are composed of fixed length units called Pages.

The Page consists of a block of words that occupy contiguous locations in the main memory.

The pages are commonly range from 2K to 16K bytes in length. The cache bridge speed up the gap between main memory and secondary storage and it is implemented in software techniques.

Each virtual address generated by the processor contains virtual Page number (Low order bit) and offset(High order bit) Virtual Page number+ Offset® Specifies the location of a particular byte (or word) within a page.

#### Page Table:

It contains the information about the main memory address where the page is stored & the current status of the page.

#### Page Frame:

An area in the main memory that holds one page is called the page frame.

#### Page Table Base Register:

It contains the starting address of the page table.

Virtual Page Number+Page Table Base register Gives the address of the corresponding entry in the page table.ie) it gives the starting address of the page if that page currently resides in memory.

## **Control Bits in Page Table:**

The Control bit specifies the status of the page while it is in main memory. Function:

The control bit indicates the validity of the page ie) it checks whether the page is actually loaded in the main memory.

It also indicates that whether the page has been modified during its residency in the memory; this information is needed to determine whether the page should be written back to the disk before it is removed from the main memory to make room for another page.





The Page table information is used by MMU for every read & write access.

① The Page table is placed in the main memory but a copy of the small portion of the page table is located within MMU.

This small portion or small cache is called Translation Look Aside Buffer (TLB).

This portion consists of the page table entries that corresponds to the most recently accessed pages and also contains the virtual address of the entry.



Figure 5.28 Use of an associative-mapped TLB.

When the operating system changes the contents of page table, the control bit in TLB will invalidate the corresponding entry in the TLB. Given a virtual address, the MMU looks in TLB for the referenced page.

If the page table entry for this page is found in TLB, the physical address is obtained immediately. If there is a miss in TLB, then the required entry is obtained from the page table in the main memory & TLB is updated.

When a program generates an access request to a page that is not in the main memory, then Page Fault will occur.

The whole page must be brought from disk into memory before an access can proceed. When it detects a page fault, the MMU asks the operating system to generate an interrupt.

The operating System suspend the execution of the task that caused the page fault and begin execution of another task whose pages are in main memory because the long delay occurs while page transfer takes place.

① When the task resumes, either the interrupted instruction must continue from the point of interruption or the instruction must be restarted.

If a new page is brought from the disk when the main memory is full, it must replace one of the resident pages. In that case, it uses LRU algorithm which removes the least referenced Page.

② A modified page has to be written back to the disk before it is removed from the main memory. In that case, write – through protocol is used.

## **MEMORY MANAGEMENT REQUIREMENTS:**

Management routines are part of the Operating system. Assembling the OS routine into virtual address sp ace is called "System Space". The virtual space in which the user application programs reside is called the "User Space". Each user space has a separate page table. The MMU uses the page table to determine the address of the table to be used in the translation process. Hence by changing the contents of this register, the OS can switch from one space to another. The process has two stages. They are,

User State

Supervisor state.

**User State**: In this state, the processor executes the user program.

**Supervisor State:** When the processor executes the operating system routines, the processor will be in supervisor state. Privileged Instruction:

In user state, the machine instructions cannot be executed. Hence a user program is prevented from accessing the page table of other user spaces or system spaces.

The control bits in each entry can be set to control the access privileges granted to each program. ie) One program may be allowed to read/write a given page, while the other programs may be given only red access.

### **4.7 SECONDARY STORAGE:**

The Semi-conductor memories do not provide all the storage capability.

The Secondary storage devices provide larger storage requirements. Some of the Secondary Storage devices are,

Magnetic Disk

Optical Disk

Magnetic Tapes.

## Magnetic Disk:

- Magnetic Disk system consists o one or more disk mounted on a common spindle.
- A thin magnetic film is deposited on each disk, usually on both sides.
- The disks are placed in a rotary drive so that the magnetized surfaces move in close proximity to read /write heads.
- Each head consists of magnetic yoke & magnetizing coil.

Digital information can be stored on the magnetic film by applying the current pulse of suitable polarity to the magnetizing coil.

- ① Only changes in the magnetic field under the head can be sensed during the Read operation.
- ☑ Therefore if the binary states 0 & 1 are represented by two opposite states of magnetization, a voltage is induced in the head only at 0-1 and at 1-0 transition in the bit stream.
- ② A consecutive (long string) of 0"s & 1"s are determined by using the clock which is mainly used for synchronization.
- Phase Encoding or Manchester Encoding is the technique to combine the clocking information with data.
- The Manchester Encoding describes that how the self-clocking scheme is implemented.

?

Fig: Mechanical Structure



The Read/Write heads must be maintained at a very small distance from the moving disk surfaces in order to achieve high bit densities.

① When the disks are moving at their steady state, the air pressure develops between the disk surfaces & the head & it forces the head away from the surface.

The flexible spring connection between head and its arm mounting permits the head to fly at the desired distance away from the surface.

## Wanchester Technology:

Property Read/Write heads are placed in a sealed, air – filtered enclosure called the Wanchester Technology.

In such units, the read/write heads can operate closure to magnetic track surfaces because the dust particles which are a problem in unsealed assemblies are absent.

### Merits

It have a larger capacity for a given physical size. The data intensity is high because the storage medium is not exposed to contaminating elements.

- 1 The read/write heads of a disk system are movable. The disk system has 3 parts. They are,
- o Disk Platter(Usually called Disk)
- o Disk Drive(spins the disk & moves Read/write heads)
- o Disk Controller(controls the operation of the system.)

## Fig:Organizing & Accessing the data on disk



Each surface is divided into concentric tracks.

Each track is divided into sectors. The set of corresponding tracks on all surfaces of a stack of disk form a logical **cylinder**.

The data are accessed by specifying the surface number, track number and the

### sector number.

The Read/Write operation start at sector boundaries. Data bits are stored serially on each track.

Each sector usually contains 512 bytes.

**Sector header** -> contains identification information.

It helps to find the desired sector on the selected track.

**ECC** (Error checking code)- used to detect and correct errors.

An unformatted disk has no information on its tracks.

The formatting process divides the disk physically into tracks and sectors and this process may discover some defective sectors on all tracks.

The disk controller keeps a record of such defects.

The disk is divided into logical partitions. They are,

Primary partition

Secondary partition

In the diag, Each track has same number of sectors.

So all tracks have same storage capacity.

Each surface is divided into concentric tracks.

Each track is divided into sectors. The set of corresponding tracks on all surfaces of a stack of disk form a logical **cylinder**.

The data are accessed by specifying the surface number, track number and the

### sector number.

The Read/Write operation start at sector boundaries. Data bits are stored serially on each track.

Each sector usually contains 512 bytes.

**Sector header** -> contains identification information.

It helps to find the desired sector on the selected track.

**ECC** (Error checking code)- used to detect and correct errors.

An unformatted disk has no information on its tracks.

The formatting process divides the disk physically into tracks and sectors and this process may discover some defective sectors on all tracks.

The disk controller keeps a record of such defects.

The disk is divided into logical partitions. They are,

Primary partition

Secondary partition

In the diag, Each track has same number of sectors.

So all tracks have same storage capacity.

Each surface is divided into concentric tracks.

Each track is divided into sectors. The set of corresponding tracks on all surfaces of a stack of disk form a logical **cylinder**.

The data are accessed by specifying the surface number, track number and the

#### sector number.

The Read/Write operation start at sector boundaries. Data bits are stored serially on each track.

Each sector usually contains 512 bytes.

**Sector header** -> contains identification information.

It helps to find the desired sector on the selected track.

**ECC** (Error checking code)- used to detect and correct errors.

An unformatted disk has no information on its tracks.

The formatting process divides the disk physically into tracks and sectors and this process may discover some defective sectors on all tracks.

The disk controller keeps a record of such defects.

The disk is divided into logical partitions. They are,

Primary partition

Secondary partition

In the diag, Each track has same number of sectors.

So all tracks have same storage capacity.

Thus the stored information is packed more densely on inner track than on outer track.

### Access time

There are 2 components involved in the time delay between receiving an address and the beginning of the actual data transfer. They are,

Seek time

? Rotational delay / Latency

**Seek time** – Time required to move the read/write head to the proper track. **Latency** – The amount of time that elapses after the head is positioned over the correct track until the starting position of the addressed sector passes under the read/write head.

Seek time + Latency = Disk access time

## Typical disk

One inch disk- weight=1 ounce,

size -> comparable to match book

Capacity -> 1GB

Inch disk has the following parameter

Recording surface=20

Tracks=15000 tracks/surface

Sectors=400.

Each sector stores 512 bytes of data

Capacity of formatted disk=20x15000x400x512=60x109 =60GB

Seek time=3ms

Platter rotation=10000 rev/min

Latency=3ms

Internet transfer rate=34MB/s

### Data Buffer / cache

A disk drive that incorporates the required SCSI circuit is referred as SCSI drive. The SCSI can transfer data at higher rate than the disk tracks.

② An efficient method to deal with the possible difference in transfer rate between disk and SCSI bus is accomplished by including a data buffer.

This buffer is a semiconductor memory.

The data buffer can also provide cache mechanism for the disk (ie) when a read request arrives at the disk, then controller first check if the data is available in the cache (buffer).

If the data is available in the cache, it can be accessed and placed on SCSI bus.

If it is not available then the data will be retrieved from the disk.

### **Disk Controller:**

The disk controller acts as interface between disk drive and system bus.

The disk controller uses DMA scheme to transfer data between disk and main memory.

① When the OS initiates the transfer by issuing Read/Write request, the controllers register will load the following information. They are,

Main memory address (address of first main memory location of the block of words involved in the transfer)

① Disk address(The location of the sector containing the beginning of the desired block of words) (number of words in the block to be transferred).

## **Diskette or Floppy Disk**

- spinning platter of special material
- Information stored by magnetically
- read/write head positioned by mechanical arm
- Storage capacity is at a few MBs
- Random access
- seek time from 10 to 40 milliseconds
- Easily portable

## **Optical Disks**

- CD-ROM read only (books, software releases)
- WORM write once, read many (archival storage)
- Laser encoding, not magnetic
- 30-50 ms seek times
- 640MB 17GB storage capacity
- Cheaper than hard disks per MB of storage capacity, but slower
- portable
- Jukeboxes of optical disks are becoming popular for storing really, really large collections of data. The Mercury-20 jukebox (no I'm not selling these, just using it as a typical example) provides access to up to 150 CD-ROMs, or in other words 94GBs of storage capacity. The Mercury jukebox takes a maximum of four seconds to exchange and load a disc into a drive, 2.5 seconds to spin up and access the data and 10 seconds to transfer a 6.0 MB file to the computer or server.

# **Input/Output Interface:**

**INPUT-OUTPUT INTERFACES**: An interface is a data path between two separate devices in a computer system. Interface to buses can be classified based on the number of bits that are transmitted at a given time to serial versus parallel ports. In a serial port, only 1 bit of data is transferred at a time. Mice and modems are usually connected to serial ports. A parallel port allows more than 1 bit of data to be processed at once. Printers are the most common peripheral devices connected to parallel ports. Table 8.4 shows a summary of the variety of buses and interfaces used in personal computers.

TABLE 8.4 Descriptions of Buses and Interfaces Used in Personal Computers

| Bus/Interface                                                              | Description                                                                                                                                                                                                                                                                                                                                                                              |  |  |
|----------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|
| PS/2                                                                       | A type of port (or interface) that can be used to connect mice and<br>keyboards to the computer. The PS/2 port is sometimes called the<br>mouse port.                                                                                                                                                                                                                                    |  |  |
| Industry standard<br>architecture (ISA)                                    | ISA was originally an 8-bit bus and later expanded to a 16-bit bus i<br>1984. In 1993, Intel and Microsoft introduced a plug and play<br>ISA bus that allowed the computer to automatically detect and so<br>up computer ISA peripherals such as a modem or sound card.                                                                                                                  |  |  |
| Extended industry<br>standard<br>architecture<br>(EISA)                    | EISA is an enhanced form of ISA, which allows for 32-bit data<br>transfers, while maintaining support for 8- and 16-bit expansion<br>boards. However, its bus speed, like ISA, is only 8 MHz. EISA is<br>not widely used, due to its high cost and complicated nature.                                                                                                                   |  |  |
| Micro channel<br>architecture<br>(MCA)                                     | MCA was introduced by IBM in 1987. It offered several additions features over the ISA such as a 32-bit bus, automatically configured cards and bus mastering for greater efficiency. It is slightly superior to EISA, but not many expansion boards were ever made to fit MCA specifications.                                                                                            |  |  |
| VESA (Video<br>electronics<br>standards<br>association) local<br>bus (VLB) | The VESA, a nonprofit organization founded by NEC, released the VLB in 1992. It is a 32-bit bus that had direct access to the system memory at the speed of the processor, commonly the 486 CPU (33/40 MHz). VLB 2.0 was later released in 1994 and had a 64-bit bus and a bus speed of 50 MHz.                                                                                          |  |  |
| Peripheral<br>component<br>interconnect (PCI)                              | PCI was introduced by Intel in 1992, revised in 1993 to version 2.0 and later revised in 1995 to PCI 2.1. It is a 32-bit bus that is also available as a 64-bit bus today. Many modern expansion boards are connected to PCI slots.                                                                                                                                                      |  |  |
| Advanced graphic<br>port (AGP)                                             | AGP was introduced by Intel in 1997. AGP is a 32-bit bus designed for<br>the high demands of 3D graphics. AGP has a direct line to memory,<br>which allows 3D elements to be stored in the system memory<br>instead of the video memory. AGP is geared towards data-intensive<br>graphics cards, such as 3D accelerators; its design allows for data<br>throughput at rates of 266 MB/s. |  |  |

TABLE 8.4 Continued

| Bus/Interface                                      | Description                                                                                                                                                                                                                                                                                                                  |  |
|----------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| Universal serial bus<br>(USB)                      | USB is an external bus developed by Intel, Compaq, DEC, IBM, Microsoft, NEC and Northern Telcom. It was released in 1996 with the Intel 430HX Triton II Mother Board. USB has the capability of transferring 12 Mbps, supporting up to 127 devices. Many devices can be connected to USB ports, which support plug and play. |  |
| FireWire (IEEE<br>1394)                            | FireWire is a type of external bus, which supports very fast transf<br>rates: 400 Mbps. Because of this, FireWire is suitable for<br>connecting video devices, such as VCRs, to the computer.                                                                                                                                |  |
| Small computer<br>system interface<br>(SCSI)       | SCSI is a type of parallel interface that is commonly used for mas<br>storage devices. SCSI can transfer data at rates of 4 MB/s; in<br>addition, there are several varieties of SCSI that support higher<br>speeds: Fast SCSI (10 MB/s), Ultra SCSI and Fast Wide SCSI<br>(20 MB/s), as well as Ultra Wide SCSI (40 MB/s).  |  |
| Integrated drive<br>electronics (IDE)              | IDE is a commonly used interface for hard disk drives and<br>CD-ROM drives. It is less expensive than SCSI, but offers<br>slightly less in terms of performance.                                                                                                                                                             |  |
| Enhanced integrated<br>drive electronics<br>(EIDE) | EIDE is an improved version of IDE, which offers better<br>performance than standard SCSI. It offers transfer rates between<br>4 and 16.6 MB/s.                                                                                                                                                                              |  |
| PCI-X                                              | PCI-X is a high performance bus that is designed to meet the<br>increased I/O demands of technologies such as Fibre Channel,<br>Gigabit Ethernet, and Ultra3 SCSI.                                                                                                                                                           |  |
| Communication and<br>network riser<br>(CNR)        | CNR was introduced by Intel in 2000. It is a specification that<br>supports audio, modem USB and local area networking interfaces<br>of core logic chipsets.                                                                                                                                                                 |  |

# **Accessing I/O Devices:**

Accessing I/O Devices.: In computing, input/output, or I/O, refers to the communication between an information processing system (computer), and the outside world. Inputs are the signals or data received by the system, and outputs are the signals or data sent from it. I/O devices are used by a person other system) to communicate with computer. Some of the input devices are keyboard, mouse, track ball, joy stick, touch screen, digital camera, webcam, image scanner, fingerprint scanner, barcode reader, microphone and so on. Some of the output devices are speakers, headphones, monitors and printers. Devices for communication between computers, such as modems and network cards, typically serve for both input and output. I/O devices can be connected to a computer through a single bus which enables the exchange of information. The bus consists of three sets of lines used to carry address, data, and control signals. Each I/O device is assigned a unique set of addresses. When the processor places a particular address on the address lines, the device that recognizes this address responds to the commands issued on the control lines. The processor requests either a read or a write operation, and the

requested data are transferred over the data lines. Figure 5.1 shows the simple arrangement of I/O devices to processor and memory with single bus.



Figure 5.1 A Single bus structure

Memory-mapped I/O: The arrangement of I/O devices and the memory share the same address space is called memory-mapped I/O. With memory-mapped I/O, any machine instruction that can access memory can be used to transfer data to or from an I/O device. For example, if DATAIN is the address of the input buffer associated with the keyboard, the instruction

### Move DATAIN,R0

reads the data from DATAIN and stores them into processor register RO. Similarly, the instruction

### Move R0.DATAOUT

sends the contents of register R0 to location DATAOUT, which may be the output data buffer of a display unit or a printer. Most computer systems use memory-mapped I/O. Some processors have special In and Out instructions to perform I/O transfers.

Figure 5.2 illustrates the hardware required to connect an I/O device to the bus. The address decoder enables the device to recognize its address when this address appears on the address lines. The data register holds the data being transferred to or from the processor. The status register contains information relevant to the operation of the I/O device. Both the data and status registers are connected to the data bus and assigned unique addresses. The address decoder, the data and status registers, and the control circuitry required to coordinate I/O transfers constitute the device's interface circuit.



Figure 5.2 I/O interface for an input device

I/O devices operate at speeds that are vastly different from that of the processor. When a human operator is entering characters at a keyboard, the processor is capable of executing millions of instructions between successive character entries. An instruction that reads a character from the keyboard should be executed only when a character is available in the input buffer of the keyboard interface. An input character is read only once.

For an input device such as a keyboard, a status flag, SIN, is included in the interface circuit as part of the status register. This flag is set to 1 when a character is entered at the keyboard and cleared to 0 once this character is read by the processor. Hence, by checking the SIN flag, the software can ensure that it is always reading valid data. This is often accomplished in a program loop that repeatedly reads the status register and checks the state of SIN. When SIN becomes equal to 1, the program reads the input data register. A similar procedure can be used to control output operations using an output status flag, SOUT.

## **Example:**

**Program-controlled I/O**: Consider a simple example of I/O operations involving a keyboard and a display device in a computer system. The four registers shown in Figure 5.3 are used in the data transfer operations. Register STATUS contains two control flags, SIN and SOUT, which provide status information for the keyboard and the display unit, respectively. The two flags KIRQ and DIRQ in this register are used in conjunction with interrupts. They, and the KEN and DEN bits in register CONTROL, Data from the keyboard are made available in the DATAIN register, and data sent to the display are stored in the DATAOUT register.



Figure 5.3 Registers in keyboard and display interfaces.

The program in Figure 5.4 reads a line of characters from the keyboard and stores it in a memory buffer starting at location LINE. Then, it calls a subroutine PROCESS to process the input line. As each character is read, it is echoed back to the display. Register R0 is used as a pointer to the memory buffer area. The contents of R0 are updated using the Autoincrement addressing mode so that successive characters are stored in successive memory locations. Each character is checked to see if it is the Carriage Return (CR) character, which has the ASCII code 0D (hex). If it is, a Line Feed character (ASCII code 0A) is sent to move the cursor one line down on the display and subroutine PROCESS is called. Otherwise, the program loops back to wait for another character from the keyboard.

In program-controlled I/O the processor repeatedly checks a status flag to achieve the required synchronization between the processor and an input or output device. The processor polls the device. There are two other commonly used mechanisms for implementing I/O operations: interrupts and direct memory access. In the case of interrupts, synchronization is achieved by having the I/O device send a special signal over the bus whenever it is ready for a data transfer operation. Direct memory access is a technique used for high-speed I/O devices. It involves having the device interface transfer data directly to or from the memory, without continuous involvement by the processor.

| 588 688 | Move      | #LINE,R0      | Initialize memory pointer.                   |  |
|---------|-----------|---------------|----------------------------------------------|--|
| WAITK   | TestBit   | #0,STATUS     | Test SIN.                                    |  |
|         | Branch=0  | WAITK         | Wait for character to be entered.            |  |
|         | Move      | DATAIN,RI     | Read character.                              |  |
| WAITD   | TestBit   | #1,STATUS     | Test SOUT.                                   |  |
|         | Branch=0  | WAITD         | Wait for display to become ready.            |  |
|         | Move      | RI,DATAOUT    | Send character to display.                   |  |
|         | Move      | R1,(RO)+      | Store character and advance pointer.         |  |
|         | Compare   | #\$0D,R1      | Check if Carriage Return.                    |  |
|         | Branch!=0 | WAITK         | If not, get another character.               |  |
|         | Move      | #\$0A,DATAOUT | Otherwise, send Line Feed.                   |  |
|         | Call      | PROCESS       | Call a subroutine to process the input line. |  |

Figure 5.4 A program that reads one line from the keyboard, stores it in memory buffer, and echoes it back to the display.

# **Interrupts:**

**Interrupts**: Interrupt is a hardware signal to the processor from I/O devices through one of the control line called interrupt-request line. The routine executed in response to an interrupt request is called the interrupt-service routine, Interrupts bear considerable resemblance to subroutine calls. Assume that an interrupt request arrives during execution of instruction i in Figure 4.5. The processor first completes execution of instruction i. Then, it loads the program counter with the address of the first instruction of the interrupt-service routine. For the time being, let us assume that this address is hardwired in the processor. After execution of the interrupt-service routine, the processor has to come back to instruction i 1. Therefore, when an interrupt occurs, the current contents of the PC, which point to instruction i 1, must be put in temporary storage in a known location. A Return from-interrupt instruction at the end of the interrupt-service routine reloads the PC from that temporary storage location, causing execution to resume at instruction i 1. In many processors, the return address is saved on the processor stack. Alternatively, it may be saved in a special location, such as a register provided for this purpose.



Figure 5.5 Transfer of control through the use of interrupts

The processor must inform the device that its request h been recognized so that it may remove its interrupt-request signal. This may be accomplished by means of a special control signal on the bus called an interrupt-acknowledge signal. The execution of an instruction in the interrupt - service routine that accesses a status or data register in the device interface implicitly informs the device that its interrupt request has been recognized.



## **Interrupt Hardware:**

**Interrupt Hardware**: In the above discussion, we have assumed that the processor has recognized the occurrence of an interrupt before proceeding to serve it. Computers are provided with interrupt hardware capability in the form of specialized interrupt lines to the processor. These lines are used to send interrupt signals to the processor. In the case of I/O, there exists more than one I/O device. The processor should be provided with a mechanism that enables it to handle simultaneous interrupt requests and to recognize the interrupting device. Two basic schemes can be implemented to achieve this task. The first scheme is called daisy chain bus arbitration (DCBA) and the second is called independent source bus arbitration (ISBA).

According to the DCBA (see Fig. 8.6a), I/O devices present their interrupt requests to the interrupt request line INR (similar to the polling arrangement). Upon recognizing the arrival of an interrupt request, the processor, through a daisy chained grant line (GL), sends its grant to the requesting device to start communication with the processor. The GL goes through all devices starting from the first device nearer to the processor and going to the next device and so on until it reaches the last device (Device #N). If Device #1 has put a request, then it will hold the grant signal and start communication with the processor. If, on the other hand, Device #1 has no interrupt request, it will pass the grant signal to device #2, which will repeat the same procedure, and so on. In the case of multiple requests, the DCBA arrangement gives highest priority to the device physically nearer to the processor. The furthest device from the processor has the lowest priority.

According to the ISBA (see Fig. 8.6b), each I/O device has its own interrupt request line, through which it can send its interrupt request, independent of the other devices. Similarly, each I/O device has its own grant line, through which it receives the grant signal for its request such that it can start communicating with the processor. I/O device priority in the ISBA does not depend on the device location. A priority arbitration circuitry is needed in order to deal with simultaneous interrupt requests. 8.3.2.

**Interrupt in Operating Systems** When an interrupt occurs, the operating system gains control. The operating system saves the state of the interrupted process, analyzes the interrupt, and passes control to the appropriate routine to handle the interrupt. There are several



Figure 8.6 Interrupt hardware schemes. (a) Daisy chain interrupt arrangement (b) Independent interrupt arrangement

types of interrupts, including I/O interrupts. An I/O interrupt notifies the operating system that an I/O device has completed or suspended its operation and needs some service from the CPU. To

process an interrupt, the context of the current process must be saved and the interrupt handling routine must be invoked. This process is called context switching. A process context has two parts: processor context and memory context. The processor context is the state of the CPU's registers including program counter (PC), program status words (PSWs), and other registers. The memory context is the state of the program's memory including the program and data. The interrupt handler is a routine that processes each different type of interrupt.

The operating system must provide programs with save area for their contexts. It also must provide an organized way for allocating and deallocating memory for the interrupted process. When the interrupt handling routine finishes processing the interrupt, the CPU is dispatched to either the interrupted process, or to the highest priority ready process. This will depend on whether the interrupted process is preemptive or nonpreemptive. If the process is nonpreemptive, it gets the CPU again. First the context must be restored, then control is returned to the interrupts process.



Figure 8.7 Layered I/O software

Figure 8.7 shows the layers of software involved in I/O operations. First, the program issues an I/O request via an I/O call. The request is passed through to the I/O device. When the device completes the I/O, an interrupt is sent and the interrupt handler is invoked. Eventually, control is relinquished back to the process that initiated the I/O.

### Examples of interrupt I/O

**Example 1: 80386 Interrupt Architecture** The 8086 processors have just two hardware interrupt pins. These are labeled **INTR** andNMI.NMI nonmaskable interrupt, which means it cannot be blocked and the processor must respond to it. The NMI input is usually reserved for critical system functions. The INTR input is a maskable interrupt request line between the CPU and the programmable interrupt controller (8259A PIC). Interrupts on INTR can be enabled and disabled using the instructions STI (set interrupt flag) and CLI (clear interrupt flag), respectively. Interrupt handlers are called interrupt service routines (ISR). The address of each interrupt service routine is stored in four consecutive memory locations in the interrupt vector table (IVT). The IVT stores pointers to ISR for each type of interrupt. When an interrupt occurs, an 8-bit type number is supplied to the processor, which identifies the appropriate entry in this table.

When an interrupt is generated by a device, it goes to the PIC. Multiple interrupts may be generated simultaneously. However, they are all buffered by the PIC. The PIC decides which one of these interrupts should be forwarded to the CPU. To inform the CPU that an outstanding interrupt is waiting to be processed, the PIC sends an interrupt request (INTR) to the CPU, which then, at the appropriate time, responds with an interrupt acknowledgment (INTA). At this time, PIC will put an 8-bit interrupt type number associated with the device on the bus so that the CPU can identify which interrupt handler to invoke. In the case when several interrupts are pending, PIC will send next interrupt request to the CPU only after it receives an end of interrupt command from the current ISR. Figure 8.8 shows the simple protocol that is used to determine which ISR is to be invoked. In the computer designs that used a single PIC (PC and XT), eight different interrupt requests are allowed (IRQ0–IRQ7). Table 8.1 shows a list of standard interrupt type numbers for typical devices. When AT was designed, a second PIC was added,



Figure 8.8 Interrupt handling in 80×86

increasing the number of interrupt inputs to 15. Figure 8.9 shows two PICS wired in cascade. One PIC is designated as master and the other becomes the slave. As shown in the figure, all slave interrupts are input via IRQ1 of the master. In general, eight different slaves can be accommodated by a single PIC.

**Example 2: ARM Interrupt Architecture** ARM stands for Advanced RISC Machines. ARM is a 16/32-bit architecture that is used for portable devices because of its low power consumption and reasonable performance. Interrupt requests to the ARM core are collected and controlled by the interrupt controller, which is called ATIC. The interrupt controller provides an interface to the core and can collect up to 64 interrupt requests. The usual sequence of events for interrupts is as follows. Interrupts would be enabled at the source (such as a peripheral), then enabled in the interrupt controller, and finally, enabled to the core. When an interrupt occurs at the source, its signal is routed to the interrupt controller then to the ARM core. In the interrupt controller, the interrupt can be enabled or disabled to the core and can be assigned a priority

TABLE 8.1 Standard IBM-PC Interrupt Type Numbers for Typical Devices

| Device                           | IRQ<br>no. | Interrupt<br>type number |
|----------------------------------|------------|--------------------------|
| Programmable interval timer      | 0          | 08H                      |
| Keyboard                         | 1          | 09H                      |
| Cascading to the second PICs     | 2          | Reserved                 |
| Serial communication port (COM2) | 3          | 0BH                      |
| Serial communication port (COM1) | 4          | 0CH                      |
| Fixed disk controller            | 5          | 0DH                      |
| Floppy disk controller           | 6          | 0EH                      |
| Parallel printer controller      | 7          | 0FH                      |



Figure 8.9 Fifteen different interrupts are supported by two PICs wired in cascade

level. Once the interrupt request reaches the core, it will halt the core from its normal processing routines to allow the interrupt request to be serviced. Among the different interrupt requests that the ARM core can handle are IRQ and FIQ requests. The IRQ (normal interrupt request) is used for general-purpose interrupt handling. It has a lower priority than an FIQ (fast interrupt request) and is masked out when an FIQ sequence is entered. The FIQ is used to support highspeed data transfer or channel processes.

TABLE 8.2 Interrupt Vector Table

| Exception type            | Mode       | Address              |
|---------------------------|------------|----------------------|
| Reset                     | Supervisor | 0×00000000           |
| Undefined instructions    | Undefined  | $0 \times 000000004$ |
| Software interrupts (SWI) | Supervisor | $0 \times 000000008$ |
| Prefetch abort            | Abort      | 0×0000000C           |
| Data abort                | Abort      | $0 \times 00000010$  |
| IRQ (Normal interrupt)    | IRQ        | $0 \times 00000018$  |
| FIQ (Fast interrupt)      | FIQ        | $0 \times 0000001C$  |

Similar to the 8086, the addresses of the interrupt handlers are stored in a vector table, which is shown in Table 8.2. For example, when an IRQ is detected by the core, it accesses address 018 of the vector table and executes the instruction loaded in that address. Normally, the instruction found at 018 of the vector table is of the form: LDR PC, IRQ\_Handler (load the address of the IRQ interrupt handler in the PC). When an FIQ is detected by the core, it accesses address 01C of the vector table and executes the instruction loaded in that address. Normally, the instruction found at 01C of the vector table is of the form: LDR PC, FIQ Handler.

## **Enabling and Disabling Interrupts:**

**ENABLING AND DISABLING INTERRUPTS**: There are many situations in which the processor should ignore interrupt requests. For example, in the case of the Compute-Print program of Figure 5.5, an interrupt request from the printer should be accepted only if there are output lines to be printed. After printing the last line of a set of n lines, interrupts should be disabled until another set becomes available for printing.

A simple way is to provide machine instructions, such as Interrupt-enable and Interruptdisable. The processor hardware ignores the interrupt-request line until the execution of the first instruction of the interrupt-service routine has been completed. Then, by using an Interrupt-disable instruction as the first instruction in the interrupt-service routine, the programmer can ensure that no further interruptions will occur until an Interrupt-enable instruction is executed. Typically, the Interrupt-enable instruction will be the last instruction in the interrupt-service routine before the Return-from-interrupt instruction. The processor must guarantee that execution of the Return-interrupt instruction is completed before ¬further interruption can occur.

The second option, which is suitable for a simple processor with only one interrupt- request line, is to have the processor automatically disable interrupts before starting the execution of the interrupt-service routine. After saving the contents of the PC and the processor status register (PS) on the stack, the processor performs the equivalent of executing an Interrupt -disable instruction. It is often the case that one bit in the PS register, called Interrupt-enable, indicates whether

interrupts are enabled. An interrupt request received while this bit is equal to 1 will be accepted. After saving the contents of the PS on the stack, with the Interrupt-enable bit equal to 1, the processor clears the Interrupt-enable bit in its PS register, thus disabling further interrupts. When a Return-frominterrupt instruction is executed, the contents of the PS are restored from the stack, setting the Interrupt-enable bit back to 1. Hence, interrupts are again enabled.

In the third option, the processor has a special interrupt-request line for which the interrupthandling circuit responds only to the leading edge of the signal. Such a line is said to be edgetriggered. In this case, the processor will receive only one request, regardless of how long the line is activated. Hence, there is no danger of multiple interruptions and no need to explicitly disable interrupt requests from this line. Before proceeding to study more complex aspects of interrupts, let us summarize the sequence of events involved in handling an interrupt request from a single device.

## Assuming that interrupts are enabled, the following is a typical scenario:

- 1. The device raises an interrupt request.
- 2. The processor interrupts the program currently being executed.
- 3. Interrupts are disabled by changing the control bits in the PS (except in the case of edge triggered interrupts).
- 4. The device is informed that its request has been recognized, and in response, it deactivates the interrupt-request signal.
- 5. The action requested by the interrupt is performed by the interrupt-service routine.
- 6. Interrupts are enabled and execution of the interrupted program is resumed.

## **Handling Multiple Devices:**

**HANDLING MULTIPLE DEVICES**: Let us now consider the situation where a number of devices capable of initiating interrupts are connected to the processor. Because these devices are operationally independent, there is no definite order in which they will generate interrupts. For example, device X may request an interrupt while an interrupt caused by device Y is being serviced, or several devices may request interrupts at exactly the same time. This gives rise to a number of questions:

- 1. How can the processor recognize the device requesting an interrupt?
- 2. Given that different devices are likely to require different interrupt -service routines, how can the processor obtain the starting address of the appropriate routine in each case?
- 3. Should a device be allowed to interrupt the processor while another interrupt is being serviced?
- 4. How should two or more simultaneous interrupt requests be handled? If two devices have activated the line at the same time, it must be possible to break the tie and elect one of the two requests for service. When the interrupt-service routine for the selected device has been completed, the second request can be serviced.

**Vectored interrupts**: To reduce the time involved in the polling process, a device requesting an interrupt may identify itself directly to the processor. Then, the processor can immediately start executing the corresponding interrupt-service routine. The term vectored interrupts refers to all interrupt - handling schemes based on this approach.

A device requesting an interrupt can identify itself by sending a special code to the processor over the bus. This enables the processor to identify individual devices even if they share a single interrupt-request line. The code supplied by the device may represent the starting address of the interrupt-service routine for that device. The code length is typically in the range of 4 to 8 bits. The remainder of the address is supplied by the processor based on the area in its memory where the addresses for interrupt-service routines are located.

This arrangement implies that the interrupt-service routine for a given device must always start at the same location. The programmer can gain some flexibility by storing in this location an instruction that causes a branch to the appropriate routine. In many computers, this is done automatically by the interrupt-handling mechanism. The location pointed to by the interrupting device is used to store the starting address of the interrupt-service routine. The processor reads this address, called the interrupt vector, and loads it into the PC. The interrupt vector may also include a new value for the processor status register.

In most computers, I/O devices send the interrupt-vector code over the data bus, using the bus control signals to ensure that devices do not interfere with each other. When a device sends an interrupt request, the processor may not be ready to receive the interrupt-vector code immediately. For example, it must first complete the execution of the current instruction, which may require the use of the bus. There may be further delays if interrupts happen to be disabled at the time the request is raised. The interrupting device must wait to put data on the bus only when the processor is ready to receive it. When the processor is ready to receive the interrupt-vector code, it activates the interrupt-acknowledge line, INTA. The I/O device responds by sending its interrupt- vector code and turning off the INTR signal.

**Interrupt nesting**: I/O devices should be organized in a priority structure. An interrupt request from a highpriority device should be accepted while the processor is servicing another request from a lowerpriority device. A multiple-level priority organization means that during execution of an interruptservice routine, interrupt requests will be accepted from some devices but not from others, depending upon the device's priority. To implement this scheme, priority level can be assigned processor that can be changed bv the program. The processor's priority is usually encoded in a few bits of the processor status word. It can be changed by program instructions that write into the PS. These are privileged instructions, which can be executed only while the processor is running in the supervisor mode. The processor is in the supervisor mode only when executing operating system routines. It switches to the user mode before beginning to execute application programs. Thus, a user program cannot accidentally, or intentionally, change the priority of the processor and disrupt the system's operation. An attempt to execute a privileged instruction while in the user mode leads to a special type of interrupt called a privilege exception.

A multiple-priority scheme can be implemented easily by using separate interrupt- request and interrupt-acknowledge lines for each device, as shown in Figure 5.7. Each of the interrupt request lines is assigned a different priority level. Interrupt requests received over these lines are sent to a

priority arbitration circuit in the processor. A request is accepted only if it has a higher priority level than that currently assigned to the processor.



Figure 5.7 Implementation of interrupt priority using individual interrupt-request and acknowledge lines.

**Simultaneous request**: When simultaneous interrupt requests are arrived from two or more I/O devices to the processor, the processor must have some means of deciding which request to service first.

Using a priority scheme such as that of Figure 5.7, the solution is straightforward. The processor simply accepts the request having the highest priority. If several devices share one interrupt-request line, as in Figure 5.6, some other mechanism is needed. Polling the status registers of the I/O devices is the simplest such mechanism. In this case, priority is determined by the order in which the devices are polled. When vectored interrupts are used, we must ensure that only one device is selected to send its interrupt vector code. A widely used scheme is to connect the devices to form a daisy chain, as shown in Figure 5.8. The interruptrequest line INTR is common to all devices. The interrupt-acknowledge line, INTA, is connected in a daisy-chain fashion, such that the INTA signal propagates serially through the devices. When several devices raise an interrupt request and the INTR line is activated, the processor responds by setting the INTA line to 1. This signal is received by device 1. Device 1 passes the signal on to device 2 only if it does not require any service. If device 1 has a pending request for interrupt, it blocks the INTA signal and proceeds to put its identifying code on the data lines. Therefore, in the daisy-chain arrangement, the device that is electrically closest to the processor has the highest priority. The second device along the chain has second highest priority, and so on.

## **Controlling Device Requests:**

CONTROLLING DEVICE REQUESTS: The control needed is usually provided in the form of an interrupt-enable bit in the device's interface circuit. The keyboard interrupt-enable, KEN, and display interrupt- enable, DEN, flags in register CONTROL in Figure 5.3 perform this function. If either of these flags is set, the interface circuit generates an interrupt request whenever the corresponding status flag in register STATUS is set. At the same time, the interface circuit sets bit KIRQ or DIRQ to indicate that the keyboard or display unit, respectively, is requesting an interrupt. If an interrupt-enable bit is equal to 0, the interface circuit will not generate an interrupt request, regardless of the state of the status flag. There are two independent mechanisms for controlling interrupt requests. At the device end, an interrupt-enable bit in a control register determines whether the device is allowed to generate an

interrupt request. At the processor end, either an interrupt enable bit in the PS register or a priority structure determines whether a given interrupt request will be accepted.

**EXCEPTIONS:** The term exception is often used to refer to any event that causes an interruption. Hence, I/O interrupts are one example of an exception.

Recovery from errors: Computers use a variety of techniques to ensure that all hardware components are operating properly. For example, many computers include an error-checking code in the main memory, which allows detection of errors in the stored data. If an error occurs, the control hardware detects it and informs the processor by raising an interrupt. The processor may also interrupt a program if it detects an error or an unusual condition while executing the instructions of this program. For example, the OP-code field of an instruction may not correspond to any legal instruction, or an arithmetic instruction may attempt a division by zero. When exception processing is initiated as a result of such errors, the processor proceeds in exactly the same manner as in the case of an I/O interrupt request. It suspends the program being executed and starts an exception-service routine. This routine takes appropriate action to recover from the error, if possible, or to inform the user about it. Recall that in the case of an I/O interrupt, the processor completes execution of the instruction in progress before accepting the interrupt. However, when an interrupt is caused by an error, execution of the interrupted instruction cannot usually be completed, and the processor begins exception processing immediately.

**Debugging**: Another important type of exception is used as an aid in debugging programs. System software usually includes a program called a debugger, which helps the programmer find errors in a program. The debugger uses exceptions to provide two important facilities called trace and breakpoints.

When a processor is operating in the trace mode, an exception occurs after execution of every instruction, using the debugging program as the exception-service routine. The debugging program enables the user to examine the contents of registers, memory locations, and so on. On return from the debugging program, the next instruction in the program being debugged is executed, then the debugging program is activated again. The trace exception is disabled during the execution of the debugging program. Breakpoints provide a similar facility, except that the program being debugged is interrupted only at specific points selected by the user. An instruction called Trap or Software interrupt is usually provided for this purpose. Execution of this instruction results in exactly the same actions as when a hardware interrupt request is received. While debugging a program, the user may wish to interrupt program execution after instruction i. The debugging routine saves instruction i 1 and replaces it with a software interrupt instruction. When the program is executed and reaches that point, it is interrupted and the debugging routine is activated. This gives the user a chance to examine memory and register contents. When the user is ready to continue executing the program being debugged, the debugging routine restores the saved instruction that was at location i 1 and executes a Return-from-interrupt instruction.

**Privilege exception**: To protect the operating system of a computer from being corrupted by user programs, certain instructions can be executed only while the processor is in the supervisor mode. These are called privileged instructions. For example, when the processor is running in the user mode, it will not execute an instruction that changes the priority level of the processor or that enables a user program to access areas in the computer memory that have been allocated to other

users. An attempt to execute such an instruction will produce a privilege exception, causing the processor to switch to the supervisor mode and begin executing an appropriate routine in the operating system.

## **Use of Interrupts in Operating Systems:**

USE OF INTERRUPTS IN OPERATING SYSTEMS: The operating system (OS) is responsible for coordinating all activities within a computer. It makes extensive use of interrupts to perform I/O operations and communicate with and control the execution of user programs. The operating system incorporates the interrupt-service routines (ISR) for all devices connected to a computer. Application programs do not perform I/O operations themselves. An operating system provides a variety of services to application programs. To facilitate the implementation of these services, most processors have several different software interrupt instructions, each with its own interrupt vector. They can be used to call different parts of the OS.

- In a computer that has both a supervisor and a user mode, the processor switches its operation to supervisor mode at the time it accepts an interrupt request. It does so by setting a bit in the processor status register after saving the old contents of that register on the stack. Thus, when an application program calls the as by a software interrupt instruction, the processor automatically switches to supervisor mode, giving the as complete access to the computer's resources. When the as executes a Return-from-interrupt instruction, the processor status word belonging to the application program is restored from the stack. As a result, the processor switches back to the user mode.
- Multitasking is a mode of operation in which a processor executes several user programs at the same time. A common as technique that makes this possible is called time slicing. when operating system is started, an initialization routine OSINIT called for initialization. OSINIT loads the starting address of a routine called SCHEDULER in the interrupt vector corresponding to the timer interrupt. Hence, at the end of each time slice, the timer interrupt causes this routine to be executed.
- A program, together with any information that describes its current state of execution, is regarded by the as an entity called a process. A process can be in one of three states: Running, Runnable, or Blocked. The Running state means that the program is currently being executed. Assume that program A is in the Running state during a given time slice. At the end of that time slice, the timer interrupts the execution of this program and starts the execution of SCHEDULER. This is an operating system routine whose function is to determine which user program should run in the next time slice. It starts by saving all the information that will be needed later when execution of program A is resumed. The information saved, which is called

- the program state, includes register contents, the program counter, and the processor status word.
- SCHEDULER selects for execution some other program, B, that was suspended earlier and is in the Runnable state. It restores all information saved at the time program R was suspended, including the contents of PS and PC, and executes a Retum-from-interrupt instruction. As a result, program B resumes execution for T seconds, at the end of which the timer clock raises an interrupt again, and a context switch to another runnable process takes place. Suppose that program A needs to read an input line from the keyboard. Instead of performing the operation itself, it requests I/O service from the operating system. It uses the stack or the processor registers to pass information to the OS describing the required operation, the I/O device, and the address of a buffer in the program data area where the line should be placed. Then it executes a software interrupt instruction. The interrupt vector for s instruction points to the OS SERVICES. This routine examines the information on the stack and initiates the requested operation by calling an appropriate OS routine. In our example, it calls, which is a routine responsible for starting I/O operations.
- While an I/O operation is in progress, the program that requested it cannot continue execution. Hence, the 10INIT routine sets the process associated with program A into the Blocked state, indicating to the scheduler that the program cannot resume execution at this time. The 10INIT routine carries out any preparations needed for the I/O operation, such as initializing address pointers and byte count, then calls a routine that performs the I/O transfers. It is common practice in operating system design to encapsulate all software pertaining to a particular device into a elf-contained module called the device driver. Such a module can be easily added to or deleted from the OS. We have assumed that the device driver for the keyboard consists of two routines, KBDINIT and KBDDATA, as shown n Figure 4.1 Oc. The 10INIT routine calls KBDINIT, which performs any initialization operations needed by the device or its interface circuit. KBDINIT also enables interrupts in the interface circuit by setting the appropriate bit in its control register, d then it returns to IOINIT, which returns to OSSERVICES. The keyboard is now ready to participate in a data transfer operation. It will generate an interrupt request whenever a key is pressed.
- Following the return to OSSERVICES, the SCHEDULER routine selects another user program to run. Of course, the scheduler will not select program A, because that program is now in the Blocked state. The Return-from-interrupt instruction that causes the selected user program to begin execution will also enable interrupts in the processor by loading new contents into the

processor status register. Thus, an interrupt request generated by the keyboard's interface will be accepted. The interrupt vector for this interrupt points to an OS routine called 10DATA. Because there could be several devices connected to the same interrupt request line, 10DATA begins by polling these devices to determine the one requesting service. Then, it calls the appropriate device driver to service the request. In our example, the driver called will be KBDDATA, which will transfer one character of data. If the character is a Carriage Return, it will also set to 1 a flag called END, to inform 10DATA that the requested I/O operation has been completed. At this point, the 10DATA routine changes the state of process A from Blocked to Runnable, so that the scheduler may select it for execution in some future time slice.

## **Handling Multiple Devices:**

**HANDLING MULTIPLE DEVICES**: Let us now consider the situation where a number of devices capable of initiating interrupts are connected to the processor. Because these devices are operationally independent, there is no definite order in which they will generate interrupts. For example, device X may request an interrupt while an interrupt caused by device Y is being serviced, or several devices may request interrupts at exactly the same time. This gives rise to a number of questions:

- 1. How can the processor recognize the device requesting an interrupt?

  2. Given that different devices are likely to require different interrupt -service routines, how can the processor obtain the starting address of the appropriate routine in each case?

  3. Should a device be allowed to interrupt the processor while another interrupt is being serviced?
- 3. Should a device be allowed to interrupt the processor while another interrupt is being serviced?
- 4. How should two or more simultaneous interrupt requests be handled? If two devices have activated the line at the same time, it must be possible to break the tie and elect one of the two requests for service. When the interrupt-service routine for the selected device has been completed, the second request can be serviced.

**Vectored interrupts**: To reduce the time involved in the polling process, a device requesting an interrupt may identify itself directly to the processor. Then, the processor can immediately start executing the corresponding interrupt-service routine. The term vectored interrupts refers to all interrupt - handling schemes based on this approach.

A device requesting an interrupt can identify itself by sending a special code to the processor over the bus. This enables the processor to identify individual devices even if they share a single interrupt-request line. The code supplied by the device may represent the starting address of the interrupt-service routine for that device. The code length is typically in the range of 4 to 8 bits. The remainder of the address is supplied by the processor based on the area in its memory where the addresses for interrupt-service routines are located.

This arrangement implies that the interrupt-service routine for a given device must always start at the same location. The programmer can gain some flexibility by storing in this location an instruction that causes a branch to the appropriate routine. In many computers, this is done automatically by the interrupt-handling mechanism. The location pointed to by the interrupting device is used to store the starting address of the interrupt-service routine. The processor reads this address, called the interrupt vector, and loads it into the PC. The interrupt vector may also include a new value for the processor status register.

In most computers, I/O devices send the interrupt-vector code over the data bus, using the bus control signals to ensure that devices do not interfere with each other. When a device sends an interrupt request, the processor may not be ready to receive the interrupt-vector code immediately. For example, it must first complete the execution of the current instruction, which may require the use of the bus. There may be further delays if interrupts happen to be disabled at the time the request is raised. The interrupting device must wait to put data on the bus only when the processor is ready to receive it. When the processor is ready to receive the interrupt-vector code, it activates the interrupt-acknowledge line, INTA. The I/O device responds by sending its interrupt-vector code and turning off the INTR signal.

Interrupt nesting: I/O devices should be organized in a priority structure. An interrupt request from a high priority device should be accepted while the processor is servicing another request from a lower priority device. A multiple-level priority organization means that during execution of an interrupt service routine, interrupt requests will be accepted from some devices but not from others, depending upon the device's priority. To implement this scheme, priority level can be assigned processor to the that can be changed by the The processor's priority is usually encoded in a few bits of the processor status word. It can be changed by program instructions that write into the PS. These are privileged instructions, which can be executed only while the processor is running in the supervisor mode. The processor is in the supervisor mode only when executing operating system routines. It switches to the user mode before beginning to execute application programs. Thus, a user program cannot accidentally, or intentionally, change the priority of the processor and disrupt the system's operation. An attempt to execute a privileged instruction while in the user mode leads to a special type of interrupt called privilege

A multiple-priority scheme can be implemented easily by using separate interrupt- request and interrupt-acknowledge lines for each device, as shown in Figure 5.7. Each of the interrupt request lines is assigned a different priority level. Interrupt requests received over these lines are sent to a priority arbitration circuit in the processor. A request is accepted only if it has a higher priority level than that currently assigned to the processor.



Figure 5.7 Implementation of interrupt priority using individual interrupt-request and acknowledge lines.

**Simultaneous request**: When simultaneous interrupt requests are arrived from two or more I/O devices to the processor, the processor must have some means of deciding which request to service first.

Using a priority scheme such as that of Figure 5.7, the solution is straightforward. The processor simply accepts the request having the highest priority. If several devices share one interrupt-request line, as in Figure 5.6, some other mechanism is needed. Polling the status registers of the I/O devices is the simplest such mechanism. In this case, priority is determined by the order in which the devices are polled. When vectored interrupts are used, we must ensure that only one device is selected to send its interrupt vector code. A widely used scheme is to connect the devices to form a daisy chain, as shown in Figure 5.8. The interruptrequest line INTR is common to all devices. The interrupt-acknowledge line, INTA, is connected in a daisy-chain fashion, such that the INTA signal propagates serially through the devices. When several devices raise an interrupt request and the INTR line is activated, the processor responds by setting the INTA line to 1. This signal is received by device 1. Device 1 passes the signal on to device 2 only if it does not require any service. If device 1 has a pending request for interrupt, it blocks the INTA signal and proceeds to put its identifying code on the data lines. Therefore, in the daisy-chain arrangement, the device that is electrically closest to the processor has the highest priority. The second device along the chain has second highest priority, and so on.

## **Controlling Device Requests:**

CONTROLLING DEVICE REQUESTS: The control needed is usually provided in the form of an interrupt-enable bit in the device's interface circuit. The keyboard interrupt-enable, KEN, and display interrupt- enable, DEN, flags in register CONTROL in Figure 5.3 perform this function. If either of these flags is set, the interface circuit generates an interrupt request whenever the corresponding status flag in register STATUS is set. At the same time, the interface circuit sets bit KIRQ or DIRQ to indicate that the keyboard or display unit, respectively, is requesting an interrupt. If an interrupt-enable bit is equal to 0, the interface circuit will not generate an interrupt request, regardless of the state of the status flag. There are two independent mechanisms for controlling interrupt requests. At the device end, an interrupt-enable bit in a control register determines whether the device is allowed to generate an interrupt request. At the processor end, either an interrupt enable bit in the PS register or a priority structure determines whether a given interrupt request will be accepted.

**EXCEPTIONS:** The term exception is often used to refer to any event that causes an interruption. Hence, I/O interrupts are one example of an exception.

**Recovery from errors**: Computers use a variety of techniques to ensure that all hardware components are operating properly. For example, many computers include an error-checking code in the main memory, which allows detection of errors in the stored data. If an error occurs, the control hardware detects it and informs the processor by raising an interrupt. The processor may

also interrupt a program if it detects an error or an unusual condition while executing the instructions of this program. For example, the OP-code field of an instruction may not correspond to any legal instruction, or an arithmetic instruction may attempt a division by zero. When exception processing is initiated as a result of such errors, the processor proceeds in exactly the same manner as in the case of an I/O interrupt request. It suspends the program being executed and starts an exception-service routine. This routine takes appropriate action to recover from the error, if possible, or to inform the user about it. Recall that in the case of an I/O interrupt, the processor completes execution of the instruction in progress before accepting the interrupt. However, when an interrupt is caused by an error, execution of the interrupted instruction cannot usually be completed, and the processor begins exception processing immediately.

**Debugging**: Another important type of exception is used as an aid in debugging programs. System software usually includes a program called a debugger, which helps the programmer find errors in a program. The debugger uses exceptions to provide two important facilities called trace and breakpoints.

When a processor is operating in the trace mode, an exception occurs after execution of every instruction, using the debugging program as the exception-service routine. The debugging program enables the user to examine the contents of registers, memory locations, and so on. On return from the debugging program, the next instruction in the program being debugged is executed, then the debugging program is activated again. The trace exception is disabled during the execution of the debugging program. Breakpoints provide a similar facility, except that the program being debugged is interrupted only at specific points selected by the user. An instruction called Trap or Software interrupt is usually provided for this purpose. Execution of this instruction results in exactly the same actions as when a hardware interrupt request is received. While debugging a program, the user may wish to interrupt program execution after instruction i. The debugging routine saves instruction i 1 and replaces it with a software interrupt instruction. When the program is executed and reaches that point, it is interrupted and the debugging routine is activated. This gives the user a chance to examine memory and register contents. When the user is ready to continue executing the program being debugged, the debugging routine restores the saved instruction that was at location i 1 and executes a Return-from-interrupt instruction.

**Privilege exception**: To protect the operating system of a computer from being corrupted by user programs, certain instructions can be executed only while the processor is in the supervisor mode. These are called privileged instructions. For example, when the processor is running in the user mode, it will not execute an instruction that changes the priority level of the processor or that enables a user program to access areas in the computer memory that have been allocated to other users. An attempt to execute such an instruction will produce a privilege exception, causing the processor to switch to the supervisor mode and begin executing an appropriate routine in the operating system.

## **Use of Interrupts in Operating System:**

**USE OF INTERRUPTS IN OPERATING SYSTEMS**: The operating system (OS) is responsible for coordinating all activities within a computer. It makes extensive use of interrupts to perform I/O operations and communicate with and control the execution of user programs. The operating system incorporates the interrupt-service routines (ISR) for all devices connected to a

computer. Application programs do not perform I/O operations themselves. An operating system provides a variety of services to application programs. To facilitate the implementation of these services, most processors have several different software interrupt instructions, each with its own interrupt vector. They can be used to call different parts of the OS.

- In a computer that has both a supervisor and a user mode, the processor switches its operation to supervisor mode at the time it accepts an interrupt request. It does so by setting a bit in the processor status register after saving the old contents of that register on the stack. Thus, when an application program calls the as by a software interrupt instruction, the processor automatically switches to supervisor mode, giving the as complete access to the computer's resources. When the as executes a Return-from-interrupt instruction, the processor status word belonging to the application program is restored from the stack. As a result, the processor switches back to the user mode.
- Multitasking is a mode of operation in which a processor executes several user programs at the same time. A common as technique that makes this possible is called time slicing. when operating system is started, an initialization routine OSINIT called for initialization. OSINIT loads the starting address of a routine called SCHEDULER in the interrupt vector corresponding to the timer interrupt. Hence, at the end of each time slice, the timer interrupt causes this routine to be executed.
- A program, together with any information that describes its current state of execution, is regarded by the as an entity called a process. A process can be in one of three states: Running, Runnable, or Blocked. The Running state means that the program is currently being executed. Assume that program A is in the Running state during a given time slice. At the end of that time slice, the timer interrupts the execution of this program and starts the execution of SCHEDULER. This is an operating system routine whose function is to determine which user program should run in the next time slice. It starts by saving all the information that will be needed later when execution of program A is resumed. The information saved, which is called the program state, includes register contents, the program counter, and the processor status word.
- SCHEDULER selects for execution some other program, B, that was suspended earlier and is in the Runnable state. It restores all information saved at the time program R was suspended, including the contents of PS and PC, and executes a Retum-from-interrupt instruction. As a result, program B resumes execution for T seconds, at the end of which the timer clock raises an interrupt again, and a context switch to another runnable process takes place. Suppose that program A needs to read an input line from the keyboard. Instead of performing the

operation itself, it requests I/O service from the operating system. It uses the stack or the processor registers to pass information to the OS describing the required operation, the I/O device, and the address of a buffer in the program data area where the line should be placed. Then it executes a software interrupt instruction. The interrupt vector for s instruction points to the OS SERVICES. This routine examines the information on the stack and initiates the requested operation by calling an appropriate OS routine. In our example, it calls, which is a routine responsible for starting I/O operations.

- While an I/O operation is in progress, the program that requested it cannot continue execution. Hence, the 10INIT routine sets the process associated with program A into the Blocked state, indicating to the scheduler that the program cannot resume execution at this time. The 10INIT routine carries out any preparations needed for the I/O operation, such as initializing address pointers and byte count, then calls a routine that performs the I/O transfers. It is common practice in operating system design to encapsulate all software pertaining to a particular device into a elf-contained module called the device driver. Such a module can be easily added to or deleted from the OS. We have assumed that the device driver for the keyboard consists of two routines, KBDINIT and KBDDATA, as shown n Figure 4.1 Oc. The 10INIT routine calls KBDINIT, which performs any initialization operations needed by the device or its interface circuit. KBDINIT also enables interrupts in the interface circuit by setting the appropriate bit in its control register, d then it returns to IOINIT, which returns to OSSERVICES. The keyboard is now ready to participate in a data transfer operation. It will generate an interrupt request whenever a key is pressed.
- Following the return to OSSERVICES, the SCHEDULER routine selects another user program to run. Of course, the scheduler will not select program A, because that program is now in the Blocked state. The Return-from-interrupt instruction that causes the selected user program to begin execution will also enable interrupts in the processor by loading new contents into the processor status register. Thus, an interrupt request generated by the keyboard's interface will be accepted. The interrupt vector for this interrupt points to an OS routine called 10DATA. Because there could be several devices connected to the same interrupt request line, 10DATA begins by polling these devices to determine the one requesting service. Then, it calls the appropriate device driver to service the request. In our example, the driver called will be KBDDATA, which will transfer one character of data. If the character is a Carriage Return, it will also set to 1 a flag called END, to inform 10DATA that the requested I/O operation has been completed. At this

point, the 10DATA routine changes the state of process A from Blocked to Runnable, so that the scheduler may select it for execution in some future time slice.

# **Direct Memory Access:**

**DIRECT MEMORY ACCESS (DMA):** The main idea of direct memory access (DMA) is to enable peripheral devices to cut out the "middle man" role of the CPU in data transfer. It allows peripheral devices to transfer data directly from and to memory without the intervention of the CPU. Having peripheral devices access memory directly would allow the CPU to do other work, which would lead to improved performance, especially in the cases of large transfers. The DMA controller is a piece of hardware that controls one or more peripheral devices. It allows devices to transfer data to or from the system's memory without the help of the processor. In a typical DMA transfer, some event notifies the DMA controller that data needs to be transferred to or from memory. Both the DMA and CPU use memory bus and only one or the other can use the memory at the same time. The DMA controller then sends a request to the CPU asking its permission to use the bus. The CPU returns an acknowledgment to the DMA controller granting it bus access. The DMA can now take control of the bus to independently conduct memory transfer. When the transfer is complete the DMA relinquishes its control of the bus to the CPU. Processors that support DMA provide one or more input signals that the bus requester can assert to gain control of the bus and one or more output signals that the CPU asserts to indicate it has relinquished the bus. Figure 8.10 shows how the DMA controller shares the CPU's memory bus.



Figure 8.10 DMA controller shares the CPU's memory bus

Direct memory access controllers require initialization by the CPU. Typical setup parameters include the address of the source area, the address of the destination area, the length of the block, and whether the DMA controller should generate a processor interrupt once the block transfer is complete. A DMA controller has an address register, a word count register, and a control register. The address register contains an address that specifies the memory location of the data to be transferred. It is typically possible to have the DMA controller automatically increment the address register after each word transfer, so that the next transfer will be from the next memory location.

The word count register holds the number of words to be transferred. The word count is decremented by one after each word transfer. The control register specifies the transfer mode.

Direct memory access data transfer can be performed in burst mode or singlecycle mode. In burst mode, the DMA controller keeps control of the bus until all the data has been transferred to (from) memory from (to) the peripheral device. This mode of transfer is needed for fast devices where data transfer cannot be stopped until the entire transfer is done. In single-cycle mode (cycle stealing), the DMA controller relinquishes the bus after each transfer of one data word. This minimizes the amount of time that the DMA controller keeps the CPU from controlling the bus, but it requires that the bus request/acknowledge sequence be performed for every single transfer. This overhead can result in a degradation of the performance. The single-cycle mode is preferred if the system cannot tolerate more than a few cycles of added interrupt latency or if the peripheral devices can buffer very large amounts of data, causing the DMA controller to tie up the bus for an excessive amount of time.

## The following steps summarize the DMA operations:

- 1. DMA controller initiates data transfer.
- 2. Data is moved (increasing the address in memory, and reducing the count of words to be moved).
- 3. When word count reaches zero, the DMA informs the CPU of the termination by means of an interrupt.
- 4. The CPU regains access to the memory bus.

A DMA controller may have multiple channels. Each channel has associated with it an address register and a count register. To initiate a data transfer the device driver sets up the DMA channel's address and count registers together with the direction of the data transfer, read or write. While the transfer is taking place, the CPU is free to do other things. When the transfer is complete, the CPU is interrupted. Direct memory access channels cannot be shared between device drivers. A device driver must be able to determine which DMA channel to use. Some devices have a fixed DMA channel, while others are more flexible, where the device driver can simply pick a free DMA channel to use.

Linux tracks the usage of the DMA channels using a vector of dma\_chan data structures (one per DMA channel). The dma\_chan data structure contains just two fields, a pointer to a string describing the owner of the DMA channel and a flag indicating if the DMA channel is allocated or not.

# **Buses:**

**BUSES**: A bus in computer terminology represents a physical connection used to carry a signal from one point to another. The signal carried by a bus may represent address, data, control signal, or power. Typically, a bus consists of a number of connections running together. Each connection is called a bus line. A bus line is normally identified by a number. Related groups of bus lines are usually identified by a name. For example, the group of bus lines 1 to 16 in a given computer system may be used to carry the address of memory locations, and therefore are identified as address lines.

Depending on the signal carried, there exist at least four types of buses: address, data, control, and power buses. Data buses carry data, control buses carry control signals, and power buses carry the power-supply/ground voltage. The size (number of lines) of the address, data, and control bus varies from one system to another. Consider, for example, the bus connecting a CPU and memory in a given system, called the CPU bus. The size of the memory in that system is 512Mword and each word is 32 bits. In such system, the size of the address bus should be  $log2(5122^{20}) = 29 lines$ , the size of the data bus should be 32 lines, and at least one control line (RR=W) should exist in that system. In addition to carrying control signals, a control bus can carry timing signals. These are signals used to determine the exact timing for data transfer to and from a bus; that is, they determine when a given computer system component, such as the processor, memory, or I/O devices, can place data on the bus and when they can receive data from the bus. A bus can be synchronous if data transfer over the bus is controlled by a bus clock. The clock acts as the timing reference for all bus signals. A bus is asynchronous if data transfer over the bus is based on the availability of the data and not on a clock signal. Data is transferred over an asynchronous bus using a technique called handshaking. The operations of synchronous and asynchronous buses are explained below.

To understand the difference between synchronous and asynchronous, let us consider the case when a master such as a CPU or DMA is the source of data to be transferred to a slave such as an I/O device. The following is a sequence of events involving the master and slave:

- 1. Master: send request to use the bus
- 2. Master: request is granted and bus is allocated to master
- 3. Master: place address/data on bus
- 4. Slave: slave is selected5. Master: signal data transfer
- 6. Slave: take data
  7. Master: free the bus

## **Synchronous and Asynchronous Buses:**

**Bus Protocols**: A bus is a communication channel shared by many devices and hence rules need to be established in order for the communication to happen correctly. These rules are called bus protocols. Design of a bus architecture involves several tradeoffs related to the width of the data bus, data transfer size, bus protocols, clocking, etc. Depending on whether the bus transactions are controlled by a clock or not, buses are classified into synchronous and asynchronous buses. Depending on whether the data bits are sent on parallel wires or multiplexed onto one single wire, there are parallel and serial buses. Control of the bus communication in the presence of multiple devices necessitates defined procedures called arbitration schemes. In this section, different kinds of buses and arbitration schemes are described.

Synchronous Buses: In synchronous buses, the steps of data transfer take place at fixed clock cycles. Everything is synchronized to bus clock and clock signals are made available to both master and slave. The bus clock is a square wave signal. A cycle starts at one rising edge of the clock and ends at the next rising edge, which is the beginning of the next cycle. A transfer may take multiple bus cycles depending on the speed parameters of the bus and the two ends of the transfer. One scenario would be that on the first clock cycle, the master puts an address on the address bus, puts data on the data bus, and asserts the appropriate control lines. Slave recognizes its address on the address bus on the first cycle and reads the new value from the bus in the second cycle. Synchronous buses are simple and easily implemented. However, when connecting devices with varying speeds to a synchronous bus, the slowest device will determine the speed of the bus. Also, the synchronous bus length could be limited to avoid clock-skewing problems.



Figure 6. Read Operations on an Asynchronous Bus

A memory read transaction on the synchronous bus typically proceeds as illustrated in Fig. 5. During the first clock cycle the CPU places the address of the location it wants to read, on the address lines of the bus. Later during the same clock cycle, once the address lines have stabilized,

the READ request is asserted by the CPU. Many times, some of these control signals are active low and asserting the signal means that they are pulled low. A few clock cycles are needed for the memory to perform accessing of the requested location. In a simple non-pipelined bus, these appear as wait states and the data is placed on the bus by the memory after the tow or three wait cycles. The CPU then releases the bus by deasserting the READ control signal. The write transaction is similar except that the processor is the data source and the WRITE signal is the one that is asserted. Different bus architectures synchronize bus operations with respect to the rising edge or falling edge or level of the clock signal.

**Asynchronous Buses**: There are no fixed clock cycles in asynchronous buses. Handshaking is used instead. Figure 8.11 shows the handshaking protocol. The master asserts the data-ready line



Figure 8.11 Asynchronous bus timing using handshaking protocol

(point 1 in the figure) until it sees a data-accept signal. When the slave sees a dataready signal, it will assert the data-accept line (point 2 in the figure). The rising of the data-accept line will trigger the falling of the data-ready line and the removal of data from the bus. The falling of the data-ready line (point 3 in the figure) will trigger the falling of the data-accept line (point 4 in the figure). This handshaking, which is called fully interlocked, is repeated until the data is completely transferred. Asynchronous bus is appropriate for different speed devices.

An asynchronous bus has no system clock. Handshaking is done to properly conduct the transmission of data between the sender and the receiver. The process is illustrated in Fig. 6. For example, in an asynchronous read operation, the bus master puts the address and control signals on the bus and then asserts a synchronization signal. The synchronization signal from the master prompts the slave to get synchronized and once it has accessed the data, it asserts its own synchronization signal. The slave's synchronization signal indicates to the processor that there is valid data on the bus, and it reads the data. The master then deasserts its synchronization signal, which indicates to the slave that the master has read the data. The slave then deasserts its synchronization signal. This method of synchronization is referred to as a full handshake. Note

that there is no clock and that starting and ending of the data transfer are indicated by special synchronization signals. An asynchronous communication protocol can be considered as a pair of Finite State machines (FSMs) that operate in such a way that one FSM does not proceed until the other FSM has reached a certain state.

Synchronous buses are typically faster than asynchronous buses because there is no overhead to establish a time reference for each transaction. Another reason that helps the synchronous bus to operate fast is that the bus protocol is predetermined and very little logic is involved in implementing the Finite State machine. However, synchronous buses are affected by clock skew and they cannot be very long. But asynchronous buses work well even when they are long because clock skew problems do not affect them. Thus asynchronous buses can handle longer physical distances and higher number of devices. Processor-memory buses are typically synchronous because the devices connected to the bus are fast, are small in number and are located in close proximity. I/O buses are typically asynchronous because many peripherals need only slow data rates and are physically situated far away.

## **Bus Arbitration:**

**Bus Arbitration**: Bus arbitration is needed to resolve conflicts when two or more devices want to become the bus master at the same time. In short, arbitration is the process of selecting the next bus master from among multiple candidates. Conflicts can be resolved based on fairness or priority in a centralized or distributed mechanisms. Centralized Arbitration In centralized arbitration schemes, a single arbiter is used to select the next master. A simple form of centralized arbitration uses a bus request line, a bus grant line, and a bus busy line. Each of these lines is shared by potential masters, which are daisy-chained in a cascade. Figure 8.12 shows this simple centralized arbitration scheme. In the figure, each of the potential masters can submit a bus request at any time.

A fixed priority is set among the masters from left to right. When a bus request is received at the central bus arbiter, it issues a bus grant by asserting the bus grant line. When the potential master that is closest to the arbiter (potential master 1) sees the bus grant signal, it checks to see if it had made a bus request. If yes, it takes over the bus and stops propagation of the bus grant signal any further. If it has not made a request, it will simple turn the bus grant signal to the next master to the right (potential master 2), and so on. When the transaction is complete, the busy line is deasserted.

Instead of using shared request and grant lines, multiple bus request and bus grant lines can be used. In one scheme, each master will have its own independent request and grant line as shown in Figure 8.13. The central arbiter can employ any prioritybased or fairness-based tiebreaker. Another scheme allows the masters to have multiple priority levels. For each priority level, there is a bus request and a bus grant line. Within each priority level, daisy chain is used. In this scheme, each device is attached to the daisy chain of one priority level. If the arbiter receives multiple



Figure 8.12 Centralized arbiter in a daisy-chain scheme



Figure 8.13 Centralized arbiter with independent request and grant lines

bus requests from different levels, it grants the bus to the level with the highest priority. Daisy chaining is used among the devices of that level. Figure 8.14 shows an example of four devices included in two priority levels. Potential master 1 and potential master 3 are daisy-chained in level 1 and potential master 2 and potential master 4 are daisy-chained in level 2.

**Decentralized Arbitration** In decentralized arbitration schemes, priority-based arbitration is usually used in distributed fashion. Each potential master has unique arbitration number, which is used in resolving conflicts when multiple requests are submitted. For example, a conflict can always be resolved in favor of the device with the highest arbitration number. The question now is how to determine which device has the highest arbitration number? One method is that a requesting device would make its unique arbitration number available to all other devices. Each device compares that number with its own arbitration number. The device with the smaller number is always dismissed. Eventually, the requester with the highest arbitration number will survive and be granted bus access.



Figure 8.14 Centralized arbiter with two priority levels (four devices)

# **Interface Circuits:**

**Interface circuits**: An Input/output (I/O) interface consists of the circuitry required to connect an I/O device to a computer bus. On one side of the interface we have the bus signals for address, data, and control. On the other side we have a data path with its associated controls to transfer data between the interface and the I/O device. This side is called a port, and it can be classified as either a parallel or a serial port. A parallel port transfers data in the form of a number of bits, typically 8 or 16, simultaneously to or from the device. A serial port transmits and receives data one bit at a time. Communication with the bus is the same for both formats; the conversion from the parallel to the serial format, and vice versa, takes place inside the interface circuit. I/O interface does the following:

- 1. Provides a storage buffer for at least one word of data (or one byte, in the case of byte-oriented devices)
- 2. Contains status flags that can be accessed by the processor to determine whether the buffer is full (for input) or empty (for output)
- 3. Contains address-decoding circuitry to determine when it is being addressed by the processor
- 4. Generates the appropriate timing signals required by the bus control scheme
- 5. Performs any format conversion that may be necessary to transfer data between the bus and the I/O device, such as parallel-serial conversion in the case of a serial port

**Parallel port:** Figure 5.20 shows the hardware components needed for connecting a keyboard to a processor. A typical keyboard consists of mechanical switches that are normally open. When a key is pressed, its switch closes and establishes a path for an electrical signal. This signal is

detected by an encoder circuit that generates the ASCII code for the corresponding character. A difficulty with such push-button switches is that the contacts bounce when a key is pressed. Although bouncing may last only one or two milliseconds, this is long enough for the computer to observe a single pressing of a key as several distinct electrical events; this single pressing could be erroneously interpreted as the key being pressed and released rapidly several times. The effect of bouncing must be eliminated. We can do this in two ways: A simple de-bouncing circuit can be included, or a software approach can be used. When debouncing is implemented in software, the I/O routine that reads a character from the keyboard waits long enough to ensure that bouncing has subsided. Figure 5.20 illustrates the hardware approach; debouncing circuits are included as a part of the encoder block.



Figure 5.20 Keyboard to processor connection.

The output of the encoder consists of the bits that represent the encoded character and one control signal called Valid, which indicates that a key is being pressed. This information is sent to the interface circuit, which contains a data register, DATAIN, and a status flag, SIN. When a key is pressed, the valid signal changes from 0 to 1,, causing the ASCII code to be loaded into DATAIN and SIN to be set to 1. The status flag SIN is cleared to 0 when the processor reads the contents of the DATAIN register. The interface circuit is connected to an asynchronous bus on which transfers are controlled using the handshake signals Master-ready and Slave-ready. The third control line, R/W distinguishes read and write transfers.



Figure 5.21 Input interface circuit.

#### input interface.

Figure 5.21 shows a suitable circuit for an

The output lines of the DATAIN register are connected to the data lines of the bus by means of three-state drivers, which are turned on when the processor issues a read instruction with the address that selects this register. The SIN signal is generated by a status flag circuit. This signal is also sent to the bus through a three-state driver. It is connected to bit DO, which means it will appear as bit 0 of the status register. Other bits of this register do not contain valid information. An address decoder is used to select the input interface when the high-order 31 bits of an address correspond to any of the addresses assigned to this interface. Address bit AO determines whether the status or the data registers is to be read when the Master-ready signal is active. The control handshake is accomplished by activating the Slaveready signal when either Read-status or Read-data is equal to 1.



#### Figure 5.22 Circuit for the status Rag block in Figure 5.21.

A possible implementation of the status flag circuit is shown in Figure 5.21. An edgetriggered D flip-flop is set to 1 by a rising edge on the Valid signal line, this event changes the state of the NOR latch such that SIN is set to 1. The state of this latch must not change while SIN is being read by the processor. Hence, the circuit ensures that SIN can be set only while Masterready is equal to 0. Both the flip- flop and the latch are reset to 0 when Read-data is set to 1 to read the DATAIN register.



Figure 5.23 Printer to processor connection.

Let us now consider an output interface that can be used to connect an output device, such as a printer, to a processor, as shown in Figure 5.23. The printer operates under control of the handshake signals Valid and Idle in a manner similar to the handshake used on the bus with the Master-ready and Slave-ready signals. When it is ready to accept a character, the printer asserts its Idle signal. The interface circuit can then place a new character on the data lines and activate the Valid signal. In response, the printer starts printing the new character and negates the Idle signal, which in turn causes the interface to deactivate the Valid signal.

<u>Serial port</u>: A serial port is used to connect the processor to I/O devices that require transmission of data one bit at a time. The key feature of an interface circuit for a serial port is that it is capable of communicating in a bit-serial fashion on the device side and in a bit-parallel fashion on the bus side. The transformation between the parallel and serial formats is achieved with shift registers that have parallel access capability. A block diagram of a typical serial interface is shown in Figure 5.27. It includes the familiar DATAIN and DATAOUT registers. The input shift register accepts bit-serial input from the I/O device. When all 8 bits of data have been received, the contents of this shift register are loaded in parallel into the DATAIN register. Similarly, output data in the DATAOUT register are loaded into the output shift register, from which e bits are shifted out and sent to the I/O device.



Figure 5.27 A serial interface.

The part of the interface that deals with the bus is the same as in the parallel interface described earlier. The status flags SIN and SOUT serve similar functions. The SIN flag is set to 1

- •when new data are loaded in DATAIN; it is cleared to 0 when the processor reads the contents of DATAIN. As soon as the data are transferred from the input shift register into the DATAIN register, the shift register can start accepting the next 8-bit character from the I/O device. The SOUT flag indicates whether the output buffer is available. It is cleared to 0 when the processor writes new data into the DATAOUT register and set to 1 when data are transferred from DATAOUT into the output shift register.
- •The double buffering used in the input and output paths are important. A simpler interface could be implemented by turning DATAIN and DATA OUT into shift registers and eliminating

the shift registers in Figure 5.27. However, this would impose awkward restrictions on the operation of the I/O device; after receiving one character from the serial line, the device cannot start receiving the next character until the processor reads the contents of DATAIN. Thus, a pause would be needed between two characters to allow the processor to read the input data. With the double buffer, the transfer of the second character can begin as soon as the first character is loaded from the shift register into the DATAIN register. Thus, provided the processor reads the contents of DATAIN before the serial transfer of the second character is completed, the interface can receive a continuous stream of serial data. An analogous situation occurs in the output path of the interface.

Because it requires fewer wires, serial transmission is convenient for connecting devices that are physically far away from the computer. The speed of transmission, often given as a bit rate, depends on the nature of the devices connected. To accommodate a range of devices, a serial interface must be able to use a range of clock speeds. The circuit in Figure 5.27 allows separate clock signals to be used for input and output operations for increased flexibility. Because serial interfaces play a vital role in connecting I/O devices, several widely used standards have been developed. A standard circuit that includes the features of our example in Figure 5.27 is known as a Universal Asynchronous Receiver Transmitter (UART). It is intended for use with low-speed serial devices. Data transmission is performed using the asynchronous start-stop format. To facilitate connection to communication links, a popular standard known as RS-232-C was developed.

# **Standard I/O Interfaces:**

Introduction: A number of standards have been developed for I/O Interface. IBM developed a they called ISA (Industry Standard Architecture) for their personal computer known at the time as PC AT. The popularity of that computer led to other manufacturers producing ISA -compatible interfaces for their 110 devices, thus making ISA into a de facto standard. Some standards have been developed through industrial cooperative efforts, even among competing companies driven by their common self-interest in having compatible products. In some cases, organizations such as the IEEE (Institute of Electrical and Electronics Engineers), ANSI (American National Standards Institute), or international bodies such as ISO (International Standards Organization) have blessed them standards and given official these an There are three widely used bus standards, PCI (Peripheral Component Interconnect), SCSI (Small Computer System Interface), and USB (Universal Serial Bus). The way these standards are used in a typical computer system is illustrated in Figure 5.28. The PCI standard defines an expansion bus on the motherboard. SCSI and USB are used for connecting additional devices, both inside and outside the computer box. The SCSI bus is a high-speed parallel bus intended for devices such as disks and video displays. The USB bus uses serial transmission to suit the needs of equipment ranging from keyboards to game controls to internet connections. The figure shows an interface circuit that enables devices compatible with the earlier ISA standard, such as the popular IDE (Integrated Device Electronics) disk, to be connected. It also shows a connection to an Ethernet.

The Ethernet is a widely used local area network, providing a high-speed connection among computers in a building or a university campus. A given computer may use more than one bus standard. A typical Pentium computer has both a PCI bus and an ISA bus, thus providing the user with a wide range of devices to choose from.



Figure 5.28 An example of a computer system using different interface standards.

### 5.7.1 PCI (Peripheral Component Interconnect)

The PCI bus is a good example of a system bus. It supports the functions found on a processor bus but in a standardized format that is independent of any particular processor. Devices connected to the PCI bus appear to the processor as if they were connected directly to the processor bus. They are assigned addresses in the memory address space of the processor. Early PCs used the 8-bit XT bus, whose signals closely mimicked those of Intel's 80x86 processors. Later, the 16-bit bus used on the PC AT computers became known as the ISA bus. Its extended 32- bit version is known as the EISA bus. Other buses developed in the eighties with similar capabilities are the Microchannel used in IBM PCs and the NuBus used in Macintosh computers.

The PCI was developed as a low-cost bus that is truly processor independent. Its design anticipated a rapidly growing demand for bus bandwidth to support high-speed disks and graphic and video devices, as well as the specialized needs of multiprocessor systems. As a result, the PCI is still popular as an industry standard almost a decade after it was first introduced in 1992. An important feature that the PCI pioneered is a plug-and-play capability for connecting I/O devices. To connect a new device, the user simply connects the device interface board to the bus. The software takes care of the rest.

**Data Transfer:** Most memory transfers involve a burst of data rather than just one word. The reason is that modem processors include a cache memory. The PCI is designed primarily to support this mode of operation. A read or a write operation involving a single word is simply treated as a burst of length one. The bus supports three independent address spaces: memory, I/O, and configuration. The first two are self-explanatory. The I/O address space is intended for use with processors, such as Pentium, that have a separate I/O address space. Figure 5.28 shows the main memory of the computer connected directly to the processor bus. An alternative arrangement that is used often with the PCI bus is shown in Figure 5.29. The PCI Bridge provides a separate physical connection for the main memory. For electrical reasons, the bus may be further divided into segments connected via bridges. However, regardless of which bus segment a device is connected to, it may still be mapped into the processor's memory address space.



**Figure 5.29 Use of a PCI bus in a computer system.**: At any given time, one device is the bus master. It has the right to initiate data transfers by issuing read and write commands. A master is called an initiator in PCI terminology. This is either a processor or a DMA controller. The addressed device that responds to read and write commands is called a target. The main bus signals used for transferring data are listed in Table 5.1. Signals whose name ends with the symbol # are asserted when in the low-voltage state. The main difference between the PCI protocol with others is that in addition to a Target-ready signal, PCI also uses an Initiatorready signal, IRDY #. The latter is needed to support burst transfers.

## **SCSI Bus:**

SCSI (Small Computer System Interface): The acronym SCSI stands for Small Computer System Interface. It refers to a standard bus defined by the American National Standards Institute (ANSI) under the designation X3.131. In the original specifications of the standard, devices such as disks are connected to a computer via a 50- wire cable, which can be up to 25 meters in length

and can transfer data at rates up to 5 megabytes/s. The SCSI bus standard has undergone many revisions, and its data transfer capability has increased very rapidly, almost doubling every two SCSI-2 and SCSI-3 have been defined, and each has several options. A SCSI bus may have eight data lines, in which case it is called a narrow bus and transfers data one byte at a time. Because of these various options, the SCSI connector may have 50, 68, or 80 pins. The maximum transfer rate in commercial devices that are currently available varies from 5 megabytes/s to 160 megabytes/so The most recent version of the standard is intended to support transfer rates up to 320 megabytes/s, and 640 megabytes/s is anticipated a little later. Devices connected to the SCSI bus are not part of the address space of the processor in the same way as devices connected to the processor bus. The SCSI bus is connected to the processor bus through a SCSI controller, as shown in Figure 5.28. This controller uses DMA to transfer data packets from the main memory to the device, or vice versa. A packet may contain a block of data, commands from the processor to the device, or status information about the device. A controller connected to a SCSI bus is one of two types - an initiator or a target. An initiator has the ability to select a particular target and to send commands specifying the operations to be performed. The disk controller operates as a target. It carries out the commands it receives from the initiator. The initiator establishes a logical connection with the intended target. Once this connection has been established, it can be suspended and restored as needed to transfer commands and bursts of data. While a particular connection is suspended, other devices can use the bus to transfer information. This ability to overlap data transfer requests is one of the key features of the SCSI bus that leads to its high performance.

Data transfers on the SCSI bus are always controlled by the target controller. To send a command to a target, an initiator requests control of the bus and, after winning arbitration, selects the controller it wants to communicate with and hands control of the bus over to it. Then the controller starts a data transfer operation to receive a command from the initiator. Assume that the processor wishes to read a block of data from a disk drive and that these data are stored in two disk sectors that are not contiguous. The processor sends a command to the SCSI controller, which causes the following sequence of events to take place:

- 1. The SCSI controller, acting as an initiator, contends for control of the bus.
- 2. When the 'initiator wins the arbitration process, it selects the target controller and hands over control of the bus to it.
- 3. The target starts an output operation (from initiator to target); in response to this, the initiator sends a command specifying the required read operation.
- 4. The target, realizing that it first needs to perform a disk seek operation, sends a message to the initiator indicating that it will temporarily suspend the connection between them. Then it releases the bus.
- 5. The target controller sends a command to the disk drive to move the read head to the first sector involved in the requested read operation. Then, it reads the data stored in that sector and stores them in a data buffer. When it is ready to begin transferring data to the initiator,

- the target requests control of the bus. After it wins arbitration, it res elects the initiator controller, thus restoring the suspended connection.
- 6. The target transfers the contents of the data buffer to the initiator and then suspends the connection again. Data are transferred either 8 or 16 bits in parallel, depending on the width of the bus.
- 7. The target controller sends a command to the disk drive to perform another seek operation. Then, it transfers the contents of the second disk sector to the initiator, as before. At the end of this transfer, the logical connection between the two controllers is terminated.
- 8. As the initiator controller receives the data, it stores them into the main memory using the DMA approach.
- 9. The SCSI controller sends an interrupt to the processor to inform it that the requested operation has been completed.

This scenario shows that the messages exchanged over the SCSI bus are at a higher level than those exchanged over the processor bus. The SCSI bus standard defines a wide range of control messages that can be ex- changed between the controllers to handle different types of I/O devices. Messages are also defined to deal with various error or failure conditions that might arise during device operation or data transfer.

Table 5.1 Data transfer si9na on the PCI bus.

| Name         | Function                                                                                                               |
|--------------|------------------------------------------------------------------------------------------------------------------------|
| CLK          | A 33-MHz or 66-MHz clock.                                                                                              |
| FRAME#       | Sent by the initiator to indicate the duration of a transaction.                                                       |
| AD           | 32 address/data lines, which may be OJtionally increased to 64.                                                        |
| C/BE#        | 4 command/byte-enable lines (8 for a 64-bit bus).                                                                      |
| IRDY#, TRDY# | Initiator-ready and Target-ready signals.                                                                              |
| DEVSEL#      | A response from the device indicating that it has recognized its Address and is ready for a data transfer transaction. |
| IDSEL#       | Initialization Device Select,                                                                                          |

Consider a bus transaction in which the processor reads four 32-bit words from the memory. In this case, the initiator is the processor and the target is the memory. A complete transfer operation on the bus, involving an address and a burst of data, is called a transaction. Individual word transfers within a transaction are called phases. The sequence of events on the bus is shown in Figure 5.30. A clock signal provides the timing reference used to coordinate different phases of a transaction. All signal transitions are triggered by the rising edge of the clock. The signals changing later in the clock cycle to indicate the delays they encounter.

**Bus Signal:** The bus signals are summarized in Table 5.2. For simplicity we show the signals for a narrow bus (8 data lines). Note that all signal names are preceded by a minus sign. This indicates that the signals are active, or that a data line is equal to 1, when they are in the low-voltage state. The bus has no address lines. Instead, the data lines are used to identify the bus controllers involved during the selection or reselection process and during bus arbitration. For a

narrow bus, there are eight possible controllers, numbered o through 7, and each i, associated with the data line that has the same number. A wide bus accommodates up to 16 controllers. A controller places its own address or the address of another controller on the bus by activating the corresponding data line. Thus, it is possible to have more than one address on the bus at the same time, as in the arbitration process we describe next. Once a connection is established between two controllers, there is no further need for addressing, and the data lines are used to carry data.

Table 5.2 The SCSI bus signal

| Category              | Name                | Function                                                                                                                                                 |
|-----------------------|---------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------|
| Data                  | -DB(O) to<br>-DB(7) | Data lines: Carry one byte of information during the information transfer phase and identify device during arbitration, selection and reselection phases |
|                       | -DB(P)              | Parity bit for the data bus                                                                                                                              |
| Phase                 | -BSY                | Busy: Asserted when the bus is not free                                                                                                                  |
|                       | -SEL                | Selection: Asserted during selection and reelection                                                                                                      |
| Information type      | -C/D                | Control/Data: Asserted during transfer of control information (command, status or message)                                                               |
|                       | -MSG ssage:         | ssage: indicates that the information being transferred is a message                                                                                     |
| Handshake             | -REQ                | quest Asserted by a target to request a data transfer cycle                                                                                              |
|                       | -ACK                | Acknowledge: Asserted by the initiator when it has completed a data transfer operation                                                                   |
| Direction of transfer | -I/O                | Input/Output. Asserted to indicate an input operation                                                                                                    |
| Other                 | -ATN                | ention: Asserted by an initiator when it wishes to send a message to a target                                                                            |
|                       | -RST                | set: Causes all Device controls to disconnect from the bus and their start-up state                                                                      |

The main phases involved in the operation of the SCSI bus are arbitration, selection, information transfer, and reselection.

Arbitration: The bus is free when the -BSY signal is in the inactive (high-voltage) state. Any controller can request the use of the bus while it is in this state. Since two or more controllers may generate such a request at the same time, an arbitration scheme must be implemented. A controller requests the bus by asserting the - BSY signal and by asserting its associated data line to identify itself. The SCSI bus uses a simple distributed arbitration scheme. It is illustrated by the example in Figure 5.32, in which controllers 2 and 6 request the use of the bus simultaneously. Each controller on the bus is assigned a fixed priority, with controller 7 having the highest priority. When -BSY becomes active, all controllers that are requesting the bus examine the data lines and determine whether a higher-priority device is requesting the bus at the same time. The controller using the highest-numbered line realizes that it has won the arbitration process. All other controllers disconnect from the bus and wait for -BSY to become inactive again.



Figure 5.32 Arbitration and selection on the SCSI bus. Device6 wins arbitration and selects device2.

In Figure 5.32, we have assumed that controller 6 is an initiator that wishes to establish a connection to controller 5. After winning arbitration, controller 6 proceeds to the selection phase, in which it identifies the target.

**Selection**: Having won arbitration, controller 6 continues to assert -BSY and -DB6 (its address). It indicates that it wishes to select controller 5 by asserting the -SEL and then the - DB5 lines. Any other controller that may have been involved in the arbitration phase, such as controller 2 in the figure, must stop driving the data lines once the -SEL line becomes active, if it has not already done so. After placing the address of the target controller on the bus, the initiator releases the -BSY line. The selected target controller responds by asserting - BSY. This informs the initiator that the connection it is requesting has been established, so that it may remove the address information from the data lines. The selection process is now complete, and the target controller (controller 5) is asserting - BSY. From this point on, controller 5 has control of the bus, as required for the information transfer phase.

**Information Transfer:** The information transferred between two controllers may consist of commands from the initiator to the target, status responses from the target to the initiator, or data being transferred to or from the I/O device. Handshake signaling is used to control information transfers. The - REQ and - ACK signals replace the Master-ready and Slave-ready signals. The target asserts - I/O during an input operation (target to initiator), and it asserts -CID to indicate that the information being transferred is a command or a status response rather than data. High-speed versions of the SCSI bus use a technique known as double-edge clocking or Double Transitions (DT). Each data transfer requires a high-to-low transition followed by a lowto-high transition on the two handshake signals. Double-edge clocking means that data are transferred on both the rising and falling edges of these signals, thus doubling the transfer rate. At the end of the transfer, the target controller releases the - BSY signal, thus freeing the bus for use by other devices.

**Reselection**: When a logical connection is suspended and the target is ready to restore it, the target must first gain control of the bus. It starts an arbitration cycle, and after winning arbitration, it selects the initiator controller in exactly the same manner as described above. But with the roles of the target and initiator reversed, the initiator is now asserting - BSY. Before data transfer can begin, the initiator must hand control over to the target. This is achieved by having the target

controller assert - BSY r selecting the initiator. Meanwhile, the initiator waits for a short period after being selected to make sure that the target has asserted -BS and then it releases the -BSY line. The connection between the two controllers has now been reestablished, with the target in control of the bus as required for data transfer to proceed.

The bus signaling scheme described above provides the mechanisms needed for two controllers to establish a logical connection and exchange messages. The connection may be suspended and reestablished at any time. The SCSI standard defines the structure and contents of various types of packets that the controllers exchange to handle different situations. The initiator uses these packets to send the commands it receives from the processor to the target. The target responds with status information and data transfer operations. The latter are controlled by the target, because it is the target that knows when data are available, when to suspend and reestablish connections, etc.

## **Universal Serial Bus (USB):**

**USB** (Universal Serial Bus).: Universal Serial Bus (USB) is an industry standard developed through a collaborative effort of several computer and communications companies, including Compaq, Hewlett-Packard, Intel, Lucent, Microsoft, Nortel Networks, and Philips. USB is a simple and low cost mechanism to connect the devices such as keyboards, mouse, cameras, speakers, printer and display devices to the computer.

The USB supports two speeds of operation, called low-speed (1.5 megabits/s) and fullspeed (12 megabits/s). The most recent revision of the bus specification (USB 2.0) introduced a third speed of operation, called high-speed (480 megabits/s). The USB is quickly gaining acceptance in the market place, and with the addition of the high-speed capability it may well become the interconnection method of choice for most computer devices. The USB has been designed to meet several key objectives:

- Provide a simple, low-cost, and easy to use interconnection system that overcomes the difficulties due to the limited number of I/O ports available on a computer
- Accommodate a wide range of data transfer characteristics for I/O devices, including telephone and Internet connections
- Enhance user convenience through a "plug-and-play" mode of operation

**Port limitation**: Only a few ports are provided in a typical computer. To add new ports, a user must open the computer box to gain access to the internal expansion bus and install a new interface card. The user may also need to know how to configure the device and the software. An objective of the USB is to make it possible to add many devices to a computer system at any time, without opening the computer box.

**Device Characteristics**: The different kinds of devices may be connected to a computer cover a wide range of functionality. The speed, volume, and timing constraints associated with data transfers to and from such devices vary significantly. In the case of a keyboard, one byte of data

is generated every time a key is pressed, which may happen at any time. These data should be transferred to the computer promptly. Since the event of pressing a key is not synchronized to any other event in -8 computer system, the data generated by the keyboard are called asynchronous. Furthermore, the rate at which the data are generated is quite low. It is limited by the speed of the human operator to about 100 bytes per second, which is less than 1000 bits per second.

Let us consider a different source of data. Many computers have a microphone either externally attached or built in. The sound picked up by the microphone produces an analog electrical signal, which must be converted into a digital form before it can be handled by the computer. This is accomplished by sampling the analog signal periodically. For each sample, an analog-to-digital (A/D) converter generates an n-bit number representing the magnitude of the sample. The number of bits, n, is selected based on, the desired precision with which to represent each sample. Later, when these data are sent to a speaker, a digital-to-analog (D/A) converter is used to restore the original analog signal from the digital format. The sampling process yields a continuous stream of digitized samples that arrive at regular intervals, synchronized with the sampling clock. Such a data stream is called isochronous, meaning that successive events are separated by equal periods of time.

**Plug-and-play**: The plug-and-play feature means that a new device, such as an additional speaker, can be connected at any time while the system is operating. The system should detect the existence of this new device automatically, identify the appropriate device-driver soft- ware and any other facilities needed to service that device, and establish the appropriate addresses and logical connections to enable them to communicate. The plug-and-play requirement has many implications at all levels in the system, from the hardware to the operating system and the applications software. One of the primary objectives of the design of the USB has been to provide a plug-and-play capability.

**USB Architecture**: A serial transmission format has been chosen for the USB because a serial bus satisfies the low-cost and flexibility requirements. Clock and data information are encoded together and transmitted as a single signal. Hence, there are no limitations on clock frequency or distance arising from data skew. Therefore, it is possible to provide a high data transfer bandwidth by using a high clock frequency. As pointed out earlier, the USB offers three bit rates, ranging from 1.5 to 480 megabits/s, to suit the needs of different I/O devices. To accommodate a large number of devices that can be added or removed at any time, the USB has the tree structure shown in Figure 5.33. Each node of the tree has a device called a hub, which acts as an intermediate control point between the host and the I/O devices. At the root of the tree, a root hub connects the entire tree to the host computer. The leaves of the tree are the I/O devices being served (for example, keyboard, speaker, or digital TV), which are called functions in USB terminology.



Figure 5.33 Universal Serial Bus tree structure.

The tree structure enables many devices to be connected while using only simple point -topoint serial links. Each hub has a number of ports where devices may be connected, including other hubs. In normal operation, a hub copies a message that it receives from its upstream connection to all its downstream ports. As a result, a message sent by the host computer is broadcast to ill VO devices, but only the addressed device will respond to that message. A message from an I/O device is sent only upstream towards the root of the tree and is not seen by other devices. Hence, the USB enables the host to communicate with the I/O devices, but it does not enable these devices to communicate with each other.

The USB operates strictly on the basis of polling. A device may send a message only in response to a poll message from the host. Hence, upstream messages do not encounter conflicts or interfere with each other, as no two devices can send messages at the same time. This restriction allows hubs to be simple, Low-cost devices.



Figure 5.34 USB Split bus operations.

The mode of operation described above is observed for all devices operating at either low speed or full speed. However, one exception has been necessitated by the introduction of highpeed operation in USB version 2.0. Consider the situation in Figure 5.34. Hub A is connected to the root hub by a high-speed link. This hub serves one high-speed device, C, and one low-speed device, D. Normally, a message to device D would be sent at low speed from the root hub. At 1.5 megabits/s, even a short message takes several tens of microseconds. For the duration of this message, no other data transfers can take place, thus reducing the effectiveness of the high-speed links and introducing unacceptable delays for high-speed devices. To mitigate this problem, the USB protocol requires that a message transmitted on a high-speed link is always transmitted at high speed, even when the ultimate receiver is a low-speed device. Hence, a message intended for device D is sent at high speed from the root hub to hub A, then forwarded at low speed to device D. The latter transfer will take a long time, during which high-speed traffic to other nodes is allowed to continue. For example, the root hub may exchange several messages with device C while the low-speed message is being sent from hub A to device D. During this period, the bus is said to be split between high-speed and low-speed traffic. The message to device D is preceded and followed by special commands to hub A to start and end the split-traffic mode of operation, respectively.

The USB standard specifies the hardware details of USB interconnections as well as the organization and requirements of the host software. The purpose of the USB software is to provide bidirectional communication links between application software and I/O devices. These links are called pipes. Any data entering at one end of a pipe is delivered at the other end. Issues such as addressing, timing, or error detection and recovery are handled by the USB protocols. The software that transfers data to or from a given IJO device is called the device driver for that device. The device drivers depend on the characteristics of the devices they support. Hence, a more precise description of the USB pipe is that it connects an VO device to its device driver. It is

established when a device is connected and assigned a unique address by the USB software. Once established, data may flow through the pipe at any time.

Addressing: I/O devices are normally identified by assigning them a unique memory address. In fact, a device usually has several addressable locations to enable the software to send and receive control and status information and to transfer data. When a USB is connected to a host computer, its root hub is attached to the processor bus, where it appears as a single device. The host software communicates with individual devices attached to the USB by sending packets of information, which the root hub forwards to the appropriate device in the USB tree. Each device on the USB, whether it is a hub or an IJO device, is assigned a 7 -bit address. This address is local to the USB tree and is not related in any way to the addresses used on the processor bus. A hub may have any number of devices or other hubs connected to it, and addresses are assigned arbitrarily. When a device is first connected to a hub, or when it is powered on, it has the address 0. The hardware of the hub to which this device is connected is capable of detecting that the device has been connected, and it records this fact as part of its own status information. Periodically, the host polls each hub to collect status information and learn about new devices that may have been added or disconnected.

**USB Protocol**: All information transferred over the USB is organized in packets, where a packet consists of one or more bytes of information. There are many types of packets that perform a variety of control functions. The information transferred on the USB can be divided into two broad categories: control and data. Control packets perform such tasks as addressing a device to initiate data transfer, acknowledging that data have been received correctly, or indicating an error. Data packets carry information that is delivered to a device. For example, put and output data are transferred inside data packets.

A packet consists of one or more fields containing different kinds of information. The first field of any packet is called the packet identifier, PID, which identifies the type of that packet. There are four bits of information in this field, but they are transmitted twice. The first time they are sent with their true values, and the second time with each bit complemented, as shown in Figure 5.35(a). This enables the receiving device to verify that the PID byte has been received correctly.



Figure 5.35. USB packet format.

The four PID bits identify one of 16 different packet types. Some control packets, such as ACK (Acknowledge), consist only of the PID byte. Control packets used for controlling data transfer operations are called token packets. They have the format shown in Figure 5.35(b). A token packet starts with the PID field, using one of two PID values to distinguish between an IN packet and an OUT packet, which control input and output transfers, respectively. The PID field is followed by the 7-bit address of a device and the 4-bit endpoint number within that device. The packet ends with 5 bits for error checking, using a method called cyclic redundancy check (CRC). The CRC bits are computed based on the contents of the address and endpoint fields. By performing an inverse computation, the receiving device can determine whether the packet has been received correctly.

Data packets, which carry input and output data, have the format shown in Figure 4.45c. The packet identifier field is followed by up to 8192 bits of data, then 16 error-checking bits. Three different PID patterns are used to identify data packets, so that data packets may be numbered 0, 1, or 2. Note that data packets do not carry a device address or an endpoint number. This information is included in the IN or OUT token packet that initiates the transfer. Consider an output device connected to a USB hub, which in turn is connected to a host computer. An example of an output operation is shown in Figure 5.36. The host computer sends a token packet of type OUT to the hub, followed by a data packet containing the output data. The PID field of the data packet identifies it as data packet number o. The hub verifies that the transmission has been error free by

checking the error control bits, and then sends an acknowledgment packet (ACK) back to the host. The hub forwards the token and data packets downstream. All I/O devices receive this sequence of packets, but only the device that recognizes its address in the token packet accepts the data in the packet that follows. After verifying that transmission has been error free, it sends an ACK packet to the hub.



Figure 5.36 An output transfer

Successive data packets on a full-speed or low-speed pipe carry the numbers 0 and 1, alternately. This simplifies recovery from transmission errors. If a token, data, or acknowledgment packet is lost as a result of a transmission error, the sender resends the entire sequence. By checking the data packet number in the PID field, the receiver can detect and discard duplicate packets. High-speed data packets are sequentially numbered 0, 1, 2, 0, and so on. Input operations follow a similar procedure. The host sends a token packet of type IN containing the device address. In effect, this packet is a poll asking the device to send any input data it may have. The device responds by sending a data packet followed by an ACK. If it has no data ready, it responds by sending a negative acknowledgment (NAK) instead.

**Electrical characteristics**: The cables used for USB connections consist of four wires. Two are used to carry power, 5 V and Ground. Thus, a hub or an I/O device may be powered directly from the bus, or it may have its own external power connection. The other two wires are used to carry data. Different signaling schemes are used for different speeds of transmission. At low speed, 1s and 0s are transmitted by sending a high voltage state (5 V) on one or the other of the two signal wires. For high-speed links, differential transmission is used.