Secure coding

Operands in x86 assembly

Richard Azu
September 19, 2019 by
Richard Azu

Introduction

This article defines an operand as it is used in x86 assembly instructions. It goes on to explain in detail, using simulations, three modes by which the operand of an instruction is specified. 

This article is designed for self-starters, students and professionals who want to gain a detailed understanding of operands and how they’re used. It will help to give you the fine details and a better understanding of x86 operands and how they are used in addressing modes.

Intro to x86 Disassembly

Intro to x86 Disassembly

Build your x86 assembly skills with six courses covering the basics of computer architecture, how to build and debug x86, x86 assembly instructions and more.

Operands in x86

An operand is a subsection of an x86 instruction that specifies data that is being operated on or being manipulated. An x86 instruction can have from zero to three operands in its statement. An operand has a type that can either be a register, a memory location, an immediate value or an address. (Source)

Addressing modes in x86

An x86 addressing mode briefly explains the types of operands and the way they are accessed from various locations within the microprocessor architecture. The term addressing modes refers to the way in which the operand of an instruction is specified. Information contained in the instruction code is the value of the operand or the address of the result/operand. (Source)

Register addressing mode

In this addressing mode, operands are specified using either 8-bit, 16-bit or 32-bit registers.

Register operands in two of the 8-bit general purpose registers

Let’s assume BL=0x2 and BH=0x0.

  1. MOV BH, BL: This instruction copies the content of register BL to BH. The new value of BH is 0x2. The source operand is BL and the destination operand is BH

Figure 1: Register BL preset with initial value of 0x2 before instruction 1 is executed

 

Figure 2: Contents shown for 8-bit registers BL and BH after instruction 1 is executed.

 

Register operands in two of the 16-bit general purpose registers

Let’s assume BX=0x2 and AX=0x0.

  1. MOV AX, BX: This instruction copies the content of register BX to AX. The new value of AX is 0x2. The source operand is BX and the destination operand is AX

Figure 3: Register BX preset with initial value of 0x2 before instruction 2 is executed

 

Figure 4: Contents shown for 16-bit registers BX and AX after instruction 2 is executed

 

Immediate addressing mode

In this mode, the operand represents data that are constant values or results from expressions to be computed. Operands used in this mode are called immediate operands.

Examples of immediate operands in instructions

  1. ADD AL, 0x2: This instruction adds the hexadecimal value 2 to the value in register AL and then stores the sum in AL. In this instruction, AL is the destination operand. Immediate data is never permitted in the destination operand. We also assumed the initial value of AL is 0x0

Figure 5: Contents of register AL after instruction 3 is executed

 

  1. ADD BX, 7*7: This instruction evaluates the expressions involving constants and then the resulting value is subtracted from the value in register BX. Let’s assume register BX is preloaded with the value 0x0

Figure 6: Contents of the BX register after instruction 4 is executed

 

  • SUB AX, 0x63: This subtracts the hexadecimal value 63 from the value stored in the accumulator. We assumed that the accumulator register AX is pre-loaded with the hexadecimal value 99

Figure 7: Register AX preset with initial value of 0x99 before instruction 5 is executed

 

Figure 8: Contents of the AX register after instruction 5 is executed

 

Before we move on to the next type of addressing mode, let’s discuss physical address calculation. The physical address of a memory location is calculated as follows:

  • Physical address = Starting address of Segment + Offset

The starting point of the segment is obtained by appending 0H to the content

Table 1: How segment registers pair with offset registers

Assume the following preset register values:

  • CS = 1147 H
  • SS = 7726 H
  • IP = 1762 H
  • SP = 1188 H
  • DI = 1120 H

The base address of the code segment is 1147 H * 10H = 11470 H. This is required to shift the address from 16 bits to 20 bits — the width of a memory address in x86.

The effective address of memory is given by the sum of starting address of shifted segment and offset.

CS + IP = 11470 H + 1762H = 12BD2 H. It can also be written as 1147:1762.

Direct memory addressing mode/register indirect addressing mode

Direct memory addressing is a mode in which the address of the operand is directly specified in the instruction. This is known as register indirect addressing mode. It is particularly executed on instruction line 12, where the content of register CX is copied into the memory location with the address specified in register BX.

Instructions set 6 to 13:

  • ORG 177h: This is a compiler directive which instructs the compiler that instructions will be loaded at the offset
  • MOV AX, 0B800h: This instruction sets register AX to hexadecimal value of B800h
  • MOV DS, AX: This instruction copies value of AX to DS
  • MOV CL, 'C': Here we set CL to ASCII code of 'C', it is 43h
  • MOV CH, 1101_1101b: And then set CH to binary value (This is DD in hexadecimal)
  • MOV BX, 17Eh: Also, set BX to 17Eh
  • MOV [BX], CX: Now copy contents of CX to memory at B800:017E (This will be verified after execution of the above code)
  • RET: Return control of program to operating system

Figure 9: Copy of instructions 6 to 13 loaded into the Emu8086 simulator 

 

Figure 10: Contents of registers AX, DS, CL, CH, BX before instructions set 6 to 13 is executed

 

Figure 11: Contents of registers AX, DS, CL, CH and BX after instructions set 6 to 13 is executed

 

Figure 12: Verifying the contents of the memory location B800:017E 

Conclusion

This article has explained how the operand of an instruction is specified, using three addressing modes: Register Addressing Mode, Immediate Addressing Mode (which is sometimes known as Register Indirect Addressing Mode) and Direct Addressing Mode. These addressing modes form assembly language statements and show the contents of registers before and after instructions are executed.

Intro to x86 Disassembly

Intro to x86 Disassembly

Build your x86 assembly skills with six courses covering the basics of computer architecture, how to build and debug x86, x86 assembly instructions and more.

 

Sources

  1. Addressing Modes, cs.iit.edu
  2. x86Sim: A Simulation Tool for the Intel x86 Architecture, Jeffrey D. Heid
Richard Azu
Richard Azu

Experienced in the deployment of voice and data over the 3 media; radio, copper and fibre, Richard – a system support technician with First National Bank Ghana Limited is still looking for ways to derive benefit from the WDM technology in Optics. Using Kali as a springboard, he has developed an interest in digital forensics and penetration testing.