CMU Computer Systems: Machine-Level Programming (Data)

WihauShe

已于 2022-03-16 16:32:59 修改

阅读量102

点赞数 1

分类专栏： Others 文章标签： cmu computer system machine-level data

于 2022-03-03 13:57:11 首次发布

本文链接：https://blog.csdn.net/qq_39376697/article/details/123251581

版权

36 篇文章 1 订阅

订阅专栏

Basic Principle
- T A[L]
- Array of data type T and length L
- Contiguously allocated region of L * sizeof (T) bytes in memory
Reference
- &A[L]
- *A

Declaration
- T A[R][C]
- 2D array of data type T
- R rows, C columns
- Type T element requires K bytes
Array Size
- R * C * K bytes
Arrangement
- Row-Major Ordering

Row Vectors
- A[i] is array of C elements
- Each element of type T requires K bytes
- Starting address A + i*(C*K)

Array Elements
- A[i][j] is element of type T, which requires L bytes
- Address A + i * (C* K) + j * K = A + (i * C + j) * K

Computation
- Element access Mem[Mem[univ+8index]+4digit]
- Must do two memory reads
  - First get pointer to row array
  - Then access element within array

Fixed dimensions
- Know value of N at compile time
Variable dimensions, explicit indexing
- Traditional way to implement dynamic arrays
Variable dimensions, implicit indexing
- Now supported by gcc

Structure represented as block of memory
- Big enough to hold all of the fields
Fields ordered according to declaration
- Even if another ordering could yield a more compact representation
Compiler determines overall size + positions of fields
- Machine-level program has no understanding of the structures in the source code

Aligned Data
- Primitive data type requires K bytes
- Address must be multiple of K
- Required on some machines; advised on x86-64
Motivation for Aligning Data
- Memory accessed by (aligned) chunks of 4 or 8 bytes (system dependent)
  - Inefficient to load or store datum that spans quad word boundaries
  - Virtual memory trickier when datum spans 2 pages
Compiler
- Inserts gaps in structure to ensure correct alignment of fields

Within structure:
- Must satisfy each element’s alignment requirement
Overall structure placement
- Each structure has alignment requirement K
  - K = Largest alignment of any element
- Initial address & structure length must be multiples of K

Integer (and pointer) arguments passed in regular registers
FP values passed in XMM registers
Different mov instructions to move between XMM registers, and between memory and XMM registers

Lots of instructions
- Different operations, different formats, …
Floating-point comparisons
- Instructions ucomiss and ucomisd
- Set condition codes CF, ZF, and PF
Using constant values
- Set XMM0 register to 0 with instruction xorpd %xmm0, %xmm0
- Others loaded from memory