ARMv8-A系列学习笔记(2)--2.1 ARMv8-A

The ARMv8-A architecture introduces a number of changes, which enable significantly higher

performance processor implementations to be designed.


Large physical address

This enables the processor to access beyond 4GB of physical memory.

物理地址空间变大了,V7是4GB,V8变成多少了呢?是2^64B吗?


64-bit virtual addressing
This enables virtual memory beyond the 4GB limit. This is important for modern

desktop and server software using memory mapped file I/O or sparse addressing.

虚拟地址空间变大了,V7是4GB, V8是不是变成了2^64B?


Automatic event signaling

This enables power-efficient, high-performance spinlocks.

SPINLOCK变得HPLP, 硬件如何设计,指令上如何体现的呢?


Larger register files
Thirty-one 64-bit general-purpose registers increase performance and reduce

stack use.

31个64BIT寄存器,提高了Performance,减少了STACK的使用, 硬件如何设计,指令上如何体现的呢?


Efficient 64-bit immediate generation

There is less need for literal pools.

不明白?是不是与XZR,WZR有关系?


Large PC-relative addressing range
A +/-4GB addressing range for efficient data addressing within shared libraries

and position-independent executables.

不明白?


Additional 16KB and 64KB translation granules
This reduces Translation Lookaside Buffer (TLB) miss rates and depth of page
walks.
是不是说,TRANSLATION LOOKASIZE BUFFER的粒度变细(小)了,从而使得地址转换结果的HIT率,或者说是重复使用率提高了,也减少了遍历的次数/时间?

New exception model
This reduces OS and hypervisor software complexity.
应该是说有了EL0~EL3几种EXCEPTION LEVEL。

Efficient cache management
User space cache operations improve dynamic code generation efficiency. Fast
Data cache clear using a Data Cache Zero instruction.
用户空间也有了CACHE的操作?
DATA CACHE清零指令,V7没有吗,该指令的使用场景是,设计意图是?

Hardware-accelerated cryptography
Provides 3 × to 10 × better software encryption performance. This is useful for
small granule decryption and encryption too small to offload to a hardware
accelerator efficiently, for example https.
加解密有了硬件加速?

Load-Acquire, Store-Release instructions
Designed for C++11, C11, Java memory models. They improve performance of
thread-safe code by eliminating explicit memory barrier instructions.
新的内存操作指令,显式地不使用内存屏障?

NEON double-precision floating-point advanced SIMD
This enables SIMD vectorization to be applied to a much wider set of algorithms,
for example, scientific computing,
High Performance Computing (HPC) and
supercomputers.
 
SIMD有了进一步发展?

  • 1
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 5
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 5
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值