[Accelerators] DOTA: detect and omit weak attentions for scalable transformer acceleration
[Systems for Machine Learning] RecShard: statistical feature-based memory optimization for industry-scale neural recommendation
[Systems for Machine Learning] AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures
[Systems for Machine Learning] Breaking the computation and communication abstraction barrier in distributed machine learning workloads
05-30
318
![](https://csdnimg.cn/release/blogv2/dist/pc/img/readCountWhite.png)
03-19
906
![](https://csdnimg.cn/release/blogv2/dist/pc/img/readCountWhite.png)