推荐项目：FBTT-Embedding——高效稀疏嵌入表压缩方案-CSDN博客

本文链接：https://blog.csdn.net/gitblog_00421/article/details/141669225

推荐项目：FBTT-Embedding——高效稀疏嵌入表压缩方案

FBTT-EmbeddingThis is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.项目地址:https://gitcode.com/gh_mirrors/fb/FBTT-Embedding

在机器学习领域，特别是推荐系统和自然语言处理中，大量应用着密集的特征表示。然而，随着模型规模的扩大，嵌入表的体积也随之膨胀，给内存和计算带来了极大挑战。FBTT-Embedding 库应运而生，它通过高效的张量列火车（Tensor Train，简称TT）分解技术，提供了对这类大规模稀疏嵌入表的压缩解决方案，同时保证了算法效率和模型性能。

项目介绍

FBTT-Embedding是一个专为解决深度学习中嵌入表存储和计算效率问题设计的库。它兼容PyTorch环境，并能够直接替换PyTorch中的EmbeddingBag组件，实现相同的功能，但加入了压缩机制。此外，该库利用软件缓存策略来加速访问频繁的条目，避免重复的压缩和解压缩过程，显著提升了训练和推理阶段的效率。

技术分析

FBTT-Embedding的核心在于其利用了TT分解，这是一种将高维张量转换为一系列低秩矩阵乘积的技术，从而大幅度减少所需的存储空间。与传统方法相比，FBTT-Embedding通过高度优化的前向传播和后向传播函数，维持了良好的计算效率，即便是对于大规模嵌入表也不例外。项目通过允许用户自定义TT核心的秩、分割形状以及其他初始化参数，实现了灵活的压缩程度和性能调优。