8G 显存玩转书生大模型 Demo

Olivia Lam

已于 2024-07-25 14:52:02 修改

阅读量332

点赞数 8

文章标签：人工智能

于 2024-07-23 16:10:37 首次发布

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_45725137/article/details/140634406

版权

环境部署

开发机配置

镜像：Cuda12.2-conda
资源配置：10% A100*1

创建conda环境

studio-conda -t lmdeploy -o pytorch-2.1.2

进入这个状态以后要等待很久很久，可以去干点别的再回来看

安装结束
在这里插入图片描述

切换环境

conda activate lmdeploy

安装lmdeploy

pip install lmdeploy[all]==0.3.0

在这里插入图片描述

安装结束

模型下载

在internStudio中，实际上已经存在模型文件，仅需要建立软链接

ln -s /root/share/new_models/Shanghai_AI_Laboratory/internlm2-chat-1_8b /root/

查看软链

ls -al | grep internlm2

在这里插入图片描述

使用Transformer库运行模型

Transformer库是Huggingface社区推出的用于运行HF模型的官方库。

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("/root/internlm2-chat-1_8b", trust_remote_code=True)

# Set `torch_dtype=torch.float16` to load model in float16, otherwise it will be loaded as float32 and cause OOM Error.
model = AutoModelForCausalLM.from_pretrained("/root/internlm2-chat-1_8b", torch_dtype=torch.float16, trust_remote_code=True).cuda()
model = model.eval()

inp = "hello"
print("[INPUT]", inp)
response, history = model.chat(tokenizer, inp, history=[])
print("[OUTPUT]", response)

inp = "please provide three suggestions about time management"
print("[INPUT]", inp)  
response, history = model.chat(tokenizer, inp, history=history)


print("[OUTPUT]", response)

在这里插入图片描述

LMDeploy部署模型

lmdeploy chat /root/internlm2-chat-1_8b

在这里插入图片描述

关注

8
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

Olivia Lam CSDN认证博客专家 CSDN认证企业博客

码龄5年

61: 原创

31万+: 周排名

4万+: 总排名

3万+: 访问

: 等级

879: 积分

51: 粉丝

75: 获赞

12: 评论

111: 收藏

私信

关注

热门文章

分类专栏

数据结构 20篇
c++ 17篇
c 14篇
作业（c++程序设计） 10篇

最新评论

c++程序设计继承中的析构函数和静态成员
zzssr: [code=cpp] #include <iostream> #include <string> #include <vector> using std::string; using std::vector; using std::endl; using std::cout; using std::cin; class Animal { private: string name{ "Animal" }; protected: static int AliveNumber; static bool enable; public: Animal() { AliveNumber++; cout << name << endl; } ~Animal() { AliveNumber--; if (enable) { cout << AliveNumber<<endl; enable = false; } } }; class Dog : public Animal { private: string name{ "Dog" }; public: Dog() { cout << name << endl << AliveNumber << endl; } ~Dog() { if (! enable ) { enable = true; } } }; int Animal::AliveNumber = 0; bool Animal::enable{ false }; int main() { int N = 0; cin >> N; vector<Animal*> v1{ }; vector<Dog*> v2{ }; for (int i = N; i > 0; i--) { Animal* ap = new Animal{}; Dog* bp = new Dog{}; v1.push_back(ap); v2.push_back(bp); } for (int i = static_cast<int>(v1.size()); i > 0 && !v1.empty(); i-- ) { delete v1[i - 1]; v1.erase(v1.end() - 1 [/code]
c++程序设计继承中的析构函数和静态成员
qq_41049980: [code=cpp] #include <iostream> #include <string> #include <vector> #include <string.h> using namespace std; #define DOG 1 #define ANI 0 static int count = 0; class Dog; class Animal { protected: int e; public: Animal(int flag) { cout << "Animal" << endl; //Animal *a; //cout << typeid(Animal()).name() << "," << typeid(this).name()<<endl; //if (typeid(a).name()==typeid(this).name()) { // ++count; //} this->e = flag; if (flag == 0) { ++count; } } ~Animal() { //Animal *a; if (e == 0) { --count; } } }; class Dog : public Animal { public: Dog() : Animal(DOG) { cout << "Dog" << endl; count++; } ~Dog() { --count; } static int getter() { return count; } }; int main() { int N; cin >> N; vector<Animal *> v1; vector<Dog *> v2; Animal *a; Dog *d; for (int i [/code]
c++程序设计多态和纯虚函数
huangwu139: 还可以这呀子，牛逼
c++程序设计继承中的析构函数和静态成员
else___if: 因为Dog类继承了Animal类，又因为基类Animal的构造函数未被显示调用，所以子类的构造函数每次被调用的时候就会首先自动调用基类中的默认构造函数。这就相当于创建一个Animal对象和一个Dog对象时，Animal中的默认构造函数被调用了两遍，所以Dog的默认函数被调用的次数不必再加减了。
c++程序设计多态和纯虚函数
大呆鹅呆呆呆: 牛逼!感谢!

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。