LLM大模型算法学习资源持续整理(2024-)

spatial_coder

已于 2024-09-17 20:24:35 修改

阅读量339

点赞数 10

分类专栏： LLM 文章标签：学习

于 2024-06-26 19:53:41 首次发布

本文链接：https://blog.csdn.net/spatial_coder/article/details/139997156

版权

LLM 专栏收录该内容

3 篇文章 0 订阅

订阅专栏

整理大模型数据、预训练、对齐、推理加速、微调、应用、工程部署等相关学习资源；

AI社区

huggingface

https://huggingface.co/
https://huggingface.co/learn 抱抱脸学习中心上有很多公开课程；
在这里插入图片描述

魔搭社区

https://www.modelscope.cn/home
在这里插入图片描述

waytoagi

飞书文档写的AGI知识库。
https://www.waytoagi.com/
在这里插入图片描述

datawhale

大模型开源课程
https://github.com/datawhalechina/so-large-lm
开源大模型部署指南：
https://github.com/datawhalechina/self-llm

斯坦福大规模语言模型winter 2022

https://stanford-cs324.github.io/winter2022/

李宏毅生成式AI课程 2024

在这里插入图片描述

https://speech.ee.ntu.edu.tw/~hylee/genai/2024-spring.php

AI agent

AutoGPT: https://github.com/Significant-Gravitas/Auto-GPT
AgentGPT: https://agentgpt.reworkd.ai/
BabyAGI: https://github.com/yoheinakajima/babyagi
Godmode: https://godmode.space/?ref=futuretools.io

transformer

attention

flash attention

《FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness》https://huggingface.co/papers/2205.14135

multi-query attention

group-query attention

https://cyrilzakka.github.io/llm-playbook/nested/gqa.html
在这里插入图片描述

位置编码

RoPE
《RoFormer: Enhanced Transformer with Rotary Position Embedding》中提出
https://paperswithcode.com/paper/roformer-enhanced-transformer-with-rotary

Kaggle model courses

https://www.kaggle.com/models
在这里插入图片描述

LLM101

karpathy更新中的大模型教程：
https://github.com/karpathy/LLM101n
在这里插入图片描述

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

课程地址：https://github.com/mlabonne/llm-course

SJTU dive-into-llms

https://sjtullm.gitbook.io/dive-into-llms
https://github.com/Lordog/dive-into-llms
在这里插入图片描述

llm-cookbook

吴恩达LLM系列课程的笔记
https://github.com/datawhalechina/llm-cookbook?tab=readme-ov-file

刘志远团队大模型公开课

https://www.openbmb.cn/community/course

开源大模型

Meta Llama 3.1

https://ai.meta.com/blog/meta-llama-3/

https://llama.meta.com/

开源大模型微调

https://github.com/hiyouga/LLaMA-Factory

在这里插入图片描述

LLM benchmark测试

https://github.com/EleutherAI/lm-evaluation-harness
在这里插入图片描述

多模态大模型学习资料

ViT

https://paperswithcode.com/method/vision-transformer

CLIP

https://github.com/openai/CLIP
在这里插入图片描述

BLIP

SAM

https://segment-anything.com/
在这里插入图片描述

文生视频

cogvideo

https://github.com/THUDM/CogVideo

opensora

扩散模型

fastai diffusion course

From Deep Learning Foundations to Stable Diffusion,part 2 of Practical Deep Learning for Coders.
https://course.fast.ai/Lessons/part2.html
在这里插入图片描述

huggingface-diffusion-course

https://huggingface.co/learn/diffusion-course/unit0/1
在这里插入图片描述

模型微调

LoRA

在这里插入图片描述

https://www.codewithgpu.com/i/Akegarasu/lora-scripts/lora-train

模型训练

强化学习

openai的spinningup深度强化学习教程；
https://spinningup.openai.com/en/latest/user/introduction.html#what-this-is

deepspeed

https://www.deepspeed.ai/
在这里插入图片描述

streamingLLM

https://www.high-flyer.cn/blog/streamingllm/

本地大模型

Ollama

https://ollama.com/
在这里插入图片描述

训练数据采集

gpt-crawler

爬取网站构建自己的GPT本地数据库；
https://github.com/BuilderIO/gpt-crawler

firecrawl

根据网站链接，采集网页内容并转为markdown或json结构；
https://www.firecrawl.dev/
在这里插入图片描述

LLM全栈工程bootcamp

https://fullstackdeeplearning.com/llm-bootcamp/
在这里插入图片描述

推理加速

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs
https://github.com/vllm-project/vllm

RAG

Langchain-Chatchat : https://github.com/chatchat-space/Langchain-Chatchat
ACL2023 tutorial: https://acl2023-retrieval-lm.github.io/
llamaindex: https://docs.llamaindex.ai/en/stable/optimizing/production_rag/

graphRAG

https://siwei.io/graph-rag/

AI native应用

asksia.ai

私人学习AI导师：https://www.asksia.ai/zh-TW?utm_source=homepage&utm_medium=organic&utm_campaign=homepage

LLM面试题

spatial_coder

关注

10
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

LLM大模型算法学习资源持续整理(2024-)

文章目录

AI社区

huggingface

魔搭社区

waytoagi

datawhale

斯坦福大规模语言模型winter 2022

李宏毅生成式AI课程 2024

AI agent

transformer

attention

flash attention

multi-query attention

group-query attention

位置编码

Kaggle model courses

LLM101

llm-course

SJTU dive-into-llms

llm-cookbook

刘志远团队大模型公开课

开源大模型

Meta Llama 3.1

开源大模型微调

LLM benchmark测试

多模态大模型学习资料

ViT

CLIP

BLIP

SAM

文生视频

cogvideo

opensora

扩散模型

fastai diffusion course

huggingface-diffusion-course

模型微调

LoRA

模型训练

强化学习

deepspeed

streamingLLM

本地大模型

Ollama

训练数据采集

gpt-crawler

firecrawl

LLM全栈工程bootcamp

推理加速

vllm

RAG

graphRAG

相关blog

AI native应用

asksia.ai

LLM面试题