AI人工智能大模型中——数据集就是一切 The dataset is everything

最新推荐文章于 2024-06-27 15:39:44 发布

置顶禅与计算机程序设计艺术

最新推荐文章于 2024-06-27 15:39:44 发布

阅读量215

点赞数

分类专栏： ChatGPT 文章标签：人工智能

本文链接：https://blog.csdn.net/universsky2015/article/details/138174141

版权

ChatGPT 专栏收录该内容

1049 篇文章 225 订阅 ¥59.90 ¥99.00

订阅专栏

超级会员免费看

文章目录

人工智能模型中的“它”是数据集。 The “it” in AI models is the dataset.
2023 年机器学习的现状 The State of ML in 2023
Research Code 研究代码
Learned Structures 学习结构
Compute Multipliers 计算乘数

人工智能模型中的“它”是数据集。 The “it” in AI models is the dataset.

I’ve been at OpenAI for almost a year now. In that time, I’ve trained a lot of generative models. More than anyone really has any right to train. As I’ve spent these hours observing the effects of tweaking various model configurations and hyperparameters, one thing that has struck me is the similarities in between all the training runs.

我在 OpenAI 工作已经快一年了。那段时间，我训练了很多生成模型。比任何人都更有权利接受训练。当我花了几个小时观察调整各种模型配置和超参数的效果时，令我印象深刻的一件事是所有训练运行之间的相似性。

It’s becoming awfully clear to me that these models are truly approximating their datasets to an incredible degree. What that means is not only that they learn what it means to be a dog or a cat, but the interstitial frequencies between distributions that don’t matter, like what photos humans are likely to take or words humans commonly write down.
我越来越

了解本专栏

超级会员免费看

禅与计算机程序设计艺术

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
2
评论
AI人工智能大模型中——数据集就是一切 The dataset is everything

我认为对计算乘数的搜索比任何不严格遵守缩放定律的人想象的要普遍得多：实际上，机器学习领域的每一位不研究现有技术的新应用的科学家都应该执行计算效率扫描以确保他们的发现确实相关。不过，随着训练的进行，这些机制会“上线”：当您需要提高学习更复杂的数据分布层的能力时，它们就会提供有意义的价值。更重要的是，认识到像 GPT-4 或 DALL-E 3 这样的巨大模型仍然存在根本性缺陷，这表明试图从 Llama 2 或 Stable Diffusion 等相对较小的模型中获得真正智能的行为是没有希望的。
复制链接

扫一扫