关于KAN（Kolmogorov-Arnold Networks）的小小调研

最新推荐文章于 2024-09-16 14:49:33 发布

lqzzzzzzzz

最新推荐文章于 2024-09-16 14:49:33 发布

阅读量645

点赞数 26

分类专栏： DL 科研文章标签： python 人工智能深度学习神经网络

本文链接：https://blog.csdn.net/qq_37105055/article/details/142208280

版权

5 篇文章 1 订阅

订阅专栏

1 篇文章 0 订阅

订阅专栏

前言

KAN是MIT2024年5月提出，与传统MLP（多层感知机）并列的全新深度学习架构。截至2024.9.13已有150+次引用。笔者是该方向的入门新手，开此贴记录相关方向的论文调研笔记~（欢迎指正和补充）

（KAN使用B样条拟合每个node的一元非线性函数）

KANs outperforms conventional MLP in a real-world satellite traffic forecasting task, providing more accurate results with considerably fewer number of parameters.
aim to evaluate the practicality of KANs in realworld scenarios, analyzing their efficiency in terms of the
number of trainable parameters and discussing how the additional degrees of freedom might affect forecasting performance.
using realworld satellite traffic data.
单变量时间序列预测
两层KAN，输入层和输出层的节点数分别对应total amount of time steps.
基于卫星数据预测交通状况
实验
- 对比KAN与MLP架构的performance(6 beam area)
- 用一周数据预测一天
- KAN shows a rapid adjustment <> MLP exhibits a lag
- KAN matched the rapid volume <> MLP moderately over/under-predicted
- the robustness of KAN despite the complexity and higher volume <>
- 参数方面：This reduced complexity suggests that KANs can achieve higher or comparable forecasting accuracy with simpler and potentially faster models.（用于资源有限和快速部署的场景）

T-KAN:detect conecpt drift within time series(univariate)
- core：use sliding window (two historical time steps to predict the next time step in the example, different KAN structures & activation functions represent different concepts) to observe the variations in KAN and identify concept drift.
MT-KAN: imrove predictive performance (multivariate time series)