Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers
Published in arXiv preprint arXiv:2410.13184, 2024
Use Google Scholar for full citation
Recommended citation: Shwai He, Tao Ge, Guoheng Sun, Bowei Tian, Xiaoyang Wang, Ang Li, Dong Yu, "Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers." arXiv preprint arXiv:2410.13184, 2024.