Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
  • Loading branch information
zyushun committed Jun 25, 2024
1 parent ffc2c45 commit 951363b
Show file tree
Hide file tree
Showing 2 changed files with 19 additions and 8 deletions.
11 changes: 7 additions & 4 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -39,14 +39,17 @@ <h1>Yushun Zhang</h1>
<h2>About me</h2>
<p>I'm a Ph.D student in School of Data Science at The Chinese University of Hong Kong, Shenzhen, China. I'm very proud to be advised by <a href="https://scholar.google.com/citations?user=dW3gcXoAAAAJ&amp;hl=en">Prof. Zhi-Quan (Tom) Luo</a>. I’m also very fortunate to work closely with <a href="https://ruoyus.github.io">Prof. Ruoyu Sun</a>.
Previously, I did my undergraduate study in the Department of Mathematics at Southern University of Science and Technology (SUSTech). </p>
<p>My research focuses on optimization and deep learning, and especially, large language models. I aim to solve practical engineering problems in these areas. </p>
<p>My research focuses on optimization, deep learning, and especially, large language models. I aim to solve practical engineering problems in these areas. </p>
<h2>Preprints</h2>
<p><a href="http://arxiv.org/abs/2406.16793">Adam-mini: Use Fewer Learning Rates To Gain More</a> <br />
<b>Yushun Zhang* </b>, Congliang Chen*, Ziniu Li, Tian Ding, Chenwei Wu, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun</p>
<p><a href="https://arxiv.org/abs/2402.16788">Why Transformers Need Adam: A Hessian Perspective</a> <br />
<b>Yushun Zhang</b>, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo</p>
<p><a href="https://arxiv.org/abs/2208.09900">Provable Benefit of Adaptivity in Adam</a> <br />
Bohan Wang, <b>Yushun Zhang</b>, Huishuai Zhang, Qi Meng, Ruoyu Sun, Zhi-Ming Ma, Zhi-Quan Luo, Tie-Yan Liu, Wei Chen <br /></p>
<h2>Publications </h2>
<p>(*: Equal contribution, alphabetically ordered.)</p>
<p>(*: Equal contribution.)</p>
<p><a href="https://arxiv.org/abs/2208.09900">Provable Adaptivity of Adam under Non-Uniform Smoothness</a> <br />
Bohan Wang*, <b>Yushun Zhang* </b>, Huishuai Zhang, Qi Meng, Ruoyu Sun, Zhi-Ming Ma, Zhi-Quan Luo, Tie-Yan Liu, Wei Chen <br />
KDD 2024</p>
<p><a href="https://arxiv.org/abs/2310.10505">ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models</a> <br />
Ziniu Li, Tian Xu, <b>Yushun Zhang</b>, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo <br />
ICML 2024</p>
Expand Down
16 changes: 12 additions & 4 deletions index.jemdoc
Original file line number Diff line number Diff line change
Expand Up @@ -19,22 +19,30 @@ I'm a Ph.D student in School of Data Science at The Chinese University of Hong K
Previously, I did my undergraduate study in the Department of Mathematics at Southern University of Science and Technology (SUSTech).


My research focuses on optimization and deep learning, and especially, large language models. I aim to solve practical engineering problems in these areas.
My research focuses on optimization, deep learning, and especially, large language models. I aim to solve practical engineering problems in these areas.

== Preprints

[http://arxiv.org/abs/2406.16793 Adam-mini: Use Fewer Learning Rates To Gain More] \n
*Yushun Zhang\* *, Congliang Chen\*, Ziniu Li, Tian Ding, Chenwei Wu, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun



[https://arxiv.org/abs/2402.16788 Why Transformers Need Adam: A Hessian Perspective] \n
*Yushun Zhang*, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo


[https://arxiv.org/abs/2208.09900 Provable Benefit of Adaptivity in Adam] \n
Bohan Wang, *Yushun Zhang*, Huishuai Zhang, Qi Meng, Ruoyu Sun, Zhi-Ming Ma, Zhi-Quan Luo, Tie-Yan Liu, Wei Chen \n




== Publications

(*: Equal contribution, alphabetically ordered.)
(*: Equal contribution.)

[https://arxiv.org/abs/2208.09900 Provable Adaptivity of Adam under Non-Uniform Smoothness] \n
Bohan Wang\*, *Yushun Zhang\* *, Huishuai Zhang, Qi Meng, Ruoyu Sun, Zhi-Ming Ma, Zhi-Quan Luo, Tie-Yan Liu, Wei Chen \n
KDD 2024

[https://arxiv.org/abs/2310.10505 ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models] \n
Ziniu Li, Tian Xu, *Yushun Zhang*, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo \n
Expand Down

0 comments on commit 951363b

Please sign in to comment.