Skip to content

Commit

Permalink
couple more papers
Browse files Browse the repository at this point in the history
  • Loading branch information
jxbz committed Jun 5, 2024
1 parent c828940 commit 932b886
Showing 1 changed file with 16 additions and 0 deletions.
16 changes: 16 additions & 0 deletions docs/source/history.rst
Original file line number Diff line number Diff line change
@@ -1,6 +1,16 @@
The science of scale
=====================

.. admonition:: Warning
:class: warning

This page is still under construction.

.. admonition:: Warning
:class: seealso

This page was written by Jeremy and so is likely biased by his view of the world. He is putting this here because he thinks it provides a useful counterpoint to some prevailing narratives. If you'd like to mention some other work here, feel free to either make a pull request or reach out to us by email.

some twists and turns

| 📘 `On the distance between two neural networks and the stability of learning <https://arxiv.org/abs/2002.03432>`_
Expand All @@ -19,6 +29,12 @@ and more text
| Greg Yang, James B. Simon, Jeremy Bernstein
| arXiv 2023
and more

| 📒 `Automatic gradient descent: Deep learning without hyperparameters <https://arxiv.org/abs/2304.05187>`_
| Jeremy Bernstein, Chris Mingard, Kevin Huang, Navid Azizan, Yisong Yue
| arXiv 2023
and even more text

| 📕 `Scalable optimization in the modular norm <https://arxiv.org/abs/2405.14813>`_
Expand Down

0 comments on commit 932b886

Please sign in to comment.