diff --git a/docs/source/faq.rst b/docs/source/faq.rst index 92d5987..b499a75 100644 --- a/docs/source/faq.rst +++ b/docs/source/faq.rst @@ -3,6 +3,9 @@ Frequently asked questions Feel free to reach out or start a `GitHub issue `_ if you have any questions about Modula. We'll post answers to any useful or common questions on this page. +Conceptual questions +^^^^^^^^^^^^^^^^^^^^^ + .. dropdown:: The gradient is a vector: how can a vector have a spectral norm? :icon: question @@ -114,6 +117,9 @@ Feel free to reach out or start a `GitHub issue `_ or `schedule-free optimizer `_. I think this is a great direction for future work. +Modula package +^^^^^^^^^^^^^^^ + .. dropdown:: The modular norm involves a max---why do I not see any maxes in the package? :icon: question @@ -167,7 +176,10 @@ Feel free to reach out or start a `GitHub issue `_ and George Dahl's call for `a healthy dose of skepticism `_ when evaluating claims in the literature. \ No newline at end of file + I don't think so. There are a lot of very technical people working in this field bringing with them some quite advanced tools from math and theoretical physics, and this is great. But in my experience it's usually the simpler and more elementary ideas that actually work in practice. I strongly believe that deep learning theory is still at the stage of model building. And I resonate with both Rahimi and Recht's call for `"simple theorems" and "simple experiments" `_ and George Dahl's call for `a healthy dose of skepticism `_ when evaluating claims in the literature. \ No newline at end of file