Alert button
Picture for Jacques Chen

Jacques Chen

Alert button

Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be

Add code
Bookmark button
Alert button
Apr 27, 2023
Frederik Kunstner, Jacques Chen, Jonathan Wilder Lavington, Mark Schmidt

Figure 1 for Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be
Figure 2 for Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be
Figure 3 for Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be
Figure 4 for Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be
Viaarxiv icon

Fast Sparse Decision Tree Optimization via Reference Ensembles

Add code
Bookmark button
Alert button
Dec 21, 2021
Hayden McTavish, Chudi Zhong, Reto Achermann, Ilias Karimalis, Jacques Chen, Cynthia Rudin, Margo Seltzer

Figure 1 for Fast Sparse Decision Tree Optimization via Reference Ensembles
Figure 2 for Fast Sparse Decision Tree Optimization via Reference Ensembles
Figure 3 for Fast Sparse Decision Tree Optimization via Reference Ensembles
Figure 4 for Fast Sparse Decision Tree Optimization via Reference Ensembles
Viaarxiv icon

How Smart Guessing Strategies Can Yield Massive Scalability Improvements for Sparse Decision Tree Optimization

Add code
Bookmark button
Alert button
Dec 01, 2021
Hayden McTavish, Chudi Zhong, Reto Achermann, Ilias Karimalis, Jacques Chen, Cynthia Rudin, Margo Seltzer

Figure 1 for How Smart Guessing Strategies Can Yield Massive Scalability Improvements for Sparse Decision Tree Optimization
Figure 2 for How Smart Guessing Strategies Can Yield Massive Scalability Improvements for Sparse Decision Tree Optimization
Figure 3 for How Smart Guessing Strategies Can Yield Massive Scalability Improvements for Sparse Decision Tree Optimization
Figure 4 for How Smart Guessing Strategies Can Yield Massive Scalability Improvements for Sparse Decision Tree Optimization
Viaarxiv icon