Picture for Mahdi Soltanolkotabi

Mahdi Soltanolkotabi

CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing

Add code
Feb 17, 2026
Viaarxiv icon

Asymmetric Prompt Weighting for Reinforcement Learning with Verifiable Rewards

Add code
Feb 11, 2026
Viaarxiv icon

Full-Batch Gradient Descent Outperforms One-Pass SGD: Sample Complexity Separation in Single-Index Learning

Add code
Feb 02, 2026
Viaarxiv icon

Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs

Add code
Jul 16, 2025
Figure 1 for Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs
Figure 2 for Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs
Figure 3 for Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs
Figure 4 for Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs
Viaarxiv icon

The Rich and the Simple: On the Implicit Bias of Adam and SGD

Add code
May 29, 2025
Figure 1 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Figure 2 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Figure 3 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Figure 4 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Viaarxiv icon

Emergence and Evolution of Interpretable Concepts in Diffusion Models

Add code
Apr 21, 2025
Viaarxiv icon

Test-Time Training Provably Improves Transformers as In-context Learners

Add code
Mar 14, 2025
Viaarxiv icon

CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price Prediction

Add code
Jan 02, 2025
Figure 1 for CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price Prediction
Figure 2 for CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price Prediction
Figure 3 for CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price Prediction
Figure 4 for CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price Prediction
Viaarxiv icon

Stability properties of gradient flow dynamics for the symmetric low-rank matrix factorization problem

Add code
Nov 24, 2024
Viaarxiv icon

Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning

Add code
Aug 29, 2024
Viaarxiv icon