Alert button
Picture for Martin Jaggi

Martin Jaggi

Alert button

Personalized Collaborative Fine-Tuning for On-Device Large Language Models

Add code
Bookmark button
Alert button
Apr 15, 2024
Nicolas Wagner, Dongyang Fan, Martin Jaggi

Viaarxiv icon

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

Add code
Bookmark button
Alert button
Mar 30, 2024
Saleh Ashkboos, Amirkeivan Mohtashami, Maximilian L. Croci, Bo Li, Martin Jaggi, Dan Alistarh, Torsten Hoefler, James Hensman

Viaarxiv icon

Towards an empirical understanding of MoE design choices

Add code
Bookmark button
Alert button
Feb 20, 2024
Dongyang Fan, Bettina Messmer, Martin Jaggi

Viaarxiv icon

Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains

Add code
Bookmark button
Alert button
Feb 06, 2024
Ashok Vardhan Makkuva, Marco Bondaschi, Adway Girish, Alliot Nagle, Martin Jaggi, Hyeji Kim, Michael Gastpar

Viaarxiv icon

InterpretCC: Conditional Computation for Inherently Interpretable Neural Networks

Add code
Bookmark button
Alert button
Feb 05, 2024
Vinitra Swamy, Julian Blackwell, Jibril Frej, Martin Jaggi, Tanja Käser

Viaarxiv icon

DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging

Add code
Bookmark button
Alert button
Feb 04, 2024
Matteo Pagliardini, Amirkeivan Mohtashami, Francois Fleuret, Martin Jaggi

Viaarxiv icon

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

Add code
Bookmark button
Alert button
Nov 27, 2023
Zeming Chen, Alejandro Hernández Cano, Angelika Romanou, Antoine Bonnet, Kyle Matoba, Francesco Salvi, Matteo Pagliardini, Simin Fan, Andreas Köpf, Amirkeivan Mohtashami, Alexandre Sallinen, Alireza Sakhaeirad, Vinitra Swamy, Igor Krawczuk, Deniz Bayazit, Axel Marmet, Syrielle Montariol, Mary-Anne Hartley, Martin Jaggi, Antoine Bosselut

Viaarxiv icon

Controllable Topic-Focused Abstractive Summarization

Add code
Bookmark button
Alert button
Nov 12, 2023
Seyed Ali Bahrainian, Martin Jaggi, Carsten Eickhoff

Viaarxiv icon

DoGE: Domain Reweighting with Generalization Estimation

Add code
Bookmark button
Alert button
Oct 23, 2023
Simin Fan, Matteo Pagliardini, Martin Jaggi

Viaarxiv icon

Irreducible Curriculum for Language Model Pretraining

Add code
Bookmark button
Alert button
Oct 23, 2023
Simin Fan, Martin Jaggi

Viaarxiv icon