Picture for Antoine Bosselut

Antoine Bosselut

CAVE: Detecting and Explaining Commonsense Anomalies in Visual Environments

Add code
Oct 29, 2025
Viaarxiv icon

Revisiting Multilingual Data Mixtures in Language Model Pretraining

Add code
Oct 29, 2025
Viaarxiv icon

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Add code
Sep 17, 2025
Figure 1 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 2 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 3 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 4 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Viaarxiv icon

Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining

Add code
Sep 05, 2025
Figure 1 for Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Figure 2 for Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Figure 3 for Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Figure 4 for Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Viaarxiv icon

Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization

Add code
Aug 06, 2025
Figure 1 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Figure 2 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Figure 3 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Figure 4 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Viaarxiv icon

GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration

Add code
Jul 31, 2025
Viaarxiv icon

PERK: Long-Context Reasoning as Parameter-Efficient Test-Time Learning

Add code
Jul 08, 2025
Viaarxiv icon

ConLID: Supervised Contrastive Learning for Low-Resource Language Identification

Add code
Jun 18, 2025
Figure 1 for ConLID: Supervised Contrastive Learning for Low-Resource Language Identification
Figure 2 for ConLID: Supervised Contrastive Learning for Low-Resource Language Identification
Figure 3 for ConLID: Supervised Contrastive Learning for Low-Resource Language Identification
Figure 4 for ConLID: Supervised Contrastive Learning for Low-Resource Language Identification
Viaarxiv icon

Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization

Add code
Jun 16, 2025
Figure 1 for Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
Figure 2 for Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
Figure 3 for Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
Figure 4 for Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
Viaarxiv icon

AbstRaL: Augmenting LLMs' Reasoning by Reinforcing Abstract Thinking

Add code
Jun 11, 2025
Viaarxiv icon