Picture for Aosong Feng

Aosong Feng

PRISM: A Unified Framework for Post-Training LLMs Without Verifiable Rewards

Add code
Jan 08, 2026
Viaarxiv icon

From Chains to Graphs: Self-Structured Reasoning for General-Domain LLMs

Add code
Jan 07, 2026
Viaarxiv icon

Toward Global Large Language Models in Medicine

Add code
Jan 05, 2026
Viaarxiv icon

Diffusion Language Model Inference with Monte Carlo Tree Search

Add code
Dec 13, 2025
Viaarxiv icon

Route Experts by Sequence, not by Token

Add code
Nov 09, 2025
Viaarxiv icon

TRACE: Grounding Time Series in Context for Multimodal Embedding and Retrieval

Add code
Jun 10, 2025
Viaarxiv icon

Learning to Reason without External Rewards

Add code
May 26, 2025
Figure 1 for Learning to Reason without External Rewards
Figure 2 for Learning to Reason without External Rewards
Figure 3 for Learning to Reason without External Rewards
Figure 4 for Learning to Reason without External Rewards
Viaarxiv icon

SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning

Add code
May 22, 2025
Figure 1 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 2 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 3 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 4 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Viaarxiv icon

CSPLADE: Learned Sparse Retrieval with Causal Language Models

Add code
Apr 15, 2025
Viaarxiv icon

MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering

Add code
Mar 21, 2025
Figure 1 for MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering
Figure 2 for MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering
Figure 3 for MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering
Figure 4 for MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering
Viaarxiv icon