Picture for Weitao Ma

Weitao Ma

SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution

Add code
Apr 21, 2026
Viaarxiv icon

Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play

Add code
Apr 20, 2026
Viaarxiv icon

ImplicitMemBench: Measuring Unconscious Behavioral Adaptation in Large Language Models

Add code
Apr 09, 2026
Viaarxiv icon

Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

Add code
Jan 13, 2026
Viaarxiv icon

Fine-Mem: Fine-Grained Feedback Alignment for Long-Horizon Memory Management

Add code
Jan 13, 2026
Viaarxiv icon

Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation

Add code
Nov 19, 2025
Figure 1 for Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation
Figure 2 for Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation
Figure 3 for Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation
Figure 4 for Causal Tracing of Object Representations in Large Vision Language Models: Mechanistic Interpretability and Hallucination Mitigation
Viaarxiv icon

LangGPS: Language Separability Guided Data Pre-Selection for Joint Multilingual Instruction Tuning

Add code
Nov 13, 2025
Viaarxiv icon

Adaptive Backtracking for Privacy Protection in Large Language Models

Add code
Aug 08, 2025
Viaarxiv icon

From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems

Add code
Mar 03, 2025
Figure 1 for From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
Figure 2 for From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
Figure 3 for From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
Figure 4 for From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
Viaarxiv icon

XTransplant: A Probe into the Upper Bound Performance of Multilingual Capability and Culture Adaptability in LLMs via Mutual Cross-lingual Feed-forward Transplantation

Add code
Dec 17, 2024
Viaarxiv icon