Picture for Wang Yang

Wang Yang

MOSAIC: Multi-Objective Slice-Aware Iterative Curation for Alignment

Add code
Mar 19, 2026
Viaarxiv icon

Toward Trustworthy Evaluation of Sustainability Rating Methodologies: A Human-AI Collaborative Framework for Benchmark Dataset Construction

Add code
Feb 19, 2026
Viaarxiv icon

When Domains Interact: Asymmetric and Order-Sensitive Cross-Domain Effects in Reinforcement Learning for Reasoning

Add code
Feb 01, 2026
Viaarxiv icon

AJAR: Adaptive Jailbreak Architecture for Red-teaming

Add code
Jan 16, 2026
Viaarxiv icon

Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers

Add code
Jan 11, 2026
Viaarxiv icon

Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?

Add code
Oct 14, 2025
Figure 1 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 2 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 3 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 4 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Viaarxiv icon

100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?

Add code
May 25, 2025
Figure 1 for 100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?
Figure 2 for 100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?
Figure 3 for 100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?
Figure 4 for 100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?
Viaarxiv icon

SELF: Self-Extend the Context Length With Logistic Growth Function

Add code
May 22, 2025
Viaarxiv icon

Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning

Add code
May 22, 2025
Viaarxiv icon

Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time

Add code
Apr 12, 2025
Figure 1 for Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time
Figure 2 for Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time
Figure 3 for Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time
Figure 4 for Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time
Viaarxiv icon