Picture for Haochen Tan

Haochen Tan

OVD: On-policy Verbal Distillation

Add code
Jan 29, 2026
Viaarxiv icon

DSA-Tokenizer: Disentangled Semantic-Acoustic Tokenization via Flow Matching-based Hierarchical Fusion

Add code
Jan 15, 2026
Viaarxiv icon

A1: Asynchronous Test-Time Scaling via Conformal Prediction

Add code
Sep 18, 2025
Figure 1 for A1: Asynchronous Test-Time Scaling via Conformal Prediction
Figure 2 for A1: Asynchronous Test-Time Scaling via Conformal Prediction
Figure 3 for A1: Asynchronous Test-Time Scaling via Conformal Prediction
Figure 4 for A1: Asynchronous Test-Time Scaling via Conformal Prediction
Viaarxiv icon

Pangu DeepDiver: Adaptive Search Intensity Scaling via Open-Web Reinforcement Learning

Add code
May 30, 2025
Viaarxiv icon

TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios

Add code
May 19, 2025
Viaarxiv icon

MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models

Add code
Jun 20, 2024
Figure 1 for MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Figure 2 for MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Figure 3 for MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Figure 4 for MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Viaarxiv icon

MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation

Add code
May 19, 2024
Figure 1 for MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Figure 2 for MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Figure 3 for MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Figure 4 for MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Viaarxiv icon

PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models

Add code
Jan 26, 2024
Viaarxiv icon

VCSUM: A Versatile Chinese Meeting Summarization Dataset

Add code
May 15, 2023
Figure 1 for VCSUM: A Versatile Chinese Meeting Summarization Dataset
Figure 2 for VCSUM: A Versatile Chinese Meeting Summarization Dataset
Figure 3 for VCSUM: A Versatile Chinese Meeting Summarization Dataset
Figure 4 for VCSUM: A Versatile Chinese Meeting Summarization Dataset
Viaarxiv icon

Self-Supervised Sentence Compression for Meeting Summarization

Add code
May 13, 2023
Figure 1 for Self-Supervised Sentence Compression for Meeting Summarization
Figure 2 for Self-Supervised Sentence Compression for Meeting Summarization
Figure 3 for Self-Supervised Sentence Compression for Meeting Summarization
Figure 4 for Self-Supervised Sentence Compression for Meeting Summarization
Viaarxiv icon