Picture for Tao Ji

Tao Ji

AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress

Add code
Nov 11, 2025
Viaarxiv icon

From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling

Add code
Oct 01, 2025
Viaarxiv icon

MDAR: A Multi-scene Dynamic Audio Reasoning Benchmark

Add code
Sep 26, 2025
Viaarxiv icon

Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction

Add code
Jun 14, 2025
Figure 1 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 2 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 3 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Figure 4 for Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction
Viaarxiv icon

Protein Design with Dynamic Protein Vocabulary

Add code
May 25, 2025
Figure 1 for Protein Design with Dynamic Protein Vocabulary
Figure 2 for Protein Design with Dynamic Protein Vocabulary
Figure 3 for Protein Design with Dynamic Protein Vocabulary
Figure 4 for Protein Design with Dynamic Protein Vocabulary
Viaarxiv icon

PDFBench: A Benchmark for De novo Protein Design from Function

Add code
May 25, 2025
Viaarxiv icon

Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation

Add code
Apr 26, 2025
Viaarxiv icon

Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations

Add code
Mar 19, 2025
Viaarxiv icon

The Role of Visual Modality in Multimodal Mathematical Reasoning: Challenges and Insights

Add code
Mar 06, 2025
Viaarxiv icon

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Add code
Feb 20, 2025
Viaarxiv icon