Picture for Zhifei Li

Zhifei Li

CocoaBench: Evaluating Unified Digital Agents in the Wild

Add code
Apr 14, 2026
Viaarxiv icon

MyGram: Modality-aware Graph Transformer with Global Distribution for Multi-modal Entity Alignment

Add code
Jan 17, 2026
Viaarxiv icon

MacVQA: Adaptive Memory Allocation and Global Noise Filtering for Continual Visual Question Answering

Add code
Jan 05, 2026
Viaarxiv icon

Let the Barbarians In: How AI Can Accelerate Systems Performance Research

Add code
Dec 22, 2025
Viaarxiv icon

KeenKT: Knowledge Mastery-State Disambiguation for Knowledge Tracing

Add code
Dec 21, 2025
Viaarxiv icon

FrontierCS: Evolving Challenges for Evolving Intelligence

Add code
Dec 17, 2025
Figure 1 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 2 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 3 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 4 for FrontierCS: Evolving Challenges for Evolving Intelligence
Viaarxiv icon

Analyzing Planner Design Trade-offs for MAPF under Realistic Simulation

Add code
Dec 10, 2025
Viaarxiv icon

LEANN: A Low-Storage Vector Index

Add code
Jun 09, 2025
Viaarxiv icon

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Add code
Mar 03, 2025
Figure 1 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 2 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 3 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 4 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Viaarxiv icon

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Add code
Feb 06, 2025
Viaarxiv icon