Picture for Cong Liu

Cong Liu

TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment

Add code
Mar 24, 2026
Viaarxiv icon

Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

Add code
Mar 04, 2026
Viaarxiv icon

SpikeTrack: A Spike-driven Framework for Efficient Visual Tracking

Add code
Feb 27, 2026
Viaarxiv icon

Tele-Omni: a Unified Multimodal Framework for Video Generation and Editing

Add code
Feb 10, 2026
Viaarxiv icon

Neurosymbolic LoRA: Why and When to Tune Weights vs. Rewrite Prompts

Add code
Jan 19, 2026
Viaarxiv icon

Robust Detection of Underwater Target Against Non-Uniform Noise With Optical Fiber DAS Array

Add code
Dec 12, 2025
Viaarxiv icon

REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation

Add code
Dec 12, 2025
Viaarxiv icon

Spark-Prover-X1: Formal Theorem Proving Through Diverse Data Training

Add code
Nov 18, 2025
Viaarxiv icon

TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction

Add code
Nov 16, 2025
Figure 1 for TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
Figure 2 for TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
Figure 3 for TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
Figure 4 for TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
Viaarxiv icon

Thinking Before You Speak: A Proactive Test-time Scaling Approach

Add code
Aug 27, 2025
Figure 1 for Thinking Before You Speak: A Proactive Test-time Scaling Approach
Figure 2 for Thinking Before You Speak: A Proactive Test-time Scaling Approach
Figure 3 for Thinking Before You Speak: A Proactive Test-time Scaling Approach
Figure 4 for Thinking Before You Speak: A Proactive Test-time Scaling Approach
Viaarxiv icon