Picture for Tong Lin

Tong Lin

Peking University

What to Format and How: A Benchmark and Workflow Approach for Document Formatting

Add code
Jun 01, 2026
Viaarxiv icon

Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation

Add code
May 12, 2026
Viaarxiv icon

Long-Text-to-Image Generation via Compositional Prompt Decomposition

Add code
Apr 20, 2026
Viaarxiv icon

ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment

Add code
Mar 24, 2026
Viaarxiv icon

Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models

Add code
Mar 02, 2026
Viaarxiv icon

ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning

Add code
Feb 11, 2026
Viaarxiv icon

FARTrack: Fast Autoregressive Visual Tracking with High Performance

Add code
Feb 03, 2026
Viaarxiv icon

GeoPep: A geometry-aware masked language model for protein-peptide binding site prediction

Add code
Oct 30, 2025
Viaarxiv icon

Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model

Add code
Dec 02, 2024
Figure 1 for Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model
Figure 2 for Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model
Figure 3 for Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model
Figure 4 for Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model
Viaarxiv icon

Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action Localization

Add code
Jul 12, 2024
Viaarxiv icon