Picture for Yanming Xu

Yanming Xu

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Add code
Sep 03, 2024
Figure 1 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Figure 2 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Figure 3 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Figure 4 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Viaarxiv icon