Picture for Jia Zheng

Jia Zheng

Serving Large Language Models on Huawei CloudMatrix384

Add code
Jun 15, 2025
Viaarxiv icon

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Add code
Jun 09, 2025
Viaarxiv icon

GLOVER++: Unleashing the Potential of Affordance Learning from Human Behaviors for Robotic Manipulation

Add code
May 17, 2025
Viaarxiv icon

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers

Add code
Apr 01, 2025
Viaarxiv icon

Large Language Models Often Say One Thing and Do Another

Add code
Mar 10, 2025
Viaarxiv icon

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

Add code
Jan 07, 2025
Viaarxiv icon

From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach

Add code
Dec 17, 2024
Figure 1 for From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Figure 2 for From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Figure 3 for From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Figure 4 for From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Viaarxiv icon

READoc: A Unified Benchmark for Realistic Document Structured Extraction

Add code
Sep 08, 2024
Figure 1 for READoc: A Unified Benchmark for Realistic Document Structured Extraction
Figure 2 for READoc: A Unified Benchmark for Realistic Document Structured Extraction
Figure 3 for READoc: A Unified Benchmark for Realistic Document Structured Extraction
Figure 4 for READoc: A Unified Benchmark for Realistic Document Structured Extraction
Viaarxiv icon

Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation

Add code
Apr 10, 2024
Viaarxiv icon

PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs

Add code
Aug 10, 2023
Figure 1 for PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs
Figure 2 for PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs
Figure 3 for PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs
Figure 4 for PlankAssembly: Robust 3D Reconstruction from Three Orthographic Views with Learnt Shape Programs
Viaarxiv icon