Picture for Jihua Kang

Jihua Kang

Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies

Add code
May 05, 2026
Viaarxiv icon

MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation

Add code
Jan 13, 2026
Viaarxiv icon

DocFusion: A Unified Framework for Document Parsing Tasks

Add code
Dec 17, 2024
Figure 1 for DocFusion: A Unified Framework for Document Parsing Tasks
Figure 2 for DocFusion: A Unified Framework for Document Parsing Tasks
Figure 3 for DocFusion: A Unified Framework for Document Parsing Tasks
Figure 4 for DocFusion: A Unified Framework for Document Parsing Tasks
Viaarxiv icon

InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction

Add code
Apr 17, 2023
Figure 1 for InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction
Figure 2 for InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction
Figure 3 for InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction
Figure 4 for InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction
Viaarxiv icon