Picture for Junchi Yan

Junchi Yan

Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives

Add code
Aug 13, 2024
Viaarxiv icon

LinSATNet: The Positive Linear Satisfiability Neural Networks

Add code
Jul 18, 2024
Viaarxiv icon

GeoMix: Towards Geometry-Aware Data Augmentation

Add code
Jul 15, 2024
Viaarxiv icon

Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

Add code
Jun 22, 2024
Viaarxiv icon

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models

Add code
Jun 17, 2024
Figure 1 for DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
Figure 2 for DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
Figure 3 for DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
Figure 4 for DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
Viaarxiv icon

Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach

Add code
Jun 13, 2024
Figure 1 for Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach
Figure 2 for Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach
Figure 3 for Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach
Figure 4 for Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach
Viaarxiv icon

Towards Vision-Language Geo-Foundation Model: A Survey

Add code
Jun 13, 2024
Figure 1 for Towards Vision-Language Geo-Foundation Model: A Survey
Figure 2 for Towards Vision-Language Geo-Foundation Model: A Survey
Figure 3 for Towards Vision-Language Geo-Foundation Model: A Survey
Figure 4 for Towards Vision-Language Geo-Foundation Model: A Survey
Viaarxiv icon

Learning Divergence Fields for Shift-Robust Graph Representations

Add code
Jun 07, 2024
Figure 1 for Learning Divergence Fields for Shift-Robust Graph Representations
Figure 2 for Learning Divergence Fields for Shift-Robust Graph Representations
Figure 3 for Learning Divergence Fields for Shift-Robust Graph Representations
Figure 4 for Learning Divergence Fields for Shift-Robust Graph Representations
Viaarxiv icon

Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving

Add code
Jun 06, 2024
Viaarxiv icon

TerDiT: Ternary Diffusion Models with Transformers

Add code
May 23, 2024
Figure 1 for TerDiT: Ternary Diffusion Models with Transformers
Figure 2 for TerDiT: Ternary Diffusion Models with Transformers
Figure 3 for TerDiT: Ternary Diffusion Models with Transformers
Figure 4 for TerDiT: Ternary Diffusion Models with Transformers
Viaarxiv icon