Picture for Liang Lin

Liang Lin

ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection

Add code
Oct 20, 2023
Viaarxiv icon

ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and Multispectral Data Fusion

Add code
Oct 11, 2023
Viaarxiv icon

Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation

Add code
Sep 23, 2023
Figure 1 for Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation
Figure 2 for Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation
Figure 3 for Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation
Figure 4 for Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation
Viaarxiv icon

VisualProg Distiller: Learning to Fine-tune Non-differentiable Visual Programming Frameworks

Add code
Sep 18, 2023
Viaarxiv icon

Towards Real-World Burst Image Super-Resolution: Benchmark and Method

Add code
Sep 09, 2023
Figure 1 for Towards Real-World Burst Image Super-Resolution: Benchmark and Method
Figure 2 for Towards Real-World Burst Image Super-Resolution: Benchmark and Method
Figure 3 for Towards Real-World Burst Image Super-Resolution: Benchmark and Method
Figure 4 for Towards Real-World Burst Image Super-Resolution: Benchmark and Method
Viaarxiv icon

Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs

Add code
Sep 04, 2023
Viaarxiv icon

EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE

Add code
Aug 23, 2023
Figure 1 for EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Figure 2 for EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Figure 3 for EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Figure 4 for EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Viaarxiv icon

DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment

Add code
Aug 22, 2023
Figure 1 for DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment
Figure 2 for DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment
Figure 3 for DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment
Figure 4 for DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment
Viaarxiv icon

Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos

Add code
Aug 20, 2023
Figure 1 for Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
Figure 2 for Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
Figure 3 for Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
Figure 4 for Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
Viaarxiv icon

Understanding Self-attention Mechanism via Dynamical System Perspective

Add code
Aug 19, 2023
Viaarxiv icon