Picture for Heng Wang

Heng Wang

Can LLM Graph Reasoning Generalize beyond Pattern Memorization?

Add code
Jun 23, 2024
Viaarxiv icon

Autoregressive Pretraining with Mamba in Vision

Add code
Jun 11, 2024
Viaarxiv icon

Dance Any Beat: Blending Beats with Visuals in Dance Video Generation

Add code
May 15, 2024
Figure 1 for Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
Figure 2 for Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
Figure 3 for Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
Figure 4 for Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
Viaarxiv icon

Boosting 3D Neuron Segmentation with 2D Vision Transformer Pre-trained on Natural Images

Add code
May 04, 2024
Viaarxiv icon

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

Add code
Apr 15, 2024
Viaarxiv icon

Digital Twin Channel for 6G: Concepts, Architectures and Potential Applications

Add code
Mar 31, 2024
Figure 1 for Digital Twin Channel for 6G: Concepts, Architectures and Potential Applications
Figure 2 for Digital Twin Channel for 6G: Concepts, Architectures and Potential Applications
Figure 3 for Digital Twin Channel for 6G: Concepts, Architectures and Potential Applications
Figure 4 for Digital Twin Channel for 6G: Concepts, Architectures and Potential Applications
Viaarxiv icon

MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts

Add code
Mar 14, 2024
Figure 1 for MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts
Figure 2 for MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts
Figure 3 for MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts
Figure 4 for MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts
Viaarxiv icon

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Add code
Mar 05, 2024
Figure 1 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 2 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 3 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 4 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Viaarxiv icon

Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models

Add code
Mar 05, 2024
Figure 1 for Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models
Figure 2 for Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models
Figure 3 for Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models
Figure 4 for Hy-DAT: A Tool to Address Hydropower Modeling Gaps Using Interdependency, Efficiency Curves, and Unit Dispatch Models
Viaarxiv icon

DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

Add code
Feb 16, 2024
Viaarxiv icon