Picture for Dongxu Li

Dongxu Li

PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery

Add code
Jun 16, 2024
Figure 1 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Figure 2 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Figure 3 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Figure 4 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Viaarxiv icon

Design and Performance of Resonant Beam Communications -- Part I: Quasi-Static Scenario

Add code
Mar 25, 2024
Figure 1 for Design and Performance of Resonant Beam Communications -- Part I: Quasi-Static Scenario
Figure 2 for Design and Performance of Resonant Beam Communications -- Part I: Quasi-Static Scenario
Figure 3 for Design and Performance of Resonant Beam Communications -- Part I: Quasi-Static Scenario
Figure 4 for Design and Performance of Resonant Beam Communications -- Part I: Quasi-Static Scenario
Viaarxiv icon

Resonant Beam Communications: A New Design Paradigm and Challenges

Add code
Mar 25, 2024
Figure 1 for Resonant Beam Communications: A New Design Paradigm and Challenges
Figure 2 for Resonant Beam Communications: A New Design Paradigm and Challenges
Figure 3 for Resonant Beam Communications: A New Design Paradigm and Challenges
Figure 4 for Resonant Beam Communications: A New Design Paradigm and Challenges
Viaarxiv icon

Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario

Add code
Mar 25, 2024
Figure 1 for Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario
Figure 2 for Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario
Figure 3 for Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario
Figure 4 for Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario
Viaarxiv icon

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Add code
Jan 03, 2024
Figure 1 for Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Figure 2 for Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Figure 3 for Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Figure 4 for Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Viaarxiv icon

Fundamental Limitation of Semantic Communications: Neural Estimation for Rate-Distortion

Add code
Jan 02, 2024
Viaarxiv icon

X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning

Add code
Nov 30, 2023
Figure 1 for X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning
Figure 2 for X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning
Figure 3 for X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning
Figure 4 for X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning
Viaarxiv icon

Linearized Relative Positional Encoding

Add code
Jul 18, 2023
Viaarxiv icon

BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing

Add code
May 24, 2023
Viaarxiv icon

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

Add code
May 11, 2023
Figure 1 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 2 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 3 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 4 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Viaarxiv icon