Picture for Bin Wang

Bin Wang

and Other Contributors

Segmentation-guided Layer-wise Image Vectorization with Gradient Fills

Add code
Aug 28, 2024
Viaarxiv icon

IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities

Add code
Aug 23, 2024
Figure 1 for IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
Figure 2 for IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
Figure 3 for IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
Figure 4 for IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
Viaarxiv icon

TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning

Add code
Aug 20, 2024
Figure 1 for TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning
Figure 2 for TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning
Figure 3 for TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning
Figure 4 for TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning
Viaarxiv icon

CSI-Free Position Optimization for Movable Antenna Communication Systems: A Black-Box Optimization Approach

Add code
Aug 09, 2024
Figure 1 for CSI-Free Position Optimization for Movable Antenna Communication Systems: A Black-Box Optimization Approach
Figure 2 for CSI-Free Position Optimization for Movable Antenna Communication Systems: A Black-Box Optimization Approach
Figure 3 for CSI-Free Position Optimization for Movable Antenna Communication Systems: A Black-Box Optimization Approach
Viaarxiv icon

In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models

Add code
Aug 07, 2024
Viaarxiv icon

Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration

Add code
Aug 03, 2024
Figure 1 for Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration
Figure 2 for Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration
Figure 3 for Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration
Figure 4 for Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration
Viaarxiv icon

Image Re-Identification: Where Self-supervision Meets Vision-Language Learning

Add code
Jul 30, 2024
Figure 1 for Image Re-Identification: Where Self-supervision Meets Vision-Language Learning
Figure 2 for Image Re-Identification: Where Self-supervision Meets Vision-Language Learning
Figure 3 for Image Re-Identification: Where Self-supervision Meets Vision-Language Learning
Figure 4 for Image Re-Identification: Where Self-supervision Meets Vision-Language Learning
Viaarxiv icon

A New Dataset and Framework for Real-World Blurred Images Super-Resolution

Add code
Jul 20, 2024
Figure 1 for A New Dataset and Framework for Real-World Blurred Images Super-Resolution
Figure 2 for A New Dataset and Framework for Real-World Blurred Images Super-Resolution
Figure 3 for A New Dataset and Framework for Real-World Blurred Images Super-Resolution
Figure 4 for A New Dataset and Framework for Real-World Blurred Images Super-Resolution
Viaarxiv icon

Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations

Add code
Jul 08, 2024
Figure 1 for Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Figure 2 for Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Figure 3 for Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Figure 4 for Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Viaarxiv icon

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Add code
Jul 03, 2024
Figure 1 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 2 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 3 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Figure 4 for InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Viaarxiv icon