Picture for Shuai Zhao

Shuai Zhao

DPNet: Dynamic Pooling Network for Tiny Object Detection

Add code
May 05, 2025
Figure 1 for DPNet: Dynamic Pooling Network for Tiny Object Detection
Figure 2 for DPNet: Dynamic Pooling Network for Tiny Object Detection
Figure 3 for DPNet: Dynamic Pooling Network for Tiny Object Detection
Figure 4 for DPNet: Dynamic Pooling Network for Tiny Object Detection
Viaarxiv icon

Exploring Cognitive and Aesthetic Causality for Multimodal Aspect-Based Sentiment Analysis

Add code
Apr 22, 2025
Figure 1 for Exploring Cognitive and Aesthetic Causality for Multimodal Aspect-Based Sentiment Analysis
Figure 2 for Exploring Cognitive and Aesthetic Causality for Multimodal Aspect-Based Sentiment Analysis
Figure 3 for Exploring Cognitive and Aesthetic Causality for Multimodal Aspect-Based Sentiment Analysis
Figure 4 for Exploring Cognitive and Aesthetic Causality for Multimodal Aspect-Based Sentiment Analysis
Viaarxiv icon

Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation

Add code
Apr 17, 2025
Viaarxiv icon

Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data

Add code
Apr 14, 2025
Figure 1 for Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Figure 2 for Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Figure 3 for Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Figure 4 for Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Viaarxiv icon

Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined Behaviors

Add code
Mar 04, 2025
Figure 1 for Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined Behaviors
Figure 2 for Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined Behaviors
Figure 3 for Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined Behaviors
Figure 4 for Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined Behaviors
Viaarxiv icon

T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting

Add code
Feb 28, 2025
Figure 1 for T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Figure 2 for T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Figure 3 for T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Figure 4 for T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Viaarxiv icon

CutPaste&Find: Efficient Multimodal Hallucination Detector with Visual-aid Knowledge Base

Add code
Feb 18, 2025
Viaarxiv icon

Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education

Add code
Feb 09, 2025
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Figure 1 for Baichuan-Omni-1.5 Technical Report
Figure 2 for Baichuan-Omni-1.5 Technical Report
Figure 3 for Baichuan-Omni-1.5 Technical Report
Figure 4 for Baichuan-Omni-1.5 Technical Report
Viaarxiv icon

Enhancing Multimodal Entity Linking with Jaccard Distance-based Conditional Contrastive Learning and Contextual Visual Augmentation

Add code
Jan 24, 2025
Viaarxiv icon