Picture for Chen Chen

Chen Chen

Department of Radiology, Zhejiang Cancer Hospital, Hangzhou, 310022, China, Hangzhou Institute of Medicine

Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model

Add code
May 21, 2025
Figure 1 for Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model
Figure 2 for Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model
Figure 3 for Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model
Figure 4 for Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model
Viaarxiv icon

Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts

Add code
May 20, 2025
Figure 1 for Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts
Figure 2 for Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts
Figure 3 for Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts
Figure 4 for Generalizable Multispectral Land Cover Classification via Frequency-Aware Mixture of Low-Rank Token Experts
Viaarxiv icon

ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations

Add code
May 20, 2025
Figure 1 for ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Figure 2 for ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Figure 3 for ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Figure 4 for ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Viaarxiv icon

GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing

Add code
May 16, 2025
Figure 1 for GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing
Figure 2 for GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing
Figure 3 for GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing
Figure 4 for GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing
Viaarxiv icon

Canny2Palm: Realistic and Controllable Palmprint Generation for Large-scale Pre-training

Add code
May 08, 2025
Viaarxiv icon

SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing

Add code
May 05, 2025
Viaarxiv icon

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Add code
May 05, 2025
Viaarxiv icon

Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models

Add code
Apr 25, 2025
Figure 1 for Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models
Figure 2 for Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models
Figure 3 for Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models
Figure 4 for Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models
Viaarxiv icon

Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Assessment and Beyond

Add code
Apr 18, 2025
Viaarxiv icon

Towards deployment-centric multimodal AI beyond vision and language

Add code
Apr 04, 2025
Viaarxiv icon