Picture for Wen Wang

Wen Wang

RMTransformer: Accurate Radio Map Construction and Coverage Prediction

Add code
Jan 11, 2025
Figure 1 for RMTransformer: Accurate Radio Map Construction and Coverage Prediction
Figure 2 for RMTransformer: Accurate Radio Map Construction and Coverage Prediction
Figure 3 for RMTransformer: Accurate Radio Map Construction and Coverage Prediction
Viaarxiv icon

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Add code
Jan 10, 2025
Figure 1 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 2 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 3 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 4 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Viaarxiv icon

RadioTransformer: Accurate Radio Map Construction and Coverage Prediction

Add code
Jan 09, 2025
Figure 1 for RadioTransformer: Accurate Radio Map Construction and Coverage Prediction
Figure 2 for RadioTransformer: Accurate Radio Map Construction and Coverage Prediction
Figure 3 for RadioTransformer: Accurate Radio Map Construction and Coverage Prediction
Viaarxiv icon

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Add code
Dec 19, 2024
Figure 1 for LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
Figure 2 for LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
Figure 3 for LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
Figure 4 for LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
Viaarxiv icon

AniDoc: Animation Creation Made Easier

Add code
Dec 18, 2024
Viaarxiv icon

CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models

Add code
Dec 13, 2024
Figure 1 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Figure 2 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Figure 3 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Figure 4 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Viaarxiv icon

Multi-Functional RIS Integrated Sensing and Communications for 6G Networks

Add code
Dec 02, 2024
Figure 1 for Multi-Functional RIS Integrated Sensing and Communications for 6G Networks
Figure 2 for Multi-Functional RIS Integrated Sensing and Communications for 6G Networks
Figure 3 for Multi-Functional RIS Integrated Sensing and Communications for 6G Networks
Figure 4 for Multi-Functional RIS Integrated Sensing and Communications for 6G Networks
Viaarxiv icon

MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation

Add code
Nov 22, 2024
Figure 1 for MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Figure 2 for MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Figure 3 for MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Figure 4 for MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Viaarxiv icon

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning

Add code
Nov 15, 2024
Figure 1 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 2 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 3 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 4 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Viaarxiv icon

MagicQuill: An Intelligent Interactive Image Editing System

Add code
Nov 14, 2024
Viaarxiv icon