Picture for Wenhao Zheng

Wenhao Zheng

UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

Add code
May 21, 2025
Viaarxiv icon

Anyprefer: An Agentic Framework for Preference Data Synthesis

Add code
Apr 27, 2025
Figure 1 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Figure 2 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Figure 3 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Figure 4 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Viaarxiv icon

Token Level Routing Inference System for Edge Devices

Add code
Apr 10, 2025
Figure 1 for Token Level Routing Inference System for Edge Devices
Figure 2 for Token Level Routing Inference System for Edge Devices
Figure 3 for Token Level Routing Inference System for Edge Devices
Figure 4 for Token Level Routing Inference System for Edge Devices
Viaarxiv icon

Verifiable Format Control for Large Language Model Generations

Add code
Feb 06, 2025
Viaarxiv icon

CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing

Add code
Feb 04, 2025
Figure 1 for CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing
Figure 2 for CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing
Figure 3 for CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing
Figure 4 for CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing
Viaarxiv icon

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Add code
Oct 14, 2024
Figure 1 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 2 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 3 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 4 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Viaarxiv icon

VHELM: A Holistic Evaluation of Vision Language Models

Add code
Oct 09, 2024
Figure 1 for VHELM: A Holistic Evaluation of Vision Language Models
Figure 2 for VHELM: A Holistic Evaluation of Vision Language Models
Figure 3 for VHELM: A Holistic Evaluation of Vision Language Models
Figure 4 for VHELM: A Holistic Evaluation of Vision Language Models
Viaarxiv icon

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

Add code
Jun 10, 2024
Figure 1 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 2 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 3 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 4 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Viaarxiv icon

Multimodal Clinical Trial Outcome Prediction with Large Language Models

Add code
Feb 18, 2024
Viaarxiv icon

STAN: Stage-Adaptive Network for Multi-Task Recommendation by Learning User Lifecycle-Based Representation

Add code
Jun 21, 2023
Figure 1 for STAN: Stage-Adaptive Network for Multi-Task Recommendation by Learning User Lifecycle-Based Representation
Figure 2 for STAN: Stage-Adaptive Network for Multi-Task Recommendation by Learning User Lifecycle-Based Representation
Figure 3 for STAN: Stage-Adaptive Network for Multi-Task Recommendation by Learning User Lifecycle-Based Representation
Figure 4 for STAN: Stage-Adaptive Network for Multi-Task Recommendation by Learning User Lifecycle-Based Representation
Viaarxiv icon