Picture for Jian Wu

Jian Wu

SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis

Add code
Jun 09, 2025
Viaarxiv icon

MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning

Add code
May 26, 2025
Figure 1 for MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning
Figure 2 for MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning
Figure 3 for MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning
Figure 4 for MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning
Viaarxiv icon

Towards Reliable Large Audio Language Model

Add code
May 25, 2025
Viaarxiv icon

Practical Equivalence Testing and Its Application in Synthetic Pre-Crash Scenario Validation

Add code
May 19, 2025
Viaarxiv icon

Dual-level Fuzzy Learning with Patch Guidance for Image Ordinal Regression

Add code
May 09, 2025
Viaarxiv icon

Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading

Add code
May 01, 2025
Viaarxiv icon

OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding

Add code
Apr 20, 2025
Figure 1 for OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding
Figure 2 for OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding
Figure 3 for OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding
Figure 4 for OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding
Viaarxiv icon

Matrix Factorization with Dynamic Multi-view Clustering for Recommender System

Add code
Apr 20, 2025
Viaarxiv icon

From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs

Add code
Apr 15, 2025
Figure 1 for From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs
Figure 2 for From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs
Figure 3 for From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs
Figure 4 for From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs
Viaarxiv icon

ProtFlow: Fast Protein Sequence Design via Flow Matching on Compressed Protein Language Model Embeddings

Add code
Apr 15, 2025
Viaarxiv icon