Picture for Wenming Yang

Wenming Yang

TACO: Think-Answer Consistency for Optimized Long-Chain Reasoning and Efficient Data Learning via Reinforcement Learning in LVLMs

Add code
May 27, 2025
Viaarxiv icon

PathoSCOPE: Few-Shot Pathology Detection via Self-Supervised Contrastive Learning and Pathology-Informed Synthetic Embeddings

Add code
May 23, 2025
Viaarxiv icon

UP-Person: Unified Parameter-Efficient Transfer Learning for Text-based Person Retrieval

Add code
Apr 14, 2025
Viaarxiv icon

VISTA: Unsupervised 2D Temporal Dependency Representations for Time Series Anomaly Detection

Add code
Apr 03, 2025
Viaarxiv icon

DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval

Add code
Mar 06, 2025
Viaarxiv icon

GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning

Add code
Mar 04, 2025
Viaarxiv icon

Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting

Add code
Jan 18, 2025
Figure 1 for Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
Figure 2 for Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
Figure 3 for Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
Figure 4 for Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
Viaarxiv icon

DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration

Add code
Jan 17, 2025
Viaarxiv icon

CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs

Add code
Nov 19, 2024
Figure 1 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Figure 2 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Figure 3 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Figure 4 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Viaarxiv icon

Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection

Add code
Nov 05, 2024
Figure 1 for Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection
Figure 2 for Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection
Figure 3 for Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection
Figure 4 for Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection
Viaarxiv icon