Picture for Jing Yang

Jing Yang

Self-Rewarded Multimodal Coherent Reasoning Across Diverse Visual Domains

Add code
Dec 27, 2025
Viaarxiv icon

RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks

Add code
Dec 24, 2025
Viaarxiv icon

FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models

Add code
Dec 23, 2025
Figure 1 for FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
Figure 2 for FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
Figure 3 for FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
Figure 4 for FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
Viaarxiv icon

SirenPose: Dynamic Scene Reconstruction via Geometric Supervision

Add code
Dec 23, 2025
Viaarxiv icon

Reflective Confidence: Correcting Reasoning Flaws via Online Self-Correction

Add code
Dec 21, 2025
Figure 1 for Reflective Confidence: Correcting Reasoning Flaws via Online Self-Correction
Figure 2 for Reflective Confidence: Correcting Reasoning Flaws via Online Self-Correction
Figure 3 for Reflective Confidence: Correcting Reasoning Flaws via Online Self-Correction
Figure 4 for Reflective Confidence: Correcting Reasoning Flaws via Online Self-Correction
Viaarxiv icon

Large Language Models as Discounted Bayesian Filters

Add code
Dec 20, 2025
Viaarxiv icon

MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models

Add code
Dec 09, 2025
Viaarxiv icon

Insights from the ICLR Peer Review and Rebuttal Process

Add code
Nov 19, 2025
Viaarxiv icon

3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale

Add code
Nov 17, 2025
Viaarxiv icon

Cost-Effective Communication: An Auction-based Method for Language Agent Interaction

Add code
Nov 17, 2025
Viaarxiv icon