Picture for Wenhao Jiang

Wenhao Jiang

Guangdong Laboratory of Artificial Intelligence and Digital Economy

Cognitive Mismatch in Multimodal Large Language Models for Discrete Symbol Understanding

Add code
Mar 19, 2026
Viaarxiv icon

OddGridBench: Exposing the Lack of Fine-Grained Visual Discrepancy Sensitivity in Multimodal Large Language Models

Add code
Mar 10, 2026
Viaarxiv icon

LEA: Label Enumeration Attack in Vertical Federated Learning

Add code
Mar 04, 2026
Viaarxiv icon

Enhancing Geometric Perception in VLMs via Translator-Guided Reinforcement Learning

Add code
Feb 26, 2026
Viaarxiv icon

BEAP-Agent: Backtrackable Execution and Adaptive Planning for GUI Agents

Add code
Jan 29, 2026
Viaarxiv icon

RM-Distiller: Exploiting Generative LLM for Reward Model Distillation

Add code
Jan 20, 2026
Viaarxiv icon

SDE-SQL: Enhancing Text-to-SQL Generation in Large Language Models via Self-Driven Exploration with SQL Probes

Add code
Jun 08, 2025
Viaarxiv icon

SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought

Add code
May 30, 2025
Viaarxiv icon

LlamaSeg: Image Segmentation via Autoregressive Mask Generation

Add code
May 26, 2025
Viaarxiv icon

ReEx-SQL: Reasoning with Execution-Aware Reinforcement Learning for Text-to-SQL

Add code
May 19, 2025
Viaarxiv icon