Picture for Jingyi Wang

Jingyi Wang

TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured Table Question Answering

Add code
Jun 04, 2025
Viaarxiv icon

Blind Spot Navigation: Evolutionary Discovery of Sensitive Semantic Concepts for LVLMs

Add code
May 21, 2025
Viaarxiv icon

Convergence Rates of Constrained Expected Improvement

Add code
May 16, 2025
Viaarxiv icon

PRUNE: A Patching Based Repair Framework for Certiffable Unlearning of Neural Networks

Add code
May 10, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

CHARMS: Cognitive Hierarchical Agent with Reasoning and Motion Styles

Add code
Apr 03, 2025
Viaarxiv icon

VisNumBench: Evaluating Number Sense of Multimodal Large Language Models

Add code
Mar 19, 2025
Viaarxiv icon

Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities

Add code
Jan 17, 2025
Figure 1 for Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities
Figure 2 for Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities
Figure 3 for Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities
Figure 4 for Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities
Viaarxiv icon

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Add code
Jan 16, 2025
Figure 1 for Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Figure 2 for Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Figure 3 for Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Figure 4 for Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Viaarxiv icon

On the convergence of noisy Bayesian Optimization with Expected Improvement

Add code
Jan 16, 2025
Viaarxiv icon