Picture for Junyu Gao

Junyu Gao

WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code

Add code
Jun 09, 2025
Viaarxiv icon

LLMs Caught in the Crossfire: Malware Requests and Jailbreak Challenges

Add code
Jun 09, 2025
Viaarxiv icon

Scale Efficient Training for Large Datasets

Add code
Mar 17, 2025
Viaarxiv icon

From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models

Add code
Mar 08, 2025
Viaarxiv icon

A Benchmark for Multi-Lingual Vision-Language Learning in Remote Sensing Image Captioning

Add code
Mar 06, 2025
Viaarxiv icon

FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation

Add code
Jan 03, 2025
Viaarxiv icon

SignEye: Traffic Sign Interpretation from Vehicle First-Person View

Add code
Nov 18, 2024
Viaarxiv icon

Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes

Add code
Nov 05, 2024
Figure 1 for Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Figure 2 for Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Figure 3 for Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Figure 4 for Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Viaarxiv icon

Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models

Add code
Oct 11, 2024
Viaarxiv icon

Revisiting Essential and Nonessential Settings of Evidential Deep Learning

Add code
Oct 01, 2024
Figure 1 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Figure 2 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Figure 3 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Figure 4 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Viaarxiv icon