Picture for Chunyi Li

Chunyi Li

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

Add code
Mar 02, 2026
Viaarxiv icon

STAR : Bridging Statistical and Agentic Reasoning for Large Model Performance Prediction

Add code
Feb 12, 2026
Viaarxiv icon

Free-GVC: Towards Training-Free Extreme Generative Video Compression with Temporal Coherence

Add code
Feb 10, 2026
Viaarxiv icon

Automated Safety Benchmarking: A Multi-agent Pipeline for LVLMs

Add code
Jan 27, 2026
Viaarxiv icon

Embodied Image Compression

Add code
Dec 12, 2025
Figure 1 for Embodied Image Compression
Figure 2 for Embodied Image Compression
Figure 3 for Embodied Image Compression
Figure 4 for Embodied Image Compression
Viaarxiv icon

Using GUI Agent for Electronic Design Automation

Add code
Dec 12, 2025
Figure 1 for Using GUI Agent for Electronic Design Automation
Figure 2 for Using GUI Agent for Electronic Design Automation
Figure 3 for Using GUI Agent for Electronic Design Automation
Figure 4 for Using GUI Agent for Electronic Design Automation
Viaarxiv icon

GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models

Add code
Nov 17, 2025
Viaarxiv icon

Data Assessment for Embodied Intelligence

Add code
Nov 12, 2025
Viaarxiv icon

AU-IQA: A Benchmark Dataset for Perceptual Quality Assessment of AI-Enhanced User-Generated Content

Add code
Aug 07, 2025
Viaarxiv icon

The Ever-Evolving Science Exam

Add code
Jul 22, 2025
Figure 1 for The Ever-Evolving Science Exam
Figure 2 for The Ever-Evolving Science Exam
Figure 3 for The Ever-Evolving Science Exam
Figure 4 for The Ever-Evolving Science Exam
Viaarxiv icon