Picture for Jiale Zhao

Jiale Zhao

Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

Add code
Mar 04, 2026
Viaarxiv icon

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Add code
Mar 03, 2026
Viaarxiv icon

RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation

Add code
Jan 13, 2026
Viaarxiv icon

Evaluating Frontier LLMs on PhD-Level Mathematical Reasoning: A Benchmark on a Textbook in Theoretical Computer Science about Randomized Algorithms

Add code
Dec 16, 2025
Viaarxiv icon

Decoding the Ear: A Framework for Objectifying Expressiveness from Human Preference Through Efficient Alignment

Add code
Oct 23, 2025
Viaarxiv icon

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Add code
Aug 23, 2025
Viaarxiv icon

One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models

Add code
Jul 10, 2025
Viaarxiv icon

TRAIL: Transferable Robust Adversarial Images via Latent diffusion

Add code
May 22, 2025
Figure 1 for TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Figure 2 for TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Figure 3 for TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Figure 4 for TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Viaarxiv icon

T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models

Add code
May 08, 2025
Viaarxiv icon

T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation

Add code
May 01, 2025
Viaarxiv icon