Picture for Jialun Cao

Jialun Cao

Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination

Add code
May 29, 2026
Viaarxiv icon

Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training

Add code
May 10, 2026
Viaarxiv icon

From What to How: Bridging User Requirements with Software Development Using Large Language Models

Add code
Feb 14, 2026
Viaarxiv icon

ModelWisdom: An Integrated Toolkit for TLA+ Model Visualization, Digest and Repair

Add code
Feb 12, 2026
Viaarxiv icon

Isolating Language-Coding from Problem-Solving: Benchmarking LLMs with PseudoEval

Add code
Feb 26, 2025
Viaarxiv icon

From Informal to Formal -- Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs

Add code
Jan 27, 2025
Viaarxiv icon

How Should I Build A Benchmark?

Add code
Jan 18, 2025
Viaarxiv icon

ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation

Add code
Dec 24, 2024
Figure 1 for ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation
Figure 2 for ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation
Figure 3 for ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation
Figure 4 for ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation
Viaarxiv icon

CODECLEANER: Elevating Standards with A Robust Data Contamination Mitigation Toolkit

Add code
Nov 16, 2024
Viaarxiv icon

DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation

Add code
Aug 23, 2024
Figure 1 for DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation
Figure 2 for DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation
Figure 3 for DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation
Figure 4 for DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation
Viaarxiv icon