Picture for Yuchen Tian

Yuchen Tian

CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?

Add code
Aug 20, 2024
Viaarxiv icon

CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification

Add code
Apr 30, 2024
Viaarxiv icon

MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems

Add code
Apr 15, 2024
Viaarxiv icon

Token Alignment via Character Matching for Subword Completion

Add code
Mar 13, 2024
Viaarxiv icon

CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation

Add code
Oct 08, 2023
Viaarxiv icon

A Static Evaluation of Code Completion by Large Language Models

Add code
Jun 05, 2023
Viaarxiv icon

Greener yet Powerful: Taming Large Code Generation Models with Quantization

Add code
Mar 09, 2023
Viaarxiv icon

Multi-lingual Evaluation of Code Generation Models

Add code
Oct 26, 2022
Viaarxiv icon