Picture for Tong Yang

Tong Yang

Michael Pokorny

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Add code
Dec 14, 2025
Viaarxiv icon

LPFQA: A Long-Tail Professional Forum-based Benchmark for LLM Evaluation

Add code
Nov 09, 2025
Viaarxiv icon

MergeMoE: Efficient Compression of MoE Models via Expert Output Merging

Add code
Oct 16, 2025
Viaarxiv icon

Fast Visuomotor Policy for Robotic Manipulation

Add code
Oct 14, 2025
Viaarxiv icon

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Add code
Sep 04, 2025
Viaarxiv icon

Adapting Foundation Model for Dental Caries Detection with Dual-View Co-Training

Add code
Aug 28, 2025
Figure 1 for Adapting Foundation Model for Dental Caries Detection with Dual-View Co-Training
Figure 2 for Adapting Foundation Model for Dental Caries Detection with Dual-View Co-Training
Figure 3 for Adapting Foundation Model for Dental Caries Detection with Dual-View Co-Training
Figure 4 for Adapting Foundation Model for Dental Caries Detection with Dual-View Co-Training
Viaarxiv icon

Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent

Add code
Aug 11, 2025
Viaarxiv icon

Fairy$\pm i$: the First 2-bit Complex LLM with All Parameters in $\{\pm1, \pm i\}$

Add code
Aug 07, 2025
Viaarxiv icon

FAF: A Feature-Adaptive Framework for Few-Shot Time Series Forecasting

Add code
Jun 24, 2025
Viaarxiv icon

SciDA: Scientific Dynamic Assessor of LLMs

Add code
Jun 15, 2025
Viaarxiv icon