Picture for Ruoxi Jia

Ruoxi Jia

Virginia Tech

Characterizing Model-Native Skills

Add code
Apr 19, 2026
Viaarxiv icon

From Weak Cues to Real Identities: Evaluating Inference-Driven De-Anonymization in LLM Agents

Add code
Mar 19, 2026
Viaarxiv icon

Diagnosing and Repairing Citation Failures in Generative Engine Optimization

Add code
Mar 10, 2026
Viaarxiv icon

Understanding and Preserving Safety in Fine-Tuned LLMs

Add code
Jan 15, 2026
Viaarxiv icon

A Sustainable AI Economy Needs Data Deals That Work for Generators

Add code
Jan 15, 2026
Viaarxiv icon

Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance

Add code
Jan 06, 2026
Viaarxiv icon

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

Add code
Dec 30, 2025
Viaarxiv icon

More Than the Final Answer: Improving Visual Extraction and Logical Consistency in Vision-Language Models

Add code
Dec 13, 2025
Viaarxiv icon

Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls

Add code
Oct 02, 2025
Viaarxiv icon

Quagmires in SFT-RL Post-Training: When High SFT Scores Mislead and What to Use Instead

Add code
Oct 02, 2025
Viaarxiv icon