Picture for Jiayi Shi

Jiayi Shi

Learning More from Less: Unlocking Internal Representations for Benchmark Compression

Add code
Feb 03, 2026
Viaarxiv icon

Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling

Add code
Jan 29, 2026
Viaarxiv icon

A Capsule-Sized Multi-Wavelength Wireless Optical System for Edge-AI-Based Classification of Gastrointestinal Bleeding Flow Rate

Add code
Jan 25, 2026
Viaarxiv icon

Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules

Add code
May 30, 2025
Viaarxiv icon

Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator

Add code
May 27, 2025
Figure 1 for Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator
Figure 2 for Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator
Figure 3 for Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator
Figure 4 for Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator
Viaarxiv icon

Speculative Decoding for Multi-Sample Inference

Add code
Mar 07, 2025
Figure 1 for Speculative Decoding for Multi-Sample Inference
Figure 2 for Speculative Decoding for Multi-Sample Inference
Figure 3 for Speculative Decoding for Multi-Sample Inference
Figure 4 for Speculative Decoding for Multi-Sample Inference
Viaarxiv icon

Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation

Add code
Feb 27, 2025
Figure 1 for Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
Figure 2 for Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
Figure 3 for Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
Figure 4 for Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
Viaarxiv icon

From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN

Add code
Feb 19, 2025
Figure 1 for From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN
Figure 2 for From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN
Figure 3 for From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN
Figure 4 for From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN
Viaarxiv icon

Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation

Add code
Feb 19, 2025
Figure 1 for Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
Figure 2 for Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
Figure 3 for Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
Figure 4 for Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
Viaarxiv icon

InsBank: Evolving Instruction Subset for Ongoing Alignment

Add code
Feb 17, 2025
Figure 1 for InsBank: Evolving Instruction Subset for Ongoing Alignment
Figure 2 for InsBank: Evolving Instruction Subset for Ongoing Alignment
Figure 3 for InsBank: Evolving Instruction Subset for Ongoing Alignment
Figure 4 for InsBank: Evolving Instruction Subset for Ongoing Alignment
Viaarxiv icon