Picture for Feifan Song

Feifan Song

Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding

Add code
Jun 09, 2025
Viaarxiv icon

TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios

Add code
May 19, 2025
Viaarxiv icon

Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation

Add code
Mar 11, 2025
Viaarxiv icon

MPO: Boosting LLM Agents with Meta Plan Optimization

Add code
Mar 04, 2025
Viaarxiv icon

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Add code
Oct 10, 2024
Figure 1 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 2 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 3 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 4 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Viaarxiv icon

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Add code
Sep 04, 2024
Figure 1 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 2 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 3 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Figure 4 for Towards a Unified View of Preference Learning for Large Language Models: A Survey
Viaarxiv icon

Learning Spatial Similarity Distribution for Few-shot Object Counting

Add code
May 20, 2024
Figure 1 for Learning Spatial Similarity Distribution for Few-shot Object Counting
Figure 2 for Learning Spatial Similarity Distribution for Few-shot Object Counting
Figure 3 for Learning Spatial Similarity Distribution for Few-shot Object Counting
Figure 4 for Learning Spatial Similarity Distribution for Few-shot Object Counting
Viaarxiv icon

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment

Add code
Mar 30, 2024
Viaarxiv icon

ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization

Add code
Feb 14, 2024
Viaarxiv icon

Making Large Language Models Better Reasoners with Alignment

Add code
Sep 05, 2023
Viaarxiv icon