Picture for Chenyan Xiong

Chenyan Xiong

Microsoft Research

Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation

Add code
Jul 21, 2024
Viaarxiv icon

In-Context Probing Approximates Influence Function for Data Valuation

Add code
Jul 17, 2024
Viaarxiv icon

ResearchArena: Benchmarking LLMs' Ability to Collect and Organize Information as Research Agents

Add code
Jun 13, 2024
Viaarxiv icon

MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models

Add code
Jun 10, 2024
Viaarxiv icon

MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

Add code
May 13, 2024
Viaarxiv icon

Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval

Add code
Apr 05, 2024
Viaarxiv icon

Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression

Add code
Feb 25, 2024
Viaarxiv icon

Cleaner Pretraining Corpus Curation with Neural Web Scraping

Add code
Feb 22, 2024
Viaarxiv icon

ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance

Add code
Feb 21, 2024
Viaarxiv icon

ActiveRAG: Revealing the Treasures of Knowledge via Active Learning

Add code
Feb 21, 2024
Viaarxiv icon