Picture for Wenya Xie

Wenya Xie

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior

Add code
May 21, 2025
Viaarxiv icon

Knowledge Boundary of Large Language Models: A Survey

Add code
Dec 17, 2024
Figure 1 for Knowledge Boundary of Large Language Models: A Survey
Figure 2 for Knowledge Boundary of Large Language Models: A Survey
Figure 3 for Knowledge Boundary of Large Language Models: A Survey
Figure 4 for Knowledge Boundary of Large Language Models: A Survey
Viaarxiv icon

LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

Add code
Jun 26, 2024
Figure 1 for LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them
Figure 2 for LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them
Figure 3 for LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them
Figure 4 for LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them
Viaarxiv icon

LLMs Could Autonomously Learn Without External Supervision

Add code
Jun 02, 2024
Viaarxiv icon

MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V

Add code
Nov 23, 2023
Figure 1 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Figure 2 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Figure 3 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Figure 4 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Viaarxiv icon

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

Add code
Nov 16, 2023
Viaarxiv icon

Coarse-to-fine Hybrid 3D Mapping System with Co-calibrated Omnidirectional Camera and Non-repetitive LiDAR

Add code
Feb 08, 2023
Viaarxiv icon