Picture for Tong Niu

Tong Niu

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Add code
Jun 16, 2025
Viaarxiv icon

JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking

Add code
Oct 31, 2024
Viaarxiv icon

Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification

Add code
Oct 05, 2024
Viaarxiv icon

Mixture of Prompt Learning for Vision Language Models

Add code
Sep 18, 2024
Viaarxiv icon

Solution-oriented Agent-based Models Generation with Verifier-assisted Iterative In-context Learning

Add code
Feb 04, 2024
Viaarxiv icon

General Automatic Solution Generation of Social Problems

Add code
Jan 25, 2024
Viaarxiv icon

Parameter-Efficient Detoxification with Contrastive Decoding

Add code
Jan 13, 2024
Viaarxiv icon

DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text

Add code
Oct 31, 2023
Viaarxiv icon

XGen-7B Technical Report

Add code
Sep 07, 2023
Viaarxiv icon

Attention-based 3D CNN with Multi-layer Features for Alzheimer's Disease Diagnosis using Brain Images

Add code
Aug 10, 2023
Viaarxiv icon