Picture for Xuanliang Zhang

Xuanliang Zhang

How Do Language Models Understand Tables? A Mechanistic Analysis of Cell Location

Add code
Feb 09, 2026
Viaarxiv icon

When Does Context Help? Error Dynamics of Contextual Information in Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains

Add code
Nov 14, 2025
Figure 1 for DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains
Figure 2 for DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains
Figure 3 for DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains
Figure 4 for DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains
Viaarxiv icon

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning

Add code
Sep 16, 2025
Figure 1 for FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
Figure 2 for FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
Figure 3 for FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
Figure 4 for FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning
Viaarxiv icon

A Survey on Latent Reasoning

Add code
Jul 08, 2025
Figure 1 for A Survey on Latent Reasoning
Figure 2 for A Survey on Latent Reasoning
Figure 3 for A Survey on Latent Reasoning
Figure 4 for A Survey on Latent Reasoning
Viaarxiv icon

RoT: Enhancing Table Reasoning with Iterative Row-Wise Traversals

Add code
May 21, 2025
Viaarxiv icon

Abacus-SQL: A Text-to-SQL System Empowering Cross-Domain and Open-Domain Database Retrieval

Add code
Apr 14, 2025
Viaarxiv icon

MULTITAT: Benchmarking Multilingual Table-and-Text Question Answering

Add code
Feb 24, 2025
Viaarxiv icon

SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types

Add code
Dec 16, 2024
Figure 1 for SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Figure 2 for SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Figure 3 for SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Figure 4 for SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Viaarxiv icon

DAC: Decomposed Automation Correction for Text-to-SQL

Add code
Aug 16, 2024
Figure 1 for DAC: Decomposed Automation Correction for Text-to-SQL
Figure 2 for DAC: Decomposed Automation Correction for Text-to-SQL
Figure 3 for DAC: Decomposed Automation Correction for Text-to-SQL
Figure 4 for DAC: Decomposed Automation Correction for Text-to-SQL
Viaarxiv icon