Picture for Kunli Zhang

Kunli Zhang

LogicCat: A Chain-of-Thought Text-to-SQL Benchmark for Multi-Domain Reasoning Challenges

Add code
May 24, 2025
Viaarxiv icon

JOLT-SQL: Joint Loss Tuning of Text-to-SQL with Confusion-aware Noisy Schema Sampling

Add code
May 20, 2025
Viaarxiv icon

CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

Add code
Jul 06, 2021
Figure 1 for CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Figure 2 for CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Figure 3 for CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Figure 4 for CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Viaarxiv icon