Picture for Zining Zhu

Zining Zhu

Can AI Validate Science? Benchmarking LLMs for Accurate Scientific Claim $\rightarrow$ Evidence Reasoning

Add code
Jun 09, 2025
Viaarxiv icon

KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning

Add code
May 24, 2025
Viaarxiv icon

Truth Neurons

Add code
May 18, 2025
Viaarxiv icon

Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce

Add code
May 18, 2025
Viaarxiv icon

FinAudio: A Benchmark for Audio Large Language Models in Financial Applications

Add code
Mar 26, 2025
Viaarxiv icon

Contrastive Similarity Learning for Market Forecasting: The ContraSim Framework

Add code
Feb 22, 2025
Viaarxiv icon

Unhackable Temporal Rewarding for Scalable Video MLLMs

Add code
Feb 17, 2025
Viaarxiv icon

PerPO: Perceptual Preference Optimization via Discriminative Rewarding

Add code
Feb 05, 2025
Viaarxiv icon

INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent

Add code
Dec 24, 2024
Figure 1 for INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
Figure 2 for INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
Figure 3 for INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
Figure 4 for INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
Viaarxiv icon

Show, Don't Tell: Uncovering Implicit Character Portrayal using LLMs

Add code
Dec 05, 2024
Viaarxiv icon