Picture for Chao Wang

Chao Wang

SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM

Add code
Apr 22, 2025
Viaarxiv icon

Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning

Add code
Apr 21, 2025
Viaarxiv icon

Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching

Add code
Apr 08, 2025
Viaarxiv icon

StarFlow: Generating Structured Workflow Outputs From Sketch Images

Add code
Mar 27, 2025
Viaarxiv icon

Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings

Add code
Mar 25, 2025
Viaarxiv icon

RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment

Add code
Mar 18, 2025
Viaarxiv icon

DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models

Add code
Mar 17, 2025
Viaarxiv icon

Can LLMs Formally Reason as Abstract Interpreters for Program Analysis?

Add code
Mar 16, 2025
Viaarxiv icon

FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance

Add code
Mar 07, 2025
Figure 1 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 2 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 3 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 4 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Viaarxiv icon

TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction

Add code
Mar 06, 2025
Viaarxiv icon