Picture for Xiao Wang

Xiao Wang

School of Computer and Information, Hefei University of Technology, China

Enhancing Hepatopathy Clinical Trial Efficiency: A Secure, Large Language Model-Powered Pre-Screening Pipeline

Add code
Feb 25, 2025
Viaarxiv icon

Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric

Add code
Feb 25, 2025
Figure 1 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Figure 2 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Figure 3 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Figure 4 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Viaarxiv icon

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Add code
Feb 20, 2025
Figure 1 for SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Figure 2 for SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Figure 3 for SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Figure 4 for SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Viaarxiv icon

RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision

Add code
Feb 19, 2025
Figure 1 for RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision
Figure 2 for RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision
Figure 3 for RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision
Figure 4 for RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision
Viaarxiv icon

Censor Dependent Variational Inference

Add code
Feb 13, 2025
Viaarxiv icon

EventSTR: A Benchmark Dataset and Baselines for Event Stream based Scene Text Recognition

Add code
Feb 13, 2025
Viaarxiv icon

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

Add code
Feb 11, 2025
Viaarxiv icon

Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark

Add code
Feb 08, 2025
Figure 1 for Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark
Figure 2 for Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark
Figure 3 for Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark
Figure 4 for Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark
Viaarxiv icon

XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion

Add code
Feb 08, 2025
Figure 1 for XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion
Figure 2 for XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion
Figure 3 for XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion
Figure 4 for XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion
Viaarxiv icon

Sparse Measurement Medical CT Reconstruction using Multi-Fused Block Matching Denoising Priors

Add code
Feb 03, 2025
Viaarxiv icon