Picture for Xiaokang Zhang

Xiaokang Zhang

Sparse-RL: Breaking the Memory Wall in LLM Reinforcement Learning via Stable Sparse Rollouts

Add code
Jan 15, 2026
Viaarxiv icon

Geospatial Foundation Models to Enable Progress on Sustainable Development Goals

Add code
May 30, 2025
Viaarxiv icon

Semantic-Spatial Feature Fusion with Dynamic Graph Refinement for Remote Sensing Image Captioning

Add code
Mar 30, 2025
Figure 1 for Semantic-Spatial Feature Fusion with Dynamic Graph Refinement for Remote Sensing Image Captioning
Figure 2 for Semantic-Spatial Feature Fusion with Dynamic Graph Refinement for Remote Sensing Image Captioning
Figure 3 for Semantic-Spatial Feature Fusion with Dynamic Graph Refinement for Remote Sensing Image Captioning
Figure 4 for Semantic-Spatial Feature Fusion with Dynamic Graph Refinement for Remote Sensing Image Captioning
Viaarxiv icon

CADFormer: Fine-Grained Cross-modal Alignment and Decoding Transformer for Referring Remote Sensing Image Segmentation

Add code
Mar 30, 2025
Figure 1 for CADFormer: Fine-Grained Cross-modal Alignment and Decoding Transformer for Referring Remote Sensing Image Segmentation
Figure 2 for CADFormer: Fine-Grained Cross-modal Alignment and Decoding Transformer for Referring Remote Sensing Image Segmentation
Figure 3 for CADFormer: Fine-Grained Cross-modal Alignment and Decoding Transformer for Referring Remote Sensing Image Segmentation
Figure 4 for CADFormer: Fine-Grained Cross-modal Alignment and Decoding Transformer for Referring Remote Sensing Image Segmentation
Viaarxiv icon

OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale

Add code
Mar 04, 2025
Viaarxiv icon

Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL

Add code
Feb 17, 2025
Figure 1 for Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL
Figure 2 for Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL
Figure 3 for Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL
Figure 4 for Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL
Viaarxiv icon

Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation

Add code
Jan 13, 2025
Figure 1 for Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation
Figure 2 for Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation
Figure 3 for Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation
Figure 4 for Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation
Viaarxiv icon

CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis

Add code
Jan 03, 2025
Figure 1 for CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
Figure 2 for CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
Figure 3 for CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
Figure 4 for CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
Viaarxiv icon

Dynamic Scaling of Unit Tests for Code Reward Modeling

Add code
Jan 02, 2025
Figure 1 for Dynamic Scaling of Unit Tests for Code Reward Modeling
Figure 2 for Dynamic Scaling of Unit Tests for Code Reward Modeling
Figure 3 for Dynamic Scaling of Unit Tests for Code Reward Modeling
Figure 4 for Dynamic Scaling of Unit Tests for Code Reward Modeling
Viaarxiv icon

DeepSeek-V3 Technical Report

Add code
Dec 27, 2024
Figure 1 for DeepSeek-V3 Technical Report
Figure 2 for DeepSeek-V3 Technical Report
Figure 3 for DeepSeek-V3 Technical Report
Figure 4 for DeepSeek-V3 Technical Report
Viaarxiv icon