Picture for Lei Cui

Lei Cui

Geometric-Mean Policy Optimization

Add code
Jul 28, 2025
Viaarxiv icon

WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models

Add code
Jun 12, 2025
Viaarxiv icon

Think Only When You Need with Large Hybrid-Reasoning Models

Add code
May 21, 2025
Viaarxiv icon

Model as a Game: On Numerical and Spatial Consistency for Generative Games

Add code
Mar 27, 2025
Viaarxiv icon

PEACE: Empowering Geologic Map Holistic Understanding with MLLMs

Add code
Jan 10, 2025
Figure 1 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Figure 2 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Figure 3 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Figure 4 for PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Viaarxiv icon

MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark

Add code
Dec 19, 2024
Viaarxiv icon

RedStone: Curating General, Code, Math, and QA Data for Large Language Models

Add code
Dec 04, 2024
Viaarxiv icon

CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization

Add code
Aug 13, 2024
Figure 1 for CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization
Figure 2 for CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization
Figure 3 for CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization
Figure 4 for CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization
Viaarxiv icon

Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions

Add code
Jun 04, 2024
Figure 1 for Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions
Figure 2 for Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions
Figure 3 for Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions
Figure 4 for Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions
Viaarxiv icon

Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models

Add code
Apr 04, 2024
Viaarxiv icon