Picture for Cunxiang Wang

Cunxiang Wang

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Add code
Aug 08, 2025
Viaarxiv icon

Unlocking Recursive Thinking of LLMs: Alignment via Refinement

Add code
Jun 06, 2025
Viaarxiv icon

Exploring the Evolution of Physics Cognition in Video Generation: A Survey

Add code
Mar 27, 2025
Viaarxiv icon

StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error

Add code
Mar 13, 2025
Viaarxiv icon

LongSafety: Evaluating Long-Context Safety of Large Language Models

Add code
Feb 24, 2025
Viaarxiv icon

HPSS: Heuristic Prompting Strategy Search for LLM Evaluators

Add code
Feb 18, 2025
Figure 1 for HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
Figure 2 for HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
Figure 3 for HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
Figure 4 for HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
Viaarxiv icon

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Add code
Dec 16, 2024
Figure 1 for SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Figure 2 for SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Figure 3 for SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Figure 4 for SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Viaarxiv icon

CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search

Add code
Dec 03, 2024
Figure 1 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 2 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 3 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Figure 4 for CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Viaarxiv icon

Long$^2$RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall

Add code
Oct 31, 2024
Viaarxiv icon

RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation

Add code
Aug 15, 2024
Figure 1 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Figure 2 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Figure 3 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Figure 4 for RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
Viaarxiv icon