Picture for Kai Gao

Kai Gao

Infinite-Instruct: Synthesizing Scaling Code instruction Data with Bidirectional Synthesis and Static Verification

Add code
May 29, 2025
Viaarxiv icon

MuST: Multi-Head Skill Transformer for Long-Horizon Dexterous Manipulation with Skill Progress

Add code
Feb 04, 2025
Viaarxiv icon

Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play

Add code
Jan 31, 2025
Figure 1 for Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play
Figure 2 for Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play
Figure 3 for Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play
Figure 4 for Imitation Game for Adversarial Disillusion with Multimodal Generative Chain-of-Thought Role-Play
Viaarxiv icon

Tabletop Object Rearrangement: Structure, Complexity, and Efficient Combinatorial Search-Based Solutions

Add code
Dec 19, 2024
Viaarxiv icon

A First Look at License Compliance Capability of LLMs in Code Generation

Add code
Aug 05, 2024
Figure 1 for A First Look at License Compliance Capability of LLMs in Code Generation
Figure 2 for A First Look at License Compliance Capability of LLMs in Code Generation
Figure 3 for A First Look at License Compliance Capability of LLMs in Code Generation
Figure 4 for A First Look at License Compliance Capability of LLMs in Code Generation
Viaarxiv icon

Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances

Add code
May 21, 2024
Figure 1 for Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances
Figure 2 for Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances
Figure 3 for Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances
Figure 4 for Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances
Viaarxiv icon

Toward Holistic Planning and Control Optimization for Dual-Arm Rearrangement

Add code
Apr 10, 2024
Figure 1 for Toward Holistic Planning and Control Optimization for Dual-Arm Rearrangement
Figure 2 for Toward Holistic Planning and Control Optimization for Dual-Arm Rearrangement
Figure 3 for Toward Holistic Planning and Control Optimization for Dual-Arm Rearrangement
Figure 4 for Toward Holistic Planning and Control Optimization for Dual-Arm Rearrangement
Viaarxiv icon

MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations

Add code
Mar 20, 2024
Figure 1 for MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations
Figure 2 for MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations
Figure 3 for MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations
Figure 4 for MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations
Viaarxiv icon

Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition

Add code
Dec 22, 2023
Figure 1 for Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
Figure 2 for Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
Figure 3 for Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
Figure 4 for Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
Viaarxiv icon

LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement

Add code
Sep 27, 2023
Figure 1 for LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement
Figure 2 for LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement
Figure 3 for LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement
Figure 4 for LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement
Viaarxiv icon