Picture for Jian Zhang

Jian Zhang

Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay

Add code
Oct 16, 2024
Figure 1 for Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Figure 2 for Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Figure 3 for Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Figure 4 for Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay
Viaarxiv icon

Dual-Teacher Ensemble Models with Double-Copy-Paste for 3D Semi-Supervised Medical Image Segmentation

Add code
Oct 15, 2024
Figure 1 for Dual-Teacher Ensemble Models with Double-Copy-Paste for 3D Semi-Supervised Medical Image Segmentation
Figure 2 for Dual-Teacher Ensemble Models with Double-Copy-Paste for 3D Semi-Supervised Medical Image Segmentation
Figure 3 for Dual-Teacher Ensemble Models with Double-Copy-Paste for 3D Semi-Supervised Medical Image Segmentation
Figure 4 for Dual-Teacher Ensemble Models with Double-Copy-Paste for 3D Semi-Supervised Medical Image Segmentation
Viaarxiv icon

DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation

Add code
Oct 14, 2024
Figure 1 for DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation
Figure 2 for DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation
Figure 3 for DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation
Figure 4 for DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation
Viaarxiv icon

Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering

Add code
Oct 12, 2024
Figure 1 for Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering
Figure 2 for Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering
Figure 3 for Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering
Figure 4 for Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering
Viaarxiv icon

FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models

Add code
Oct 03, 2024
Figure 1 for FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
Figure 2 for FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
Figure 3 for FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
Figure 4 for FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
Viaarxiv icon

Generative AI Application for Building Industry

Add code
Oct 01, 2024
Figure 1 for Generative AI Application for Building Industry
Figure 2 for Generative AI Application for Building Industry
Figure 3 for Generative AI Application for Building Industry
Figure 4 for Generative AI Application for Building Industry
Viaarxiv icon

FastTalker: Jointly Generating Speech and Conversational Gestures from Text

Add code
Sep 24, 2024
Figure 1 for FastTalker: Jointly Generating Speech and Conversational Gestures from Text
Figure 2 for FastTalker: Jointly Generating Speech and Conversational Gestures from Text
Figure 3 for FastTalker: Jointly Generating Speech and Conversational Gestures from Text
Figure 4 for FastTalker: Jointly Generating Speech and Conversational Gestures from Text
Viaarxiv icon

EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion

Add code
Sep 11, 2024
Figure 1 for EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion
Figure 2 for EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion
Figure 3 for EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion
Figure 4 for EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion
Viaarxiv icon

MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR

Add code
Aug 30, 2024
Figure 1 for MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR
Figure 2 for MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR
Figure 3 for MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR
Figure 4 for MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR
Viaarxiv icon

GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation

Add code
Aug 21, 2024
Figure 1 for GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation
Figure 2 for GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation
Figure 3 for GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation
Figure 4 for GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation
Viaarxiv icon