Picture for Han Wang

Han Wang

Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation

Add code
Oct 17, 2024
Figure 1 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Figure 2 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Figure 3 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Figure 4 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Viaarxiv icon

Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps

Add code
Oct 14, 2024
Figure 1 for Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Figure 2 for Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Figure 3 for Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Figure 4 for Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Viaarxiv icon

REGNet V2: End-to-End REgion-based Grasp Detection Network for Grippers of Different Sizes in Point Clouds

Add code
Oct 12, 2024
Figure 1 for REGNet V2: End-to-End REgion-based Grasp Detection Network for Grippers of Different Sizes in Point Clouds
Figure 2 for REGNet V2: End-to-End REgion-based Grasp Detection Network for Grippers of Different Sizes in Point Clouds
Figure 3 for REGNet V2: End-to-End REgion-based Grasp Detection Network for Grippers of Different Sizes in Point Clouds
Figure 4 for REGNet V2: End-to-End REgion-based Grasp Detection Network for Grippers of Different Sizes in Point Clouds
Viaarxiv icon

Bridging OOD Detection and Generalization: A Graph-Theoretic View

Add code
Sep 26, 2024
Viaarxiv icon

AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge

Add code
Sep 11, 2024
Figure 1 for AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
Figure 2 for AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
Figure 3 for AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
Figure 4 for AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
Viaarxiv icon

q-exponential family for policy optimization

Add code
Aug 14, 2024
Viaarxiv icon

MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili

Add code
Jul 28, 2024
Figure 1 for MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili
Figure 2 for MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili
Figure 3 for MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili
Figure 4 for MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili
Viaarxiv icon

Disentangling Masked Autoencoders for Unsupervised Domain Generalization

Add code
Jul 10, 2024
Figure 1 for Disentangling Masked Autoencoders for Unsupervised Domain Generalization
Figure 2 for Disentangling Masked Autoencoders for Unsupervised Domain Generalization
Figure 3 for Disentangling Masked Autoencoders for Unsupervised Domain Generalization
Figure 4 for Disentangling Masked Autoencoders for Unsupervised Domain Generalization
Viaarxiv icon

Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey

Add code
Jul 05, 2024
Figure 1 for Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey
Figure 2 for Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey
Figure 3 for Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey
Figure 4 for Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey
Viaarxiv icon

A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding

Add code
Jul 02, 2024
Figure 1 for A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
Figure 2 for A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
Figure 3 for A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
Figure 4 for A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
Viaarxiv icon