Picture for Wenhao Huang

Wenhao Huang

DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion

Add code
Jul 17, 2024
Viaarxiv icon

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Add code
Jun 26, 2024
Figure 1 for LongIns: A Challenging Long-context Instruction-based Exam for LLMs
Figure 2 for LongIns: A Challenging Long-context Instruction-based Exam for LLMs
Figure 3 for LongIns: A Challenging Long-context Instruction-based Exam for LLMs
Figure 4 for LongIns: A Challenging Long-context Instruction-based Exam for LLMs
Viaarxiv icon

GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models

Add code
Jun 24, 2024
Figure 1 for GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models
Figure 2 for GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models
Figure 3 for GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models
Figure 4 for GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models
Viaarxiv icon

Intensity Confusion Matters: An Intensity-Distance Guided Loss for Bronchus Segmentation

Add code
Jun 23, 2024
Viaarxiv icon

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Add code
Jun 20, 2024
Figure 1 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 2 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 3 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Figure 4 for PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents
Viaarxiv icon

MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language

Add code
Jun 19, 2024
Viaarxiv icon

DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?

Add code
Jun 18, 2024
Figure 1 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 2 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 3 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 4 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Viaarxiv icon

Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction

Add code
Jun 17, 2024
Figure 1 for Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction
Figure 2 for Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction
Figure 3 for Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction
Figure 4 for Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction
Viaarxiv icon

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

Add code
Jun 11, 2024
Viaarxiv icon

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Add code
May 29, 2024
Viaarxiv icon