Picture for Penglei Sun

Penglei Sun

WaterVideoQA: ASV-Centric Perception and Rule-Compliant Reasoning via Multi-Modal Agents

Add code
Feb 26, 2026
Viaarxiv icon

Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling

Add code
Feb 23, 2026
Viaarxiv icon

OmniReview: A Large-scale Benchmark and LLM-enhanced Framework for Realistic Reviewer Recommendation

Add code
Feb 09, 2026
Viaarxiv icon

MRD-RAG: Enhancing Medical Diagnosis with Multi-Round Retrieval-Augmented Generation

Add code
Apr 10, 2025
Viaarxiv icon

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Add code
Feb 18, 2025
Figure 1 for Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research
Figure 2 for Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research
Figure 3 for Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research
Figure 4 for Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research
Viaarxiv icon

Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective

Add code
Oct 14, 2024
Figure 1 for Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Figure 2 for Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Figure 3 for Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Figure 4 for Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Viaarxiv icon

3D Question Answering for City Scene Understanding

Add code
Jul 24, 2024
Viaarxiv icon

Multi-Task Domain Adaptation for Language Grounding with 3D Objects

Add code
Jul 03, 2024
Figure 1 for Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Figure 2 for Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Figure 3 for Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Figure 4 for Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Viaarxiv icon

A Contrastive Compositional Benchmark for Text-to-Image Synthesis: A Study with Unified Text-to-Image Fidelity Metrics

Add code
Dec 04, 2023
Viaarxiv icon

Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance Grounding

Add code
Jan 27, 2023
Viaarxiv icon