Picture for Wenxuan Huang

Wenxuan Huang

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Add code
Dec 18, 2025
Viaarxiv icon

Interleaving Reasoning for Better Text-to-Image Generation

Add code
Sep 09, 2025
Figure 1 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 2 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 3 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 4 for Interleaving Reasoning for Better Text-to-Image Generation
Viaarxiv icon

Learning Only with Images: Visual Reinforcement Learning with Reasoning, Rendering, and Visual Feedback

Add code
Jul 28, 2025
Figure 1 for Learning Only with Images: Visual Reinforcement Learning with Reasoning, Rendering, and Visual Feedback
Figure 2 for Learning Only with Images: Visual Reinforcement Learning with Reasoning, Rendering, and Visual Feedback
Figure 3 for Learning Only with Images: Visual Reinforcement Learning with Reasoning, Rendering, and Visual Feedback
Figure 4 for Learning Only with Images: Visual Reinforcement Learning with Reasoning, Rendering, and Visual Feedback
Viaarxiv icon

AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need

Add code
Jun 18, 2025
Viaarxiv icon

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Add code
Jun 12, 2025
Viaarxiv icon

MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning

Add code
May 26, 2025
Figure 1 for MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning
Figure 2 for MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning
Figure 3 for MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning
Figure 4 for MT$^{3}$: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning
Viaarxiv icon

CompBench: Benchmarking Complex Instruction-guided Image Editing

Add code
May 18, 2025
Viaarxiv icon

Large Language Model Enhancers for Graph Neural Networks: An Analysis from the Perspective of Causal Mechanism Identification

Add code
May 15, 2025
Viaarxiv icon

LLM Enhancers for GNNs: An Analysis from the Perspective of Causal Mechanism Identification

Add code
May 13, 2025
Viaarxiv icon

ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation

Add code
May 08, 2025
Figure 1 for ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation
Figure 2 for ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation
Figure 3 for ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation
Figure 4 for ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation
Viaarxiv icon