Picture for Te Yang

Te Yang

One Ring to Rule Them All: Unifying Group-Based RL via Dynamic Power-Mean Geometry

Add code
Jan 30, 2026
Viaarxiv icon

Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy

Add code
Nov 23, 2024
Figure 1 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 2 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 3 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 4 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Viaarxiv icon

Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation

Add code
May 11, 2024
Viaarxiv icon

Knowledge Condensation and Reasoning for Knowledge-based VQA

Add code
Mar 15, 2024
Figure 1 for Knowledge Condensation and Reasoning for Knowledge-based VQA
Figure 2 for Knowledge Condensation and Reasoning for Knowledge-based VQA
Figure 3 for Knowledge Condensation and Reasoning for Knowledge-based VQA
Figure 4 for Knowledge Condensation and Reasoning for Knowledge-based VQA
Viaarxiv icon