Picture for Fan Yang

Fan Yang

refer to the report for detailed contributions

LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs

Add code
Mar 14, 2025
Figure 1 for LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs
Figure 2 for LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs
Figure 3 for LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs
Figure 4 for LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs
Viaarxiv icon

TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs

Add code
Mar 13, 2025
Viaarxiv icon

SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video

Add code
Mar 12, 2025
Figure 1 for SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video
Figure 2 for SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video
Figure 3 for SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video
Figure 4 for SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video
Viaarxiv icon

Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding

Add code
Mar 12, 2025
Viaarxiv icon

Resource-Efficient Affordance Grounding with Complementary Depth and Semantic Prompts

Add code
Mar 04, 2025
Viaarxiv icon

One-Shot Affordance Grounding of Deformable Objects in Egocentric Organizing Scenes

Add code
Mar 03, 2025
Viaarxiv icon

Multi-Keypoint Affordance Representation for Functional Dexterous Grasping

Add code
Feb 27, 2025
Figure 1 for Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Figure 2 for Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Figure 3 for Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Figure 4 for Multi-Keypoint Affordance Representation for Functional Dexterous Grasping
Viaarxiv icon

LongRoPE2: Near-Lossless LLM Context Window Scaling

Add code
Feb 27, 2025
Figure 1 for LongRoPE2: Near-Lossless LLM Context Window Scaling
Figure 2 for LongRoPE2: Near-Lossless LLM Context Window Scaling
Figure 3 for LongRoPE2: Near-Lossless LLM Context Window Scaling
Figure 4 for LongRoPE2: Near-Lossless LLM Context Window Scaling
Viaarxiv icon

PCL: Prompt-based Continual Learning for User Modeling in Recommender Systems

Add code
Feb 26, 2025
Figure 1 for PCL: Prompt-based Continual Learning for User Modeling in Recommender Systems
Figure 2 for PCL: Prompt-based Continual Learning for User Modeling in Recommender Systems
Figure 3 for PCL: Prompt-based Continual Learning for User Modeling in Recommender Systems
Figure 4 for PCL: Prompt-based Continual Learning for User Modeling in Recommender Systems
Viaarxiv icon

External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation

Add code
Feb 26, 2025
Figure 1 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 2 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 3 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 4 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Viaarxiv icon