Picture for Wayner Barrios

Wayner Barrios

Beyond Final Answers: CRYSTAL Benchmark for Transparent Multimodal Reasoning Evaluation

Add code
Mar 16, 2026
Viaarxiv icon

Native LLM and MLLM Inference at Scale on Apple Silicon

Add code
Jan 27, 2026
Viaarxiv icon

Multi-layer Learnable Attention Mask for Multimodal Tasks

Add code
Jun 04, 2024
Figure 1 for Multi-layer Learnable Attention Mask for Multimodal Tasks
Figure 2 for Multi-layer Learnable Attention Mask for Multimodal Tasks
Figure 3 for Multi-layer Learnable Attention Mask for Multimodal Tasks
Figure 4 for Multi-layer Learnable Attention Mask for Multimodal Tasks
Viaarxiv icon

FT2TF: First-Person Statement Text-To-Talking Face Generation

Add code
Dec 09, 2023
Figure 1 for FT2TF: First-Person Statement Text-To-Talking Face Generation
Figure 2 for FT2TF: First-Person Statement Text-To-Talking Face Generation
Figure 3 for FT2TF: First-Person Statement Text-To-Talking Face Generation
Figure 4 for FT2TF: First-Person Statement Text-To-Talking Face Generation
Viaarxiv icon

Localizing Moments in Long Video Via Multimodal Guidance

Add code
Feb 26, 2023
Figure 1 for Localizing Moments in Long Video Via Multimodal Guidance
Figure 2 for Localizing Moments in Long Video Via Multimodal Guidance
Figure 3 for Localizing Moments in Long Video Via Multimodal Guidance
Figure 4 for Localizing Moments in Long Video Via Multimodal Guidance
Viaarxiv icon