Alert button
Picture for Kashu Yamazaki

Kashu Yamazaki

Alert button

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

Add code
Bookmark button
Alert button
Oct 05, 2023
Kashu Yamazaki, Taisei Hanyu, Khoa Vo, Thang Pham, Minh Tran, Gianfranco Doretto, Anh Nguyen, Ngan Le

Figure 1 for Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
Figure 2 for Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
Figure 3 for Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
Figure 4 for Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
Viaarxiv icon

AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

Add code
Bookmark button
Alert button
Jun 12, 2023
Kashu Yamazaki, Taisei Hanyu, Minh Tran, Adrian Garcia, Anh Tran, Roy McCann, Haitao Liao, Chase Rainwater, Meredith Adkins, Andrew Molthan, Jackson Cothren, Ngan Le

Figure 1 for AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation
Figure 2 for AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation
Figure 3 for AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation
Figure 4 for AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation
Viaarxiv icon

Contextual Explainable Video Representation: Human Perception-based Understanding

Add code
Bookmark button
Alert button
Dec 17, 2022
Khoa Vo, Kashu Yamazaki, Phong X. Nguyen, Phat Nguyen, Khoa Luu, Ngan Le

Figure 1 for Contextual Explainable Video Representation: Human Perception-based Understanding
Figure 2 for Contextual Explainable Video Representation: Human Perception-based Understanding
Figure 3 for Contextual Explainable Video Representation: Human Perception-based Understanding
Figure 4 for Contextual Explainable Video Representation: Human Perception-based Understanding
Viaarxiv icon

Contextual Explainable Video Representation:\\Human Perception-based Understanding

Add code
Bookmark button
Alert button
Dec 12, 2022
Khoa Vo, Kashu Yamazaki, Phong X. Nguyen, Phat Nguyen, Khoa Luu, Ngan Le

Figure 1 for Contextual Explainable Video Representation:\\Human Perception-based Understanding
Figure 2 for Contextual Explainable Video Representation:\\Human Perception-based Understanding
Figure 3 for Contextual Explainable Video Representation:\\Human Perception-based Understanding
Figure 4 for Contextual Explainable Video Representation:\\Human Perception-based Understanding
Viaarxiv icon

CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection

Add code
Bookmark button
Alert button
Dec 09, 2022
Hyekang Kevin Joo, Khoa Vo, Kashu Yamazaki, Ngan Le

Figure 1 for CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection
Figure 2 for CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection
Figure 3 for CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection
Figure 4 for CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection
Viaarxiv icon

VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

Add code
Bookmark button
Alert button
Nov 28, 2022
Kashu Yamazaki, Khoa Vo, Sang Truong, Bhiksha Raj, Ngan Le

Figure 1 for VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Figure 2 for VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Figure 3 for VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Figure 4 for VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Viaarxiv icon

AISFormer: Amodal Instance Segmentation with Transformer

Add code
Bookmark button
Alert button
Oct 13, 2022
Minh Tran, Khoa Vo, Kashu Yamazaki, Arthur Fernandes, Michael Kidd, Ngan Le

Figure 1 for AISFormer: Amodal Instance Segmentation with Transformer
Figure 2 for AISFormer: Amodal Instance Segmentation with Transformer
Figure 3 for AISFormer: Amodal Instance Segmentation with Transformer
Figure 4 for AISFormer: Amodal Instance Segmentation with Transformer
Viaarxiv icon

AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation

Add code
Bookmark button
Alert button
Oct 05, 2022
Khoa Vo, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, Ngan Le

Viaarxiv icon

VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

Add code
Bookmark button
Alert button
Jun 26, 2022
Kashu Yamazaki, Sang Truong, Khoa Vo, Michael Kidd, Chase Rainwater, Khoa Luu, Ngan Le

Figure 1 for VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
Figure 2 for VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
Figure 3 for VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
Figure 4 for VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
Viaarxiv icon