Picture for Jey Han Lau

Jey Han Lau

Predicting Sentence Acceptability Judgments in Multimodal Contexts

Add code
Feb 24, 2026
Viaarxiv icon

GPSBench: Do Large Language Models Understand GPS Coordinates?

Add code
Feb 18, 2026
Viaarxiv icon

Context Volume Drives Performance: Tackling Domain Shift in Extremely Low-Resource Translation via RAG

Add code
Jan 15, 2026
Viaarxiv icon

Investigating The Functional Roles of Attention Heads in Vision Language Models: Evidence for Reasoning Modules

Add code
Dec 11, 2025
Figure 1 for Investigating The Functional Roles of Attention Heads in Vision Language Models: Evidence for Reasoning Modules
Figure 2 for Investigating The Functional Roles of Attention Heads in Vision Language Models: Evidence for Reasoning Modules
Figure 3 for Investigating The Functional Roles of Attention Heads in Vision Language Models: Evidence for Reasoning Modules
Figure 4 for Investigating The Functional Roles of Attention Heads in Vision Language Models: Evidence for Reasoning Modules
Viaarxiv icon

PROPA: Toward Process-level Optimization in Visual Reasoning via Reinforcement Learning

Add code
Nov 13, 2025
Viaarxiv icon

Understanding the Geospatial Reasoning Capabilities of LLMs: A Trajectory Recovery Perspective

Add code
Oct 02, 2025
Viaarxiv icon

Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task

Add code
May 28, 2025
Figure 1 for Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task
Figure 2 for Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task
Figure 3 for Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task
Figure 4 for Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task
Viaarxiv icon

FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation

Add code
Apr 24, 2025
Figure 1 for FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Figure 2 for FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Figure 3 for FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Figure 4 for FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Viaarxiv icon

Moderation Matters:Measuring Conversational Moderation Impact in English as a Second Language Group Discussion

Add code
Feb 24, 2025
Viaarxiv icon

Analysis of Emotion in Rumour Threads on Social Media

Add code
Feb 23, 2025
Viaarxiv icon