Language Modelling


Screen Hijack: Visual Poisoning of VLM Agents in Mobile Environments

Add code
Jun 16, 2025
Viaarxiv icon

C-TLSAN: Content-Enhanced Time-Aware Long- and Short-Term Attention Network for Personalized Recommendation

Add code
Jun 16, 2025
Viaarxiv icon

SoundMind: RL-Incentivized Logic Reasoning for Audio-Language Models

Add code
Jun 15, 2025
Viaarxiv icon

We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems

Add code
Jun 16, 2025
Viaarxiv icon

Assessing the Role of Data Quality in Training Bilingual Language Models

Add code
Jun 15, 2025
Viaarxiv icon

Domain Specific Benchmarks for Evaluating Multimodal Large Language Models

Add code
Jun 15, 2025
Viaarxiv icon

Stress-Testing Multimodal Foundation Models for Crystallographic Reasoning

Add code
Jun 16, 2025
Viaarxiv icon

Evaluating Cell Type Inference in Vision Language Models Under Varying Visual Context

Add code
Jun 15, 2025
Viaarxiv icon

Towards a Formal Specification for Self-organized Shape Formation in Swarm Robotics

Add code
Jun 16, 2025
Viaarxiv icon

Block-wise Adaptive Caching for Accelerating Diffusion Policy

Add code
Jun 16, 2025
Viaarxiv icon