Text


Cross-Domain Human Action Recognition from Multiview Motion and Textual Descriptions

Add code
May 21, 2026
Viaarxiv icon

Seeing the Poem: Image-Semantic Detection of AI-Generated Modern Chinese Poetry with MLLMs

Add code
May 21, 2026
Viaarxiv icon

Boiling the Frog: A Multi-Turn Benchmark for Agentic Safety

Add code
May 21, 2026
Viaarxiv icon

Measuring Cross-Modal Synergy: A Benchmark for VLM Explainability

Add code
May 21, 2026
Viaarxiv icon

ChronoMedicalWorld: A Medical World Model for Learning Patient Trajectories from Longitudinal Care Data

Add code
May 21, 2026
Viaarxiv icon

MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems

Add code
May 21, 2026
Viaarxiv icon

Diversed Model Discovery via Structured Table Discovery

Add code
May 21, 2026
Viaarxiv icon

SeqLoRA: Bilevel Orthogonal Adaptation for Continual Multi-Concept Generation

Add code
May 21, 2026
Viaarxiv icon

Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators

Add code
May 21, 2026
Viaarxiv icon

AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild

Add code
May 21, 2026
Viaarxiv icon