Picture for Kai Chen

Kai Chen

Tony

Enhancing Logical Expressiveness in Graph Neural Networks via Path-Neighbor Aggregation

Add code
Nov 13, 2025
Figure 1 for Enhancing Logical Expressiveness in Graph Neural Networks via Path-Neighbor Aggregation
Figure 2 for Enhancing Logical Expressiveness in Graph Neural Networks via Path-Neighbor Aggregation
Figure 3 for Enhancing Logical Expressiveness in Graph Neural Networks via Path-Neighbor Aggregation
Figure 4 for Enhancing Logical Expressiveness in Graph Neural Networks via Path-Neighbor Aggregation
Viaarxiv icon

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

Add code
Nov 11, 2025
Figure 1 for How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
Figure 2 for How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
Figure 3 for How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
Figure 4 for How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
Viaarxiv icon

DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry

Add code
Oct 25, 2025
Figure 1 for DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
Figure 2 for DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
Figure 3 for DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
Figure 4 for DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
Viaarxiv icon

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Add code
Sep 18, 2025
Figure 1 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Figure 2 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Figure 3 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Figure 4 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Viaarxiv icon

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Add code
Aug 27, 2025
Viaarxiv icon

Building Self-Evolving Agents via Experience-Driven Lifelong Learning: A Framework and Benchmark

Add code
Aug 26, 2025
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Figure 1 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 2 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 3 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 4 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Viaarxiv icon

InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling

Add code
Aug 12, 2025
Figure 1 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Figure 2 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Figure 3 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Figure 4 for InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling
Viaarxiv icon

Undress to Redress: A Training-Free Framework for Virtual Try-On

Add code
Aug 11, 2025
Viaarxiv icon