Picture for Bo Wu

Bo Wu

Dima

LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

Add code
Mar 23, 2026
Viaarxiv icon

Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance

Add code
Mar 08, 2026
Viaarxiv icon

MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

Add code
Feb 12, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Improved Physics-Driven Neural Network to Solve Inverse Scattering Problems

Add code
Dec 10, 2025
Viaarxiv icon

Investigating Student Interaction Patterns with Large Language Model-Powered Course Assistants in Computer Science Courses

Add code
Sep 10, 2025
Viaarxiv icon

Results of the NeurIPS 2023 Neural MMO Competition on Multi-task Reinforcement Learning

Add code
Aug 17, 2025
Viaarxiv icon

LlamaRL: A Distributed Asynchronous Reinforcement Learning Framework for Efficient Large-scale LLM Trainin

Add code
May 29, 2025
Viaarxiv icon

R^3-VQA: "Read the Room" by Video Social Reasoning

Add code
May 07, 2025
Figure 1 for R^3-VQA: "Read the Room" by Video Social Reasoning
Figure 2 for R^3-VQA: "Read the Room" by Video Social Reasoning
Figure 3 for R^3-VQA: "Read the Room" by Video Social Reasoning
Figure 4 for R^3-VQA: "Read the Room" by Video Social Reasoning
Viaarxiv icon

WLTCL: Wide Field-of-View 3-D LiDAR Truck Compartment Automatic Localization System

Add code
Apr 26, 2025
Viaarxiv icon