Picture for Jiajun Song

Jiajun Song

VARMA-Enhanced Transformer for Time Series Forecasting

Add code
Sep 05, 2025
Viaarxiv icon

SalientFusion: Context-Aware Compositional Zero-Shot Food Recognition

Add code
Sep 04, 2025
Viaarxiv icon

Mind the Gap: The Divergence Between Human and LLM-Generated Tasks

Add code
Aug 01, 2025
Viaarxiv icon

ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs

Add code
Apr 02, 2025
Viaarxiv icon

OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning

Add code
Dec 31, 2024
Figure 1 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 2 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 3 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 4 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Viaarxiv icon

Proposing and solving olympiad geometry with guided tree search

Add code
Dec 14, 2024
Viaarxiv icon

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Add code
Dec 01, 2024
Viaarxiv icon

A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

Add code
Jul 06, 2024
Figure 1 for A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning
Figure 2 for A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning
Figure 3 for A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning
Figure 4 for A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning
Viaarxiv icon

Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering

Add code
May 21, 2024
Viaarxiv icon

Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection

Add code
Feb 14, 2024
Viaarxiv icon