Picture for Yitong Zhou

Yitong Zhou

A 28nm 0.22 μJ/token memory-compute-intensity-aware CNN-Transformer accelerator with hybrid-attention-based layer-fusion and cascaded pruning for semanticsegmentation

Add code
Dec 19, 2025
Viaarxiv icon

BLADE: A Behavior-Level Data Augmentation Framework with Dual Fusion Modeling for Multi-Behavior Sequential Recommendation

Add code
Dec 15, 2025
Viaarxiv icon

Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching

Add code
Nov 18, 2025
Figure 1 for Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Figure 2 for Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Figure 3 for Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Figure 4 for Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
Viaarxiv icon

Benchmarking Multimodal LLMs on Recognition and Understanding over Chemical Tables

Add code
Jun 13, 2025
Viaarxiv icon

Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs

Add code
Jun 12, 2025
Figure 1 for Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs
Figure 2 for Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs
Figure 3 for Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs
Figure 4 for Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs
Viaarxiv icon

Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner

Add code
Dec 30, 2024
Figure 1 for Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Figure 2 for Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Figure 3 for Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Figure 4 for Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Viaarxiv icon

3D Hand Mesh Recovery from Monocular RGB in Camera Space

Add code
May 12, 2024
Figure 1 for 3D Hand Mesh Recovery from Monocular RGB in Camera Space
Figure 2 for 3D Hand Mesh Recovery from Monocular RGB in Camera Space
Figure 3 for 3D Hand Mesh Recovery from Monocular RGB in Camera Space
Figure 4 for 3D Hand Mesh Recovery from Monocular RGB in Camera Space
Viaarxiv icon

Every Preference Changes Differently: Neural Multi-Interest Preference Model with Temporal Dynamics for Recommendation

Add code
Jul 21, 2022
Figure 1 for Every Preference Changes Differently: Neural Multi-Interest Preference Model with Temporal Dynamics for Recommendation
Figure 2 for Every Preference Changes Differently: Neural Multi-Interest Preference Model with Temporal Dynamics for Recommendation
Figure 3 for Every Preference Changes Differently: Neural Multi-Interest Preference Model with Temporal Dynamics for Recommendation
Figure 4 for Every Preference Changes Differently: Neural Multi-Interest Preference Model with Temporal Dynamics for Recommendation
Viaarxiv icon

PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest

Add code
Jul 07, 2020
Figure 1 for PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest
Figure 2 for PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest
Figure 3 for PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest
Figure 4 for PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest
Viaarxiv icon