Picture for Jue Wang

Jue Wang

CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning

Add code
Apr 24, 2025
Viaarxiv icon

HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models

Add code
Apr 24, 2025
Viaarxiv icon

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods

Add code
Apr 18, 2025
Viaarxiv icon

Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation

Add code
Apr 17, 2025
Viaarxiv icon

Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models

Add code
Apr 10, 2025
Viaarxiv icon

Enlightenment Period Improving DNN Performance

Add code
Apr 02, 2025
Viaarxiv icon

Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models

Add code
Feb 19, 2025
Viaarxiv icon

HOPS: High-order Polynomials with Self-supervised Dimension Reduction for Load Forecasting

Add code
Jan 18, 2025
Figure 1 for HOPS: High-order Polynomials with Self-supervised Dimension Reduction for Load Forecasting
Figure 2 for HOPS: High-order Polynomials with Self-supervised Dimension Reduction for Load Forecasting
Figure 3 for HOPS: High-order Polynomials with Self-supervised Dimension Reduction for Load Forecasting
Figure 4 for HOPS: High-order Polynomials with Self-supervised Dimension Reduction for Load Forecasting
Viaarxiv icon

Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping

Add code
Jan 11, 2025
Figure 1 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Figure 2 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Figure 3 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Figure 4 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Viaarxiv icon

Channel Charting-assisted Non-orthogonal Pilot Allocation for Uplink XL-MIMO Transmission

Add code
Dec 30, 2024
Viaarxiv icon