Picture for Jingren Zhou

Jingren Zhou

A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models

Add code
Nov 29, 2024
Viaarxiv icon

P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs

Add code
Nov 14, 2024
Figure 1 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Figure 2 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Figure 3 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Figure 4 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Viaarxiv icon

Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Add code
Nov 05, 2024
Figure 1 for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
Figure 2 for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
Figure 3 for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
Figure 4 for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
Viaarxiv icon

Language Models can Self-Lengthen to Generate Long Texts

Add code
Oct 31, 2024
Figure 1 for Language Models can Self-Lengthen to Generate Long Texts
Figure 2 for Language Models can Self-Lengthen to Generate Long Texts
Figure 3 for Language Models can Self-Lengthen to Generate Long Texts
Figure 4 for Language Models can Self-Lengthen to Generate Long Texts
Viaarxiv icon

In-Context LoRA for Diffusion Transformers

Add code
Oct 31, 2024
Figure 1 for In-Context LoRA for Diffusion Transformers
Figure 2 for In-Context LoRA for Diffusion Transformers
Figure 3 for In-Context LoRA for Diffusion Transformers
Figure 4 for In-Context LoRA for Diffusion Transformers
Viaarxiv icon

Aligning Large Language Models via Self-Steering Optimization

Add code
Oct 22, 2024
Viaarxiv icon

Group Diffusion Transformers are Unsupervised Multitask Learners

Add code
Oct 19, 2024
Figure 1 for Group Diffusion Transformers are Unsupervised Multitask Learners
Viaarxiv icon

AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations

Add code
Oct 17, 2024
Viaarxiv icon

GenSim: A General Social Simulation Platform with Large Language Model based Agents

Add code
Oct 06, 2024
Viaarxiv icon

Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference

Add code
Sep 30, 2024
Figure 1 for Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference
Figure 2 for Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference
Figure 3 for Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference
Figure 4 for Rotated Runtime Smooth: Training-Free Activation Smoother for accurate INT4 inference
Viaarxiv icon