Picture for Tianyu Liu

Tianyu Liu

Language Models over Canonical Byte-Pair Encodings

Add code
Jun 09, 2025
Viaarxiv icon

Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding

Add code
Jun 09, 2025
Figure 1 for Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding
Figure 2 for Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding
Figure 3 for Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding
Figure 4 for Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding
Viaarxiv icon

G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Add code
May 19, 2025
Figure 1 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 2 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 3 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 4 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Viaarxiv icon

CellVerse: Do Large Language Models Really Understand Cell Biology?

Add code
May 09, 2025
Figure 1 for CellVerse: Do Large Language Models Really Understand Cell Biology?
Figure 2 for CellVerse: Do Large Language Models Really Understand Cell Biology?
Figure 3 for CellVerse: Do Large Language Models Really Understand Cell Biology?
Figure 4 for CellVerse: Do Large Language Models Really Understand Cell Biology?
Viaarxiv icon

Towards Artificial Intelligence Research Assistant for Expert-Involved Learning

Add code
May 03, 2025
Viaarxiv icon

Deployment Optimization for XL-IRS Assisted Multi-User Communications

Add code
Apr 28, 2025
Figure 1 for Deployment Optimization for XL-IRS Assisted Multi-User Communications
Figure 2 for Deployment Optimization for XL-IRS Assisted Multi-User Communications
Figure 3 for Deployment Optimization for XL-IRS Assisted Multi-User Communications
Figure 4 for Deployment Optimization for XL-IRS Assisted Multi-User Communications
Viaarxiv icon

Physical-Layer Security in Mixed Near-Field and Far-Field Communication Systems

Add code
Apr 28, 2025
Figure 1 for Physical-Layer Security in Mixed Near-Field and Far-Field Communication Systems
Figure 2 for Physical-Layer Security in Mixed Near-Field and Far-Field Communication Systems
Figure 3 for Physical-Layer Security in Mixed Near-Field and Far-Field Communication Systems
Figure 4 for Physical-Layer Security in Mixed Near-Field and Far-Field Communication Systems
Viaarxiv icon

Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo

Add code
Apr 18, 2025
Figure 1 for Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
Figure 2 for Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
Figure 3 for Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
Figure 4 for Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
Viaarxiv icon

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Add code
Apr 03, 2025
Viaarxiv icon

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Add code
Feb 23, 2025
Figure 1 for CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models
Figure 2 for CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models
Figure 3 for CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models
Figure 4 for CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models
Viaarxiv icon