Picture for Yao Zhao

Yao Zhao

Victor

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization

Add code
Mar 04, 2024
Figure 1 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Figure 2 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Figure 3 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Figure 4 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Viaarxiv icon

Region-Adaptive Transform with Segmentation Prior for Image Compression

Add code
Mar 01, 2024
Figure 1 for Region-Adaptive Transform with Segmentation Prior for Image Compression
Figure 2 for Region-Adaptive Transform with Segmentation Prior for Image Compression
Figure 3 for Region-Adaptive Transform with Segmentation Prior for Image Compression
Figure 4 for Region-Adaptive Transform with Segmentation Prior for Image Compression
Viaarxiv icon

Direct Language Model Alignment from Online AI Feedback

Add code
Feb 07, 2024
Figure 1 for Direct Language Model Alignment from Online AI Feedback
Figure 2 for Direct Language Model Alignment from Online AI Feedback
Figure 3 for Direct Language Model Alignment from Online AI Feedback
Figure 4 for Direct Language Model Alignment from Online AI Feedback
Viaarxiv icon

LiPO: Listwise Preference Optimization through Learning-to-Rank

Add code
Feb 02, 2024
Figure 1 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Figure 2 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Figure 3 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Figure 4 for LiPO: Listwise Preference Optimization through Learning-to-Rank
Viaarxiv icon

EASRec: Elastic Architecture Search for Efficient Long-term Sequential Recommender Systems

Add code
Feb 01, 2024
Figure 1 for EASRec: Elastic Architecture Search for Efficient Long-term Sequential Recommender Systems
Figure 2 for EASRec: Elastic Architecture Search for Efficient Long-term Sequential Recommender Systems
Figure 3 for EASRec: Elastic Architecture Search for Efficient Long-term Sequential Recommender Systems
Figure 4 for EASRec: Elastic Architecture Search for Efficient Long-term Sequential Recommender Systems
Viaarxiv icon

One for all: A novel Dual-space Co-training baseline for Large-scale Multi-View Clustering

Add code
Jan 28, 2024
Viaarxiv icon

Semi-Supervised Coupled Thin-Plate Spline Model for Rotation Correction and Beyond

Add code
Jan 24, 2024
Figure 1 for Semi-Supervised Coupled Thin-Plate Spline Model for Rotation Correction and Beyond
Figure 2 for Semi-Supervised Coupled Thin-Plate Spline Model for Rotation Correction and Beyond
Figure 3 for Semi-Supervised Coupled Thin-Plate Spline Model for Rotation Correction and Beyond
Figure 4 for Semi-Supervised Coupled Thin-Plate Spline Model for Rotation Correction and Beyond
Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Figure 1 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 2 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 3 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 4 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Viaarxiv icon

Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy

Add code
Jan 04, 2024
Viaarxiv icon