Alert button
Picture for Yiming Yang

Yiming Yang

Alert button

Aligning Large Multimodal Models with Factually Augmented RLHF

Add code
Bookmark button
Alert button
Sep 25, 2023
Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-Yan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell

Figure 1 for Aligning Large Multimodal Models with Factually Augmented RLHF
Figure 2 for Aligning Large Multimodal Models with Factually Augmented RLHF
Figure 3 for Aligning Large Multimodal Models with Factually Augmented RLHF
Figure 4 for Aligning Large Multimodal Models with Factually Augmented RLHF
Viaarxiv icon

Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation

Add code
Bookmark button
Alert button
Aug 22, 2023
Junwei Huang, Zhiqing Sun, Yiming Yang

Figure 1 for Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation
Viaarxiv icon

Efficient Temporal Sentence Grounding in Videos with Multi-Teacher Knowledge Distillation

Add code
Bookmark button
Alert button
Aug 07, 2023
Renjie Liang, Yiming Yang, Hui Lu, Li Li

Figure 1 for Efficient Temporal Sentence Grounding in Videos with Multi-Teacher Knowledge Distillation
Figure 2 for Efficient Temporal Sentence Grounding in Videos with Multi-Teacher Knowledge Distillation
Figure 3 for Efficient Temporal Sentence Grounding in Videos with Multi-Teacher Knowledge Distillation
Figure 4 for Efficient Temporal Sentence Grounding in Videos with Multi-Teacher Knowledge Distillation
Viaarxiv icon

Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs

Add code
Bookmark button
Alert button
Jul 22, 2023
Qingyang Zhang, Yiming Yang, Jingqing Ruan, Xuantang Xiong, Dengpeng Xing, Bo Xu

Figure 1 for Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Figure 2 for Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Figure 3 for Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Figure 4 for Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs
Viaarxiv icon

PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification

Add code
Bookmark button
Alert button
May 24, 2023
Yau-Shian Wang, Ta-Chung Chi, Ruohong Zhang, Yiming Yang

Figure 1 for PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification
Figure 2 for PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification
Figure 3 for PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification
Figure 4 for PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification
Viaarxiv icon

Policy Representation via Diffusion Probability Model for Reinforcement Learning

Add code
Bookmark button
Alert button
May 22, 2023
Long Yang, Zhixiong Huang, Fenghao Lei, Yucun Zhong, Yiming Yang, Cong Fang, Shiting Wen, Binbin Zhou, Zhouchen Lin

Figure 1 for Policy Representation via Diffusion Probability Model for Reinforcement Learning
Figure 2 for Policy Representation via Diffusion Probability Model for Reinforcement Learning
Figure 3 for Policy Representation via Diffusion Probability Model for Reinforcement Learning
Figure 4 for Policy Representation via Diffusion Probability Model for Reinforcement Learning
Viaarxiv icon

Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs

Add code
Bookmark button
Alert button
May 19, 2023
Pranjal Aggarwal, Aman Madaan, Yiming Yang, Mausam

Figure 1 for Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
Figure 2 for Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
Figure 3 for Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
Figure 4 for Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs
Viaarxiv icon

Active Retrieval Augmented Generation

Add code
Bookmark button
Alert button
May 11, 2023
Zhengbao Jiang, Frank F. Xu, Luyu Gao, Zhiqing Sun, Qian Liu, Jane Dwivedi-Yu, Yiming Yang, Jamie Callan, Graham Neubig

Figure 1 for Active Retrieval Augmented Generation
Figure 2 for Active Retrieval Augmented Generation
Figure 3 for Active Retrieval Augmented Generation
Figure 4 for Active Retrieval Augmented Generation
Viaarxiv icon

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

Add code
Bookmark button
Alert button
May 04, 2023
Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan

Figure 1 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Figure 2 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Figure 3 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Figure 4 for Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Viaarxiv icon

Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-tuned GPT

Add code
Bookmark button
Alert button
Apr 24, 2023
Ruohong Zhang, Yau-Shian Wang, Yiming Yang

Figure 1 for Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-tuned GPT
Figure 2 for Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-tuned GPT
Figure 3 for Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-tuned GPT
Figure 4 for Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-tuned GPT
Viaarxiv icon