Picture for Luo Ji

Luo Ji

Learn-to-learn on Arbitrary Textual Conditioning: A Hypernetwork-Driven Meta-Gated LLM

Add code
May 03, 2026
Viaarxiv icon

Efficient Rationale-based Retrieval: On-policy Distillation from Generative Rerankers based on JEPA

Add code
Apr 25, 2026
Viaarxiv icon

Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling

Add code
Aug 23, 2025
Figure 1 for Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
Figure 2 for Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
Figure 3 for Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
Figure 4 for Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
Viaarxiv icon

Convert Language Model into a Value-based Strategic Planner

Add code
May 11, 2025
Figure 1 for Convert Language Model into a Value-based Strategic Planner
Figure 2 for Convert Language Model into a Value-based Strategic Planner
Figure 3 for Convert Language Model into a Value-based Strategic Planner
Figure 4 for Convert Language Model into a Value-based Strategic Planner
Viaarxiv icon

FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations

Add code
Apr 16, 2025
Figure 1 for FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations
Figure 2 for FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations
Figure 3 for FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations
Figure 4 for FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations
Viaarxiv icon

Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval

Add code
Dec 21, 2024
Figure 1 for Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval
Figure 2 for Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval
Figure 3 for Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval
Figure 4 for Large Language Model Can Be a Foundation for Hidden Rationale-Based Retrieval
Viaarxiv icon

Multi-Party Supervised Fine-tuning of Language Models for Multi-Party Dialogue Generation

Add code
Dec 06, 2024
Viaarxiv icon

Dual-Layer Training and Decoding of Large Language Model with Simultaneously Thinking and Speaking

Add code
Sep 18, 2024
Figure 1 for Dual-Layer Training and Decoding of Large Language Model with Simultaneously Thinking and Speaking
Figure 2 for Dual-Layer Training and Decoding of Large Language Model with Simultaneously Thinking and Speaking
Figure 3 for Dual-Layer Training and Decoding of Large Language Model with Simultaneously Thinking and Speaking
Figure 4 for Dual-Layer Training and Decoding of Large Language Model with Simultaneously Thinking and Speaking
Viaarxiv icon

Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence

Add code
Sep 11, 2024
Viaarxiv icon

A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio

Add code
Sep 10, 2024
Figure 1 for A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio
Figure 2 for A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio
Figure 3 for A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio
Figure 4 for A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio
Viaarxiv icon