Picture for Jingbo Shang

Jingbo Shang

Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions

Add code
Mar 04, 2025
Viaarxiv icon

Active Learning for Direct Preference Optimization

Add code
Mar 03, 2025
Viaarxiv icon

Orthogonal Calibration for Asynchronous Federated Learning

Add code
Feb 21, 2025
Viaarxiv icon

Self-Taught Agentic Long Context Understanding

Add code
Feb 21, 2025
Viaarxiv icon

Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent

Add code
Feb 17, 2025
Figure 1 for Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent
Figure 2 for Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent
Figure 3 for Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent
Figure 4 for Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent
Viaarxiv icon

UltraGen: Extremely Fine-grained Controllable Generation via Attribute Reconstruction and Global Preference Optimization

Add code
Feb 17, 2025
Viaarxiv icon

Linear Correlation in LM's Compositional Generalization and Hallucination

Add code
Feb 06, 2025
Figure 1 for Linear Correlation in LM's Compositional Generalization and Hallucination
Figure 2 for Linear Correlation in LM's Compositional Generalization and Hallucination
Figure 3 for Linear Correlation in LM's Compositional Generalization and Hallucination
Figure 4 for Linear Correlation in LM's Compositional Generalization and Hallucination
Viaarxiv icon

Model-diff: A Tool for Comparative Study of Language Models in the Input Space

Add code
Dec 13, 2024
Figure 1 for Model-diff: A Tool for Comparative Study of Language Models in the Input Space
Figure 2 for Model-diff: A Tool for Comparative Study of Language Models in the Input Space
Figure 3 for Model-diff: A Tool for Comparative Study of Language Models in the Input Space
Figure 4 for Model-diff: A Tool for Comparative Study of Language Models in the Input Space
Viaarxiv icon

OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models

Add code
Oct 31, 2024
Figure 1 for OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models
Figure 2 for OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models
Figure 3 for OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models
Figure 4 for OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models
Viaarxiv icon

Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation

Add code
Oct 30, 2024
Figure 1 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Figure 2 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Figure 3 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Figure 4 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Viaarxiv icon