Card Games


Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Add code
May 01, 2025
Viaarxiv icon

Playing Non-Embedded Card-Based Games with Reinforcement Learning

Add code
Apr 07, 2025
Viaarxiv icon

A Generalist Hanabi Agent

Add code
Mar 17, 2025
Viaarxiv icon

SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?

Add code
Mar 16, 2025
Viaarxiv icon

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Add code
Mar 11, 2025
Viaarxiv icon

GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks

Add code
Mar 09, 2025
Viaarxiv icon

Seeding for Success: Skill and Stochasticity in Tabletop Games

Add code
Mar 04, 2025
Viaarxiv icon

Cardiverse: Harnessing LLMs for Novel Card Game Prototyping

Add code
Feb 10, 2025
Viaarxiv icon

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Add code
Jan 28, 2025
Figure 1 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 2 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 3 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 4 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Viaarxiv icon

CG-MER: A Card Game-based Multimodal dataset for Emotion Recognition

Add code
Jan 14, 2025
Viaarxiv icon