Mastering complex card games like GuanDan using self-play RL and sequence modeling with zero domain knowledge
reinforcement-learning deep-learning q-learning pytorch transformer card-game reinforcement-learning-algorithms representation-learning language-model game-ai doudizhu reinforcement-learning-agent self-play sequence-modeling q-learning-algorithm doudizhu-ai guandan guandan-ai
-
Updated
Apr 3, 2026 - Python