Text this: Hybrid Online and Offline Reinforcement Learning for Tibetan Jiu Chess