Abstract: Deep Reinforcement Learning (DRL)-based Random Access (RA) schemes break through the limitation of conventional RA protocols due to lack of coordination among terminals, but they still face ...