Multi-Objective Deep Reinforcement Learning Based Time-Frequency Resource Allocation for Multi-Beam Satellite Communications
Yuanzhi He, Biao Sheng, Hao Yin, Di Yan, Yingchao Zhang
China Communications . 2022, (1): 77 -91 .