1
Off-policy imitation-reinforcement learning for sequential recommendation
异策略模仿-强化学习序列推荐算法
No. 5, 2024 : 1349-1355
doi:10.19734/j.issn.1001-3695.2023.10.0447