Most current network devices have multiple network interfaces, and multipath transport protocols can utilize multiple network paths (e.g., WiFi and cellular) to improve the performance and reliability of network transmission. The scheduler of the multipath transmission protocol determines the path to which each data packet should be transmitted, and is a key module that affects multipath transmission. However, current multipath schedulers cannot adapt well to various user usage scenarios. In this paper, we propose DRLMS, a deep reinforcement learning based multipath scheduler. DRLMS uses deep reinforcement learning to train neural networks to generate packet scheduling policies. It optimizes the scheduling strategy through feedback to the neural network through the reward function based on the current user usage scenario and QoS. We implement DRLMS in the MPQUIC protocol and compared it with current multipath schedulers. The results show that DRLMS's adaptability to user usage scenarios is significantly outperforms other schedulers.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.