Figure 6 Average discounted reward for both DMDQ and Q-learning. Fig. 5. Average end-to-end delay of MCDs [milli-sec].
Related Figures (6)
Fig. 1. Al-enabled future wireless network and services. Fig. 4. Architecture of Deep Q-learning (Google DeepMind Architecture) Fig. 2. Conceptual diagram of Q-learning operation. Fig. 3. A typical structure of a neural network.