Improved Fuzzy Reinforcement Learning for Self-Optimisation of Heterogeneous Wireless Networks
06 May 2013
This paper presents a novel scheme to improve learning mechanism for future self-organising networks' functionalities using a combination of fuzzy logic and reinforcement learning. Although the two frameworks compliment each other well, an efficient reward distribution mechanism needs to be deployed or otherwise the learning performance may be degraded. This study introduce an improved reward distribution (IRD) scheme in that the action space is abstracted to represent only the actions that are most relevant to the final crisp executed action after defuzzification. As a case study, coverage and capacity optimisation of heterogeneous networks consisting of dense deployment of small cells is considered. Using the proposed method, simulation results confirm considerable algorithm's performance in terms of learning efficiency and convergence time.