Inter-carrier SLA negotiation using Q-Learning
01 February 2013
Inter-domain high performance services (e.g. telepresence) are not sustainable over the current Internet architecture. The Quality of Service (QoS) guarantees they demand, require to settle on end-toend Service Level Agreements (SLAs) among providers (aka. carriers) and across dierent networks. This process is critical since it must provide the most benets while dealing with heterogeneous operators' business interests and condentiality constraints. In this paper, we propose, in the frame of a cooperative organizational model called federation, a composition technique for inter-carrier SLAs that respects end-user's QoS requirements while maximizing network operators' long-term benets. We formulate the dynamic optimization problem rst as a game and then via a Markov Decision process (MDP). This latter allows to provide an iterative near-optimal solution through reinforcement learning (more precisely, Q-learning). The SLA composition is thus performed taking into account customers and network providers's utilities. We also propose a version including several negotiation rounds and observe how it aects the results.