SLA negotiation: Experimental Observations on Learning Policies
01 October 2012
The Internet has moved to content broadcasting and one might anticipate future evolutions on the supported applications. Meanwhile, the Internet business model remains the same. While, on the technological side, many discussions assess the ossification of the Internet, the Internet traded good is still reachability. Some authors argue that the technical ossification is a consequence of the economic one. But adopting a clean-slate economic model is as challenging as adopting a clean-slate architecture. The new system must meet requirements on profitability and stability while tackling complex issues. In this paper, we focus on the proposal of enriching Service Level Agreements (SLAs), which are contracts among Network Service Providers (NSPs) with Quality of Service (QoS) information. We propose a game model of the SLA negotiation among NSPs in order to study how some learning algorithms converge to stable conditions, which are mixed Nash Equilibria in this case. Computing a mixed Nash equilibrium is PPAD-complete; the corresponding algorithms are thus quite complex. In previous works, some authors studied the convergence of Reinforcement Learning techniques to pure and mixed Nash Equilibria. Learning mixed Nash Equilibria seems harder. Hence, we rather experimentally observe how such algorithms can, according to different policies, converge to mixed Nash Equilibria, and also how profitable they are for the NSPs.