Retail pricing for stochastic demand with unknown parameters: An online machine learning approach

The problem of dynamically pricing of electricity by a retailer for customers in a demand response program is considered. It is assumed that the retailer obtains electricity in a two-settlement wholesale market consisting of a day ahead market and a real-time market. Under a day ahead dynamic pricing mechanism, the retailer aims to learn the aggregated demand function of its customers while maximizing its retail profit. A piecewise linear stochastic approximation algorithm is proposed. It is shown that the accumulative regret of the proposed algorithm grows with the learning horizon T at the order of O(log T). It is also shown that the achieved growth rate cannot be reduced by any piecewise linear policy.

[1]  Lang Tong,et al.  Day ahead dynamic pricing for demand response in dynamic environments , 2013, 52nd IEEE Conference on Decision and Control.

[2]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[3]  D. Hearn,et al.  Mathematical and Computational Models for Congestion Charging , 2006 .

[4]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 1985 .

[5]  Josef Broder,et al.  Dynamic Pricing Under a General Parametric Choice Model , 2012, Oper. Res..

[6]  M. Carrion,et al.  Forward Contracting and Selling Price Determination for a Retailer , 2007, IEEE Transactions on Power Systems.

[7]  John N. Tsitsiklis,et al.  Linearly Parameterized Bandits , 2008, Math. Oper. Res..

[8]  H. Robbins,et al.  Iterated least squares in multiperiod control , 1982 .

[9]  John Douglas Birdwell,et al.  Residential air conditioner dynamic model for direct load control , 1988 .

[10]  C. Goldman,et al.  Demand Response from Day-Ahead Hourly Pricing for Large Customers , 2006 .

[11]  Zita Vale,et al.  Cost dependent strategy for electricity markets bidding based on adaptive reinforcement learning , 2011, 2011 16th International Conference on Intelligent System Applications to Power Systems.

[12]  T. Lai,et al.  Extended least squares and their applications to adaptive control and prediction in linear systems , 1986 .

[13]  Frank Thomson Leighton,et al.  The value of knowing a demand curve: bounds on regret for online posted-price auctions , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[14]  S. Boyd,et al.  Pricing and learning with uncertain demand , 2003 .

[15]  Peter Auer,et al.  Improved Rates for the Stochastic Continuum-Armed Bandit Problem , 2007, COLT.

[16]  S. Borenstein,et al.  Dynamic Pricing, Advanced Metering, and Demand Response in Electricity Markets , 2002 .

[17]  D. Bertsimas,et al.  Dynamic Pricing ; A Learning Approach , 2001 .

[18]  Robert D. Kleinberg Nearly Tight Bounds for the Continuum-Armed Bandit Problem , 2004, NIPS.

[19]  Eric W. Cope,et al.  Regret and Convergence Bounds for a Class of Continuum-Armed Bandit Problems , 2009, IEEE Transactions on Automatic Control.

[20]  S. Borenstein The Long-Run Efficiency of Real-Time Electricity Pricing , 2005 .

[21]  Geert Deconinck,et al.  Comparison of Two Learning Algorithms in Modelling the Generator's Learning Abilities , 2009, 2009 15th International Conference on Intelligent System Applications to Power Systems.

[22]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[23]  Jhi-Young Joo,et al.  Multi-Layered Optimization Of Demand Resources Using Lagrange Dual Decomposition , 2013, IEEE Transactions on Smart Grid.

[24]  R. Agrawal The Continuum-Armed Bandit Problem , 1995 .

[25]  A. Rahimi-Kian,et al.  Q-learning based supplier-agents for electricity markets , 2005, IEEE Power Engineering Society General Meeting, 2005.

[26]  H. Robbins,et al.  Adaptive Design and Stochastic Approximation , 1979 .

[27]  Lang Tong,et al.  Optimal pricing for residential demand response: A stochastic optimization approach , 2012, 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[28]  A. Conejo,et al.  Optimal Involvement in Futures Markets of a Power Producer , 2008, IEEE Transactions on Power Systems.