In this paper, a bidding strategy is proposed using reinforcement learning for HVAC systems in a double auction market. The bidding strategy does not require a specific model-based representation of behavior, i.e., a functional form to translate indoor house temperatures into bid prices. The results from reinforcement learning based approach are compared with the HVAC bidding approach used in the AEP gridSMART® smart grid demonstration project and it is shown that the model-free (learning based) approach tracks well the results from the model-based behavior. Successful use of model-free approaches to represent device-level economic behavior may help develop similar approaches to represent behavior of more complex devices or groups of diverse devices, such as in a building. Distributed control requires an understanding of decision making processes of intelligent agents so that appropriate mechanisms may be developed to control and coordinate their responses, and model-free approaches to represent behavior will be extremely useful in that quest.
Revised: October 13, 2015 |
Published: July 1, 2015
Citation
Sun Y., A. Somani, and T.E. Carroll. 2015.Learning Based Bidding Strategy for HVAC Systems in Double Auction Retail Energy Markets. In Proceedings of the American Control Conference (ACC), July 1-3, 2015, Chicago, Illinois, 2912-2917. Piscataway, New Jersey:IEEE.PNNL-SA-103474.doi:10.1109/ACC.2015.7171177