Hardware Architecture of Reinforcement Learning Scheme for Dynamic Power Management in Embedded Systems

Prabha, Viswanathan Lakshmi; Monie, Elwin Chandra

doi:10.1155/2007/65478

Research Article
Open access
Published: 04 July 2007

Hardware Architecture of Reinforcement Learning Scheme for Dynamic Power Management in Embedded Systems

Viswanathan Lakshmi Prabha¹ &
Elwin Chandra Monie²

EURASIP Journal on Embedded Systems volume 2007, Article number: 065478 (2007) Cite this article

1946 Accesses
7 Citations
Metrics details

Abstract

Dynamic power management (DPM) is a technique to reduce power consumption of electronic systems by selectively shutting down idle components. In this paper, a novel and nontrivial enhancement of conventional reinforcement learning (RL) is adopted to choose the optimal policy out of the existing DPM policies. A hardware architecture evolved from the VHDL model of Temporal Difference RL algorithm is proposed in this paper, which can suggest the winner policy to be adopted for any given workload to achieve power savings. The effectiveness of this approach is also demonstrated by an event-driven simulator, which is designed using JAVA for power-manageable embedded devices. The results show that RL applied to DPM can lead up to 28% power savings.

[1 2 3 4 5 6 7 8 9 10 11]

References

Irani S, Shukala S, Gupta R: Competitive analysis of dynamic power management strategies for systems with multiple power savings states. In Tech. Rep. 01-50. University of Irvine, Irvine, Calif, USA; 2001.
Google Scholar
Benini L, Bogliolo A, Paleologo GA, de Micheli G: Policy optimization for dynamic power management. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 1999,18(6):813-833. 10.1109/43.766730
Article Google Scholar
Lu Y-H, Simunic T, de Micheli G: Software controlled power management. Proceedings of the 7th International Workshop on Hardware/Software Codesign (CODES '99), May 1999, Rome, Italy 157-161.
Google Scholar
Qiu Q, Pedram M: Dynamic power management based on continuous-time Markov decision processes. Proceedings of the 36th Annual Design Automation Conference (DAC '99), June 1999, New Orleans, La, USA 555-561.
Google Scholar
Lu Y-H, de Micheli G: Comparing system-level power management policies. IEEE Design and Test of Computers 2001,18(2):10-19. 10.1109/54.914592
Article Google Scholar
Shukla SK, Gupta RK: A model checking approach to evaluating system level dynamic power management policies for embedded systems. Proceedings of the 6th IEEE International High-Level Design Validation and Test Workshop, September 2001, Monterey, Calif, USA 53-57.
Chapter Google Scholar
Watts C, Ambatipudi R: Dynamic energy management in embedded systems. Computing & Control Engineering 2003,14(5):36-40. 10.1049/cce:20030508
Article Google Scholar
Chung E-Y, Benini L, de Micheli G: Dynamic power management using adaptive learning tree. Proceedings of the IEEE/ACM International Conference on Computer-Aided Design (ICCAD '99), November 1999, San Jose, Calif, USA 274-279.
Google Scholar
Sutton RS, Barto AG: Reinforcement Learning: An Introduction. MIT Press, Cambridge, UK; 1998.
Google Scholar
Ribeiro CHC: A tutorial on reinforcement learning techniques. In Proceedings of International Conference on Neural Networks, July 1999, Washington, DC, USA. INNS Press;
Google Scholar
Johnson RA: Probability and Statistics for Engineers. Prentice-Hall, Englewood Cliffs, NJ, USA; 2001.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics and Communication Engineering, Government College of Technology, Coimbatore, Tamil Nadu, 641-013, India
Viswanathan Lakshmi Prabha
Thanthai Periyar Government Institute of Technology TPGIT, Vellore, Tamil Nadu, 632002, India
Elwin Chandra Monie

Authors

Viswanathan Lakshmi Prabha
View author publications
You can also search for this author in PubMed Google Scholar
Elwin Chandra Monie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Viswanathan Lakshmi Prabha.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Prabha, V.L., Monie, E.C. Hardware Architecture of Reinforcement Learning Scheme for Dynamic Power Management in Embedded Systems. J Embedded Systems 2007, 065478 (2007). https://doi.org/10.1155/2007/65478

Download citation

Received: 06 July 2006
Revised: 07 November 2006
Accepted: 28 May 2007
Published: 04 July 2007
DOI: https://doi.org/10.1155/2007/65478

Hardware Architecture of Reinforcement Learning Scheme for Dynamic Power Management in Embedded Systems

Abstract

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords