<?xml version="1.0" encoding="US-ASCII"?>

<!DOCTYPE rfc SYSTEM "rfc2629.dtd" [
<!ENTITY rfc2119 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.2119.xml">
<!ENTITY rfc7575 PUBLIC "" "http://xml.resource.org/public/rfc/bibxml/reference.RFC.7575.xml">
]>

<rfc category="info" docName="draft-kim-nmrg-rl-00" ipr="trust200902">
   
  <?xml-stylesheet type='text/xsl' href='rfc2629.xslt' ?>
  <!-- used by XSLT processors -->
  <!-- For a complete list and description of processing instructions (PIs),
   please see http://xml.resource.org/authoring/README.html. -->
  <!-- Below are generally applicable Processing Instructions (PIs) that most I-Ds might want to use.
   (Here they are set differently than their defaults in xml2rfc v1.32) -->
  <?rfc strict="yes" ?>
  <!-- give errors regarding ID-nits and DTD validation -->
  <!-- control the table of contents (ToC) -->
  <?rfc toc="yes"?>
  <!-- generate a ToC -->
  <?rfc tocdepth="4"?>
  <!-- the number of levels of subsections in ToC. default: 3 -->
  <!-- control references -->
  <?rfc symrefs="yes"?>
  <!-- use symbolic references tags, i.e, [RFC2119] instead of [1] -->
  <?rfc sortrefs="no" ?>
  <!-- sort the reference entries alphabetically -->
  <!-- control vertical white space
   (using these PIs as follows is recommended by the RFC Editor) -->
  <?rfc compact="no" ?>
  <!-- do not start each main section on a new page -->
  <?rfc subcompact="no" ?>
  <!-- keep one blank line between list items -->
  <!-- end of list of popular I-D processing instructions -->

  <!-- ***** FRONT MATTER ***** -->
  <front>
      <!-- The abbreviated title is used in the page header - it is only necessary if the
       full title is longer than 39 characters -->

    <title abbrev="Network RL">Intelligent Management using Collaborative Reinforcement Multi-agent System</title>

    <!-- add 'role="editor"' below for the editors if appropriate -->
    <!-- Another author who claims to be an editor -->
    <author fullname="Min-Suk Kim" initials="M-S" surname="Kim">
        <organization abbrev="ETRI">ETRI</organization>
        <address>
            <postal>
                <street>161 Gajeong-Dong Yuseung-Gu</street>
                <!-- Reorder these if your country does things differently -->
                <city>Daejeon</city>
                <region></region>
                <code>305-700</code>
                <country>Korea</country>
            </postal>
            <phone>+82 42 860 5930</phone>
            <email>mskim16@etri.re.kr</email>
            <!-- uri and facsimile elements may also be added -->
        </address>
    </author>

  	<author fullname="Yong-Geun Hong" initials="Y-G" surname="Hong">
    		<organization abbrev="ETRI">ETRI</organization>
    		<address>
            <postal>
                <street>161 Gajeong-Dong Yuseung-Gu</street>
                <!-- Reorder these if your country does things differently -->
                <city>Daejeon</city>
                <region></region>
                <code>305-700</code>
                <country>Korea</country>
            </postal>
            <phone>+82 42 860 6557</phone>
            <email>yghong@etri.re.kr</email>
            <!-- uri and facsimile elements may also be added -->
        </address>
  	</author>

    <author fullname="Youn-Hee Han" initials="Y-H" surname="Han">
        <organization abbrev="KoreaTec">KoreaTec</organization>

        <address>
            <postal>
                <street>Byeongcheon-myeon Gajeon-ri, Dongnam-gu</street>
                <street></street>
                <street>Choenan-si, Chungcheongnam-do</street>
                <street></street>
                <code>330-708</code>
                <country>Korea</country>
            </postal>
            <phone>+82 41 560 1486</phone>
            <email>yhhan@koreatec.ac.kr</email>
        </address>
    </author>

    <date month="July" year="2017" />
    <!-- If the month and year are both specified and are the current ones, xml2rfc will fill
     in the current day for you. If only the current year is specified, xml2rfc will fill
	 in the current day and month for you. If the year is not the current one, it is
	 necessary to specify at least a month (xml2rfc assumes day="1" if not specified for the
	 purpose of calculating the expiry date).  With drafts it is normally sufficient to
	 specify just the year. -->
    
    <!-- Meta-data Declarations -->
    <area>Internet</area>
    
    <workgroup>Network Management Research Group</workgroup>
    
    <!-- WG name at the upperleft corner of the doc,
     IETF is fine for individual submissions.
	 If this element is not present, the default is "Network Working Group",
     which is used by the RFC Editor as a nod to the history of the IETF. -->
    
    <keyword>Internet Draft</keyword>
    
    <!-- Keywords will be incorporated into HTML output
     files in a meta tag but they have no effect on text or nroff
     output. If you submit your draft to the RFC Editor, the
     keywords will be used for the search engine. -->

    <abstract>
      <t>This document describes an intelligent reinforcement learning agent system to autonomously manage agent path-planning over a communication network. The main centralized node called by the global environment should not only manage all agents workflow in a hybrid peer-to-peer networking architecture and, but transfer and share information in distributed nodes. All agents in distributed nodes are able to be provided with a cumulative reward for each action that a given agent takes with respect to an optimized knowledge based on a to-be-learned policy over the learning process. A reward from the global environment is reflected to the next optimized action for autonomous path management in distributed networking nodes.</t>
    </abstract>
  </front>

  <middle>
    <section title="Introduction">
		<t>In large infrastructures such as transportation, health and energy systems, collaborative monitoring system is needed, where there are special needs for intelligent distributed networking systems with learning schemes.
			 Agent Reinforcement Learning (RL) for autonomous network management, in general, is one of the challengeable methods in a dynamic complex cluttered environment over a network. The goal for autonomous network management using RL is self-management to manage optimized agent work-flow without minimal human dependency by learning process <xref target="RFC7575"></xref>. 
			 The system is needed by the development of computational multi-agents learning process in large distributed networking nodes, where the agents have limited and incomplete knowledge, and they only access local information in distributed networking nodes.</t>

    <t>Reinforcement Learning can become an effective technique to transfer and share information among agents, as it does not require a priori knowledge of the agent behavior or environment to accomplish its tasks [Megherbi]. 
    	 Such a knowledge is usually acquired and learned automatically and autonomously by trial and error.</t>

    <t>Reinforcement Learning is Machine Learning techniques that will be adapted to the various networking environments for automatic networks[I-D.jiang-nmlrg-network-machine-learning].
    	 Thus, this document provides motivation, learning technique, and use case for network machine learning.</t>    
    </section>

    <section title="Conventions and Terminology">
      <t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in <xref target="RFC2119"></xref>.</t>
    </section>

  <section title="Motivation">

		<section title="General Motivation for Reinforcement Learning">
	    	<t>Reinforcement Learning is a system capable of autonomous acquirement and incorporation of knowledge. 
	    		It can continuously self-improve learning process with experience and attempts to maximize cumulative reward to manage an optimized learning knowledge by multi-agents-based monitoring systems[Teiralbar]. 
	    		The maximized reward can be increasingly optimizing of learning speed for agent autonomous learning process.
	      </t>
    </section>

		<section title="Reinforcement Learning in networks">
	    	<t>Reinforcement learning is an emerging technology in terms of monitoring and managing network system to achieve fair resource allocation for nodes within the wire or wireless mesh setting. 
	    		Monitoring parameters of the network and adjusts based on the network dynamics can demonstrate to improve fairness in wireless environment Infrastructures and Resources [Nasim]. 
	    		The fundamental goal for Reinforcement Learning is self-management, which is comprised of a couple of properties such as self-healing (adaptive function in the environment and heal problems automatically) and self-optimizing (function for automatically determine ways to optimize their behavior against a set of well-defined goals) <xref target="RFC7575"></xref>.
	    	</t>
   	</section>

		<section title="Motivation in our work">
	    	<t>There are many different networking management issues such as connectivity, traffic management, fast internet without latency and etc. 
	    		We expect that ml-based mechanism such as reinforcement learning will provide solutions of networking issues with multiple cases against human operating capacities even if it is a challengeable area due to a multitude of reasons such as large state space search, complexity in giving reward, difficulty in agent action selection, and difficulty in sharing and merging learned information among the agents in a distributed memory node to be transferred over a communication network [Minsuk].
	    	</t>
    </section>
	</section>

	<section title="Related Works">

    <section title="Autonomous Driving System"> 	
	    	<t>Autonomous vehicle is capable of self-management for automotive driving without human supervision depending on optimized trust region policy by reinforcement learning that enables learning of more complex and special network management environment. 
	    		Such a vehicle provides a comfortable user experience safely and reliably on interactive communication network [April] [Markus].
	    	</t>
    </section>
    
    <section title="Game Theory"> 	
	    	<t>The adaptive multi-agent system, which is combined with complexities from interacting game player, has developed in a field of reinforcement learning. 
	    		In the early game theory, the interdisciplinary work was only focused on competitive games, but Reinforcement Learning has developed into a general framework for analyzing strategic interaction and has been attracted field as diverse as psychology, economics and biology [Ann]. 
	    	</t>
	    	
	    	<t>AlphaGo is also one of the game theories using reinforcement learning, developed by Google DeepMind.  Even though it began as a small learning computational program with some simple actions, it has now trained on a policy and value networks of thirty million actions, states and rewards for optimal management using learning process.
	    	</t>
    </section>
    
    <section title="Wireless Sensor Network (WSN)"> 	
	    	<t>Wireless sensor network (WSN) consists of a large number of sensors and sink nodes for monitoring systems to manage event parameters such as temperature, humidity, air conditioning, etc. 
	    		Reinforcement learning in WSNs has been applied in a wide range of schemes such as cooperative communication, routing and rate control. 
	    		The sensors and sink nodes are able to observe and carry out optimal actions on their respective operating environment for network and application performance enhancements [Kok-Lim].
	    	</t>
    </section>
    
    <section title="Routing Enhancement"> 	
	    	<t>Reinforcement Learning is used to enhance multicast routing protocol in wireless ad hoc networks, where each node has different capability. 
	    		Routers in the multicast routing protocol are determined to discover optimal route with a predicted reward, and then the routers create the optimal path with multicast transmissions to reduce the overhead in Reinforcement Learning[Kok-Lim].
	    	</t>
    </section>  	
  </section>
  
  <section title="Multi-agent Reinforcement Learning Technologies">
		<section title="Reinforcement Learning">
	    	<t>Agent reinforcement Learning is ml-based unsupervised algorithms based on an agent learning process. 
	    		Reinforcement Learning is normally used with a reward from centralized node (the global environment), and capable of autonomous acquirement and incorporation of knowledge. 
	    		It is continuously self-improving and becoming more efficient as the learning process from an agent experience to optimize management performance for autonomous learning process.[Sutton][Madera]
	    	</t>
    </section>
    
    <section title="Policy using Distance and Frequency">
	    	<t>Distance and Frequency algorithm uses the state occurrence frequency in addition to the distance to goal.
	    		 It avoids deadlocks and lets the agent escape the Dead, and it was derived to enhance agent optimal learning speed.
	    		 Distance-and-Frequency is based on more levels of agent visibility to enhance learning algorithm by an additional way that uses the state occurrence frequency.[Al-Dayaa]
	    	</t>
    </section>
    
    <section title="Distributed Computing Node">
	    	<t>Autonomous multi-agent learning process for network management environment is related to transfer optimized knowledge between agents on a given local node or distributed memory nodes over a communication network.
	    	</t>
    </section>
    
    <section title="Agent Sharing Information">
	    	<t>This is a technique how agents can share information for optimal learning process. The quality of agent decision making often depends on the willingness of agents to share a given learning information collected by agent learning process. 
	    		 Sharing Information means that an agent would share and communicate the knowledge learned and acquired with or to other agents using reinforcement learning.
	    	</t>
	    	
	    	<t>Agents normally have limited resources and incomplete knowledge during learning exploration. 
	    		 For that reason, the agents should take actions and transfer the states to the global environment under reinforcement learning, then it would share the information with other agents, where all agents explore to reach their destination via a distributed reinforcement reward-based learning method on the existing local distributed memory nodes.
	    	</t>
	    	
	    	<t>MPI (Message Passing Interface) is used for communication way. 
	    		 Even if the agents do not share the capabilities and resources to monitor an entire given large terrain environment, they are able to share the needed information to manage collaborative learning process for optimized management in distributed networking nodes.[Chowdappa][Minsuk]
	    	</t>
    </section>
    
    <section title="Sub-goal Selection">
	    	<t>A new technical method for agent sub-goal selection in distributed nodes is introduced to reduce the agent initial random exploration with a given selected sub-goal.
	    	</t>
	    	
	    	<t>[TBD]
	    	</t>
    </section>   
  </section>

		<section title="Proposed Architecture for Reinforcement Learning">
			<t>The architecture using Reinforcement Learning describes a collaborative multi-agent-based system in distributed environments as shown in figure 1, where the architecture is combined with a hybrid architecture making use of both a master and slave architecture and a peer-to-peer. 
				 The centralized node(global environment), assigns each slave computing node a portion of the distributed terrain and an initial number of agents.
			</t>
			
			<figure anchor="rl-fig1"
                    title="Hybrid P2P and Master/Slave Architecture Overview">
                    <artwork align="center">


+-------------+                           +-----------------+
|             |&lt;......>| node 1 |&lt;.......>|    terrain 1    |
|             |                           +-----------------+
| Global env. |
|  (node 0)   |                           +-----------------+
|             |&lt;......>| node 2 |&lt;.......>|    terrain 2    |
+-------------+                           +-----------------+
                    </artwork>
                    <postamble></postamble>
            </figure>

			<t>Reinforcement Learning actions involve interacting with a given environment, so the environment provides an agent learning process with the elements as followings:
			</t>
			
			<t>
			<list style="symbols">
				<t>Agent actions, states and cumulative rewards
				</t>
				<t>One or more obstacles, and goals
				</t>
				<t>Initially, random exploration in a given node
				</t>				
				<t>Next, optimal explorations under reinforcement learning
				</t>					
			</list>
		  </t>
			
			<t>Additionally, agent actions with states toward its goal as below:
			</t>
			
			<t>
			<list style="symbols">
				<t>Agent continuously actions to avoid an obstacle based on its policy and move to one or more available positions until it reaches its goal(s)
				</t>
				<t>After an agent reaches its destination, it can use the information collected by initial random learning process to next learning process for optimal management
				</t>
				<t>Agent learning process is optimized in the following phase and exploratory learning trials
				</t>									
			</list>
		  </t>
		  
		  <t>In shown as Figure2, we illustrate the fundamental architecture for relationship of an action, state and reward, and each agent explores to reach its destination(s) under reinforcement learning. 
		  	 The agent does an action that leads to a reward from achieving an optimal path toward its goal. Our works will be extended depending on the architecture.
			</t>
			
			<figure anchor="rl-fig2"
                    title="RL work-flow Overview">
                    <artwork align="center">

                                   +---------------------+
       ....state and reward........+ Global Environment +|&lt;............
       .                           +---------------------+            .
+------+------+                                                       .
| Multi-agent |                                                       .
+------+------+                    +---------------+                  .
       ............action.........>+  Destiantion  +...................
                                   +---------------+
                    </artwork>
                    <postamble></postamble>
            </figure>         
   	</section>

		<section title="Use case of Multi-agent Reinforcement Learning">
			<section title="Distributed Multi-agent Reinforcement Learning: Sharing Information Technique"> 	
	    	<t>In this section, we deal with case of a collaborative distributed multi-agent, where each agent has same or different individual destination in a distributed environment. 
	    		Since sharing information scheme among the agents is problematic one, we need to expand on the work described by solving the challenging cases.
	    	</t>
	    	
	    	<t>Basically, the main proposed algorithm is presented by distributed multi-agent reinforcement learning as below:
	    	</t>
	    	
	    	<texttable anchor="main_algorithm" title="Proposed Algorithm">
    			<ttcol align='left'>Proposed Algorithm</ttcol>
    			<c>(1) Let Ni denote the number of node (i= 1, 2, 3 ...)</c>
    			<c>(2) Let Aj denote the number of agent</c>
    			<c>(3) Let Dk denote the number of destination</c>
    			<c>(4) Place initial number of agents Aj, in random position (Xm, Yn)</c>
    			<c>(5) Every Aj in Ni</c>
    			<c>-----> (a) Do initial exploration (random) to corresponding Dk</c>
    			<c>-----> (b) Do exploration (using RL) for Tx denote the number of trial</c>
    		</texttable>
	    	
	    	<texttable anchor="random_algorithm" title="Random Trial">
    			<ttcol align='left'>Random Trial</ttcol>
    			<c>(1) Let Si denote the the current state</c>
    			<c>(2) Relinquish Si so that the other agent can occupy the position</c>
    			<c>(3) Assign the agent new position</c>
    			<c>(4) Update the current state Si -> Si+1</c>
     		</texttable>
    		
    		<texttable anchor="optimal_algorithm" title="Optimal Trial">
    			<ttcol align='left'>Optimal Trial</ttcol>
    			<c>(1) Let Si denote the the current state</c>
    			<c>(2) Let ACj denote an action</c>
    			<c>(3) Let DRm denote discount reward</c>
    			<c>(4) Choose ACj &lt;- Policy(Si, ACj)</c>
    			<c>(5) Move an available posiion</c>
    			<c>(6) Update learning process in the global environment</c>
    			<c>(7) Update the current state Si &lt; Si+1</c>
    		</texttable>
    		
    		<t>Multi-agent reinforcement learning in distributed nodes can improve the overall system performance to transfer or share information from one node to another node in following cases; expanded complexity in RL technique with various experimental factors and conditions, analyzing multi-agent sharing information for agent learning process.
	    	</t>
	    	</section>
	    	
	    	<section title="Use case of Shortest Path-planning via sub-goal selection">
	    	<t>Sub-goal selection is a scheme of a distributed multi-agent RL technique based on selected intermediary agent sub-goal(s) with the aim of reducing the initial random trial. 
	    		The scheme is to improve the multi-agent system performance with asynchronously triggered exploratory phase(s) with selected agent sub-goal(s) for autonomous network management.
	    	</t>
	    	
	    	<t>[TBD]
	    	</t>
	    	</section>	
    </section>
    	
	  <section anchor="IANA" title="IANA Considerations">
		<t>There are no IANA considerations related to this document.</t>
	  </section>

    <section title="Security Considerations">
    	<t>[TBD]</t>
    </section>

<section anchor="Acknowledgements" title="Acknowledgements">
      <t>David Meyerm, who chief scientist and VP in Brocade, has provided significant comment and feedback for the draft.</t>
      
    </section>
    
</middle>

<!--  *****BACK MATTER ***** -->
<!--  *****BACK MATTER ***** -->
<back>
    <!-- References split into informative and normative -->
    
    <!-- There are 2 ways to insert reference entries from the citation libraries:
     1. define an ENTITY at the top, and use "ampersand character"RFC2629; here (as shown)
     2. simply use a PI "less than character"?rfc include="reference.RFC.2119.xml"?> here
     (for I-Ds: include="reference.I-D.narten-iana-considerations-rfc2434bis.xml")
     
     Both are cited textually in the same manner: by using xref elements.
     If you use the PI option, xml2rfc will, by default, try to find included files in the same
     directory as the including file. You can also define the XML_LIBRARY environment variable
     with a value containing a set of directories to search.  These can be either in the local
     filing system or remote ones accessed by http (http://domain/dir/... ).-->

    <references title="Normative References">

      &rfc2119;
      &rfc7575;
           
    </references>
    
 <references title="Informative References">

  <reference anchor="I-D.jiang-nmlrg-network-machine-learning">
      <front>
        <title>Network Machine Learning</title>

        <author fullname="Sheng Jiang" initials="S." surname="Jiang">
          <organization>Huawei Technologies Co., Ltd</organization>
          </author>
       
        <date month="October" year="2016" />
      </front>

      <seriesInfo name="ID" value="draft-jiang-nmlrg-network-machine-learning-02" />
    </reference> 
	    
	<reference anchor='Megherbi'>
        <front>
            <title>Megherbi, D. B., Kim, Minsuk, Madera, Manual., "A Study of Collaborative Distributed Multi-Goal and Multi-agent based Systems for Large Critical Key Infrastructures and Resources (CKIR) Dynamic Monitoring and Surveillance", IEEE International Conference on Technologies for Homeland Security</title>
            <author></author>
            <date month='' year='2013' />
        </front>        
    </reference>

	<reference anchor='Teiralbar'>
        <front>
            <title>Megherbi, D. B., Teiralbar, A. Boulenouar, J., "A Time-varying Environment Machine Learning Technique for Autonomous Agent Shortest Path Planning.", Proceedings of SPIE International Conference on Signal and Image Processing, Orlando, Florida</title>
            <author></author>
            <date month='' year='2001' />
        </front>        
    </reference>
    
	<reference anchor='Nasim'>
        <front>
            <title>Nasim ArianpooEmail, Victor C.M. Leung, "How network monitoring and reinforcement learning can improve tcp fairness in wireless multi-hop networks", EURASIP Journal on Wireless Communications and Networking</title>
            <author></author>
            <date month='' year='2016' />
        </front>        
    </reference>    

	<reference anchor='Minsuk'>
        <front>
            <title>Dalila B. Megherbi and Minsuk Kim, "A Hybrid P2P and Master-Slave Cooperative Distributed Multi-Agent Reinforcement Learning System with Asynchronously Triggered Exploratory Trials and Clutter-index-based Selected Sub goals", IEEE CIG Conference</title>
            <author></author>
            <date month='' year='2016' />
        </front>        
    </reference>  
    
	<reference anchor='April'>
        <front>
            <title>April Yu, Raphael Palefsky-Smith, Rishi Bedi, "Deep Reinforcement Learning for Simulated Autonomous Vehicle Control", Stanford University</title>
            <author></author>
            <date month='' year='2016' />
        </front>        
    </reference>     
    
	<reference anchor='Markus'>
        <front>
            <title>Markus Kuderer, Shilpa Gulati, Wolfram Burgard, "Learning Driving Styles for Autonomous Vehicles from Demonstration", Robotics and Automation (ICRA)</title>
            <author></author>
            <date month='' year='2015' />
        </front>        
    </reference>         

	<reference anchor='Ann'>
        <front>
            <title>Ann Nowe, Peter Vrancx, Yann De Hauwere, "Game Theory and Multi-agent Reinforcement Learning", In book: Reinforcement Learning: State of the Art, Edition: Adaptation, Learning, and Optimization Volume 12</title>
            <author></author>
            <date month='' year='2012' />
        </front>        
    </reference>         

	<reference anchor='Kok-Lim'>
        <front>
            <title>Kok-Lim Alvin Yau, Hock Guan Goh, David Chieng, Kae Hsiang Kwong, "Application of reinforcement learning to wireless sensor networks: models and algorithms", Published in Journal Computing  archive Volume 97 Issue 11, Pages 1045-1075</title>
            <author></author>
            <date month='November' year='2015' />
        </front>        
    </reference>   

	<reference anchor='Sutton'>
        <front>
            <title>Sutton, R. S., Barto, A. G., "Reinforcement Learning: an Introduction", MIT Press</title>
            <author></author>
            <date month='' year='1998' />
        </front>        
    </reference> 
    
	<reference anchor='Madera'>
        <front>
            <title>Madera, M., Megherbi, D. B., "An Interconnected Dynamical System Composed of Dynamics-based Reinforcement Learning Agents in a Distributed Environment: A Case Study", Proceedings IEEE International Conference on Computational Intelligence for Measurement Systems and Applications, Italy</title>
            <author></author>
            <date month='' year='2012' />
        </front>        
    </reference>     
    
	<reference anchor='Al-Dayaa'>
        <front>
            <title>Al-Dayaa, H. S., Megherbi, D. B., "Towards A Multiple-Lookahead-Levels Reinforcement-Learning Technique and Its Implementation in Integrated Circuits", Journal of Artificial Intelligence, Journal of Supercomputing. Vol. 62, issue 1, pp. 588-61</title>
            <author></author>
            <date month='' year='2012' />
        </front>        
    </reference>     

<reference anchor='Chowdappa'>
        <front>
            <title>Chowdappa, Aswini., Skjellum, Anthony., Doss, Nathan, "Thread-Safe Message Passing with P4 and MPI", Technical Report TR-CS-941025, Computer Science Department and NSF Engineering Research Center, Mississippi State University</title>
            <author></author>
            <date month='' year='1994' />
        </front>        
    </reference>    
<!--
 &id.draft-winter-energy-efficient-internet;
 &id.draft-cheshire-edns0-owner-option;
    <reference anchor='ITU'>
        <front>
            <title>Resolution 73 - Information and communication technologies and climate change</title>
            <author></author>
            <date month='October' year='2008' />
        </front>
        </reference>

    <reference anchor='EPC'>
        <front>
            <title>The Case for Energy-Proportional Computing</title>
            <author initials='L.' surname='Barroso' fullname='Luiz Andre Barroso'></author>
            <author initials='U.' surname='Holzle' fullname='Urs Holzle'></author>
            <date month='December' year='2007'/>
        </front>
        <seriesInfo name='Proc. IEEE International Conference on Network Protocols (ICNP)' value=''/>
    </reference>

	<reference anchor='GreenSurvey'>
        <front>
            <title>A survey of green networking research</title>
            <author initials='A.P.' surname='Bianzino' fullname='Aruna Prem Bianzino'></author>
            <author initials='C.' surname='Chaudet' fullname='Claude Chaudet'></author>
            <author initials='D.' surname='Rossi' fullname='Dario Rossi'></author>
            <author initials='J.-L.' surname='Rougier' fullname='Jean-Louis Rougier'></author>            <date month='' year='2012' />
        </front>
        <seriesInfo name='IEEE Communications Surveys Tutorials' value='' />
    </reference>

    <reference anchor='EEE'>
        <front>
            <title>802.3az-2010</title>
            <author></author>
            <date month='' year='2010' />
        </front>
        <seriesInfo name='IEEE std' value='' />
    </reference>
    
    <reference anchor='PROXZZZY'>
        <front>
            <title>ProxZZZy for sleeping hosts</title>
            <author></author>
            <date month='June' year='2012' />
        </front>
        <seriesInfo name='ECMA International' value='ECMA-393' />
    </reference>
    

    <reference anchor='EEEC'>
        <front>
            <title>Improving the Energy Efficiency of Ethernet-Connected: 
			A Proposal for Proxying</title>
            <author initials='B.' surname='Nordman' fullname='Bbuce Nordman'></author>
            <author initials='K.' surname='Christensen' fullname='Ken Christensen'></author>      
            <date month='September' year='2007' />
        </front>
        <seriesInfo name='Ethernet Alliance' value='' />
    </reference>


    <reference anchor='NCP'>
        <front>
            <title>A Network Connection Proxy to Enable Hosts to Sleep and Save Energy</title>
            <author initials='M.' surname='Jimeno' fullname='M. Jimeno'></author>
            <author initials='K.' surname='Christensen' fullname='K. Christensen'></author>      
	    <author initials='B.' surname='Nordman' fullname='B. Nordman'></author>   
	 <date month='' year='2008' />
        </front>
        <seriesInfo name='Proc. IEEE Internat. Performance Computing and Communications Conf' value='' />
    </reference>

    <reference anchor='SKILL'>
        <front>
            <title>Skilled in the Art of Being Idle: Reducing Energy Waste in Networked Systems</title>
            <author initials='S.' surname='Nedevschi' fullname='S. Nedevschi'></author>
            <author initials='J.' surname='Liu' fullname='J. Liu'></author>      
			<author initials='B.' surname='Nordman' fullname='B. Nordman'></author>
		    <author initials='S.' surname='Ratnasamy' fullname='S. Ratnasamy'></author>
			<author initials='N.' surname='Taft' fullname='N. Taft'></author>
			<date month='' year='2009' />
        </front>
        <seriesInfo name='Proc. USENIX Symposium on Networked Systems Design and Implementation' value='' />
    </reference>
 -->
    </references>

</back>
</rfc>
