<?xml version="1.0" encoding="US-ASCII"?>
<!DOCTYPE rfc SYSTEM "rfc2629.dtd">
<?rfc toc="yes"?>
<?rfc tocompact="yes"?>
<?rfc tocdepth="3"?>
<?rfc tocindent="yes"?>
<?rfc symrefs="yes"?>
<?rfc sortrefs="yes"?>
<?rfc comments="yes"?>
<?rfc inline="yes"?>
<?rfc compact="yes"?>
<?rfc subcompact="no"?>
<rfc category="std" docName="draft-ietf-nvo3-geneve-02" ipr="trust200902"
     submissionType="IETF">
  <front>
    <title abbrev="Geneve Protocol">Geneve: Generic Network Virtualization
    Encapsulation</title>

    <author fullname="Jesse Gross" initials="J." role="editor" surname="Gross">
      <organization abbrev="VMware">VMware, Inc.</organization>

      <address>
        <postal>
          <street>3401 Hillview Ave.</street>

          <!-- Reorder these if your country does things differently -->

          <city>Palo Alto</city>

          <region>CA</region>

          <code>94304</code>

          <country>USA</country>
        </postal>

        <email>jgross@vmware.com</email>

        <!-- uri and facsimile elements may also be added -->
      </address>
    </author>

    <author fullname="Ilango Ganga" initials="I." role="editor"
            surname="Ganga">
      <organization abbrev="Intel">Intel Corporation</organization>

      <address>
        <postal>
          <street>2200 Mission College Blvd.</street>

          <city>Santa Clara</city>

          <region>CA</region>

          <code>95054</code>

          <country>USA</country>
        </postal>

        <phone/>

        <facsimile/>

        <email>ilango.s.ganga@intel.com</email>

        <uri/>
      </address>
    </author>

    <date day="8" month="July" year="2016"/>

    <abstract>
      <t>Network virtualization involves the cooperation of devices with a
      wide variety of capabilities such as software and hardware tunnel
      endpoints, transit fabrics, and centralized control clusters. As a
      result of their role in tying together different elements in the system,
      the requirements on tunnels are influenced by all of these components.
      Flexibility is therefore the most important aspect of a tunnel protocol
      if it is to keep pace with the evolution of the system. This draft
      describes Geneve, a protocol designed to recognize and accommodate these
      changing capabilities and needs.</t>
    </abstract>
  </front>

  <middle>
    <section anchor="intro" title="Introduction">
      <t>Networking has long featured a variety of tunneling, tagging, and
      other encapsulation mechanisms. However, the advent of network
      virtualization has caused a surge of renewed interest and a
      corresponding increase in the introduction of new protocols. The large
      number of protocols in this space, ranging all the way from <xref
      target="IEEE.802.1Q-2014">VLANs</xref> and <xref
      target="RFC3031">MPLS</xref> through the more recent <xref
      target="RFC7348">VXLAN</xref>, <xref target="RFC7637">NVGRE</xref>, and
      <xref target="I-D.davie-stt">STT</xref>, often leads to questions about
      the need for new encapsulation formats and what it is about network
      virtualization in particular that leads to their proliferation.</t>

      <t>While many encapsulation protocols seek to simply partition the
      underlay network or bridge between two domains, network virtualization
      views the transit network as providing connectivity between multiple
      components of a distributed system. In many ways this system is similar
      to a chassis switch with the IP underlay network playing the role of the
      backplane and tunnel endpoints on the edge as line cards. When viewed in
      this light, the requirements placed on the tunnel protocol are
      significantly different in terms of the quantity of metadata necessary
      and the role of transit nodes.</t>

      <t>Current work such as <xref target="VL2"/> and the <xref
      target="I-D.ietf-nvo3-dataplane-requirements">NVO3 working group</xref>
      have described some of the properties that the data plane must have to
      support network virtualization. However, one additional defining
      requirement is the need to carry system state along with the packet
      data. The use of some metadata is certainly not a foreign concept -
      nearly all protocols used for virtualization have at least 24 bits of
      identifier space as a way to partition between tenants. This is often
      described as overcoming the limits of 12-bit VLANs, and when seen in
      that context, or any context where it is a true tenant identifier, 16
      million possible entries is a large number. However, the reality is that
      the metadata is not exclusively used to identify tenants and encoding
      other information quickly starts to crowd the space. In fact, when
      compared to the tags used to exchange metadata between line cards on a
      chassis switch, 24-bit identifiers start to look quite small. There are
      nearly endless uses for this metadata, ranging from storing input ports
      for simple security policies to service based context for interposing
      advanced middleboxes.</t>

      <t>Existing tunnel protocols have each attempted to solve different
      aspects of these new requirements, only to be quickly rendered out of
      date by changing control plane implementations and advancements.
      Furthermore, software and hardware components and controllers all have
      different advantages and rates of evolution - a fact that should be
      viewed as a benefit, not a liability or limitation. This draft describes
      Geneve, a protocol which seeks to avoid these problems by providing a
      framework for tunneling for network virtualization rather than being
      prescriptive about the entire system.</t>

      <section title="Requirements Language">
        <t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
        "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
        document are to be interpreted as described in <xref
        target="RFC2119"/>.</t>

        <t>In this document, these words will appear with that interpretation
        only when in ALL CAPS. Lower case uses of these words are not to be
        interpreted as carrying RFC-2119 significance.</t>

        <t/>
      </section>

      <section title="Terminology">
        <t>The <xref target="RFC7365">NVO3 framework</xref> defines many of
        the concepts commonly used in network virtualization. In addition, the
        following terms are specifically meaningful in this document:</t>

        <t>Checksum offload. An optimization implemented by many NICs which
        enables computation and verification of upper layer protocol checksums
        in hardware on transmit and receive, respectively. This typically
        includes IP and TCP/UDP checksums which would otherwise be computed by
        the protocol stack in software.</t>

        <t>Clos network. A technique for composing network fabrics larger than
        a single switch while maintaining non-blocking bandwidth across
        connection points. ECMP is used to divide traffic across the multiple
        links and switches that constitute the fabric. Sometimes termed "leaf
        and spine" or "fat tree" topologies.</t>

        <t>ECMP. Equal Cost Multipath. A routing mechanism for selecting from
        among multiple best next hop paths by hashing packet headers in order
        to better utilize network bandwidth while avoiding reordering a single
        stream.</t>

        <t>Geneve. Generic Network Virtualization Encapsulation. The tunnel
        protocol described in this draft.</t>

        <t>LRO. Large Receive Offload. The receive-side equivalent function of
        LSO, in which multiple protocol segments (primarily TCP) are coalesced
        into larger data units.</t>

        <t>NIC. Network Interface Card. A NIC could be part of a tunnel
        endpoint or transit device and can either process Geneve packets or
        aid in the processing of Geneve packets.</t>

        <t>OAM. Operations, Administration, and Management. A suite of tools
        used to monitor and troubleshoot network problems.</t>

        <t>Transit device. A forwarding element along the path of the tunnel
        making up part of the Underlay Network. A transit device MAY be
        capable of understanding the Geneve packet format but does not
        originate or terminate Geneve packets.</t>

        <t>LSO. Large Segmentation Offload. A function provided by many
        commercial NICs that allows data units larger than the MTU to be
        passed to the NIC to improve performance, the NIC being responsible
        for creating smaller segments of size less than or equal to the MTU
        with correct protocol headers. When referring specifically to TCP/IP,
        this feature is often known as TSO (TCP Segmentation Offload).</t>

        <t>Tunnel endpoint. A component performing encapsulation and
        decapsulation of packets, such as Ethernet frames or IP datagrams, in
        Geneve headers. As the ultimate consumer of any tunnel metadata,
        endpoints have the highest level of requirements for parsing and
        interpreting tunnel headers. Tunnel endpoints may consist of either
        software or hardware implementations or a combination of the two.
        Endpoints are frequently a component of an NVE but may also be found
        in middleboxes or other elements making up an NVO3 Network.</t>

        <t>VM. Virtual Machine.</t>
      </section>
    </section>

    <section title="Design Requirements">
      <t>Geneve is designed to support network virtualization use cases, where
      tunnels are typically established to act as a backplane between the
      virtual switches residing in hypervisors, physical switches, or
      middleboxes or other appliances. An arbitrary IP network can be used as
      an underlay although Clos networks composed using ECMP links are a
      common choice to provide consistent bisectional bandwidth across all
      connection points. <xref target="genevedeployment"/> shows an example of
      a hypervisor, top of rack switch for connectivity to physical servers,
      and a WAN uplink connected using Geneve tunnels over a simplified Clos
      network. These tunnels are used to encapsulate and forward frames from
      the attached components such as VMs or physical links.</t>

      <figure anchor="genevedeployment" title="Sample Geneve Deployment">
        <artwork><![CDATA[  +---------------------+           +-------+  +------+
  | +--+  +-------+---+ |           |Transit|--|Top of|==Physical
  | |VM|--|       |   | | +------+ /|Router |  | Rack |==Servers
  | +--+  |Virtual|NIC|---|Top of|/ +-------+\/+------+
  | +--+  |Switch |   | | | Rack |\ +-------+/\+------+
  | |VM|--|       |   | | +------+ \|Transit|  |Uplink|   WAN
  | +--+  +-------+---+ |           |Router |--|      |=========>
  +---------------------+           +-------+  +------+
         Hypervisor

              ()===================================()
                      Switch-Switch Geneve Tunnels]]></artwork>
      </figure>

      <t>To support the needs of network virtualization, the tunnel protocol
      should be able to take advantage of the differing (and evolving)
      capabilities of each type of device in both the underlay and overlay
      networks. This results in the following requirements being placed on the
      data plane tunneling protocol:<list style="symbols">
          <t>The data plane is generic and extensible enough to support
          current and future control planes.</t>

          <t>Tunnel components are efficiently implementable in both hardware
          and software without restricting capabilities to the lowest common
          denominator.</t>

          <t>High performance over existing IP fabrics.</t>
        </list>These requirements are described further in the following
      subsections.</t>

      <section title="Control Plane Independence">
        <t>Although some protocols for network virtualization have included a
        control plane as part of the tunnel format specification (most
        notably, the original VXLAN spec prescribed a multicast learning-based
        control plane), these specifications have largely been treated as
        describing only the data format. The VXLAN packet format has actually
        seen a wide variety of control planes built on top of it.</t>

        <t>There is a clear advantage in settling on a data format: most of
        the protocols are only superficially different and there is little
        advantage in duplicating effort. However, the same cannot be said of
        control planes, which are diverse in very fundamental ways. The case
        for standardization is also less clear given the wide variety in
        requirements, goals, and deployment scenarios.</t>

        <t>As a result of this reality, Geneve aims to be a pure tunnel format
        specification that is capable of fulfilling the needs of many control
        planes by explicitly not selecting any one of them. This
        simultaneously promotes a shared data format and increases the chances
        that it will not be obsoleted by future control plane
        enhancements.</t>
      </section>

      <section title="Data Plane Extensibility">
        <t>Achieving the level of flexibility needed to support current and
        future control planes effectively requires an options infrastructure
        to allow new metadata types to be defined, deployed, and either
        finalized or retired. Options also allow for differentiation of
        products by encouraging independent development in each vendor's core
        specialty, leading to an overall faster pace of advancement. By far
        the most common mechanism for implementing options is
        Type-Length-Value (TLV) format.</t>

        <t>It should be noted that while options can be used to support
        non-wirespeed control packets, they are equally important on data
        packets as well to segregate and direct forwarding (for instance, the
        examples given before of input port based security policies and
        service interposition both require tags to be placed on data packets).
        Therefore, while it would be desirable to limit the extensibility to
        only control packets for the purposes of simplifying the datapath,
        that would not satisfy the design requirements.</t>

        <section title="Efficient Implementation">
          <t>There is often a conflict between software flexibility and
          hardware performance that is difficult to resolve. For a given set
          of functionality, it is obviously desirable to maximize performance.
          However, that does not mean new features that cannot be run at that
          speed today should be disallowed. Therefore, for a protocol to be
          efficiently implementable means that a set of common capabilities
          can be reasonably handled across platforms along with a graceful
          mechanism to handle more advanced features in the appropriate
          situations.</t>

          <t>The use of a variable length header and options in a protocol
          often raises questions about whether it is truly efficiently
          implementable in hardware. To answer this question in the context of
          Geneve, it is important to first divide "hardware" into two
          categories: tunnel endpoints and transit devices.</t>

          <t>Endpoints must be able to parse the variable header, including
          any options, and take action. Since these devices are actively
          participating in the protocol, they are the most affected by Geneve.
          However, as endpoints are the ultimate consumers of the data,
          transmitters can tailor their output to the capabilities of the
          recipient. As new functionality becomes sufficiently well defined to
          add to endpoints, supporting options can be designed using ordering
          restrictions and other techniques to ease parsing.</t>

          <t>Transit devices MAY be able to interpret the options and
          participate in Geneve packet processing. However, as non-terminating
          devices, they do not originate or terminate the Geneve packet. The
          participation of transit devices in Geneve packet processing is
          OPTIONAL.</t>

          <t>Further, either tunnel endpoints or transit devices MAY use
          offload capabilities of NICs such as checksum offload to improve the
          performance of Geneve packet processing. The presence of a Geneve
          variable length header SHOULD NOT prevent the tunnel endpoints and
          transit devices from using such offload capabilities.</t>
        </section>
      </section>

      <section title="Use of Standard IP Fabrics">
        <t>IP has clearly cemented its place as the dominant transport
        mechanism and many techniques have evolved over time to make it
        robust, efficient, and inexpensive. As a result, it is natural to use
        IP fabrics as a transit network for Geneve. Fortunately, the use of IP
        encapsulation and addressing is enough to achieve the primary goal of
        delivering packets to the correct point in the network through
        standard switching and routing.</t>

        <t>In addition, nearly all underlay fabrics are designed to exploit
        parallelism in traffic to spread load across multiple links without
        introducing reordering in individual flows. These equal cost
        multipathing (ECMP) techniques typically involve parsing and hashing
        the addresses and port numbers from the packet to select an outgoing
        link. However, the use of tunnels often results in poor ECMP
        performance without additional knowledge of the protocol as the
        encapsulated traffic is hidden from the fabric by design and only
        endpoint addresses are available for hashing.</t>

        <t>Since it is desirable for Geneve to perform well on these existing
        fabrics, it is necessary for entropy from encapsulated packets to be
        exposed in the tunnel header. The most common technique for this is to
        use the UDP source port, which is discussed further in <xref
        target="udphdr"/>.</t>
      </section>
    </section>

    <section anchor="format" title="Geneve Encapsulation Details">
      <t>The Geneve packet format consists of a compact tunnel header
      encapsulated in UDP over either IPv4 or IPv6. A small fixed tunnel
      header provides control information plus a base level of functionality
      and interoperability with a focus on simplicity. This header is then
      followed by a set of variable options to allow for future innovation.
      Finally, the payload consists of a protocol data unit of the indicated
      type, such as an Ethernet frame. <xref target="geneve4"/> and <xref
      target="geneve6"/> illustrate the Geneve packet format transported (for
      example) over Ethernet along with an Ethernet payload.</t>

      <section anchor="geneve4" title="Geneve Packet Format Over IPv4">
        <figure>
          <artwork><![CDATA[   0                   1                   2                   3
   0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

Outer Ethernet Header:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                 Outer Destination MAC Address                 |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   | Outer Destination MAC Address |   Outer Source MAC Address    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                   Outer Source MAC Address                    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |Optional Ethertype=C-Tag 802.1Q|  Outer VLAN Tag Information   |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |       Ethertype=0x0800        |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Outer IPv4 Header:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |Version|  IHL  |Type of Service|          Total Length         |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |         Identification        |Flags|      Fragment Offset    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |  Time to Live |Protocol=17 UDP|         Header Checksum       |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                     Outer Source IPv4 Address                 |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                   Outer Destination IPv4 Address              |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Outer UDP Header:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |       Source Port = xxxx      |       Dest Port = 6081        |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |           UDP Length          |        UDP Checksum           |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Geneve Header:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |Ver|  Opt Len  |O|C|    Rsvd.  |          Protocol Type        |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |        Virtual Network Identifier (VNI)       |    Reserved   |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                    Variable Length Options                    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Inner Ethernet Header (example payload):
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                 Inner Destination MAC Address                 |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   | Inner Destination MAC Address |   Inner Source MAC Address    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                   Inner Source MAC Address                    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |Optional Ethertype=C-Tag 802.1Q|  Inner VLAN Tag Information   |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Payload:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   | Ethertype of Original Payload |                               |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+                               |
   |                                  Original Ethernet Payload    |
   |                                                               |
   | (Note that the original Ethernet Frame's FCS is not included) |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Frame Check Sequence:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |   New FCS (Frame Check Sequence) for Outer Ethernet Frame     |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>
      </section>

      <section anchor="geneve6" title="Geneve Packet Format Over IPv6">
        <figure>
          <artwork><![CDATA[0                   1                   2                   3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1

Outer Ethernet Header:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                 Outer Destination MAC Address                 |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   | Outer Destination MAC Address |   Outer Source MAC Address    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                   Outer Source MAC Address                    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |Optional Ethertype=C-Tag 802.1Q|  Outer VLAN Tag Information   |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |       Ethertype=0x86DD        |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Outer IPv6 Header:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |Version| Traffic Class |           Flow Label                  |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |         Payload Length        | NxtHdr=17 UDP |   Hop Limit   |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                                                               |
   +                                                               +
   |                                                               |
   +                     Outer Source IPv6 Address                 +
   |                                                               |
   +                                                               +
   |                                                               |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                                                               |
   +                                                               +
   |                                                               |
   +                  Outer Destination IPv6 Address               +
   |                                                               |
   +                                                               +
   |                                                               |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Outer UDP Header:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |       Source Port = xxxx      |       Dest Port = 6081        |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |           UDP Length          |        UDP Checksum           |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Geneve Header:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |Ver|  Opt Len  |O|C|    Rsvd.  |          Protocol Type        |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |        Virtual Network Identifier (VNI)       |    Reserved   |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                    Variable Length Options                    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Inner Ethernet Header (example payload):
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                 Inner Destination MAC Address                 |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   | Inner Destination MAC Address |   Inner Source MAC Address    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                   Inner Source MAC Address                    |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |Optional Ethertype=C-Tag 802.1Q|  Inner VLAN Tag Information   |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Payload:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   | Ethertype of Original Payload |                               |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+                               |
   |                                  Original Ethernet Payload    |
   |                                                               |
   | (Note that the original Ethernet Frame's FCS is not included) |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <figure>
          <artwork><![CDATA[Frame Check Sequence:
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |   New FCS (Frame Check Sequence) for Outer Ethernet Frame     |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>
      </section>

      <section anchor="udphdr" title="UDP Header">
        <t>The use of an encapsulating <xref target="RFC0768">UDP</xref>
        header follows the connectionless semantics of Ethernet and IP in
        addition to providing entropy to routers performing ECMP. The header
        fields are therefore interpreted as follows:</t>

        <t><list style="hanging">
            <t hangText="Source port:">A source port selected by the
            originating tunnel endpoint. This source port SHOULD be the same
            for all packets belonging to a single encapsulated flow to prevent
            reordering due to the use of different paths. To encourage an even
            distribution of flows across multiple links, the source port
            SHOULD be calculated using a hash of the encapsulated packet
            headers using, for example, a traditional 5-tuple. Since the port
            represents a flow identifier rather than a true UDP connection,
            the entire 16-bit range MAY be used to maximize entropy.</t>

            <t hangText="Dest port:">IANA has assigned port 6081 as the fixed
            well-known destination port for Geneve. Although the well-known
            value should be used by default, it is RECOMMENDED that
            implementations make this configurable. The chosen port is used
            for identification of Geneve packets and MUST NOT be reversed for
            different ends of a connection as is done with TCP.</t>

            <t hangText="UDP length:">The length of the UDP packet including
            the UDP header.</t>

            <t hangText="UDP checksum:">The checksum MAY be set to zero on
            transmit for packets encapsulated in both IPv4 and IPv6 <xref
            target="RFC6935"/>. When a packet is received with a UDP checksum
            of zero it MUST be accepted and decapsulated. If the originating
            tunnel endpoint optionally encapsulates a packet with a non-zero
            checksum, it MUST be a correctly computed UDP checksum. Upon
            receiving such a packet, the egress endpoint MUST validate the
            checksum. If the checksum is not correct, the packet MUST be
            dropped, otherwise the packet MUST be accepted for decapsulation.
            It is RECOMMENDED that the UDP checksum be computed to protect the
            Geneve header and options in situations where the network
            reliability is not high and the packet is not protected by another
            checksum or CRC.</t>
          </list></t>
      </section>

      <section title="Tunnel Header Fields">
        <t><list style="hanging">
            <t hangText="Ver (2 bits):">The current version number is 0.
            Packets received by an endpoint with an unknown version MUST be
            dropped. Non-terminating devices processing Geneve packets with an
            unknown version number MUST treat them as UDP packets with an
            unknown payload.</t>

            <t hangText="Opt Len (6 bits):">The length of the options fields,
            expressed in four byte multiples, not including the eight byte
            fixed tunnel header. This results in a minimum total Geneve header
            size of 8 bytes and a maximum of 260 bytes. The start of the
            payload headers can be found using this offset from the end of the
            base Geneve header.</t>

            <t hangText="O (1 bit):">OAM packet. This packet contains a
            control message instead of a data payload. Endpoints MUST NOT
            forward the payload and transit devices MUST NOT attempt to
            interpret or process it. Since these are infrequent control
            messages, it is RECOMMENDED that endpoints direct these packets to
            a high priority control queue (for example, to direct the packet
            to a general purpose CPU from a forwarding ASIC or to separate out
            control traffic on a NIC). Transit devices MUST NOT alter
            forwarding behavior on the basis of this bit, such as ECMP link
            selection.</t>

            <t hangText="C (1 bit):">Critical options present. One or more
            options has the critical bit set (see <xref target="options"/>).
            If this bit is set then tunnel endpoints MUST parse the options
            list to interpret any critical options. On endpoints where option
            parsing is not supported the packet MUST be dropped on the basis
            of the 'C' bit in the base header. If the bit is not set tunnel
            endpoints MAY strip all options using 'Opt Len' and forward the
            decapsulated packet. Transit devices MUST NOT drop or modify
            packets on the basis of this bit.</t>

            <t hangText="Rsvd. (6 bits):">Reserved field which MUST be zero on
            transmission and ignored on receipt.</t>

            <t hangText="Protocol Type (16 bits):">The type of the protocol
            data unit appearing after the Geneve header. This follows the
            <xref target="ETYPES">EtherType</xref> convention with Ethernet
            itself being represented by the value 0x6558.</t>

            <t hangText="Virtual Network Identifier (VNI) (24 bits):">An
            identifier for a unique element of a virtual network. In many
            situations this may represent an L2 segment, however, the control
            plane defines the forwarding semantics of decapsulated packets.
            The VNI MAY be used as part of ECMP forwarding decisions or MAY be
            used as a mechanism to distinguish between overlapping address
            spaces contained in the encapsulated packet when load balancing
            across CPUs.</t>

            <t hangText="Reserved (8 bits):">Reserved field which MUST be zero
            on transmission and ignored on receipt.</t>
          </list></t>

        <t>Transit devices MUST maintain consistent forwarding behavior
        irrespective of the value of 'Opt Len', including ECMP link selection.
        These devices SHOULD be able to forward packets containing options
        without resorting to a slow path.</t>
      </section>

      <section anchor="options" title="Tunnel Options">
        <figure title="Geneve Option">
          <artwork><![CDATA[0                   1                   2                   3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|          Option Class         |      Type     |R|R|R| Length  |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                      Variable Option Data                     |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+]]></artwork>
        </figure>

        <t>The base Geneve header is followed by zero or more options in
        Type-Length-Value format. Each option consists of a four byte option
        header and a variable amount of option data interpreted according to
        the type.</t>

        <t><list style="hanging">
            <t hangText="Option Class (16 bits):">Namespace for the 'Type'
            field. IANA will be requested to create a "Geneve Option Class"
            registry to allocate identifiers for organizations, technologies,
            and vendors that have an interest in creating types for options.
            Each organization may allocate types independently to allow
            experimentation and rapid innovation. It is expected that over
            time certain options will become well known and a given
            implementation may use option types from a variety of sources. In
            addition, IANA will be requested to reserve specific ranges for
            standardized and experimental options.</t>

            <t hangText="Type (8 bits):">Type indicating the format of the
            data contained in this option. Options are primarily designed to
            encourage future extensibility and innovation and so standardized
            forms of these options will be defined in a separate document.</t>

            <t>The high order bit of the option type indicates that this is a
            critical option. If the receiving endpoint does not recognize this
            option and this bit is set then the packet MUST be dropped. If the
            critical bit is set in any option then the 'C' bit in the Geneve
            base header MUST also be set. Transit devices MUST NOT drop
            packets on the basis of this bit. The following figure shows the
            location of the 'C' bit in the 'Type' field:<figure
                suppress-title="true">
                <artwork><![CDATA[   0 1 2 3 4 5 6 7 8
   +-+-+-+-+-+-+-+-+
   |C|    Type     |
   +-+-+-+-+-+-+-+-+]]></artwork>
              </figure></t>

            <t>The requirement to drop a packet with an unknown critical
            option applies to the entire tunnel endpoint system and not a
            particular component of the implementation. For example, in a
            system comprised of a forwarding ASIC and a general purpose CPU,
            this does not mean that the packet must be dropped in the ASIC. An
            implementation may send the packet to the CPU using a rate-limited
            control channel for slow-path exception handling.</t>

            <t hangText="R (3 bits):">Option control flags reserved for future
            use. MUST be zero on transmission and ignored on receipt.</t>

            <t hangText="Length (5 bits):">Length of the option, expressed in
            four byte multiples excluding the option header. The total length
            of each option may be between 4 and 128 bytes. Packets in which
            the total length of all options is not equal to the 'Opt Len' in
            the base header are invalid and MUST be silently dropped if
            received by an endpoint.</t>

            <t hangText="Variable Option Data:">Option data interpreted
            according to 'Type'.</t>
          </list></t>

        <section anchor="optionsproc" title="Options Processing">
          <t>Geneve options are primarily intended to be originated and
          processed by tunnel endpoints. However, options MAY be processed by
          transit devices along the tunnel path as well. Transit devices not
          processing Geneve headers SHOULD process Geneve packets as any other
          UDP packet and maintain consistent forwarding behavior.</t>

          <t>In tunnel endpoints, the generation and interpretation of options
          is determined by the control plane, which is out of the scope of
          this document. However, to ensure interoperability between
          heterogeneous devices some requirements are imposed on options and
          the devices that process them:<list style="symbols">
              <t>Receiving endpoints MUST drop packets containing unknown
              options with the 'C' bit set in the option type. Conversely,
              transit devices MUST NOT drop packets as a result of
              encountering unknown options, including those with the 'C' bit
              set.</t>

              <t>Some options may be defined in such a way that the position
              in the option list is significant. Therefore, options MUST NOT
              be reordered by transit devices.</t>

              <t>An option MUST NOT affect the parsing or interpretation of
              any other option.</t>
            </list></t>

          <t>When designing a Geneve option, it is important to consider how
          the option will evolve in the future. Once an option is defined it
          is reasonable to expect that implementations may come to depend on a
          specific behavior. As a result, the scope of any future changes must
          be carefully described upfront.</t>

          <t>Unexpectedly significant interoperability issues may result from
          changing the length of an option that was defined to be a certain
          size. A particular option is specified to have either a fixed
          length, which is constant, or a variable length, which may change
          over time or for different use cases. This property is part of the
          definition of the option and conveyed by the &lsquo;Type&rsquo;. For
          fixed length options, some implementations may choose to ignore the
          length field in the option header and instead parse based on the
          well known length associated with the type. In this case, redefining
          the length will impact not only parsing of the option in question
          but also any options that follow. Therefore, options that are
          defined to be fixed length in size MUST NOT be redefined to a
          different length. Instead, a new 'Type' should be allocated.</t>
        </section>
      </section>
    </section>

    <section title="Implementation and Deployment Considerations">
      <section title="Encapsulation of Geneve in IP">
        <t>As an IP-based tunnel protocol, Geneve shares many properties and
        techniques with existing protocols. The application of some of these
        are described in further detail, although in general most concepts
        applicable to the IP layer or to IP tunnels generally also function in
        the context of Geneve.</t>

        <section title="IP Fragmentation">
          <t>To prevent fragmentation and maximize performance, the best
          practice when using Geneve is to ensure that the MTU of the physical
          network is greater than or equal to the MTU of the encapsulated
          network plus tunnel headers. Manual or upper layer (such as TCP MSS
          clamping) configuration can be used to ensure that fragmentation
          never takes place, however, in some situations this may not be
          feasible.</t>

          <t>It is strongly RECOMMENDED that Path MTU Discovery (<xref
          target="RFC1191"/>, <xref target="RFC1981"/>) be used by setting the
          DF bit in the IP header when Geneve packets are transmitted over
          IPv4 (this is the default with IPv6). The use of Path MTU Discovery
          on the transit network provides the encapsulating endpoint with
          soft-state about the link that it may use to prevent or minimize
          fragmentation depending on its role in the virtualized network.</t>

          <t>Note that some implementations may not be capable of supporting
          fragmentation or other less common features of the IP header, such
          as options and extension headers.</t>
        </section>

        <section title="DSCP and ECN">
          <t>When encapsulating IP (including over Ethernet) packets in
          Geneve, there are several considerations for propagating DSCP and
          ECN bits from the inner header to the tunnel on transmission and the
          reverse on reception.</t>

          <t><xref target="RFC2983"/> provides guidance for mapping DSCP
          between inner and outer IP headers. Network virtualization is
          typically more closely aligned with the Pipe model described, where
          the DSCP value on the tunnel header is set based on a policy (which
          may be a fixed value, one based on the inner traffic class, or some
          other mechanism for grouping traffic). Aspects of the Uniform model
          (which treats the inner and outer DSCP value as a single field by
          copying on ingress and egress) may also apply, such as the ability
          to remark the inner header on tunnel egress based on transit
          marking. However, the Uniform model is not conceptually consistent
          with network virtualization, which seeks to provide strong isolation
          between encapsulated traffic and the physical network.</t>

          <t><xref target="RFC6040"/> describes the mechanism for exposing ECN
          capabilities on IP tunnels and propagating congestion markers to the
          inner packets. This behavior MUST be followed for IP packets
          encapsulated in Geneve.</t>
        </section>

        <section title="Broadcast and Multicast">
          <t>Geneve tunnels may either be point-to-point unicast between two
          endpoints or may utilize broadcast or multicast addressing. It is
          not required that inner and outer addressing match in this respect.
          For example, in physical networks that do not support multicast,
          encapsulated multicast traffic may be replicated into multiple
          unicast tunnels or forwarded by policy to a unicast location
          (possibly to be replicated there).</t>

          <t>With physical networks that do support multicast it may be
          desirable to use this capability to take advantage of hardware
          replication for encapsulated packets. In this case, multicast
          addresses may be allocated in the physical network corresponding to
          tenants, encapsulated multicast groups, or some other factor. The
          allocation of these groups is a component of the control plane and
          therefore outside of the scope of this document. When physical
          multicast is in use, the 'C' bit in the Geneve header may be used
          with groups of devices with heterogeneous capabilities as each
          device can interpret only the options that are significant to it if
          they are not critical.</t>
        </section>

        <section title="Unidirectional Tunnels">
          <t>Generally speaking, a Geneve tunnel is a unidirectional concept.
          IP is not a connection oriented protocol and it is possible for two
          endpoints to communicate with each other using different paths or to
          have one side not transmit anything at all. As Geneve is an IP-based
          protocol, the tunnel layer inherits these same characteristics.</t>

          <t>It is possible for a tunnel to encapsulate a protocol, such as
          TCP, which is connection oriented and maintains session state at
          that layer. In addition, implementations MAY model Geneve tunnels as
          connected, bidirectional links, such as to provide the abstraction
          of a virtual port. In both of these cases, bidirectionality of the
          tunnel is handled at a higher layer and does not affect the
          operation of Geneve itself.</t>
        </section>
      </section>

      <section title="Constraints on Protocol Features">
        <t>Geneve is intended to be flexible to a wide range of current and
        future applications. As a result, certain constraints may be placed on
        the use of metadata or other aspects of the protocol in order to
        optimize for a particular use case. For example, some applications may
        limit the types of options which are supported or enforce a maximum
        number or length of options. Other applications may only handle
        certain encapsulated payload types, such as Ethernet or IP. This could
        be either globally throughout the system or, for example, restricted
        to certain classes of devices or network paths.</t>

        <t>These constraints may be communicated to tunnel endpoints either
        explicitly through a control plane or implicitly by the nature of the
        application. As Geneve is defined as a data plane protocol that is
        control plane agnostic, the exact mechanism is not defined in this
        document.</t>
      </section>

      <section title="NIC Offloads">
        <t>Modern NICs currently provide a variety of offloads to enable the
        efficient processing of packets. The implementation of many of these
        offloads requires only that the encapsulated packet be easily parsed
        (for example, checksum offload). However, optimizations such as LSO
        and LRO involve some processing of the options themselves since they
        must be replicated/merged across multiple packets. In these
        situations, it is desirable to not require changes to the offload
        logic to handle the introduction of new options. To enable this, some
        constraints are placed on the definitions of options to allow for
        simple processing rules:<list style="symbols">
            <t>When performing LSO, a NIC MUST replicate the entire Geneve
            header and all options, including those unknown to the device,
            onto each resulting segment. However, a given option definition
            may override this rule and specify different behavior in
            supporting devices. Conversely, when performing LRO, a NIC MAY
            assume that a binary comparison of the options (including unknown
            options) is sufficient to ensure equality and MAY merge packets
            with equal Geneve headers.</t>

            <t>Options MUST NOT be reordered during the course of offload
            processing, including when merging packets for the purpose of
            LRO.</t>

            <t>NICs performing offloads MUST NOT drop packets with unknown
            options, including those marked as critical.</t>
          </list>There is no requirement that a given implementation of Geneve
        employ the offloads listed as examples above. However, as these
        offloads are currently widely deployed in commercially available NICs,
        the rules described here are intended to enable efficient handling of
        current and future options across a variety of devices.</t>
      </section>

      <section title="Inner VLAN Handling">
        <t>Geneve is capable of encapsulating a wide range of protocols and
        therefore a given implementation is likely to support only a small
        subset of the possibilities. However, as Ethernet is expected to be
        widely deployed, it is useful to describe the behavior of VLANs inside
        encapsulated Ethernet frames.</t>

        <t>As with any protocol, support for inner VLAN headers is OPTIONAL.
        In many cases, the use of encapsulated VLANs may be disallowed due to
        security or implementation considerations. However, in other cases
        trunking of VLAN frames across a Geneve tunnel can prove useful. As a
        result, the processing of inner VLAN tags upon ingress or egress from
        a tunnel endpoint is based upon the configuration of the endpoint
        and/or control plane and not explicitly defined as part of the data
        format.</t>
      </section>
    </section>

    <section title="Interoperability Issues">
      <t>Viewed exclusively from the data plane, Geneve does not introduce any
      interoperability issues as it appears to most devices as UDP packets.
      However, as there are already a number of tunnel protocols deployed in
      network virtualization environments, there is a practical question of
      transition and coexistence.</t>

      <t>Since Geneve is a superset of the functionality of the three most
      common protocols used for network virtualization (VXLAN, NVGRE, and STT)
      it should be straightforward to port an existing control plane to run on
      top of it with minimal effort. With both the old and new packet formats
      supporting the same set of capabilities, there is no need for a hard
      transition - endpoints directly communicating with each other use any
      common protocol, which may be different even within a single overall
      system. As transit devices are primarily forwarding packets on the basis
      of the IP header, all protocols appear similar and these devices do not
      introduce additional interoperability concerns.</t>

      <t>To assist with this transition, it is strongly suggested that
      implementations support simultaneous operation of both Geneve and
      existing tunnel protocols as it is expected to be common for a single
      node to communicate with a mixture of other nodes. Eventually, older
      protocols may be phased out as they are no longer in use.</t>
    </section>

    <section anchor="Security" title="Security Considerations">
      <t>As UDP/IP packets, Geneve does not have any inherent security
      mechanisms. As a result, an attacker with access to the underlay network
      transporting the IP packets has the ability to snoop or inject packets.
      Legitimate but malicious tunnel endpoints may also spoof identifiers in
      the tunnel header to gain access to networks owned by other tenants.</t>

      <t>Within a particular security domain, such as a data center operated
      by a single provider, the most common and highest performing security
      mechanism is isolation of trusted components. Tunnel traffic can be
      carried over a separate VLAN and filtered at any untrusted boundaries.
      In addition, tunnel endpoints should only be operated in environments
      controlled by the service provider, such as the hypervisor itself rather
      than within a customer VM.</t>

      <t>When crossing an untrusted link, such as the public Internet, <xref
      target="RFC4301">IPsec</xref> may be used to provide authentication
      and/or encryption of the IP packets formed as part of Geneve
      encapsulation. If the remote tunnel endpoint is not completely trusted,
      for example it resides on a customer premises, then it may also be
      necessary to sanitize any tunnel metadata to prevent tenant-hopping
      attacks.</t>

      <t>Geneve does not otherwise affect the security of the encapsulated
      packets.</t>
    </section>

    <section anchor="IANA" title="IANA Considerations">
      <t>IANA has allocated UDP port 6081 as the well-known destination port
      for Geneve. Upon publication, the registry should be updated to cite
      this document. The original request was:</t>

      <figure>
        <artwork><![CDATA[Service Name: geneve
Transport Protocol(s): UDP
Assignee: Jesse Gross <jgross@vmware.com>
Contact: Jesse Gross <jgross@vmware.com>
Description: Generic Network Virtualization Encapsulation (Geneve)
Reference: This document
Port Number: 6081]]></artwork>
      </figure>

      <t>In addition, IANA is requested to create a "Geneve Option Class"
      registry to allocate Option Classes. This shall be a registry of 16-bit
      hexadecimal values along with descriptive strings. The identifiers
      0x0-0xFF are to be reserved for standardized options for allocation by
      IETF Review <xref target="RFC5226"/> and 0xFFF0-0xFFFF for Experimental
      Use. Otherwise, identifiers are to be assigned to any organization with
      an interest in creating Geneve options on a First Come First Served
      basis. The registry is to be populated with the following initial
      values:</t>

      <texttable style="full">
        <ttcol align="left">Option Class</ttcol>

        <ttcol align="left">Description</ttcol>

        <c>0x0000..0x00FF</c>

        <c>Unassigned - IETF Review</c>

        <c>0x0100</c>

        <c>Linux</c>

        <c>0x0101</c>

        <c>Open vSwitch</c>

        <c>0x0102</c>

        <c>Open Virtual Networking (OVN)</c>

        <c>0x0103</c>

        <c>In-band Network Telemetry (INT)</c>

        <c>0x0104</c>

        <c>VMware</c>

        <c>0x0105..0xFFEF</c>

        <c>Unassigned - First Come First Served</c>

        <c>0xFFF0..FFFF</c>

        <c>Experimental</c>
      </texttable>
    </section>

    <section title="Contributors">
      <t>The following individuals were authors of an earlier version of this
      document and made significant contributions:</t>

      <figure>
        <artwork><![CDATA[T. Sridhar
VMware, Inc.
3401 Hillview Ave.
Palo Alto, CA  94304
USA

Email: tsridhar@vmware.com


Pankaj Garg
Microsoft Corporation
1 Microsoft Way
Redmond, WA  98052
USA

Email: pankajg@microsoft.com


Chris Wright
Red Hat Inc.
1801 Varsity Drive
Raleigh, NC  27606
USA

Email: chrisw@redhat.com


Puneet Agarwal
Innovium, Inc.
6001 America Center Drive
San Jose, CA  95002
USA

Email: puneet@innovium.com


Kenneth Duda
Arista Networks
5453 Great America Parkway
Santa Clara, CA  95054
USA

Email: kduda@arista.com


Dinesh G. Dutt
Cumulus Networks
140C S. Whisman Road
Mountain View, CA  94041
USA

Email: ddutt@cumulusnetworks.com


Jon Hudson
Brocade Communications Systems, Inc.
130 Holger Way
San Jose, CA  95134
USA

Email: jon.hudson@gmail.com


Ariel Hendel
Broadcom Limited
3151 Zanker Road
San Jose, CA  95134
USA

Email: ariel.hendel@broadcom.com]]></artwork>
      </figure>
    </section>

    <section title="Acknowledgements">
      <t>The authors wish to thank Martin Casado, Bruce Davie and Dave Thaler
      for their input, feedback, and helpful suggestions.</t>
    </section>
  </middle>

  <back>
    <references title="Normative References">
      <?rfc include='reference.RFC.2119'?>

      <?rfc include='reference.RFC.0768'?>

      <?rfc include='reference.RFC.5226'?>
    </references>

    <references title="Informative References">
      <?rfc include='reference.RFC.7348'?>

      <?rfc include='reference.RFC.7637'?>

      <?rfc include='reference.I-D.davie-stt'?>

      <?rfc include='reference.RFC.3031'?>

      <?rfc include='reference.I-D.ietf-nvo3-dataplane-requirements'?>

      <?rfc include='reference.RFC.4301'?>

      <?rfc include='reference.RFC.6935'?>

      <?rfc include='reference.RFC.2983'?>

      <?rfc include='reference.RFC.6040'?>

      <?rfc include='reference.RFC.7365'?>

      <?rfc include='reference.RFC.1191'?>

      <?rfc include='reference.RFC.1981'?>

      <reference anchor="VL2">
        <front>
          <title>VL2: A Scalable and Flexible Data Center Network</title>

          <author fullname="A. Greenberg et al." surname="Greenberg et al"/>

          <date year="2009"/>
        </front>

        <annotation>Proc. ACM SIGCOMM 2009</annotation>
      </reference>

      <reference anchor="IEEE.802.1Q-2014">
        <front>
          <title>IEEE Standard for Local and metropolitan area networks --
          Bridges and Bridged Networks</title>

          <author>
            <organization>IEEE</organization>
          </author>

          <date year="2014"/>
        </front>

        <seriesInfo name="IEEE Std" value="802.1Q"/>
      </reference>

      <reference anchor="ETYPES"
                 target="http://www.iana.org/assignments/ieee-802-numbers/ieee-802-numbers.xml">
        <front>
          <title>IEEE 802 Numbers</title>

          <author>
            <organization>The IEEE Registration Authority</organization>
          </author>

          <date year="2013"/>
        </front>
      </reference>
    </references>
  </back>
</rfc>
