<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE rfc SYSTEM 'rfc2629.dtd' [

      <!ENTITY rfc4997 PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.4997.xml'>
      <!ENTITY rfc6282 PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.6282.xml'>
      <!ENTITY rfc4944 PUBLIC '' 'http://xml.resource.org/public/rfc/bibxml/reference.RFC.4944.xml'>

      <!ENTITY gapAna PUBLIC '' 'http://xml2rfc.ietf.org/public/rfc/bibxml-ids/reference.I-D.draft-minaburo-lp-wan-gap-analysis-01.xml'>      
]>

<?rfc symrefs="yes" ?>
<?rfc sortrefs="yes" ?>
<?rfc strict="yes" ?>
<?rfc compact="yes" ?>
<rfc category="info" docName="draft-toutain-6lpwa-ipv6-static-context-hc-01" ipr="trust200902">
  <front>
    <title abbrev="6LPWA Static Context Header Compression (SCHC)">6LPWA Static Context Header Compression (SCHC) for IPV6 and UDP</title>


<author fullname="Ana Minaburo" initials="A." surname="Minaburo">
<organization>Acklio</organization>

   <address>
    <postal>
    <street>2bis rue de la Chataigneraie</street>


    <city>35510 Cesson-Sevigne Cedex</city>

    <country>France</country>
    </postal>

    <email>ana@ackl.io</email>
  </address>
</author>


    <author fullname="Laurent Toutain" initials="L." surname="Toutain">
      <organization>Institut MINES TELECOM ; TELECOM Bretagne</organization>

      <address>
        <postal>
          <street>2 rue de la Chataigneraie</street>

          <street>CS 17607</street>

          <city>35576 Cesson-Sevigne Cedex</city>

          <country>France</country>
        </postal>

        <email>Laurent.Toutain@telecom-bretagne.eu</email>
      </address>
    </author>

    <date/>

    <!--    <workgroup>v6ops Working Group </workgroup> -->

    <abstract>
      <t>This document describes a header compression scheme for IPv6, IPv6/UDP based on static contexts. This technique is especially tailored for LPWA networks and could be extended to other protocol stacks.</t>
      <t>    
During the IETF history several compression mechanisms have been proposed. First 
mechanisms, such as RoHC, are using a context to store header field values and send smaller incremental differences on the link. Values in the context evolve dynamically with information contained in the compressed header. The challenge is to maintain sender's and receiver's contexts synchronized even with packet losses. Based on the fact that IPv6 contains only static fields, 6LoWPAN developed an efficient context-free compression mechanisms, allowing better flexibility and performance.
     </t>
     
     <t>
The Static Context Header Compression (SCHC) combines the advantages of RoHC context which offers a great level of flexibility in the processing of fields, and 6LoWPAN behavior to elide fields that are known from the other side. Static context means that values in the context field do not change during the transmission, avoiding complex resynchronization mechanisms, incompatible with LPWA characteristics. In most of the cases, IPv6/UDP headers are reduced to a small  identifier.
</t>
<t>
This document focuses on IPv6/UDP headers compression, but the mechanism can be applied to other protocols such as CoAP. It will be described in a separate document.
</t>
    </abstract>
  </front>

<middle>

<section anchor="Introduction" title="Introduction">

<t>
Headers compression is mandatory to bring the internet protocols to the node within a
LPWA network <xref target="I-D.minaburo-lp-wan-gap-analysis" />. 
</t>
<t>
Nevertheless, LPWA networks offer good properties for an efficient header compression:
<list style="symbols">
<t>Topology is star oriented, therefore all the packets follows the same path. For the needs of this draft, the architecture can be summarized to End-Systems (ES) exchanging information with a single LPWA Compressor (LC). In most of the cases, End Systems and LC form a star topology. ESs and LC maintain a context for compression.</t>
<t>Traffic flows are mostly deterministic, since End-Systems embed built-in applications. Contrary to computers or smartphones, new applications cannot be easily installed.</t> 
</list>
</t>

<t>
First mechanisms such as RoHC use a context to store header field values and send smaller
incremental differences on the link. The first version of RoHC targeted IP/UDP/RTP stack.
RoHCv2 extends the principle to any protocol and introduces a formal 
notation <xref target="RFC4997"/> describing the header and associating 
compression functions to each field. 

To be efficient the sender and the receiver must check that the context remains synchronized (i.e. contains the same values). Context synchronization
imposes to periodically send a full header or at least dynamic fields. If fully compressed, the header can be compatible with LPWA constraints. However, the first exchanges or context resynchronisations impose to send uncompressed headers, which may be bigger than the original one. This will force the use of inefficient fragmentation mechanisms. For some LPWA technologies, duty cycle limits can also delay the resynchronization.

<xref target="fig-ROHC"/> illustrates this behavior. 
<figure anchor="fig-ROHC" title="RoHC Compressed Header size evolution."><artwork><![CDATA[
                    sync
          ^         +-+         sync     sync             ^
          | IPv6    | |         +-+       +-+             | IPv6
          v         | |         | |       | |             v
   +------------+   | +-+-+     | |       | |    +------------+
   |       +--+ |   | | | |     | |       | |    | +--+       |
   |       | c| |   | | | +-+-+-+ +-+-+-+-+ |    | | c|       |
   |       | t| |   | | | | | | | | | | | | |    | | t|       |
   |       | x| |   +-+-+-+-+-+-+-+-+-+-+-+-+    | | x|       |
   |       | t| | <----------------------------> | | t|       |
   |       +--+ |                                | +--+       |
   +------------+                                +------------+
   

]]></artwork></figure> 
</t>

<t>
On the other hand, 6LoWPAN <xref target="RFC4944"/> is context-free based on the fact that IPv6, its extensions or UDP headers do not contain incremental fields. The compression mechanism described in <xref target="RFC6282"/> is based on sending a 2-byte bitmap, which describes how the header should be decompressed, either using some standard values or sending information after this bitmap. <xref target="RFC6282"/>
also allows for UDP compression. 
</t>

<t>
In the best case, when Hop limit is a standard value, flow label, DiffServ fields are set to 0 and Link Local addresses are used over a single hop network, the 6LoWPAN compressed header is reduced to 4 bytes. This compression ratio is possible because the IID are derived from the MAC addresses and the link local prefix is known from both sides.

In that  case, the IPv6 compression is 4 bytes and UDP compression is 2 bytes, which fills half of the payload of a SIGFOX frame, or more than 10% of a LoRaWAN payload (with spreading factor 12). 
</t>

<t>
The Static Context Header Compression (SCHC) combines the advantages of RoHC context, which offers a great level of flexibility in the processing of fields, and 6LoWPAN behavior to elide fields that are known from the other side. Static context means that values in the context field do not change during the transmission, avoiding complex resynchronization mechanisms, incompatible with LPWA characteristics. In most of the cases, IPv6/UDP headers are reduced to a small context identifier.
</t>

</section>

<section title="Static Context Header Compression">

<t>
Static Context Header Compression (SCHC) avoids context synchronization, which is the most bandwidth-consuming operation in RoHC. Based on the fact that the nature of data flows is highly predictable in LPWA networks, a static context may be stored on the End-System (ES). The other end, the LPWA Compressor (LC) can learn the context through a provisionning protocol during the identification phase (for instance, as it learns the encryption key). 
</t>

<t>
The context contains an ordered list of rules. Each rule is a vector of entries. Each entry is composed of a field descriptor, a prescribed matching value, a matching rule for the compression side, a matching rule for the decompression side and a compression/decompression action.
Contexts in the compressor and decompressor are the same.

A rule is identified by a rule identifier. If the layer 2 allows it, the rule id can be carried in the layer 2 header. Otherwise the rule id is located in the first byte of the L2 payload.
</t>
<t>
Being at the boundary between Layer 2 and Layer 3, the rule id will also be called a shim id. Different ES will use the same shim id to identify their own context. An LC may also use the ES device id to identify the appropriate rule.

<figure anchor="Fig-ctxt"
title="Context in LC"><artwork><![CDATA[
            
            
            +---------------------------------------------------------------------+
            |                      Rule N                                         |
       +---------------------------------------------------------------------+    |
       |                    Rule i                                           |    |
+---------------------------------------------------------------------+      |    |
|                    Rule 1                                           |      |    |
|   +---------+-------+------------+--------------+-----------------+ |      |    |
|   | Field 1 | Value |match. comp.| match decomp | Action function | |      |    | 
|   +---------+-------+------------+--------------+-----------------+ |      |    |
|   | Field 2 | Value |match. comp.| match decomp | Action function | |      |    | 
|   +---------+-------+------------+--------------+-----------------+ |      |    |
|   | ...     | ...   |...         | ...          | ...             | |      |    | 
|   +---------+-------+------------+--------------+-----------------+ |      |----+
|   | Field N | Value |match. comp.| match decomp | Action function | |      |   
|   +---------+-------+------------+--------------+-----------------+ |------+
|                                                                     |
+---------------------------------------------------------------------+
               
]]></artwork></figure>
  

</t>

<t>
The compression/decompression process follows several steps:
<list style="symbols">
<t>compression rule selection: the goal is to identify which rule will be used to compress the headers. To each field is associated a matching rule for compression. Each header field's value is compared to the corresponding value stored in the rule for that field using the matching operator. If all the fields match,  the packet is processed using this rule action functions and the rule list exploration is aborted. Otherwise the next rule is tested. If no rule is found, then the packet is dropped.</t>
<t>compression: the action function indicates is the field is send on the link or not. A field can also be partially sent regarding the matching operator. The resulting compressed header must be aligned on byte boundaries.</t>
<t>decompression rule selection, as for compression, a rule has to be selected to uncompress incoming packets. A matching operator is defined on the compress header and works as for compression. </t>
<t>decompression: the same action function indicates how the field value can be rebuilt, either from bits received on the link, a value stored in the rule or by using a specific algorithm. </t>
</list> 
</t>


</section>

<section title="Matching operators">
<t>
Matching a field with a value and header compression are related operations; If a field matches a rule  containing the value, it is not necessary to send it on the link. Since context are synchronized, reading the rule's value is enough to reconstruct the field's value at the other end. 
</t>
<t>
On some other cases, the value need to be sent on the link to inform the other end. The field value may vary from one packet to another, therefore the field cannot be used to select the rule id.
</t>
<t>
It may exist some intermediary cases, where part of the value may be used to select a field and a variable part has to be sent on the link. This is true for Least Significant Bits (LSB) where the most significant bit can be used to select a rule id and the least significant bits has to be sent on the link.
</t>
<t>
Several matching operators are defined:
<list style="symbols">
<t>= : a field value in a packet matches with a field value in a rule if they are equal.</t>
<t>no : no check is done between a field value in a packet matches with a field value in the rule </t>
<t>lbs(L) : a field value of length T in a packet matches with a field value in a rule if the most significant T-L bits are equal. </t>
</list>
</t>

</section>

<section title = "Action functions">
<t>
The action functions describe the action taken by the compression and inversely the action taken by the decompressor to restore the original value.

<figure anchor="Fig-function"
title="Simplified Protocol Stack for LP-WAN"><artwork><![CDATA[
/--------------------+-------------+--------------------------\
| Function           | Compression | Decompression            | 
|                    |             |                          | 
+--------------------+-------------+--------------------------+
|elided              |not sent     |use value stored in ctxt  |
|send-value          |send         |build field from value    |
|compute-IPv6-length |elided       |compute IPv6 length       |
|compute-UDP-length  |elided       |compute UDP length        |
|compute-UDP-checksum|elided       |compute UDP checksum      |
|ESiid-DID           |elided       |build IID from L2 ES addr |
|LCiid-DID           |elided       |build IID from L2 LA addr |
\--------------------+-------------+--------------------------/
   
]]></artwork></figure>

<xref target="Fig-function"/> lists all the functions defined to compress and decompress 
a field. The first column gives the function's name. The second and third columns outlines the compression/decompression process.
</t>

<t>
As with 6LoWPAN, the compression process may produce some data, where fields that were not compressed (or were partially compressed) will be sent in the order  of the original packet. Information added by the compression phase must be aligned on byte boundaries, but each individual compression function may generate any size. 
</t>
<t>
<figure anchor="Fig--possible-function"
title="SCHC functions' example assignment for IPv6 and UDP"><artwork><![CDATA[
/-----------------+---------------------+----------------------------------------\
| Field           |Function             | Behavior                               |         
+-----------------+---------------------+----------------------------------------+
|IPv6 version     |elided               |The value is not sent, but each end     |
|IPv6 DiffServ    |                     |agrees on a value, which can be         | 
|IPv6 Flow Label  |                     |different from 0.                       |
|IPv6 Next Header |send-value           |Depending on the matching operator, the |
|                 |                     |entire field value is sent or an        |
|                 |                     |adjustment to the context value         |            
+-----------------+---------------------+----------------------------------------+ 
|IPv6 Length      |compute-IPv6-length  |Dedicated function to reconstruct value |
+-----------------+---------------------+----------------------------------------+
|IPv6 Hop Limit   |elided+no matching   |The receiver will put a value stored in |
|                 |                     |the context. It may be different from   |
|                 |                     |one originally sent, but in a star      |
|                 |                     |topology, there is not risk of loops    |
|                 |elided+matching      |Receiver and sender agree on the value. |
|                 |                     |If the value is not correct the packet  |
|                 |                     |the rule is not selected                |
|                 |send-value           |Explicitly sent                         |
+-----------------+---------------------+----------------------------------------+ 
|IPv6 ESPrefix    |elided               |The 64 bit prefix is stored on the ctxt |
|IPv6 LCPrefix    |send-value           |Explicitly send 64 bits on the link     |
+-----------------+---------------------+----------------------------------------+
|IPv6 ESiid       |elided               |IID is not sent, but stored in the ctxt |
|IPv6 LCiid       |ESiid-DID | LCiid-DID|IID is built from the ES Device ID      |
|                 |send-value           |IID is explicitly sent on the link. The |
|                 |                     |size depends of the L2 technology       |
+-----------------+---------------------+----------------------------------------+
|UDP ESport       |elided               |In the context                          |
|UDP LCport       |send-value           |Send the 2 bytes of the port number     |    
|                 |                     |or less if lsb matching is specified in |
|                 |                     |the matching operator.                  |   
+-----------------+---------------------+----------------------------------------+ 
|UDP length       |compute-UDP-length   |Dedicated function to reconstruct value |
+-----------------+---------------------+----------------------------------------+ 
|UDP Checksum     |compute-UDP-checksum |Dedicated function to reconstruct value |
+-----------------+---------------------+----------------------------------------+                 
]]></artwork></figure>

<xref target="Fig--possible-function"/> gives an example of function assignment to IPv6/UDP fields.
</t>

<section title="Action functions">
<section title="Elided">
<t>
The compressor do not sent the field value on the link. The decompressor restore the field value with the one stored in the matched rule. 
</t>
</section>

<section title="Send-value">
<t>
The compressor send the field value on the link, if the matching operator is "=". Otherwise the matching operator indicates the information that will be sent on the link. For a LSB operator only the Least Significant Bits are sent. 
</t>
</section>
<section title="ESiid-DID, LCiid-DID">
<t>
These functions are used to process respectively the End System and the LC Device Identifier (DID).
The IID value is computed from device ID present in the Layer 2 header. The computation depends of the technology and the device ID  size. 
</t>
</section>
</section>
</section>

<section anchor="compressIPv6" title="Examples">
<t>
This section gives some scenarios of the compression mechanism for IPv6/UDP. 
The goal is to illustrate the SCHC behaviour.
</t>

<section title="IPv6/UDP compression in a star topology">
<t>
The most common case will be a LPWA end-system embeds some applications running over 
CoAP. In this example, the first flow is for instance for the device management based on CoAP using 
Link Local addresses and UDP ports 123 and 124.

The second flow will be a CoAP server for measurements done by the end-system (using ports 5683) and Global Addresses alpha::IID/64 to beta::1/64.


The last flow is for legacy applications using different ports numbers, the destination is gamma::1/64. 
</t>
<t>

<xref target="FigStack" /> presents the protocol stack for this end-system. IPv6 and UDP are represented with dotted lines since these protocols are compressed on the radio link.
The rule ID is represented by a shim id (respectively 0, 1 and 2). 

<figure anchor="FigStack"
title="Simplified Protocol Stack for LP-WAN"><artwork><![CDATA[

 Managment    Data         
+----------+---------+---------+
|   CoAP   |  CoAP   | legacy  |
+----||----+---||----+---||----+
.   UDP    .  UDP    |   UDP   | 
................................
.   IPv6   .  IPv6   .  IPv6   .
+--SHIM0------SHIM1-----SHIM2--+
|      6LPWA L2 technologies   |
+------------------------------+  
      End System or LPWA GW

]]></artwork></figure>


</t>
<t>
Note that in some LPWA technologies, only End Systems have a device ID . Therefore
it is necessary to define statically an IID for the Link Local address for the LPWA Compressor. 
</t>
<t>
<figure anchor="Fig-fields"
title="Simplified Protocol Stack for LP-WAN"><artwork><![CDATA[
  +----------------+---------+--------+--------+-------------++------+
  | Field          | Value   | Match  | Match  | Function    || Sent |
  +----------------+---------+-----------------+-------------++------+
  |LPWA SHIM       |0        | No     | =      | send-value  || 0    |
  |ESDevice-ID     |dev-id   | No     | =      | elided      ||      |
  +================+=========+========+========+=============++======+
  |IPv6 version    |6        | =      | No     | elided      ||      |     
  |IPv6 DiffServ   |0        | =      | No     | elided      ||      |
  |IPv6 Flow Label |0        | =      | No     | elided      ||      |
  |IPv6 Length     |XXXXXXXXX| No     | No     | comp-IPv6-l ||      |
  |IPv6 Next Header|17       | =      | No     | elided      ||      |
  |IPv6 Hop Limit  |255      | No     | No     | elided      ||      |
  |IPv6 ESprefix   |FE80::/64| =      | No     | elided      ||      |
  |IPv6 ESiid      |         | No     | No     | ESiid-DID   ||      |
  |IPv6 LCprefix   |FE80::/64| =      | No     | elided      ||      |
  |IPv6 LCiid      |::1      | =      | No     | elided      ||      |
  +================+=========+========+========+=============++======+
  |UDP ESport      |123      | =      | No     | elided      ||      |
  |UDP LCport      |124      | =      | No     | elided      ||      |
  |UDP Length      |XXXXXXXXX| No     | No     | comp-UDP-l  ||      |
  |UDP checksum    |XXXXXXXXX| No     | No     | comp-UDP-c  ||      |
  +================+=========+========+========+=============++======+
  
  +----------------+---------+--------+--------+-------------++------+
  | Field          | Value   | Match  | Match  | Function    || Sent |
  +----------------+---------+-----------------+-------------++------+
  |LPWA SHIM       |1        | No     | =      | send-value  || 1    |
  |ESDevice-ID     |dev-id   | No     | =      | elided      ||      |
  +================+=========+========+========+=============++======+
  |IPv6 version    |6        | =      | No     | elided      ||      |     
  |IPv6 DiffServ   |0        | =      | No     | elided      ||      |
  |IPv6 Flow Label |0        | =      | No     | elided      ||      |
  |IPv6 Length     |XXXXXXXXX| No     | No     | comp-IPv6-l ||      |
  |IPv6 Next Header|17       | =      | No     | elided      ||      |
  |IPv6 Hop Limit  |255      | No     | No     | elided      ||      |
  |IPv6 ESprefix   |alpha/64 | =      | No     | elided      ||      |
  |IPv6 ESiid      |         | No     | No     | ESiid-DID   ||      |
  |IPv6 LCprefix   |beta/64  | =      | No     | elided      ||      |
  |IPv6 LCiid      |::1000   | =      | No     | elided      ||      |
  +================+=========+========+========+=============++======+
  |UDP ESport      |5683     | =      | No     | elided      ||      |
  |UDP LCport      |5683     | =      | No     | elided      ||      |
  |UDP Length      |XXXXXXXXX| No     | No     | comp-UDP-l  ||      |
  |UDP checksum    |XXXXXXXXX| No     | No     | comp-UDP-c  ||      |
  +================+=========+========+========+=============++======+

   +----------------+---------+--------+--------+-------------++------+
  | Field          | Value   | Match  | Match  | Function    || Sent |
  +----------------+---------+-----------------+-------------++------+
  |LPWA SHIM       |2        | No     | =      | send-value  || 2    |
  |ESDevice-ID     |dev-id   | No     | =      | elided      ||      |
  +================+=========+========+========+=============++======+
  |IPv6 version    |6        | =      | No     | elided      ||      |     
  |IPv6 DiffServ   |0        | =      | No     | elided      ||      |
  |IPv6 Flow Label |0        | =      | No     | elided      ||      |
  |IPv6 Length     |XXXXXXXXX| No     | No     | comp-IPv6-l ||      |
  |IPv6 Next Header|17       | =      | No     | elided      ||      |
  |IPv6 Hop Limit  |255      | No     | No     | elided      ||      |
  |IPv6 ESprefix   |alpha/64 | =      | No     | elided      ||      |
  |IPv6 ESiid      |         | No     | No     | ESiid-DID   ||      |
  |IPv6 LCprefix   |gamma/64 | =      | No     | elided      ||      |
  |IPv6 LCiid      |::1000   | =      | No     | elided      ||      |
  +================+=========+========+========+=============++======+
  |UDP ESport      |8720     | lsb(4) | No     | elided      || lsb  |
  |UDP LCport      |8720     | lsb(4) | No     | elided      || lsb  |
  |UDP Length      |XXXXXXXXX| No     | No     | comp-UDP-l  ||      |
  |UDP checksum    |XXXXXXXXX| No     | No     | comp-UDP-c  ||      |
  +================+=========+========+========+=============++======+

 
]]></artwork></figure>
  
All the fields described in the three rules <xref target="Fig-fields"/> are present in the IPv6 and UDP headers. Two fields have been added at the begin, they are used to identify the rule id for decompression when the other end receives the compressed header. The shim id is read either from the L2 header or from the first bit in the payload depending on the technology. The ESDevice-ID value is found in the L2 header.     
</t>
<t>
The second and third rules use global addresses. The way the ES learn the prefix is not in the scope of the document. One possible way is to use a management protocol to set up in both end rules the prefix used on the LPWA network.
</t>
<t>
The third rule compresses port numbers on 4 bits. This value is selected to maintain alignment on byte boundaries for the compressed header.
</t>
</section>
</section>

<section title="Acknowledgements">
<t>
Thanks to Dominique Barthel, Alexander Pelov, Juan Carlos Zuniga for useful design
consideration.
</t>
</section>
</middle>
<back>

    <references title="Normative References">

      &rfc4944;
      &rfc4997;
      &rfc6282;

	  &gapAna; 
  
    </references>


</back>

</rfc>
