<?xml version="1.0" encoding="US-ASCII"?>
<!DOCTYPE rfc SYSTEM "rfc2629.dtd">
<?rfc toc="yes"?>
<?rfc tocompact="yes"?>
<?rfc tocdepth="3"?>
<?rfc tocindent="yes"?>
<?rfc symrefs="yes"?>
<?rfc sortrefs="yes"?>
<?rfc comments="yes"?>
<?rfc inline="yes"?>
<?rfc compact="yes"?>
<?rfc subcompact="no"?>
<rfc category="std" docName="draft-ietf-avtext-rtp-stream-pause-03"
     ipr="trust200902" updates="5104">
  <front>
    <title abbrev="RTP Stream Pause">RTP Stream Pause and Resume</title>

    <author fullname="Bo Burman" initials="B." surname="Burman">
      <organization>Ericsson</organization>

      <address>
        <postal>
          <street>Kistavagen 25</street>

          <city>SE - 164 80 Kista</city>

          <region/>

          <code/>

          <country>Sweden</country>
        </postal>

        <phone>+46107141311</phone>

        <facsimile/>

        <email>bo.burman@ericsson.com</email>

        <uri>www.ericsson.com</uri>
      </address>
    </author>

    <author fullname="Azam Akram" initials="A." surname="Akram">
      <organization>Ericsson</organization>

      <address>
        <postal>
          <street>Farogatan 6</street>

          <city>SE - 164 80 Kista</city>

          <region/>

          <code/>

          <country>Sweden</country>
        </postal>

        <phone>+46107142658</phone>

        <facsimile/>

        <email>muhammad.azam.akram@ericsson.com</email>

        <uri>www.ericsson.com</uri>
      </address>
    </author>

    <author fullname="Roni Even" initials="R." surname="Even">
      <organization>Huawei Technologies</organization>

      <address>
        <postal>
          <street/>

          <city>Tel Aviv</city>

          <region/>

          <code/>

          <country>Israel</country>
        </postal>

        <phone/>

        <facsimile/>

        <email>roni.even@mail01.huawei.com</email>

        <uri/>
      </address>
    </author>

    <author fullname="Magnus Westerlund" initials="M." surname="Westerlund">
      <organization>Ericsson</organization>

      <address>
        <postal>
          <street>Farogatan 6</street>

          <city>SE- 164 80 Kista</city>

          <region/>

          <code/>

          <country>Sweden</country>
        </postal>

        <phone>+46107148287</phone>

        <facsimile/>

        <email>magnus.westerlund@ericsson.com</email>
      </address>
    </author>

    <date day="14" month="October" year="2014"/>

    <abstract>
      <t>With the increased popularity of real-time multimedia applications,
      it is desirable to provide good control of resource usage, and users
      also demand more control over communication sessions. This document
      describes how a receiver in a multimedia conversation can pause and
      resume incoming data from a sender by sending real-time feedback
      messages when using Real-time Transport Protocol (RTP) for real time
      data transport. This document extends the Codec Control Messages (CCM)
      RTCP feedback package by explicitly allowing and describing specific use
      of existing CCM messages and adding a group of new real-time feedback
      messages used to pause and resume RTP data streams. This document
      updates RFC 5104.</t>
    </abstract>
  </front>

  <middle>
    <section anchor="sec-intro" title="Introduction">
      <t>As real-time communication attracts more people, more applications
      are created; multimedia conversation applications being one example.
      Multimedia conversation further exists in many forms, for example,
      peer-to-peer chat application and multiparty video conferencing
      controlled by central media nodes, such as RTP Mixers.</t>

      <t>Multimedia conferencing may involve many participants; each has its
      own preferences for the communication session, not only at the start but
      also during the session. This document describes several scenarios in
      multimedia communication where a conferencing node or participant
      chooses to temporarily pause an incoming <xref
      target="RFC3550">RTP</xref> stream and later resume it when needed. The
      receiver does not need to terminate or inactivate the RTP session and
      start all over again by negotiating the session parameters, for example
      using <xref target="RFC3261">SIP</xref> with <xref target="RFC3264">SDP
      Offer/Answer</xref>.</t>

      <t>Centralized nodes, like RTP Mixers or MCUs, which either uses logic
      based on voice activity, other measurements, or user input could reduce
      the resources consumed in both the sender and the network by temporarily
      pausing the RTP streams that aren't required by the RTP Mixer. If the
      number of conference participants are greater than what the conference
      logic has chosen to present simultaneously to receiving participants,
      some participant RTP streams sent to the RTP Mixer may not need to be
      forwarded to any other participant. Those RTP streams could then be
      temporarily paused. This becomes especially useful when the media
      sources are provided in <xref
      target="I-D.westerlund-avtcore-rtp-simulcast">multiple encoding versions
      (Simulcast)</xref> or with Multi-Session Transmission (MST) of scalable
      encoding such as <xref target="RFC6190">SVC</xref>. There may be some of
      the defined encodings or combination of scalable layers that are not
      used or cannot be used all of the time, for example due to temporarily
      limited network or processing resources, and a centralized node may
      choose to pause such RTP streams without being requested to do so, but
      anyway send an explicit indication that the stream is paused.</t>

      <t>As the RTP streams required at any given point in time is highly
      dynamic in such scenarios, using the out-of-band signaling channel for
      pausing, and even more importantly resuming, an RTP stream is difficult
      due to the performance requirements. Instead, the pause and resume
      signaling should be in the media plane and go directly between the
      affected nodes. When using <xref target="RFC3550">RTP </xref> for media
      transport, using <xref target="RFC4585">Extended RTP Profile for
      Real-time Transport Control Protocol (RTCP)-Based Feedback
      (RTP/AVPF)</xref> appears appropriate. No currently existing RTCP
      feedback message explicitly supports pausing and resuming an incoming
      RTP stream. As this affects the generation of packets and may even allow
      the encoding process to be paused, the functionality appears to match
      <xref target="RFC5104">Codec Control Messages in the RTP Audio-Visual
      Profile with Feedback (AVPF)</xref> and it is proposed to define the
      solution as a Codec Control Message (CCM) extension.</t>

      <t>The Temporary Maximum Media Bitrate Request (TMMBR) message of CCM is
      used by video conferencing systems for flow control. It is desirable to
      be able to use that method with a bitrate value of zero for pause,
      whenever possible.</t>
    </section>

    <section anchor="sec-def" title="Definitions">
      <section title="Abbreviations">
        <t><list style="hanging">
            <t hangText="3GPP:">3rd Generation Partnership Project</t>

            <t hangText="AVPF:">Audio-Visual Profile with Feedback (RFC
            4585)</t>

            <t hangText="BGW:">Border Gateway</t>

            <t hangText="CCM:">Codec Control Messages (RFC 5104)</t>

            <t hangText="CNAME:">Canonical Name (RTCP SDES)</t>

            <t hangText="CSRC:">Contributing Source (RTP)</t>

            <t hangText="FB:">Feedback (AVPF)</t>

            <t hangText="FCI:">Feedback Control Information (AVPF)</t>

            <t hangText="FIR:">Full Intra Refresh (CCM)</t>

            <t hangText="FMT:">Feedback Message Type (AVPF)</t>

            <t hangText="LTE:">Long-Term Evolution (3GPP)</t>

            <t hangText="MCU:">Multipoint Control Unit</t>

            <t hangText="MTU:">Maximum Transfer Unit</t>

            <t hangText="PT:">Payload Type (RTP)</t>

            <t hangText="RTP:">Real-time Transport Protocol (RFC 3550)</t>

            <t hangText="RTCP:">RTP Control Protocol (RFC 3550)</t>

            <t hangText="RTCP RR:">RTCP Receiver Report</t>

            <t hangText="SDP:">Session Description Protocol (RFC 4566)</t>

            <t hangText="SGW:">Signaling Gateway</t>

            <t hangText="SIP:">Session Initiation Protocol (RFC 3261)</t>

            <t hangText="SSRC:">Synchronization Source (RTP)</t>

            <t hangText="SVC:">Scalable Video Coding</t>

            <t hangText="TCP:">Transmission Control Protocol (RFC 793)</t>

            <t hangText="TMMBR:">Temporary Maximum Media Bitrate Request
            (CCM)</t>

            <t hangText="TMMBN:">Temporary Maximum Media Bitrate Notification
            (CCM)</t>

            <t hangText="UA:">User Agent (SIP)</t>

            <t hangText="UDP:">User Datagram Protocol (RFC 768)</t>
          </list></t>
      </section>

      <section title="Terminology">
        <t>In addition to the following, the definitions from <xref
        target="RFC3550">RTP</xref>, <xref target="RFC4585">AVPF</xref>, <xref
        target="RFC5104">CCM</xref>, and <xref
        target="I-D.ietf-avtext-rtp-grouping-taxonomy">RTP Taxonomy</xref>
        also apply in this document.</t>

        <t><list style="hanging">
            <t hangText="Feedback Messages:"><xref target="RFC5104">CCM</xref>
            categorized different RTCP feedback messages into four types,
            Request, Command, Indication and Notification. This document
            places the PAUSE and RESUME messages into Request category, PAUSED
            as Indication and REFUSED as Notification.<list style="hanging">
                <t hangText="PAUSE">Request from an RTP stream receiver to
                pause a stream</t>

                <t hangText="RESUME">Request from an RTP stream receiver to
                resume a paused stream</t>

                <t hangText="PAUSED">Indication from an RTP stream sender that
                a stream is paused</t>

                <t hangText="REFUSED">Indication from an RTP stream sender
                that a PAUSE or RESUME request will not be honored</t>
              </list></t>

            <t hangText="Mixer:">The intermediate RTP node which receives an
            RTP stream from different end points, combines them to make one
            RTP stream and forwards to destinations, in the sense described in
            Topo-Mixer of <xref
            target="I-D.ietf-avtcore-rtp-topologies-update">RTP
            Topologies</xref>.</t>

            <t hangText="Participant:">A member which is part of an RTP
            session, acting as receiver, sender or both.</t>

            <t hangText="Paused sender:">An RTP stream sender that has stopped
            its transmission, i.e. no other participant receives its RTP
            transmission, either based on having received a PAUSE request,
            defined in this specification, or based on a local decision.</t>

            <t hangText="Pausing receiver:">An RTP stream receiver which sends
            a PAUSE request, defined in this specification, to other
            participant(s).</t>

            <t hangText="Stream:">Used as a short term for RTP stream, unless
            otherwise noted.</t>

            <t hangText="Stream receiver:">Short for RTP stream receiver; the
            RTP entity responsible for receiving an RTP stream, usually a
            Media Depacketizer.</t>

            <t hangText="Stream sender:">Short for RTP stream sender; the RTP
            entity responsible for creating an RTP stream, usually a Media
            Packetizer.</t>
          </list></t>
      </section>

      <section title="Requirements Language">
        <t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
        "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
        document are to be interpreted as described in <xref
        target="RFC2119">RFC 2119</xref>.</t>
      </section>
    </section>

    <section anchor="sec-use-cases" title="Use Cases">
      <t>This section discusses the main use cases for RTP stream pause and
      resume.</t>

      <section anchor="sec-point-to-point" title="Point to Point">
        <t>This is the most basic use case with an RTP session containing two
        End Points. Each End Point sends one or more streams.</t>

        <figure align="center" anchor="fig-point-to-point"
                title="Point to Point">
          <artwork align="center"><![CDATA[
+---+         +---+
| A |<------->| B |
+---+         +---+
]]></artwork>
        </figure>

        <t>The usage of RTP stream pause in this use case is to temporarily
        halt delivery of streams that the sender provides but the receiver
        does not currently use. This can for example be due to minimized
        applications where the video stream is not actually shown on any
        display, and neither is it used in any other way, such as being
        recorded.</t>

        <t>In this case, since there is only a single receiver of the stream,
        pausing or resuming a stream does not impact anyone else than the
        sender and the single receiver of that stream.</t>

        <t>RTCWEB WG's <xref
        target="I-D.ietf-rtcweb-use-cases-and-requirements">use case and
        requirements document</xref> defines the following API requirements in
        Appendix A, used also by W3C WebRTC WG:<list style="hanging">
            <t hangText="A8">The Web API must provide means for the web
            application to mute/unmute a stream or stream component(s). When a
            stream is sent to a peer mute status must be preserved in the
            stream received by the peer.</t>

            <t hangText="A9">The Web API must provide means for the web
            application to cease the sending of a stream to a peer.</t>
          </list>This memo provides means to optimize transport usage by stop
        sending muted streams and start sending again when unmuting.</t>
      </section>

      <section anchor="sec-mixer-to-media-sender"
               title="RTP Mixer to Media Sender">
        <t>One of the most commonly used topologies in centralized
        conferencing is based on the <xref
        target="I-D.ietf-avtcore-rtp-topologies-update">RTP Mixer</xref>. The
        main reason for this is that it provides a very consistent view of the
        RTP session towards each participant. That is accomplished through the
        Mixer originating its&acute; own streams, identified by SSRC, and any
        RTP streams sent to the participants will be sent using those SSRCs.
        If the Mixer wants to identify the underlying media sources for
        its&acute; conceptual streams, it can identify them using CSRC. The
        stream the Mixer provides can be an actual mix of multiple media
        sources, but it might also be switching received streams as described
        in Sections 3.6-3.8 of <xref
        target="I-D.ietf-avtcore-rtp-topologies-update"/>.</t>

        <figure align="center" anchor="fig-mixer"
                title="RTP Mixer in Unicast-only">
          <artwork align="center"><![CDATA[
+---+      +-----------+      +---+
| A |<---->|           |<---->| B |
+---+      |           |      +---+
           |   Mixer   |           
+---+      |           |      +---+
| C |<---->|           |<---->| D |
+---+      +-----------+      +---+
]]></artwork>
        </figure>

        <t>Which streams that are delivered to a given receiver, A, can depend
        on several things. It can either be the RTP Mixer&acute;s own logic
        and measurements such as voice activity on the incoming audio streams.
        It can be that the number of sent media sources exceed what is
        reasonable to present simultaneously at any given receiver. It can
        also be a human controlling the conference that determines how the
        media should be mixed; this would be more common in lecture or similar
        applications where regular listeners may be prevented from breaking
        into the session unless approved by the moderator. The streams may
        also be part of a <xref target="I-D.westerlund-avtcore-rtp-simulcast">
        Simulcast</xref> or <xref target="RFC6190">scalable encoded (for
        Multi-Session Transmission)</xref>, thus providing multiple versions
        that can be delivered by the RTP stream sender. These examples
        indicate that there are numerous reasons why a particular stream would
        not currently be in use, but must be available for use at very short
        notice if any dynamic event occurs that causes a different stream
        selection to be done in the Mixer.</t>

        <t>Because of this, it would be highly beneficial if the Mixer could
        request to pause a particular stream from being delivered to it. It
        also needs to be able to resume delivery with minimal delay.</t>

        <t>In some cases, especially when the Mixer sends multiple RTP streams
        per receiving client, there may be situations that makes it desirable
        to the Mixer to pause some of its sent RTP streams, even without being
        explicitly asked to do so by the receiving client. Such situations can
        for example be caused by a temporary lack of available Mixer network
        or processing resources. An RTP stream receiver that no longer
        receives an RTP stream could interpret this as an error condition and
        try to take action to re-establish the RTP stream. Such action would
        likely be undesirable if the RTP stream was in fact deliberately
        paused by the Mixer. Undesirable RTP stream receiver actions could be
        avoided if the Mixer is able to explicitly indicate that an RTP stream
        is deliberately paused.</t>

        <t>Just as for <xref
        target="sec-point-to-point">point-to-point</xref>, there is only a
        single receiver of the stream, the RTP Mixer, and pausing or resuming
        a stream does not affect anyone else than the sender and single
        receiver of that stream.</t>
      </section>

      <section anchor="sec-mixer-to-media-sender-multipoint"
               title="RTP Mixer to Media Sender in Point-to-Multipoint">
        <t>This use case is similar to the previous section, however the RTP
        Mixer is involved in three domains that need to be separated; the
        Multicast Network (including participants A and C), participant B, and
        participant D. The difference from above is that A and C share a
        multicast domain, which is depicted below.</t>

        <figure align="center" anchor="fig-mixer-multipoint"
                title="RTP Mixer in Point-to-Multipoint">
          <artwork align="center"><![CDATA[           +-----+
+---+     /       \     +-----------+      +---+
| A |<---/         \    |           |<---->| B |
+---+   /   Multi-  \   |           |      +---+
       +    Cast     +->|   Mixer   |
+---+   \  Network  /   |           |      +---+
| C |<---\         /    |           |<---->| D |
+---+     \       /     +-----------+      +---+
           +-----+
]]></artwork>
        </figure>

        <t>If the RTP Mixer pauses a stream from A, it will not only pause the
        stream towards itself, but will also stop the stream from arriving to
        C, which C is heavily impacted by, might not approve of, and should
        thus have a say on.</t>

        <t>If the Mixer resumes a paused stream from A, it will be resumed
        also towards C. In this case, if C is not interested it can simply
        ignore the stream and is not impacted as much as above.</t>

        <t>In this use case there are several receivers of a stream and
        special care must be taken as not to pause a stream that is still
        wanted by some receivers.</t>
      </section>

      <section anchor="sec-media-receiver-to-mixer"
               title="Media Receiver to RTP Mixer">
        <t>An End Point in <xref target="fig-mixer"/> could potentially
        request to pause the delivery of a given stream. Possible reasons
        include the ones in the <xref target="sec-point-to-point">point to
        point case</xref> above.</t>

        <t>When the RTP Mixer is only connected to individual unicast paths,
        the use case and any considerations are identical to the point to
        point use case.</t>

        <t>However, when the End Point requesting stream pause is connected to
        the RTP Mixer through a multicast network, such as A or C in <xref
        target="fig-mixer-multipoint"/>, the use case instead becomes
        identical to the one in <xref
        target="sec-mixer-to-media-sender-multipoint"/>, only with reverse
        direction of the streams and pause/resume requests.</t>
      </section>

      <section anchor="sec-across-mixer"
               title="Media Receiver to Media Sender Across RTP Mixer">
        <t>An End Point, like A in <xref target="fig-mixer"/>, could
        potentially request to pause the delivery of a given stream, like one
        of B's, over any of the SSRCs used by the Mixer by sending a pause
        request for the CSRC identifying the stream. However, the authors are
        of the opinion that this is not a suitable solution, for several
        reasons:<list style="numbers">
            <t>The Mixer might not include CSRC in it&acute;s stream
            indications.</t>

            <t>An End Point cannot rely on the CSRC to correctly identify the
            stream to be paused when the delivered media is some type of mix.
            A more elaborate stream identification solution is needed to
            support this in the general case.</t>

            <t>The End Point cannot determine if a given stream is still
            needed by the RTP Mixer to deliver to another session
            participant.</t>
          </list></t>

        <t>Due to the above reasons, we exclude this use case from further
        consideration.</t>
      </section>
    </section>

    <section anchor="sec-design" title="Design Considerations">
      <t>This section describes the requirements that this specification needs
      to meet.</t>

      <section anchor="sec-real-time" title="Real-time Nature">
        <t><xref target="sec-intro">The first section</xref> of this
        specification describes some possible reasons why a receiver may pause
        an RTP sender. Pausing and resuming is time-dependent, i.e. a receiver
        may choose to pause an RTP stream for a certain duration, after which
        the receiver may want the sender to resume. This time dependency means
        that the messages related to pause and resume must be transmitted to
        the sender in real-time in order for them to be purposeful. The pause
        operation is arguably not very time critical since it mainly provides
        a reduction of resource usage. Timely handling of the resume operation
        is however likely to directly impact the end-user's perceived quality
        experience, since it affects the availability of media that the user
        expects to receive more or less instantly.</t>
      </section>

      <section anchor="sec-direction" title="Message Direction">
        <t>It is the responsibility of an RTP stream receiver, who wants to
        pause or resume a stream from the sender(s), to transmit PAUSE and
        RESUME messages. An RTP stream sender who likes to pause itself, can
        often simply do it, but sometimes this will adversely affect the
        receiver and an explicit indication that the RTP stream is paused may
        then help. Any indication that an RTP stream is paused is the
        responsibility of the RTP stream sender and may in some cases not even
        be needed by the stream receiver.</t>
      </section>

      <section anchor="sec-individual" title="Apply to Individual Sources">
        <t>The PAUSE and RESUME messages apply to single RTP streams
        identified by their SSRC, which means the receiver targets the
        sender's SSRC in the PAUSE and RESUME requests. If a paused sender
        starts sending with a new SSRC, the receivers will need to send a new
        PAUSE request in order to pause it. PAUSED indications refer to a
        single one of the sender's own, paused SSRC.</t>
      </section>

      <section anchor="sec-consensus" title="Consensus">
        <t>An RTP stream sender should not pause an SSRC that some receiver
        still wishes to receive. The reason is that in RTP topologies where
        the stream is shared between multiple receivers, a single receiver on
        that shared network, independent of it being multicast, a mesh with
        joint RTP session or a transport Translator based, must not
        single-handedly cause the stream to be paused without letting all
        other receivers to voice their opinions on whether or not the stream
        should be paused. A consequence of this is that a newly joining
        receiver, for example indicated by an RTCP Receiver Report containing
        both a new SSRC and a CNAME that does not already occur in the
        session, firstly needs to learn the existence of paused streams, and
        secondly should be able to resume any paused stream. Any single
        receiver wanting to resume a stream should also cause it to be
        resumed. An important exception to this is when the RTP stream sender
        is aware of conditions that makes it desirable or even necessitates to
        pause the RTP stream on its own behalf, without being explicitly asked
        to do so. Such local consideration in the RTP sender takes precedence
        over RTP receiver wishes to receive the stream.</t>
      </section>

      <section anchor="sec-acks" title="Message Acknowledgments">
        <t>RTP and RTCP does not guarantee reliable data transmission. It uses
        whatever assurance the lower layer transport protocol can provide.
        However, this is commonly UDP that provides no reliability guarantees.
        Thus it is possible that a PAUSE and/or RESUME message transmitted
        from an RTP End Point does not reach its destination, i.e. the
        targeted RTP stream sender. When PAUSE or RESUME reaches the RTP
        stream sender and are effective, i.e., an active RTP stream sender
        pauses, or a resuming RTP stream sender have media data to transmit,
        it is immediately seen from the arrival or non-arrival of RTP packets
        for that RTP stream. Thus, no explicit acknowledgments are required in
        this case.</t>

        <t>In some cases when a PAUSE or RESUME message reaches the RTP stream
        sender, it will not be able to pause or resume the stream due to some
        local consideration, for example lack of data to transmit. This error
        condition, a negative acknowledgment, may be needed to avoid
        unnecessary <xref target="sec-retransmit">retransmission of
        requests</xref>.</t>
      </section>

      <section anchor="sec-retransmit" title="Request Retransmission">
        <t>When the stream is not affected as expected by a PAUSE or RESUME
        request, the request may have been lost and the sender of the request
        will need to retransmit it. The retransmission should take the round
        trip time into account, and will also need to take the normal RTCP
        bandwidth and timing rules applicable to the RTP session into account,
        when scheduling retransmission of feedback.</t>

        <t>When it comes to resume requests that are more time critical, the
        best resume performance may be achieved by repeating the request as
        often as possible until a sufficient number have been sent to reach a
        high probability of request delivery, or the stream gets
        delivered.</t>
      </section>

      <section anchor="sec-sequence" title="Sequence Numbering">
        <t>A PAUSE request message will need to have a sequence number to
        separate retransmissions from new requests. A retransmission keeps the
        sequence number unchanged, while it is incremented every time a new
        PAUSE request is transmitted that is not a retransmission of a
        previous request.</t>

        <t>Since RESUME always takes precedence over PAUSE and are even
        allowed to avoid pausing a stream, there is a need to keep strict
        ordering of PAUSE and RESUME. Thus, RESUME needs to share sequence
        number space with PAUSE and implicitly references which PAUSE it
        refers to. For the same reasons, the explicit PAUSED indication also
        needs to share sequence number space with PAUSE and RESUME.</t>
      </section>

      <section anchor="sec-other-solutions"
               title="Relation to Other Solutions">
        <t>A performance comparison between SIP/SDP and RTCP signaling
        technologies was made and included in draft versions of this
        specification. Using SIP and <xref target="RFC4566">SDP</xref> to
        carry pause and resume information means that it will need to traverse
        the entire signaling path to reach the signaling destination (either
        the remote End Point or the entity controlling the RTP Mixer), across
        any signaling proxies that potentially also has to process the SDP
        content to determine if they are expected to act on it. The amount of
        bandwidth required for a SIP/SDP-based signaling solution is in the
        order of at least 10 times more than an RTCP-based solution.
        Especially for UA sitting on mobile wireless access, this will risk
        introducing delays that are <xref target="sec-real-time">too
        long</xref> to provide a good user experience, and the bandwidth cost
        may also be considered infeasible compared to an RTCP-based solution.
        RTCP data is sent through the media path, which is likely shorter
        (contains fewer intermediate nodes) than the signaling path, may
        anyway have to traverse a few intermediate nodes. The amount of
        processing and buffering required in intermediate nodes to forward
        those RTCP messages is however believed to be significantly less than
        for intermediate nodes in the signaling path. Based on those
        considerations, RTCP is chosen as signaling protocol for the pause and
        resume functionality.</t>
      </section>
    </section>

    <section anchor="sec-overview" title="Solution Overview">
      <t>The proposed solution implements PAUSE and RESUME functionality based
      on sending AVPF RTCP feedback messages from any RTP session participant
      that wants to pause or resume a stream targeted at the stream sender, as
      identified by the sender SSRC.</t>

      <t>It is proposed to re-use <xref target="RFC5104">CCM TMMBR and
      TMMBN</xref> to the extent possible, and to define a small set of new
      RTCP feedback messages where new semantics is needed.</t>

      <t>A single Feedback message specification is used to implement the new
      messages. The message consists of a number of Feedback Control
      Information (FCI) blocks, where each block can be a PAUSE request, a
      RESUME request, PAUSED indication, a REFUSED response, or an extension
      to this specification. This structure allows a single feedback message
      to handle pause functionality on a number of streams.</t>

      <t>The PAUSED functionality is also defined in such a way that it can be
      used standalone by the RTP stream sender to indicate a local decision to
      pause, and inform any receiver of the fact that halting media delivery
      is deliberate and which RTP packet was the last transmitted.</t>

      <t>Special considerations that apply when using TMMBR/TMMBN for pause
      and resume purposes are described in <xref
      target="sec-overview-tmmbr"/>. This specification applies to both the
      new messages defined in herein as well as their TMMBR/TMMBN
      counterparts, except when explicitly stated otherwise. An obvious
      exception are any reference to the message parameters that are only
      available in the messages defined here. For example, any reference to
      PAUSE in the text below is equally applicable to TMMBR 0, and any
      reference to PAUSED is equally applicable to TMMBN 0. Therefore and for
      brevity, TMMBR/TMMBN will not be mentioned in the text, unless there is
      specific reason to do so.</t>

      <t>This section is intended to be explanatory and therefore
      intentionally contains no mandatory statements. Such statements can
      instead be found in other parts of this specification.</t>

      <section anchor="sec-overview-cap" title="Expressing Capability">
        <t>An End Point can use an extension to CCM SDP signaling to declare
        capability to understand the messages defined in this specification.
        Capability to understand only a subset of messages is possible, to
        support partial implementation, which is specifically believed to be
        feasible for the <xref target="sec-mixer-to-media-sender">RTP Mixer to
        Media Sender use case</xref>.</t>

        <t>For the case when TMMBR/TMMBN are used for pause and resume
        purposes, it is possible to explicitly express joint support for TMMBR
        and TMMBN, but not for TMMBN only.</t>
      </section>

      <section anchor="sec-overview-pause" title="Requesting to Pause">
        <t>An RTP stream receiver can choose to request PAUSE at any time,
        subject to AVPF timing rules.</t>

        <t>The PAUSE request contains a PauseID, which is incremented by one
        (in modulo arithmetic) with each PAUSE request that is not a
        re-transmission. The PauseID is scoped by and thus a property of the
        targeted RTP stream (SSRC).</t>

        <t>When a non-paused RTP stream sender receives the PAUSE request, it
        continues to send the RTP stream while waiting for some time to allow
        other RTP stream receivers in the same RTP session that saw this PAUSE
        request to disapprove by sending a <xref
        target="sec-overview-resume">RESUME</xref> for the same stream and
        with the same PauseID as in the disapproved PAUSE. If such
        disapproving RESUME arrives at the RTP stream sender during the
        hold-off period before the stream is paused, the pause is not
        performed. In point-to-point configurations, the hold-off period may
        be set to zero. Using a hold-off period of zero is also appropriate
        when using TMMBR 0 and in line with the semantics for that
        message.</t>

        <t>If the RTP stream sender receives further PAUSE requests with the
        available PauseID while waiting as described above, those additional
        requests are ignored.</t>

        <t>If the PAUSE request is lost before it reaches the RTP stream
        sender, it will be discovered by the RTP stream receiver because it
        continues to receive the RTP stream. It will also not see any <xref
        target="sec-overview-pausing">PAUSED indication</xref> for the stream.
        The same condition can be caused by the RTP stream sender having
        received a disapproving RESUME from a stream receiver A for a PAUSE
        request sent by a stream sender B, but that the PAUSE sender (B) did
        not receive the RESUME (from A) and may instead think that the PAUSE
        was lost. In both cases, a PAUSE request can be re-transmitted using
        the same PauseID. If using TMMBR 0 the request MAY be re-transmitted
        when the requester fails to receive a TMMBN 0 confirmation.</t>

        <t>If the pending stream pause is aborted due to a disapproving
        RESUME, the PauseID from the disapproved PAUSE is invalidated by the
        RESUME and any new PAUSE must use an incremented PauseID (in modulo
        arithmetic) to be effective.</t>

        <t>An RTP stream sender receiving a PAUSE not using the available
        PauseID informs the RTP stream receiver sending the ineffective PAUSE
        of this condition by sending a REFUSED response that contains the next
        available PauseID value. This REFUSED also informs the RTP stream
        receiver that it is probably not feasible to send another PAUSE for
        some time, not even with the available PauseID, since there are other
        RTP stream receivers that wish to receive the stream.</t>

        <t>A similar situation where an ineffective PauseID is chosen can
        appear when a new RTP stream receiver joins a session and wants to
        PAUSE a stream, but does not yet know the available PauseID to use.
        The REFUSED response will then provide sufficient information to
        create a valid PAUSE. The required extra signaling round-trip is not
        considered harmful, since it is assumed that pausing a stream is not
        <xref target="sec-real-time">time-critical</xref>.</t>

        <t>There may be local considerations making it impossible or
        infeasible to pause the stream, and the RTP stream sender can then
        respond with a REFUSED. In this case, if the used PauseID would
        otherwise have been effective, REFUSED contains the same PauseID as in
        the PAUSE request, and the PauseID is kept as available. Note that
        when using TMMBR 0 as PAUSE, that request cannot be refused (TMMBN
        &gt; 0) due to the existing restriction in section 4.2.2.2 of <xref
        target="RFC5104"/> that TMMBN shall contain the current bounding set,
        and the fact that a TMMBR 0 will always be the most restrictive point
        in any bounding set.</t>

        <t>If the RTP stream sender receives several identical PAUSE for an
        RTP stream that was already at least once responded with REFUSED and
        the condition causing REFUSED remains, those additional REFUSED should
        be sent with regular RTCP timing. A single REFUSED can respond to
        several identical PAUSE requests.</t>
      </section>

      <section anchor="sec-overview-pausing" title="Media Sender Pausing">
        <t>An RTP stream sender can choose to pause the stream at any time.
        This can either be as a result of receiving a PAUSE, or be based on
        some local sender consideration. When it does, it sends a PAUSED
        indication, containing the available PauseID. Note that PauseID is
        incremented when sending an unsolicited PAUSED (without having
        received a PAUSE). It also sends the PAUSED indication in the next two
        regular RTCP reports, given that the pause condition is then still
        effective.</t>

        <t>There is no reply to a PAUSED indication; it is simply an explicit
        indication of the fact that an RTP stream is paused. This can be
        helpful for the RTP stream receiver, for example to understand that
        transmission is deliberately and temporarily suspended and no specific
        corrective action is needed.</t>

        <t>The RTP stream sender may want to apply some local consideration to
        exactly when the RTP stream is paused, for example completing some
        media unit or a forward error correction block, before pausing the
        stream.</t>

        <t>The PAUSED indication also contains information about the RTP
        extended highest sequence number when the pause became effective. This
        provides RTP stream receivers with first hand information allowing
        them to know whether they lost any packets just before the stream
        paused or when the stream is resumed again. This allows RTP stream
        receivers to quickly and safely take into account that the stream is
        paused, in for example retransmission or congestion control
        algorithms.</t>

        <t>If the RTP stream sender receives PAUSE requests with the available
        PauseID while the stream is already paused, those requests are
        ignored.</t>

        <t>As long as the stream is being paused, the PAUSED indication MAY be
        sent together with any regular RTCP SR or RR. Including PAUSED in this
        way allows RTP stream receivers joining while the stream is paused to
        quickly know that there is a paused stream, what the last sent
        extended RTP sequence number was, and what the next available PauseID
        is to be able to construct valid PAUSE and RESUME requests at a later
        stage.</t>

        <t>When the RTP stream sender learns that a new End Point has joined
        the RTP session, for example by a new SSRC and a CNAME that was not
        previously seen in the RTP session, it should send PAUSED indications
        for all its paused streams at its earliest opportunity. It should in
        addition continue to include PAUSED indications in at least two
        regular RTCP reports.</t>
      </section>

      <section anchor="sec-overview-resume" title="Requesting to Resume">
        <t>An RTP stream receiver can request to resume a stream with a RESUME
        request at any time, subject to AVPF timing rules. The RTP stream
        receiver must include the available PauseID in the RESUME request for
        it to be effective.</t>

        <t>A pausing RTP stream sender that receives a RESUME including the
        correct available PauseID resumes the stream at the earliest
        opportunity. Receiving RESUME requests for a stream that is not paused
        does not require any action and can be ignored.</t>

        <t>There may be local considerations at the RTP stream sender, for
        example that the media device is not ready, making it temporarily
        impossible to resume the stream at that point in time, and the RTP
        stream sender MAY then respond with a REFUSED containing the same
        PauseID as in the RESUME. When receiving such REFUSED with a PauseID
        identical to the one in the sent RESUME, RTP stream receivers SHOULD
        then avoid sending further RESUME requests for some reasonable amount
        of time, to allow the condition to clear.</t>

        <t>If the RTP stream sender receives several identical RESUME for an
        RTP stream that was already at least once responded with REFUSED and
        the condition causing REFUSED remains, those additional REFUSED should
        be sent with regular RTCP timing. A single REFUSED can respond to
        several identical RESUME requests.</t>

        <t>A pausing RTP stream sender can apply local considerations and MAY
        resume a paused RTP stream at any time. If TMMBR 0 was used to pause
        the RTP stream, it cannot be resumed due to local considerations,
        unless the RTP stream is paused only due to <xref
        target="sec-overview-pausing">local considerations</xref> and thus no
        RTP stream receiver has requested to pause the stream with TMMBR
        0.</t>

        <t>When resuming a paused stream, especially for media that makes use
        of temporal redundancy between samples such as video, the temporal
        dependency between samples taken before the pause and at the time
        instant the stream is resumed may not be appropriate to use in the
        encoding. Should such temporal dependency between before and after the
        media was paused be used by the RTP stream sender, it requires the RTP
        stream receiver to have saved the sample from before the pause for
        successful continued decoding when resuming. The use of this temporal
        dependency is left up to the RTP stream sender. If temporal dependency
        is not used when the RTP stream is resumed, the first encoded sample
        after the pause will not contain any temporal dependency to samples
        before the pause (for video it may be a so-called intra picture). If
        temporal dependency to before the pause is used by the RTP stream
        sender when resuming, and if the RTP stream receiver did not save any
        sample from before the pause, the RTP stream receiver can use a <xref
        target="RFC5104">FIR request</xref> to explicitly ask for a sample
        without temporal dependency (for video a so-called intra picture),
        even at the same time as sending the RESUME.</t>
      </section>

      <section anchor="sec-overview-tmmbr" title="TMMBR/TMMBN Considerations">
        <t>As stated above, TMMBR/TMMBN may be used to provide pause and
        resume functionality for the point-to-point case. If the topology is
        not point-to-point, TMMBR/TMMBN cannot safely be used for pause or
        resume.</t>

        <t>This is a brief summary of what functionality is provided when
        using TMMBR/TMMBN:<list style="hanging">
            <t hangText="TMMBR 0:">Corresponds to PAUSE, without the
            requirement for any hold-off period to wait for RESUME before
            pausing the RTP stream.</t>

            <t hangText="TMMBR &gt;0:">Corresponds to RESUME when the RTP
            stream was previously paused with TMMBR 0. Since there is only a
            single RTP stream receiver, there is no need for the RTP stream
            sender to delay resuming the stream until after sending TMMBN
            &gt;0, or to apply the hold-off period specified in <xref
            target="RFC5104"/> before increasing the bitrate from zero. The
            bitrate value used when resuming after pausing with TMMBR 0 is
            either according to known limitations, or based on starting a
            stream with the configured maximum for the stream or session, for
            example given by b-parameter in SDP.</t>

            <t hangText="TMMBN 0:">Corresponds to PAUSED when the RTP stream
            was paused with TMMBR 0, but may, just as PAUSED, also be used
            unsolicited. An unsolicited RTP stream pause based on local sender
            considerations uses the RTP stream's own SSRC as TMMBR restriction
            owner in the TMMBN message bounding set. Also corresponds to a
            REFUSED indication when a stream is requested to be resumed with
            TMMBR &gt;0.</t>

            <t hangText="TMMBN &gt;0:">Cannot be used as REFUSED indication
            when a stream is requested to be paused with TMMBR 0, for reasons
            stated in <xref target="sec-overview-pause"/>.</t>
          </list></t>
      </section>
    </section>

    <section anchor="sec-states" title="Participant States">
      <t>This document introduces three new states for a stream in an RTP
      sender, according to the figure and sub-sections below. Any references
      to PAUSE, PAUSED, RESUME and REFUSED in this section SHALL be taken to
      apply to the extent possible also when <xref
      target="sec-overview-tmmbr">TMMBR/TMMBN are used</xref> for this
      functionality.</t>

      <figure align="center" anchor="fig-pause-states"
              title="RTP Pause States">
        <artwork align="center"><![CDATA[
     +------------------------------------------------------+
     |                     Received RESUME                  |
     v                                                      |
+---------+ Received PAUSE  +---------+ Hold-off period +--------+
| Playing |---------------->| Pausing |---------------->| Paused |
|         |<----------------|         |                 |        |
+---------+ Received RESUME +---------+                 +--------+
  ^     |                        | PAUSE decision           |
  |     |                        v                          |
  |     |  PAUSE decision   +---------+    PAUSE decision   |
  |     +------------------>| Local   |<--------------------+
  +-------------------------| Paused  |
          RESUME decision   +---------+

]]></artwork>
      </figure>

      <section anchor="sec-state-playing" title="Playing State">
        <t>This state is not new, but is the normal media sending state from
        <xref target="RFC3550"/>. When entering the state, the PauseID MUST be
        incremented by one in modulo arithmetic. The RTP sequence number for
        the first packet sent after a pause SHALL be incremented by one
        compared to the highest RTP sequence number sent before the pause. The
        first RTP Time Stamp for the first packet sent after a pause SHOULD be
        set according to capture times at the source, meaning the RTP Time
        Stamp difference compared to before the pause reflects the time the
        RTP stream was paused.</t>
      </section>

      <section anchor="sec-state-pausing" title="Pausing State">
        <t>In this state, the RTP stream sender has received at least one
        PAUSE message for the stream in question. The RTP stream sender SHALL
        wait during a hold-off period for the possible reception of RESUME
        messages for the RTP stream being paused before actually pausing RTP
        stream transmission. The hold-off period to wait SHALL be long enough
        to allow another RTP stream receiver to respond to the PAUSE with a
        RESUME, if it determines that it would not like to see the stream
        paused. This hold-off period is determined by the formula:<list
            style="empty">
            <t>2 * RTT + T_dither_max,</t>
          </list></t>

        <t>where RTT is the longest round trip known to the RTP stream sender
        and T_dither_max is defined in section 3.4 of <xref
        target="RFC4585"/>. The hold-off period MAY be set to 0 by some <xref
        target="sec-signaling">signaling</xref> means when it can be
        determined that there is only a single receiver, for example in
        point-to-point or some unicast situations.</t>

        <!--MW: In future version the above values should be more clearly motivated.-->

        <t>If the RTP stream sender has set the hold-off period to 0 and
        receives information that it was an incorrect decision and that there
        are in fact several receivers of the stream, for example by RTCP RR,
        it MUST change the hold-off to instead be based on the above
        formula.</t>
      </section>

      <section anchor="sec-state-paused" title="Paused State">
        <t>An RTP stream is in paused state when the sender pauses its
        transmission after receiving at least one PAUSE message and the
        hold-off period has passed without receiving any RESUME message for
        that stream.</t>

        <t>When entering the state, the RTP stream sender SHALL send a PAUSED
        indication to all known RTP stream receivers, and SHALL also repeat
        PAUSED in the next two regular RTCP reports.</t>

        <t>Pausing an RTP stream MUST NOT affect the sending of RTP keepalive
        <xref target="RFC6263"/><xref target="RFC5245"/> applicable to that
        RTP stream.</t>

        <t>Following sub-sections discusses some potential issues when an RTP
        sender goes into paused state. These conditions are also valid if an
        RTP Translator is used in the communication. When an RTP Mixer
        implementing this specification is involved between the participants
        (which forwards the stream by marking the RTP data with its own SSRC),
        it SHALL be a responsibility of the Mixer to control sending PAUSE and
        RESUME requests to the sender. The below conditions also apply to the
        sender and receiver parts of the RTP Mixer, respectively.</t>

        <section anchor="sec-bye" title="RTCP BYE Message">
          <t>When a participant leaves the RTP session, it sends an RTCP BYE
          message. In addition to the semantics described in section 6.3.4 and
          6.3.7 of <xref target="RFC3550">RTP</xref>, following two conditions
          MUST also be considered when an RTP participant sends an RTCP BYE
          message,</t>

          <t><list style="symbols">
              <t>If a paused sender sends an RTCP BYE message, receivers
              observing this SHALL NOT send further PAUSE or RESUME requests
              to it.</t>

              <t>Since a sender pauses its transmission on receiving the PAUSE
              requests from any receiver in a session, the sender MUST keep
              record of which receiver that caused the RTP stream to pause. If
              that receiver sends an RTCP BYE message observed by the sender,
              the sender SHALL resume the RTP stream.</t>
            </list></t>
        </section>

        <section anchor="sec-time-out" title="SSRC Time-out">
          <t>Section 6.3.5 in <xref target="RFC3550">RTP</xref> describes the
          SSRC time-out of an RTP participant. Every RTP participant maintains
          a sender and receiver list in a session. If a participant does not
          get any RTP or RTCP packets from some other participant for the last
          five RTCP reporting intervals it removes that participant from the
          receiver list. Any streams that were paused by that removed
          participant SHALL be resumed.</t>
        </section>
      </section>

      <section anchor="sec-state-local-paused" title="Local Paused State">
        <t>This state can be entered at any time, based on local decision from
        the RTP stream sender. As for <xref target="sec-state-paused">Paused
        State</xref>, the RTP stream sender SHALL send a PAUSED indication to
        all known RTP stream receivers, when entering the state, and repeat it
        in the next two regular RTCP reports, unless the stream was already in
        <xref target="sec-state-paused">paused state</xref>. When using TMMBN
        0 as PAUSED indication, being in paused state, and entering local
        paused state, the RTP stream sender SHALL send TMMBN 0 with itself
        included in the TMMBN bounding set.</t>

        <t>As indicated in <xref target="fig-pause-states"/>, this state has
        higher precedence than <xref target="sec-state-paused">paused
        state</xref> and RESUME messages alone cannot resume a paused RTP
        stream as long as the local decision still applies.</t>

        <t>Pausing an RTP stream MUST NOT affect the sending of RTP keepalive
        <xref target="RFC6263"/><xref target="RFC5245"/> applicable to that
        RTP stream.</t>

        <t>When leaving the state, the stream state SHALL become Playing,
        regardless whether or not there were any RTP stream receivers that
        sent PAUSE for that stream, effectively clearing the RTP stream
        sender's memory for that stream. This does however not apply when the
        stream was paused by a TMMBR 0, either before entering or during the
        Local Paused State, in which case leaving Local Paused State just
        removes the RTP sender from the TMMBN bounding set, and a new TMMBN
        with the updated bounding set MUST be sent accordingly. The stream
        state can become Playing only when there is no entry with a bitrate
        value of 0 in the stream's bounding set.</t>
      </section>
    </section>

    <section anchor="sec-format" title="Message Format">
      <t>Section 6 of <xref target="RFC4585">AVPF</xref> defines three types
      of low-delay RTCP feedback messages, i.e. Transport layer,
      Payload-specific, and Application layer feedback messages. This document
      defines a new Transport layer feedback message, this message is either a
      PAUSE request, a RESUME request, or one of four different types of
      acknowledgments in response to either PAUSE or RESUME requests.</t>

      <t>The Transport layer feedback messages are identified by having the
      RTCP payload type be RTPFB (205) as defined by <xref
      target="RFC4585">AVPF</xref>. The PAUSE and RESUME messages are
      identified by Feedback Message Type (FMT) value in common packet header
      for feedback message defined in section 6.1 of <xref
      target="RFC4585">AVPF</xref>. The PAUSE and RESUME transport feedback
      message is identified by the FMT value = TBA1.</t>

      <t>The Common Packet Format for Feedback Messages defined by <xref
      target="RFC4585">AVPF</xref> is:</t>

      <figure>
        <artwork align="center"><![CDATA[ 0                   1                   2                   3
 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|V=2|P|   FMT   |       PT      |          Length               |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                  SSRC of packet sender                        |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                  SSRC of media source                         |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
:            Feedback Control Information (FCI)                 :
:                                                               :]]></artwork>
      </figure>

      <t>For the PAUSE and RESUME messages, the following interpretation of
      the packet fields will be:</t>

      <t><list style="hanging">
          <t hangText="FMT:">The FMT value identifying the PAUSE and RESUME
          message: TBA1</t>

          <t hangText="PT:">Payload Type = 205 (RTPFB)</t>

          <t hangText="Length:">As defined by AVPF, i.e. the length of this
          packet in 32-bit words minus one, including the header and any
          padding.</t>

          <t hangText="SSRC of packet sender:">The SSRC of the RTP session
          participant sending the messages in the FCI. Note, for End Points
          that have multiple SSRCs in an RTP session, any of its SSRCs MAY be
          used to send any of the pause message types.</t>

          <t hangText="SSRC of media source:">Not used, SHALL be set to 0. The
          FCI identifies the SSRC the message is targeted for.</t>
        </list>The Feedback Control Information (FCI) field consist of one or
      more PAUSE, RESUME, PAUSED, REFUSED, or any future extension. These
      messages have the following FCI format:</t>

      <figure anchor="fig-syntax"
              title="Syntax of FCI Entry in the PAUSE and RESUME message">
        <artwork align="center"><![CDATA[0                   1                   2                   3
 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|                           Target SSRC                         |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type  |  Res  | Parameter Len |           PauseID             | 
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
:                         Type Specific                         :
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

]]></artwork>
      </figure>

      <t>The FCI fields have the following definitions:<list style="hanging">
          <t hangText="Target SSRC (32 bits):">For a PAUSE and RESUME
          messages, this value is the SSRC that the request is intended for.
          For PAUSED, it MUST be the SSRC being paused. If pausing is the
          result of a PAUSE request, the value in PAUSED is effectively the
          same as Target SSRC in a related PAUSE request. For REFUSED, it MUST
          be the Target SSRC of the PAUSE or RESUME request that cannot change
          state. A CSRC MUST NOT be used as a target as the interpretation of
          such a request is unclear.</t>

          <t hangText="Type (4 bits):">The pause feedback type. The values
          defined in this specification are as follows,<list style="hanging">
              <t hangText="0:">PAUSE request message</t>

              <t hangText="1:">RESUME request message</t>

              <t hangText="2:">PAUSED indication message</t>

              <t hangText="3:">REFUSED indication message</t>

              <t hangText="4-15:">Reserved for future use</t>
            </list></t>

          <t hangText="Res: (4 bits):">Type specific reserved. SHALL be
          ignored by receivers implementing this specification and MUST be set
          to 0 by senders implementing this specification.</t>

          <t hangText="Parameter Len: (8 bits):">Length of the Type Specific
          field in 32-bit words. MAY be 0.</t>

          <t hangText="PauseID (16 bits):">Message sequence identification.
          SHALL be incremented by one modulo 2^16 for each new PAUSE message,
          unless the message is re-transmitted. The initial value SHOULD be 0.
          The PauseID is scoped by the Target SSRC, meaning that PAUSE,
          RESUME, and PAUSED messages therefore share the same PauseID space
          for a specific Target SSRC.</t>

          <t hangText="Type Specific: (variable):">Defined per pause feedback
          Type. MAY be empty.</t>
        </list></t>

      <t/>
    </section>

    <section anchor="sec-details" title="Message Details">
      <t>This section contains detailed explanations of each message defined
      in this specification. All transmissions of request and indications are
      governed by the transmission rules as defined by <xref
      target="sec-transmission"/>.</t>

      <t>Any references to PAUSE, PAUSED, RESUME and REFUSED in this section
      SHALL be taken to apply to the extent possible also when <xref
      target="sec-overview-tmmbr">TMMBR/TMMBN are used</xref> for this
      functionality. TMMBR/TMMBN MAY be used instead of the messages defined
      in this specification when the effective topology is point-to-point. If
      either sender or receiver learns that the topology is not
      point-to-point, TMMBR/TMMBN MUST NOT be used for pause/resume
      functionality. If the messages defined in this specification are
      supported in addition to TMMBR/TMMBN, pause/resume signaling MUST use
      messages from this specification. If the topology is not point-to-point
      and the messages defined in this specification are not supported,
      pause/resume functionality with TMMBR/TMMBN MUST NOT be used.</t>

      <section anchor="sec-pause" title="PAUSE">
        <t>An RTP stream receiver MAY schedule PAUSE for transmission at any
        time.</t>

        <t>PAUSE has no defined Type Specific parameters and Parameter Len
        MUST be set to 0.</t>

        <t>PauseID SHOULD be the available PauseID, as indicated by <xref
        target="sec-paused">PAUSED</xref> or implicitly determined by
        previously received PAUSE or <xref target="sec-resume">RESUME</xref>
        requests. A randomly chosen PauseID MAY be used if it was not possible
        to retrieve PauseID information, in which case the PAUSE will either
        succeed, or the correct PauseID can be found in the returned <xref
        target="sec-refuse">REFUSED</xref>. A PauseID that is matching the
        available PauseID is henceforth also called a valid PauseID.</t>

        <t>PauseID needs to be incremented by one, in modulo arithmetic, for
        each PAUSE request that is not a retransmission, compared to what was
        used in the last PAUSED indication sent by the media sender. This is
        to ensure that the PauseID matches what is the current available
        PauseID at the RTP stream sender. The RTP stream sender increments
        what it considers to be the available PauseID when entering <xref
        target="sec-state-playing">Playing State</xref>.</t>

        <t>For the scope of this specification, a PauseID larger than the
        current one is defined as having a value between and including
        (PauseID + 1) MOD 2^16 and (PauseID + 2^14) MOD 2^16, where "MOD" is
        the modulo operator. Similarly, a PauseID smaller than the current one
        is defined as having a value between and including (PauseID - 2^15)
        MOD 2^16 and (PauseID - 1) MOD 2^16.</t>

        <t>If an RTP stream receiver that sent a PAUSE with a certain PauseID
        receives a RESUME with the same PauseID, it is RECOMMENDED that it
        refrains from sending further PAUSE requests for some appropriate time
        since the RESUME indicates that there are other receivers that still
        wishes to receive the stream.</t>

        <t>If the targeted RTP stream does not pause, if no PAUSED indication
        with a larger PauseID than the one used in PAUSE, and if no REFUSED is
        received within 2 * RTT + T_dither_max, the PAUSE MAY be scheduled for
        retransmission, using the same PauseID. RTT is the observed round-trip
        to the RTP stream sender and T_dither_max is defined in section 3.4 of
        <xref target="RFC4585"/>.</t>

        <t>When an RTP stream sender in <xref
        target="sec-state-playing">Playing State</xref> receives a valid
        PAUSE, and unless local considerations currently makes it impossible
        to pause the stream, it SHALL enter <xref
        target="sec-state-pausing">Pausing State</xref> when reaching an
        appropriate place to pause in the stream, and act accordingly.</t>

        <t>If an RTP stream sender receives a valid PAUSE while in Pausing,
        <xref target="sec-state-paused">Paused</xref> or <xref
        target="sec-state-local-paused">Local Paused</xref> States, the
        received PAUSE SHALL be ignored.</t>
      </section>

      <section anchor="sec-paused" title="PAUSED">
        <t>The PAUSED indication MUST be sent whenever entering <xref
        target="sec-state-paused">Paused State</xref> as a result of receiving
        a valid <xref target="sec-pause">PAUSE</xref> request, or when
        entering <xref target="sec-state-local-paused">Local Paused
        State</xref> based on a RTP stream sender local decision.</t>

        <t>PauseID MUST contain the available, valid value to be included in a
        subsequent <xref target="sec-resume">RESUME</xref>.</t>

        <t>PAUSED SHALL contain a 32 bit parameter with the RTP extended
        highest sequence number valid when the RTP stream was paused.
        Parameter Len MUST be set to 1.</t>

        <t>After having entered Paused or Local Paused State and thus having
        sent PAUSED once, PAUSED MUST also be included in the next two regular
        RTCP reports, given that the pause condition is then still
        effective.</t>

        <t>While remaining in Paused or Local Paused States, PAUSED MAY be
        included in all regular RTCP reports.</t>

        <t>When in Paused or Local Paused States, It is RECOMMENDED to send
        PAUSED at the earliest opportunity and also to include it in the next
        two regular RTCP reports, whenever the RTP stream sender learns that
        there are End Points that did not previously receive the stream, for
        example by RTCP reports with an SSRC and a CNAME that was not
        previously seen in the RTP session.</t>
      </section>

      <section anchor="sec-resume" title="RESUME">
        <t>An RTP stream receiver MAY schedule RESUME for transmission
        whenever it wishes to resume a paused stream, or to disapprove a
        stream from being paused.</t>

        <t>PauseID SHOULD be the valid PauseID, as indicated by <xref
        target="sec-paused">PAUSED</xref> or implicitly determined by
        previously received <xref target="sec-pause">PAUSE</xref> or RESUME
        requests. A randomly chosen PauseID MAY be used if it was not possible
        to retrieve PauseID information, in which case the RESUME will either
        succeed, or the correct PauseID can be found in a returned <xref
        target="sec-refuse">REFUSED</xref>.</t>

        <t>RESUME has no defined Type Specific parameters and Parameter Len
        MUST be set to 0.</t>

        <t>When an RTP stream sender in <xref
        target="sec-state-pausing">Pausing</xref>, <xref
        target="sec-state-paused">Paused</xref> or <xref
        target="sec-state-local-paused">Local Paused State</xref> receives a
        valid RESUME, and unless local considerations currently makes it
        impossible to resume the stream, it SHALL enter <xref
        target="sec-state-playing">Playing State</xref> and act accordingly.
        If the RTP stream sender is incapable of honoring the RESUME request
        with a valid PauseID, or receives a RESUME request with an invalid
        PauseID while in Paused or Pausing state, the RTP stream sender sends
        a REFUSED message as specified below.</t>

        <t>If an RTP stream sender in Playing State receives a RESUME
        containing either a valid PauseID or a PauseID that is less than the
        valid PauseID, the received RESUME SHALL be ignored.</t>
      </section>

      <section anchor="sec-refuse" title="REFUSED">
        <t>REFUSED has no defined Type Specific parameters and Parameter Len
        MUST be set to 0.</t>

        <t>If an RTP stream sender receives a valid <xref
        target="sec-pause">PAUSE</xref> or <xref
        target="sec-resume">RESUME</xref> request that cannot be fulfilled by
        the sender due to some local consideration, it SHALL schedule
        transmission of a REFUSED indication containing the valid PauseID from
        the rejected request.</t>

        <t>If an RTP stream sender receives PAUSE or RESUME requests with a
        non-valid PauseID it SHALL schedule a REFUSED response containing the
        available, valid PauseID, except if the RTP stream sender is in
        Playing State and receives a RESUME with a PauseID less than the valid
        one, in which case the RESUME SHALL be ignored.</t>

        <t>If several PAUSE or RESUME that would render identical REFUSED
        responses are received before the scheduled REFUSED is sent, duplicate
        REFUSED MUST NOT be scheduled for transmission. This effectively lets
        a single REFUSED respond to several invalid PAUSE or RESUME
        requests.</t>

        <t>If REFUSED containing a certain PauseID was already sent and yet
        more PAUSE or RESUME messages are received that require additional
        REFUSED with that specific PauseID to be scheduled, and unless the
        PauseID number space has wrapped since REFUSED was last sent with that
        PauseID, further REFUSED messages with that PauseID SHOULD be sent in
        regular RTCP reports.</t>

        <t>An RTP stream receiver that sent a PAUSE or RESUME request and
        receives a REFUSED containing the same PauseID as in the request
        SHOULD refrain from sending an identical request for some appropriate
        time to allow the condition that caused REFUSED to clear.</t>

        <t>An RTP stream receiver that sent a PAUSE or RESUME request and
        receives a REFUSED containing a PauseID different from the request MAY
        schedule another request using the PauseID from the REFUSED
        indication.</t>
      </section>

      <section anchor="sec-transmission" title="Transmission Rules">
        <t>The transmission of any RTCP feedback messages defined in this
        specification MUST follow the normal AVPF defined timing rules and
        depends on the session's mode of operation.</t>

        <t>All messages defined in this specification, as well as TMMBR/TMMBN
        used for <xref target="sec-overview-tmmbr">pause/resume
        purposes</xref>, MAY use either Regular, Early or Immediate timings,
        taking the following into consideration:<list style="symbols">
            <t>PAUSE SHOULD use Early or Immediate timing, except for
            retransmissions that SHOULD use Regular timing.</t>

            <t>The first transmission of PAUSED for each (non-wrapped) PauseID
            SHOULD be sent with Immediate or Early timing, while subsequent
            PAUSED for that PauseID SHOULD use Regular timing.</t>

            <t>RESUME SHOULD always use Immediate or Early timing.</t>

            <t>The first transmission of REFUSED for each (non-wrapped)
            PauseID SHOULD be sent with Immediate or Early timing, while
            subsequent REFUSED for that PauseID SHOULD use Regular timing.</t>
          </list></t>
      </section>
    </section>

    <section anchor="sec-signaling" title="Signaling">
      <t>The capability of handling messages defined in this specification MAY
      be exchanged at a higher layer such as SDP. This document extends the
      rtcp-fb attribute defined in section 4 of <xref
      target="RFC4585">AVPF</xref> to include the request for pause and
      resume. This specification follows all the rules defined in <xref
      target="RFC4585">AVPF</xref> and <xref target="RFC5104">CCM</xref> for
      an rtcp-fb attribute relating to payload type in a session
      description.</t>

      <t>This specification defines a new parameter "pause" to the "ccm"
      feedback value defined in <xref target="RFC5104">CCM</xref>,
      representing the capability to understand the RTCP feedback message and
      all of the defined FCIs of PAUSE, RESUME, PAUSED and REFUSED.<list
          style="empty">
          <t>Note: When TMMBR 0 / TMMBN 0 are used to implement pause and
          resume functionality (with the restrictions described in this
          specification), signaling rtcp-fb attribute with ccm tmmbr parameter
          is sufficient and no further signaling is necessary. There is
          however no guarantee that TMMBR/TMMBN implementations pre-dating
          this specification work exactly as described here when used with a
          bitrate value of 0.</t>
        </list></t>

      <t>The "pause" parameter has two optional attributes, "nowait" and
      "config":<list style="symbols">
          <t>"nowait" indicates that the hold-off period defined in <xref
          target="sec-state-pausing"/> can be set to 0, reducing the latency
          before the stream can paused after receiving a PAUSE request. This
          condition occurs when there will be only a single receiver per
          direction in the RTP session, for example in point-to-point
          sessions. It is also possible to use in scenarios using
          unidirectional media. The conditions that allow "nowait" to be set
          also indicate that it would be possible to use CCM TMMBR/TMMBN as
          pause/resume signaling.</t>

          <t>"config" allows for partial implementation of this specification
          according to the different roles in the <xref
          target="sec-use-cases">use cases section</xref>, and takes a value
          that describes what sub-set is implemented:<list style="hanging">
              <t hangText="1">Full implementation of this specification. This
              is the default configuration. A missing config attribute MUST be
              treated equivalent to providing a config value of 1.</t>

              <t hangText="2">The implementation intends to send PAUSE and
              RESUME requests for received RTP streams and is thus also
              capable of receiving PAUSED and REFUSED. It does not support
              receiving PAUSE and RESUME requests, but may pause sent RTP
              streams due to local considerations and then intends to send
              PAUSED for them.</t>

              <t hangText="3">The implementation supports receiving PAUSE and
              RESUME requests targeted for RTP streams it sends. It will send
              PAUSED and REFUSED as needed. The node will not send any PAUSE
              and RESUME requests, but supports and desires receiving PAUSED
              if received RTP streams are paused.</t>

              <t hangText="4">The implementation intends to send PAUSE and
              RESUME requests for received RTP streams and is thus also
              capable of receiving PAUSED and REFUSED. It cannot pause any RTP
              streams it sends, and thus does not support receiving PAUSE and
              RESUME requests, and also does not support sending PAUSED
              indications.</t>

              <t hangText="5">The implementation supports receiving PAUSE and
              RESUME requests targeted for RTP streams it sends. It will send
              PAUSED and REFUSED as needed. It does not support sending PAUSE
              and RESUME requests to pause received RTP streams, and also does
              not support receiving PAUSED indications.</t>

              <t hangText="6">The implementation supports sent and received
              RTP streams being paused due to local considerations, and thus
              supports sending and receiving PAUSED indications.</t>

              <t hangText="7">The implementation supports and desires to
              receive PAUSED indications for received RTP streams, but does
              not pause or send PAUSED indications for sent RTP streams. It
              does not support any other messages defined in this
              specification.</t>

              <t hangText="8">The implementation supports pausing sent RTP
              streams and sending PAUSED indications for them, but does not
              support receiving PAUSED indications for received RTP streams.
              It does not support any other messages defined in this
              specification.</t>
            </list></t>
        </list></t>

      <t>When signaling a config value other than 1, an implementation MAY
      ignore non-supported messages on reception, and MAY omit sending
      non-supported messages. The below table summarizes per-message send and
      receive support for the different config attribute values ("X"
      indicating support and "-" indicating non-support):</t>

      <figure align="center" anchor="fig-config-list"
              title="Supported messages for different config values">
        <artwork align="center"><![CDATA[
+---+-----------------------------+-----------------------------+
| # | Send                        | Receive                     |
|   | PAUSE RESUME PAUSED REFUSED | PAUSE RESUME PAUSED REFUSED |
+---+-----------------------------+-----------------------------+
| 1 |   X      X      X      X    |   X      X      X      X    |
| 2 |   X      X      X      -    |   -      -      X      X    |
| 3 |   -      -      X      X    |   X      X      X      -    |
| 4 |   X      X      -      -    |   -      -      X      X    |
| 5 |   -      -      X      X    |   X      X      -      -    |
| 6 |   -      -      X      -    |   -      -      X      -    |
| 7 |   -      -      -      -    |   -      -      X      -    |
| 8 |   -      -      X      -    |   -      -      -      -    |
+---+-----------------------------+-----------------------------+
]]></artwork>
      </figure>

      <t>This is the resulting <xref target="RFC5234">ABNF</xref>, extending
      existing ABNF in section 7.1 of <xref target="RFC5104">CCM</xref>:</t>

      <figure anchor="fig-abnf" title="ABNF">
        <artwork align="left"><![CDATA[
rtcp-fb-ccm-param  =/ SP "pause" [SP pause-attr]
pause-attr         = [pause-config] [SP "nowait"] [SP byte-string]
pause-config       = "config=" pause-config-value
pause-config-value = %x31-38
; byte-string as defined in RFC 4566, for future extensions

]]></artwork>
      </figure>

      <t>An endpoint implementing this specification and using SDP to signal
      capability SHOULD indicate the new "pause" parameter with ccm signaling,
      but MAY use existing <xref target="RFC5104">ccm tmmbr signaling </xref>
      if the limitations in functionality as described in this specification
      coming from such usage are considered acceptable. The messages from this
      specification SHOULD NOT be used towards receivers that did not declare
      capability to receive those messages.</t>

      <t>There MUST NOT be more than one "a=rtcp-fb" line with "pause"
      applicable to a single payload type in the SDP, unless the additional
      line uses "*" as payload type, in which case "*" SHALL be interpreted as
      applicable to all listed payload types that does not have an explicit
      "pause" specification.</t>

      <section anchor="sec-offer-answer" title="Offer-Answer Use">
        <t>An offerer implementing this specification needs to include "pause"
        CCM parameter with suitable configuration attribute ("config") in the
        SDP, according to what messages it intends to send and desires to
        receive in the session.</t>

        <t>In SDP offer/answer, the "config" attribute and its message
        directions are interpreted based on the agent providing the SDP. The
        offerer is described in an offer, and the answerer is described in an
        answer.</t>

        <t>An answerer receiving an offer with a "pause" CCM parameter and a
        config attribute with a certain value, describing a certain capability
        to send and receive messages, MAY change the config attribute value in
        the answer to another configuration. The permitted answers are listed
        in the below table.</t>

        <figure align="center" anchor="fig-config-offer-answer"
                title="Config values in Offer/Answer">
          <artwork align="center"><![CDATA[SDP Offer config value | Permitted SDP Answer config values
-----------------------+-----------------------------------
            1          | 1, 2, 3, 4, 5, 6, 7, 8
            2          | 3, 4, 5, 6, 7, 8
            3          | 2, 4, 5, 6, 7, 8
            4          | 5, 6, 7, 8
            5          | 4, 6, 7, 8
            6          | 6, 7, 8
            7          | 8
            8          | 7

]]></artwork>
        </figure>

        <t>An offer or answer omitting the config attribute, MUST be
        interpreted as equivalent to config=1. In all cases the answerer MAY
        also completely remove any "pause" CCM parameter to indicate that it
        does not understand or desire to use any pause functionality for the
        affected payload types.</t>

        <t>If the offerer believes that itself and the intended answerer are
        likely the only End Points in the RTP session, it MAY include the
        "nowait" sub-parameter on the "pause" line in the offer. If an
        answerer receives the "nowait" sub-parameter on the "pause" line in
        the SDP, and if it has information that the offerer and itself are not
        the only End Points in the RTP session, it MUST NOT include any
        "nowait" sub-parameter on its "pause" line in the SDP answer. The
        answerer MUST NOT add "nowait" on the "pause" line in the answer
        unless it is present on the "pause" line in the offer. If both offer
        and answer contained a "nowait" parameter, then the hold-off period is
        configured to 0 at both offerer and answerer.</t>
      </section>

      <section title="Declarative Use">
        <t>In declarative use, the SDP is used to configure the node receiving
        the SDP. This has implications on the interpretation of the SDP
        signaling extensions defined in this specification.</t>

        <t>First, the "config" attribute and its message directions are
        interpreted based on the node receiving the SDP.</t>

        <t>Second, the "nowait" parameter, if included, is followed as
        specified. It is the responsibility of the declarative SDP sender to
        determine if a configured node will participate in a session that will
        be point to point, based on the usage. For example, a conference
        client being configured for an any source multicast session using
        <xref target="RFC2974">SAP</xref> will not be in a point to point
        session, thus "nowait" cannot be included. An <xref
        target="RFC2326">RTSP</xref> client receiving a declarative SDP may
        very well be in a point to point session, although it is highly
        doubtful that an RTSP client would need to support this specification,
        considering the inherent PAUSE support in RTSP.</t>
      </section>
    </section>

    <section anchor="sec-examples" title="Examples">
      <t>The following examples shows use of PAUSE and RESUME messages,
      including use of offer-answer:</t>

      <t><list style="numbers">
          <t>Offer-Answer</t>

          <t>Point-to-Point session</t>

          <t>Point-to-Multipoint using Mixer</t>

          <t>Point-to-Multipoint using Translator</t>
        </list></t>

      <section title="Offer-Answer">
        <t>The below figures contains an example how to show support for
        pausing and resuming the streams, as well as indicating whether or not
        the hold-off period can be set to 0.</t>

        <figure anchor="fig-example-sdp-offer"
                title="SDP Offer With Pause and Resume Capability">
          <artwork align="left"><![CDATA[v=0
o=alice 3203093520 3203093520 IN IP4 alice.example.com
s=Pausing Media
t=0 0
c=IN IP4 alice.example.com
m=audio 49170 RTP/AVPF 98 99
a=rtpmap:98 G719/48000
a=rtpmap:99 PCMA/8000
a=rtcp-fb:* ccm pause nowait

]]></artwork>
        </figure>

        <t>The offerer supports all of the messages defined in this
        specification, leaving out the optional config attribute. The offerer
        also believes that it will be the sole receiver of the answerer's
        stream as well as that the answerer will be the sole receiver of the
        offerer's stream and thus includes the "nowait" sub-parameter for the
        "pause" parameter.</t>

        <t>This is the SDP answer:</t>

        <figure anchor="fig-example-sdp-answer"
                title="SDP Answer With Pause and Resume Capability">
          <artwork align="left"><![CDATA[v=0
o=bob 293847192 293847192 IN IP4 bob.example.com
s=-
t=0 0
c=IN IP4 bob.example.com
m=audio 49202 RTP/AVPF 98
a=rtpmap:98 G719/48000
a=rtcp-fb:98 ccm pause config=2

]]></artwork>
        </figure>

        <t>The answerer will not allow its sent streams to be paused or
        resumed and thus restricts the answer to indicate config=2. It also
        supports pausing its own RTP streams due to local considerations,
        which is why config=2 is chosen rather than config=4. The answerer
        somehow knows that it will not be a point-to-point RTP session and has
        therefore removed "nowait" from the "pause" line, meaning that the
        offerer must use a non-zero hold-off period when being requested to
        pause the stream.</t>

        <t>When using TMMBR 0 / TMMBN 0 to achieve pause and resume
        functionality, there are no differences in SDP compared to <xref
        target="RFC5104">CCM</xref> and therefore no such examples are
        included here.</t>
      </section>

      <section title="Point-to-Point Session">
        <t>This is the most basic scenario, which involves two participants,
        each acting as a sender and/or receiver. Any RTP data receiver sends
        PAUSE or RESUME messages to the sender, which pauses or resumes
        transmission accordingly. The hold-off period before pausing a stream
        is 0.</t>

        <figure align="center" anchor="fig-pause-resume"
                title="Pause and Resume Operation in Point-to-Point">
          <artwork align="center"><![CDATA[
+---------------+                   +---------------+
|  RTP Sender   |                   | RTP Receiver  |
+---------------+                   +---------------+
       :           t1: RTP data           :
       | -------------------------------> |
       |           t2: PAUSE(3)           |
       | <------------------------------- |
       |       < RTP data paused >        |
       |           t3: PAUSED(3)          |
       | -------------------------------> |
       :       < Some time passes >       :
       |           t4: RESUME(3)          |
       | <------------------------------- |
       |           t5: RTP data           |
       | -------------------------------> |
       :       < Some time passes >       :
       |           t6: PAUSE(4)           |
       | <------------------------------- |
       |       < RTP data paused >        |
       :                                  :

]]></artwork>
        </figure>

        <t><xref target="fig-pause-resume"/> shows the basic pause and resume
        operation in Point-to-Point scenario. At time t1, an RTP sender sends
        data to a receiver. At time t2, the RTP receiver requests the sender
        to pause the stream, using PauseID 3 (which it knew since before in
        this example). The sender pauses the data and replies with a PAUSED
        containing the same PauseID. Some time later (at time t4) the receiver
        requests the sender to resume, which resumes its transmission. The
        next PAUSE, sent at time t6, contains an updated PauseID (4).</t>

        <figure align="center" anchor="fig-pause-resume-tmmbr"
                title="TMMBR Pause and Resume in Point-to-Point">
          <artwork align="center"><![CDATA[
+---------------+                   +---------------+
|  RTP Sender   |                   | RTP Receiver  |
+---------------+                   +---------------+
       :           t1: RTP data           :
       | -------------------------------> |
       |           t2: TMMBR 0            |
       | <------------------------------- |
       |       < RTP data paused >        |
       |           t3: TMMBN 0            |
       | -------------------------------> |
       :       < Some time passes >       :
       |           t4: TMMBR 150000       |
       | <------------------------------- |
       |           t5: RTP data           |
       | -------------------------------> |
       :       < Some time passes >       :
       |           t6: TMMBR 0            |
       | <------------------------------- |
       |       < RTP data paused >        |
       :                                  :

]]></artwork>
        </figure>

        <t><xref target="fig-pause-resume-tmmbr"/> describes the same
        point-to-point scenario as above, but using TMMBR/TMMBN signaling.</t>

        <figure align="center" anchor="fig-paused-tmmbn"
                title="Unsolicited PAUSED using TMMBN">
          <artwork align="center"><![CDATA[
+---------------+                 +----------------+
| RTP Sender A  |                 | RTP Receiver B |
+---------------+                 +----------------+
       :           t1: RTP data           :
       | -------------------------------> |
       |       < RTP data paused >        |
       |           t2: TMMBN {A:0}        |
       | -------------------------------> |
       :       < Some time passes >       :
       |           t3: TMMBR 0            |
       | <------------------------------- |
       |           t4: TMMBN {A:0,B:0}    |
       | -------------------------------> |
       :       < Some time passes >       :
       |           t5: TMMBN {B:0}        |
       | -------------------------------> |
       :       < Some time passes >       :
       |           t6: TMMBR 80000        |
       | <------------------------------- |
       |           t7: RTP data           |
       | -------------------------------> |
       :                                  :

]]></artwork>
        </figure>

        <t><xref target="fig-paused-tmmbn"/> describes the case when an RTP
        stream sender (A) chooses to pause an RTP stream due to local
        considerations. Both the RTP stream sender (A) and the RTP stream
        receiver (B) use TMMBR/TMMBN signaling for pause/resume purposes. A
        decides to pause the RTP stream at time t2 and uses TMMBN 0 to signal
        PAUSED, including itself in the TMMBN bounding set. At time t3,
        despite the fact that the RTP stream is still paused, B decides that
        it is no longer interested to receive the RTP stream and signals PAUSE
        by sending a TMMBR 0. As a result of that, the bounding set now
        contains both A and B, and A sends out a new TMMBN reflecting that.
        After a while, at time t5, the local considerations that caused A to
        pause the RTP stream no longer apply, causing it to remove itself from
        the bounding set and to send a new TMMBN indicating this. At time t6,
        B decides that it is now interested to receive the RTP stream again
        and signals RESUME by sending a TMMBR containing a bitrate value
        greater than 0, causing A to resume sending RTP data.</t>

        <figure align="center" anchor="fig-pause-lost"
                title="Pause and Resume Operation With Messages Lost">
          <artwork align="center"><![CDATA[
+---------------+                       +---------------+
|  RTP Sender   |                       | RTP Receiver  |
+---------------+                       +---------------+
       :           t1: RTP data                :
       | ------------------------------------> |
       |                   t2: PAUSE(7), lost  |
       |                   <---X-------------- |
       |                                       |
       |           t3: RTP data                |
       | ------------------------------------> |
       :                                       :
       |    <Timeout, still receiving data>    |
       |           t4: PAUSE(7)                |
       | <------------------------------------ |
       |          < RTP data paused >          |
       |           t5: PAUSED(7)               | 
       | ------------------------------------> |
       :          < Some time passes >         :
       |                   t6: RESUME(7), lost |
       |                   <---X-------------- |
       |           t7: RESUME(7)               |
       | <------------------------------------ |
       |           t8: RTP data                |
       | ------------------------------------> |
       |           t9: RESUME(7)               |
       | <------------------------------------ |
       :                                       :

]]></artwork>
        </figure>

        <t><xref target="fig-pause-lost"/> describes what happens if a PAUSE
        message from an RTP stream receiver does not reach the RTP stream
        sender. After sending a PAUSE message, the RTP stream receiver waits
        for a time-out to detect if the RTP stream sender has paused the data
        transmission or has sent PAUSED indication according to the rules
        discussed in <xref target="sec-state-paused"/>. As the PAUSE message
        is lost on the way (at time t2), RTP data continues to reach to the
        RTP stream receiver. When the timer expires, the RTP stream receiver
        schedules a retransmission of the PAUSE message, which is sent at time
        t4. If the PAUSE message now reaches the RTP stream sender, it pauses
        the RTP stream and replies with PAUSED.</t>

        <t>At time t6, the RTP stream receiver wishes to resume the stream
        again and sends a RESUME, which is lost. This does not cause any
        severe effect, since there is no requirement to wait until further
        RESUME are sent and another RESUME are sent already at time t7, which
        now reaches the RTP stream sender that consequently resumes the stream
        at time t8. The time interval between t6 and t7 can vary, but may for
        example be one RTCP feedback transmission interval as determined by
        the AVPF rules.</t>

        <t>The RTP stream receiver did not realize that the RTP stream was
        resumed in time to stop yet another scheduled RESUME from being sent
        at time t9. This is however harmless since the RESUME PauseID is less
        than the valid one and will be ignored by the RTP stream sender. It
        will also not cause any unwanted resume even if the stream was paused
        based on a PAUSE from some other receiver before receiving the RESUME,
        since the valid PauseID is now larger than the one in the stray RESUME
        and will only cause a REFUSED containing the new valid PauseID from
        the RTP stream sender.</t>

        <figure align="center" anchor="fig-pause-refused"
                title="Pause Request is Refused in Point-to-Point">
          <artwork align="center"><![CDATA[
+---------------+                 +---------------+
|  RTP Sender   |                 | RTP Receiver  |
+---------------+                 +---------------+
       :           t1: RTP data          :
       | ------------------------------> |
       |           t2: PAUSE(11)         |
       | <------------------------------ |
       |                                 | 
       |  < Can not pause RTP data >     |
       |           t3: REFUSED(11)       | 
       | ------------------------------> |
       |                                 |
       |           t4: RTP data          |
       | ------------------------------> |
       :                                 :

]]></artwork>
        </figure>

        <t>In <xref target="fig-pause-refused"/>, the receiver requests to
        pause the sender, which refuses to pause due to some consideration
        local to the sender and responds with a REFUSED message.</t>
      </section>

      <section title="Point-to-Multipoint using Mixer">
        <t>An RTP Mixer is an intermediate node connecting different
        transport-level clouds. The Mixer receives streams from different RTP
        sources, selects or combines them based on the application&acute;s
        needs and forwards the generated stream(s) to the destination. The
        Mixer typically puts its&acute; own SSRC(s) in RTP data packets
        instead of the original source(s).</t>

        <t>The Mixer keeps track of all the streams delivered to the Mixer and
        how they are currently used. In this example, it selects the video
        stream to deliver to the receiver R based on the voice activity of the
        RTP stream senders. The video stream will be delivered to R using M's
        SSRC and with an CSRC indicating the original source.</t>

        <t>Note that PauseID is not of any significance for the example and is
        therefore omitted in the description.</t>

        <figure align="center" anchor="fig-vad-mixer"
                title="Pause and Resume Operation for a Voice Activated Mixer">
          <artwork align="center"><![CDATA[
+-----+            +-----+            +-----+            +-----+
|  R  |            |  M  |            | S1  |            | S2  |
+-----+            +-----|            +-----+            +-----+
   :                  :   t1:RTP(S1)     :                  :
   | t2:RTP(M:S1)     |<-----------------|                  |
   |<-----------------|                  |                  |
   |                  | t3:RTP(S2)       |                  |
   |                  |<------------------------------------|
   |                  |  t4: PAUSE(S2)   |                  |
   |                  |------------------------------------>|
   |                  |                  |   t5: PAUSED(S2) |
   |                  |<------------------------------------|
   |                  |                  | <S2:No RTP to M> |
   |                  | t6: RESUME(S2)   |                  |
   |                  |------------------------------------>|
   |                  |                  | t7: RTP to M     |
   |                  |<------------------------------------|
   |   t8:RTP(M:S2)   |                  |                  |
   |<-----------------|                  |                  |
   |                  | t9:PAUSE(S1)     |                  |
   |                  |----------------->|                  |
   |                  | t10:PAUSED(S1)   |                  |
   |                  |<-----------------|                  |
   |                  | <S1:No RTP to M> |                  |
   :                  :                  :                  :

]]></artwork>
        </figure>

        <t>The session starts at t1 with S1 being the most active speaker and
        thus being selected as the single video stream to be delivered to R
        (t2) using the Mixer SSRC but with S1 as CSRC (indicated after the
        colon in the figure). Then S2 joins the session at t3 and starts
        delivering an RTP stream to the Mixer. As S2 has less voice activity
        then S1, the Mixer decides to pause S2 at t4 by sending S2 a PAUSE
        request. At t5, S2 acknowledges with a PAUSED and at the same instant
        stops delivering RTP to the Mixer. At t6, the user at S2 starts
        speaking and becomes the most active speaker and the Mixer decides to
        switch the video stream to S2, and therefore quickly sends a RESUME
        request to S2. At t7, S2 has received the RESUME request and acts on
        it by resuming RTP stream delivery to M. When the RTP stream from t7
        arrives at the Mixer, it switches this RTP stream into its SSRC (M) at
        t8 and changes the CSRC to S2. As S1 now becomes unused, the Mixer
        issues a PAUSE request to S1 at t9, which is acknowledged at t10 with
        a PAUSED and the RTP stream from S1 stops being delivered.</t>
      </section>

      <section title="Point-to-Multipoint using Translator ">
        <t>A transport Translator in an RTP session forwards the message from
        one peer to all the others. Unlike Mixer, the Translator does not mix
        the streams or change the SSRC of the messages or RTP media. These
        examples are to show that the messages defined in this specification
        can be safely used also in a transport Translator case. The
        parentheses in the figures contains (Target SSRC, PauseID) information
        for the messages defined in this specification.</t>

        <figure align="center" anchor="fig-translator"
                title="Pause and Resume Operation Between Two Participants Using a Translator">
          <artwork align="center"><![CDATA[
+-------------+     +-------------+     +--------------+
|  Sender(S)  |     | Translator  |     | Receiver(R)  |
+-------------+     +-------------|     +--------------+
       : t1: RTP(S)        :                   :
       |------------------>|                   |
       |                   | t2: RTP (S)       |
       |                   |------------------>|
       |                   | t3: PAUSE(S,3)    |
       |                   |<------------------|
       | t4:PAUSE(S,3)     |                   |
       |<------------------|                   |
       : < Sender waiting for possible RESUME> :
       |          < RTP data paused >          |              
       | t5: PAUSED(S,3)   |                   |
       |------------------>|                   |
       |                   | t6: PAUSED(S,3)   |
       |                   |------------------>|
       :                   :                   :
       |                   | t7: RESUME(S,3)   |
       |                   |<------------------|
       | t8: RESUME(S,3)   |                   |
       |<------------------|                   |
       | t9: RTP (S)       |                   |
       |------------------>|                   |
       |                   | t10: RTP (S)      |
       |                   |------------------>|
       :                   :                   :

]]></artwork>
        </figure>

        <t><xref target="fig-translator"/> describes how a Translator can help
        the receiver in pausing and resuming the sender. The sender S sends
        RTP data to the receiver R through Translator, which just forwards the
        data without modifying the SSRCs. The receiver sends a PAUSE request
        to the sender, which in this example knows that there may be more
        receivers of the stream and waits a non-zero hold-off period to see if
        there is any other receiver that wants to receive the data, does not
        receive any disapproving RESUME, hence pauses itself and replies with
        PAUSED. Similarly the receiver resumes the sender by sending RESUME
        request through Translator. Since this describes only a single pause
        operation for a single RTP stream sender, all messages uses a single
        PauseID, in this example 3.</t>

        <figure align="center" anchor="fig-translator-two-receivers"
                title="Pause and Resume Operation Between One Sender and Two Receivers Through Translator">
          <artwork align="center"><![CDATA[
+-----+            +-----+            +-----+            +-----+
|  S  |            |  T  |            | R1  |            | R2  |
+-----+            +-----|            +-----+            +-----+
   : t1:RTP(S)        :                  :                  :
   |----------------->|                  |                  |
   |                  | t2:RTP(S)        |                  |
   |                  |----------------->------------------>|
   |                  | t3:PAUSE(S,7)    |                  |
   |                  |<-----------------|                  |
   | t4:PAUSE(S,7)    |                  |                  |
   |<-----------------|------------------------------------>|
   |                  |                  |   t5:RESUME(S,7) |
   |                  |<------------------------------------|
   | t6:RESUME(S,7)   |                  |                  |
   |<-----------------|                  |                  |
   |                  |<RTP stream continues to R1 and R2>  |
   |                  |                  |   t7: PAUSE(S,8) |
   |                  |<------------------------------------|
   | t8:PAUSE(S,8)    |                  |                  |
   |<-----------------|                  |                  |
   :                  :                  :                  :
   | < Pauses RTP Stream >               |                  |
   | t9:PAUSED(S,8)   |                  |                  |
   |----------------->|                  |                  |
   |                  | t10:PAUSED(S,8)  |                  |
   |                  |----------------->------------------>|
   :                  :                  :                  :
   |                  | t11:RESUME(S,8)  |                  |
   |                  |<-----------------|                  |
   | t12:RESUME(S,8)  |                  |                  |
   |<-----------------|                  |                  |
   | t13:RTP(S)       |                  |                  |
   |----------------->|                  |                  |
   |                  | t14:RTP(S)       |                  |
   |                  |----------------->------------------>|
   :                  :                  :                  :

]]></artwork>
        </figure>

        <t><xref target="fig-translator-two-receivers"/> explains the pause
        and resume operations when a transport Translator is involved between
        a sender and two receivers in an RTP session. Each message exchange is
        represented by the time it happens. At time t1, Sender (S) starts
        sending an RTP stream to the Translator, which is forwarded to R1 and
        R2 through the Translator, T. R1 and R2 receives RTP data from
        Translator at t2. At this point, both R1 and R2 will send RTCP
        Receiver Reports to S informing that they receive S's stream.</t>

        <t>After some time (at t3), R1 chooses to pause the stream. On
        receiving the PAUSE request from R1 at t4, S knows that there are at
        least one receiver that may still want to receive the data and uses a
        non-zero hold-off period to wait for possible RESUME messages. R2 did
        also receive the PAUSE request at time t4 and since it still wants to
        receive the stream, it sends a RESUME for it at time t5, which is
        forwarded to the sender S by the translator T. The sender S sees the
        RESUME at time t6 and continues to send data to T which forwards to
        both R1 and R2. At t7, the receiver R2 chooses to pause the stream by
        sending a PAUSE request with an updated PauseID. The sender S still
        knows that there are more than one receiver (R1 and R2) that may want
        the stream and again waits a non-zero hold-off period, after which and
        not having received any disapproving RESUME, it concludes that the
        stream must be paused. S now stops sending the stream and replies with
        PAUSED to R1 and R2. When any of the receivers (R1 or R2) chooses to
        resume the stream from S, in this example R1, it sends a RESUME
        request to the sender. The RTP sender immediately resumes the
        stream.</t>

        <t>Consider also an RTP session which includes one or more receivers,
        paused sender(s), and a Translator. Further assume that a new
        participant joins the session, which is not aware of the paused
        sender(s). On receiving knowledge about the newly joined participant,
        e.g. any RTP traffic or RTCP report (i.e. either SR or RR) from the
        newly joined participant, the paused sender(s) immediately sends
        PAUSED indications for the paused streams since there is now a
        receiver in the session that did not pause the sender(s) and may want
        to receive the streams. Having this information, the newly joined
        participant has the same possibility as any other participant to
        resume the paused streams.</t>
      </section>
    </section>

    <section anchor="IANA" title="IANA Considerations">
      <t>This specification requests the following registrations from
      IANA:<list style="numbers">
          <t>A new value for media stream pause / resume to be registered with
          IANA in the "FMT Values for RTPFB Payload Types" registry located at
          the time of publication at:
          http://www.iana.org/assignments/rtp-parameters/rtp-parameters.xhtml#rtp-parameters-8<list
              style="hanging">
              <t hangText="Value:">TBA1</t>

              <t hangText="Name:">PAUSE-RESUME</t>

              <t hangText="Long Name:">Media Pause / Resume</t>

              <t hangText="Reference:">This RFC</t>
            </list></t>

          <t>A new value "pause" to be registered with IANA in the "Codec
          Control Messages" registry located at the time of publication at:
          http://www.iana.org/assignments/sdp-parameters/sdp-parameters.xhtml#sdp-parameters-19<list
              style="hanging">
              <t hangText="Value Name:">pause</t>

              <t hangText="Long Name:">Media Pause / Resume</t>

              <t hangText="Usable with:">ccm</t>

              <t hangText="Reference:">This RFC</t>
            </list></t>
        </list></t>

      <t/>
    </section>

    <section anchor="Security" title="Security Considerations">
      <t>This document extends the <xref target="RFC5104">CCM</xref> and
      defines new messages, i.e. PAUSE and RESUME. The exchange of these new
      messages MAY have some security implications, which need to be addressed
      by the user. Following are some important implications,</t>

      <t><list style="numbers">
          <t>Identity spoofing - An attacker can spoof him/herself as an
          authenticated user and can falsely pause or resume any source
          transmission. In order to prevent this type of attack, a strong
          authentication and integrity protection mechanism is needed.</t>

          <t>Denial of Service (DoS) - An attacker can falsely pause all
          source streams which MAY result in Denial of Service (DoS). An
          Authentication protocol may prevent this attack.</t>

          <t>Man-in-Middle Attack (MiMT) - The pausing and resuming of an RTP
          source is prone to a Man-in-Middle attack. Public key authentication
          may be used to prevent MiMT.</t>
        </list></t>
    </section>

    <section title="Contributors">
      <t>Daniel Grondal contributed in the creation and writing of early
      versions of this specification. Christian Groves contributed
      significantly to the SDP config attribute and its use in
      Offer/Answer.</t>
    </section>

    <section anchor="Acknowledgements" title="Acknowledgements">
      <t>Daniel Grondal made valuable contributions during the initial
      versions of this draft. Christian Groves and Bernard Aboba provided
      valuable review comments.</t>
    </section>
  </middle>

  <back>
    <references title="Normative References">
      <?rfc include="reference.RFC.3550"?>

      <?rfc include="reference.RFC.4585"?>

      <?rfc include="reference.RFC.5104"?>

      <?rfc include='reference.RFC.5234'?>

      <?rfc include='reference.RFC.5245'?>

      <?rfc include='reference.RFC.6263'?>

      <?rfc include="reference.RFC.2119"?>
    </references>

    <references title="Informative References">
      <?rfc include='reference.RFC.2326'?>

      <?rfc include='reference.RFC.2974'?>

      <?rfc include='reference.RFC.3261'?>

      <?rfc include='reference.RFC.3264'?>

      <?rfc include='reference.RFC.4566'?>

      <?rfc include='reference.RFC.6190'?>

      <?rfc include='reference.I-D.westerlund-avtcore-rtp-simulcast'?>

      <?rfc include='reference.I-D.ietf-rtcweb-use-cases-and-requirements'?>

      <?rfc include='reference.I-D.ietf-avtext-rtp-grouping-taxonomy'?>

      <?rfc include='reference.I-D.ietf-avtcore-rtp-topologies-update'?>
    </references>

    <section title="Changes From Earlier Versions">
      <t>NOTE TO RFC EDITOR: Please remove this section prior to
      publication.</t>

      <section title="Modifications Between Version -02 and -03">
        <t><list style="symbols">
            <t>Changed the section on SDP signaling to be more explicit and
            clear in what is supported, replacing the 'paused' parameter and
            the 'dir' attribute with a 'config' parameter that can take a
            value, and an explicit listing of what each value means.</t>

            <t>Added a sentence in section on <xref
            target="sec-state-paused">paused state</xref> that pause must not
            affect RTP keepalive.</t>

            <t>Replaced REFUSE message name with REFUSED throughout, to better
            indicate that it is not a command but a notification.</t>

            <t>Added text in a few places, clarifying that PAUSED message may
            be used unsolicited due to RTP sender local considerations, and
            also clarified the interaction between this usage and an RTP
            stream receiver pausing the stream. Also added an example
            describing this case.</t>

            <t>Clarified that when TMMBN 0 is used as PAUSED message, and when
            sent unsolicited due to RTP sender local considerations, the TMMBN
            message includes the RTP stream sender itself as part of the
            bounding set.</t>

            <t>Clarified that there is no reply to a PAUSED indication.</t>

            <t>Improved the IANA section.</t>

            <t>Editorial improvements.</t>
          </list></t>
      </section>

      <section title="Modifications Between Version -01 and -02">
        <t><list style="symbols">
            <t>Replaced most text on relation with other signaling
            technologies in previous section 5 with a single, summarizing
            paragraph, as discussed at IETF 90 in Toronto, and placed it as
            the last sub-section of section 4 (design considerations).</t>

            <t>Removed unused references.</t>
          </list></t>
      </section>

      <section title="Modifications Between Version -00 and -01">
        <t><list style="symbols">
            <t>Corrected text in section 6.5 and 6.2 to indicate that a PAUSE
            signaled via TMMBR 0 cannot be REFUSED using TMMBN &gt; 0</t>

            <t>Improved alignment with RTP Taxonomy draft, including the
            change of Packet Stream to RTP Stream</t>

            <t>Editorial improvements</t>
          </list></t>
      </section>
    </section>
  </back>
</rfc>
