Speech and multimedia Transmission Quality (STQ) - Transmission requirements for wideband VoIP terminals (handset and headset) from a QoS perspective as perceived by the user

The present document provides speech transmission performance requirements for 8 kHz wideband VoIP handset and
headset terminals; it addresses all types of IP based terminals, including wireless and soft phones.
In contrast to other standards which define minimum performance requirements it is the intention of the present
document to specify terminal equipment requirements which enable manufacturers and service providers to enable good
quality end-to-end speech performance as perceived by the user.
In addition to basic testing procedures, the present document describes advanced testing procedures taking into account
further quality parameters as perceived by the user.

Kakovost prenosa govora in večpredstavnih vsebin (STQ) - Prenosne zahteve za širokopasovne terminale VoIP (ročne in naglavne) glede na kakovost storitev (QoS), kot jih dojema uporabnik

V tem dokumentu so podane zahteve glede učinkovitosti prenosa govora za ročne in naglavne 8-kHz širokopasovne terminale VoIP; obravnava vse vrste terminalov na podlagi naslova IP, vključno z brezžičnimi in programskimi telefoni.
V nasprotju z ostalimi standardi, ki opredeljujejo minimalne zahteve glede učinkovitosti, je namen tega dokumenta določiti zahteve za terminalsko opremo, ki proizvajalcem in ponudnikom storitev omogočajo, da zagotavljajo dobro kakovost govora od začetka do konca, kot jo dojema uporabnik.
Poleg osnovnih preskusnih postopkov ta dokument opisuje napredne preskusne postopke, ki upoštevajo še druge parametre kakovosti, kot jih dojema uporabnik.

General Information

Status
Published
Publication Date
25-Mar-2018
Current Stage
6060 - National Implementation/Publication (Adopted Project)
Start Date
19-Mar-2018
Due Date
24-May-2018
Completion Date
26-Mar-2018

Buy Standard

Standard
ETSI ES 202 739 V1.7.1 (2017-09) - Speech and multimedia Transmission Quality (STQ); Transmission requirements for wideband VoIP terminals (handset and headset) from a QoS perspective as perceived by the user
English language
51 pages
sale 15% off
Preview
sale 15% off
Preview
Standard
ETSI ES 202 739 V1.7.1 (2017-07) - Speech and multimedia Transmission Quality (STQ); Transmission requirements for wideband VoIP terminals (handset and headset) from a QoS perspective as perceived by the user
English language
51 pages
sale 15% off
Preview
sale 15% off
Preview
Standardization document
ES 202 739 V1.7.1:2018
English language
51 pages
sale 10% off
Preview
sale 10% off
Preview
e-Library read for
1 day

Standards Content (Sample)

ETSI ES 202 739 V1.7.1 (2017-09)






ETSI STANDARD
Speech and multimedia Transmission Quality (STQ);
Transmission requirements for wideband
VoIP terminals (handset and headset)
from a QoS perspective as perceived by the user

---------------------- Page: 1 ----------------------
2 ETSI ES 202 739 V1.7.1 (2017-09)



Reference
RES/STQ-258
Keywords
quality, speech, telephony, terminal, VoIP,
wideband
ETSI
650 Route des Lucioles
F-06921 Sophia Antipolis Cedex - FRANCE

Tel.: +33 4 92 94 42 00  Fax: +33 4 93 65 47 16

Siret N° 348 623 562 00017 - NAF 742 C
Association à but non lucratif enregistrée à la
Sous-Préfecture de Grasse (06) N° 7803/88

Important notice
The present document can be downloaded from:
http://www.etsi.org/standards-search
The present document may be made available in electronic versions and/or in print. The content of any electronic and/or
print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any
existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the
print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat.
Users of the present document should be aware that the document may be subject to revision or change of status.
Information on the current status of this and other ETSI documents is available at
https://portal.etsi.org/TB/ETSIDeliverableStatus.aspx
If you find errors in the present document, please send your comment to one of the following services:
https://portal.etsi.org/People/CommiteeSupportStaff.aspx
Copyright Notification
No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying
and microfilm except as authorized by written permission of ETSI.
The content of the PDF version shall not be modified without the written authorization of ETSI.
The copyright and the foregoing restriction extend to reproduction in all media.

© ETSI 2017.
All rights reserved.

TM TM TM
DECT , PLUGTESTS , UMTS and the ETSI logo are trademarks of ETSI registered for the benefit of its Members.
TM
3GPP and LTE™ are trademarks of ETSI registered for the benefit of its Members and
of the 3GPP Organizational Partners.
oneM2M logo is protected for the benefit of its Members.
GSM® and the GSM logo are trademarks registered and owned by the GSM Association.
ETSI

---------------------- Page: 2 ----------------------
3 ETSI ES 202 739 V1.7.1 (2017-09)
Contents
Intellectual Property Rights . 5
Foreword . 5
Modal verbs terminology . 5
Introduction . 5
1 Scope . 6
2 References . 6
2.1 Normative references . 6
2.2 Informative references . 7
3 Definitions and abbreviations . 8
3.1 Definitions . 8
3.2 Abbreviations . 8
4 General considerations . 9
4.1 Coding algorithm . 9
4.2 End-to-end considerations . 9
5 Test equipment . 10
5.1 IP half channel measurement adaptor . 10
5.2 Environmental conditions for tests . 10
5.3 Accuracy of measurements and test signal generation . 10
5.4 Network impairment simulation . 11
5.5 Acoustic environment . 12
5.6 Influence of terminal delay on measurements . 12
6 Requirements and associated measurement methodologies . 12
6.1 Notes . 12
6.2 Test setup. 13
6.2.1 General . 13
6.2.2 Setup for handsets and headsets . 13
6.2.3 Position and calibration of HATS . 14
6.2.4 Test signal levels . 14
6.2.5 Setup of background noise simulation . 14
6.2.6 Setup of variable echo path . 14
6.3 Coding independent parameters . 15
6.3.1 Send frequency response . 15
6.3.2 Send Loudness Rating (SLR). 16
6.3.3 Mic mute . 17
6.3.4 Linearity range for SLR . 17
6.3.5 Send distortion . 18
6.3.6 Out-of-band signals in send direction . 19
6.3.7 Send noise . 19
6.3.8 SideTone Masking Rating STMR (mouth to ear) . 20
6.3.9 Sidetone delay . 20
6.3.10 Terminal Coupling Loss (TCL) . 21
6.3.11 Stability loss. 22
6.3.12 Receive frequency response . 23
6.3.13 Receive Loudness Rating (RLR) . 25
6.3.14 Receive distortion . 26
6.3.15 Out-of-band signals in receive direction . 27
6.3.16 Minimum activation level and sensitivity in receive direction . 27
6.3.17 Receive noise . 27
6.3.18 Automatic level control in receive . 28
6.3.19 Double talk performance . 28
6.3.19.1 General . 28
6.3.19.2 Attenuation range in send direction during double talk A . 28
H,S,dt
ETSI

---------------------- Page: 3 ----------------------
4 ETSI ES 202 739 V1.7.1 (2017-09)
6.3.19.3 Attenuation range in receive direction during double talk A . 29
H,R,dt
6.3.19.4 Detection of echo components during double talk . 30
6.3.19.5 Minimum activation level and sensitivity of double talk detection . 31
6.3.20 Switching characteristics . 31
6.3.20.1 Note . 31
6.3.20.2 Activation in send direction . 32
6.3.20.3 Silence suppression and comfort noise generation . 32
6.3.21 Background noise performance . 32
6.3.21.1 Performance in send in the presence of background noise . 32
6.3.21.2 Speech quality in the presence of background noise . 33
6.3.21.3 Quality of background noise transmission (with far end speech). 34
6.3.22 Quality of echo cancellation . 34
6.3.22.1 Temporal echo effects . 34
6.3.22.2 Spectral echo attenuation . 35
6.3.22.3 Occurrence of artefacts . 36
6.3.22.4 Variable echo path. 36
6.3.23 Variant impairments; network dependant . 36
6.3.23.1 Clock accuracy send . 36
6.3.23.2 Clock accuracy receive . 36
6.3.23.3 Send packet delay variation. 37
6.3.24 Send and receive delay - round trip delay . 37
6.4 Codec specific requirements. 40
6.4.1 Objective listening speech quality MOS-LQO in send direction . 40
6.4.2 Objective listening quality MOS-LQO in receive direction . 41
6.4.3 Quality of jitter buffer adjustment . 43
Annex A (informative): Processing delays in VoIP terminals . 45
Annex B (informative): Optimum frequency responses for wideband transmission in receive
direction - underlying subjective experiments . 48
Annex C (informative): Bibliography . 50
History . 51

ETSI

---------------------- Page: 4 ----------------------
5 ETSI ES 202 739 V1.7.1 (2017-09)
Intellectual Property Rights
Essential patents
IPRs essential or potentially essential to the present document may have been declared to ETSI. The information
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web
server (https://ipr.etsi.org/).
Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web
server) which are, or may be, or may become, essential to the present document.
Trademarks
The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners.
ETSI claims no ownership of these except for any which are indicated as being the property of ETSI, and conveys no
right to use or reproduce any trademark and/or tradename. Mention of those trademarks in the present document does
not constitute an endorsement by ETSI of products, services or organizations associated with those trademarks.
Foreword
This ETSI Standard (ES) has been produced by ETSI Technical Committee Speech and multimedia Transmission
Quality (STQ).
Modal verbs terminology
In the present document "shall", "shall not", "should", "should not", "may", "need not", "will", "will not", "can" and
"cannot" are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of
provisions).
"must" and "must not" are NOT allowed in ETSI deliverables except when used in direct citation.
Introduction
Traditionally, the analogue and digital telephones were interfacing switched-circuit 64 kbit/s PCM networks. With the
fast growth of IP networks, wideband terminals providing higher audio-bandwidth and directly interfacing
packet-switched networks (VoIP) are being rapidly introduced. Such IP network edge devices may include gateways,
specifically designed IP phones, soft phones or other devices connected to the IP based networks and providing
telephony service. Since the IP networks will be in many cases interworking with the traditional PSTN and private
networks, many of the basic transmission requirements have to be harmonised with specifications for traditional digital
terminals. However, due to the unique characteristics of the IP networks including packet loss, delay, etc. New
performance specification, as well as appropriate measuring methods, will have to be developed. Terminals are getting
increasingly complex, advanced signal processing is used to address the IP specific issues.
The advanced signal processing of terminals is targeted to speech signals. Therefore, wherever possible speech signals
are used for testing in order to achieve mostly realistic test conditions and meaningful results.
The present document provides speech transmission performance requirements for wideband VoIP handset and headset
terminals.
NOTE: Requirement limits are given in tables, the associated curve when provided is given for illustration.
ETSI

---------------------- Page: 5 ----------------------
6 ETSI ES 202 739 V1.7.1 (2017-09)
1 Scope
The present document provides speech transmission performance requirements for 8 kHz wideband VoIP handset and
headset terminals; it addresses all types of IP based terminals, including wireless and soft phones.
In contrast to other standards which define minimum performance requirements it is the intention of the present
document to specify terminal equipment requirements which enable manufacturers and service providers to enable good
quality end-to-end speech performance as perceived by the user.
In addition to basic testing procedures, the present document describes advanced testing procedures taking into account
further quality parameters as perceived by the user.
2 References
2.1 Normative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.
Referenced documents which are not found to be publicly available in the expected location might be found at
http://docbox.etsi.org/Reference.
NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.
The following referenced documents are necessary for the application of the present document.
[1] Recommendation ITU-T G.107: "The E-model, a computational model for use in transmission
planning".
[2] Recommendation ITU-T G.108: "Application of the E-model: A planning guide".
[3] Recommendation ITU-T G.109: "Definition of categories of speech transmission quality".
[4] Void.
[5] Recommendation ITU-T G.722: "7 kHz audio-coding within 64 kbit/s".
[6] Recommendation ITU-T G.722.1: "Low-complexity coding at 24 and 32 kbit/s for hands-free
operation in systems with low frame loss".
[7] Recommendation ITU-T G.729.1: "G.729 based Embedded Variable bit-rate coder: An 8-32 kbit/s
scalable wideband coder bitstream interoperable with G.729".
[8] Recommendation ITU-T P.56: "Objective measurement of active speech level".
[9] Recommendation ITU-T P.57: "Artificial ears".
[10] Recommendation ITU-T P.58: "Head and torso simulator for telephonometry".
[11] Recommendation ITU-T P.64: "Determination of sensitivity/frequency characteristics of local
telephone systems".
[12] Recommendation ITU-T P.79: "Calculation of loudness ratings for telephone sets".
[13] Recommendation ITU-T P.340: "Transmission characteristics and speech quality parameters of
hands-free terminals".
[14] Recommendation ITU-T P.380: "Electro-acoustic measurements on headsets".
[15] Recommendation ITU-T P.501: "Test signals for use in telephonometry".
ETSI

---------------------- Page: 6 ----------------------
7 ETSI ES 202 739 V1.7.1 (2017-09)
[16] Recommendation ITU-T P.502: "Objective test methods for speech communication systems using
complex test signals".
[17] Recommendation ITU-T P.581: "Use of head and torso simulator (HATS) for hands-free terminal
testing".
[18] IEC 61260-1: "Electroacoustics - Octave-band and fractional-octave-band filters - Part 1:
Specifications".
[19] TIA-920.130-A: "Telecommunications Telephone Terminal Equipment Transmission
Requirements for Wideband Digital Wireline Telephones with Headset".
[20] ETSI TS 103 224: "Speech and multimedia Transmission Quality (STQ); A sound field
reproduction method for terminal testing including a background noise database".
[21] Recommendation ITU-T P.863: "Perceptual objective listening quality assessment".
[22] Recommendation ITU-T P.863.1: "Application Guide for Recommendation ITU-T P.863".
[23] ETSI ES 202 737: "Speech and multimedia Transmission Quality (STQ); Transmission
requirements for narrowband VoIP terminals (handset and headset) from a QoS perspective as
perceived by the user".
[24] Recommendation ITU-T P.1010: "Fundamental voice transmission objectives for VoIP terminals
and gateways".
[25] Recommendation ITU-T G.722.2: "Wideband coding of speech at around 16 kbit/s using Adaptive
Multi-Rate Wideband (AMR-WB)".
[26] IETF RFC 3550: "RTP: A Transport Protocol for Real-Time Applications".
[27] Recommendation ITU-T G.122: "Influence of national systems on stability and talker echo in
international connections".
2.2 Informative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.
NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.
The following referenced documents are not necessary for the application of the present document but they assist the
user with regard to a particular subject area.
[i.1] ETSI EG 201 377-1: "Speech and multimedia Transmission Quality (STQ); Specification and
measurement of speech transmission quality; Part 1: Introduction to objective comparison
measurement methods for one-way speech quality across networks".
[i.2] ETSI EG 202 425: "Speech Processing, Transmission and Quality Aspects (STQ); Definition and
implementation of VoIP reference point".
[i.3] ETSI EG 202 396-3: "Speech and multimedia Transmission Quality (STQ); Speech Quality
performance in the presence of background noise; Part 3: Background noise transmission -
Objective test methods".
[i.4] Recommendation ITU-T P.800.1: "Mean Opinion Score (MOS) Terminology".
TM
[i.5] NIST Net .
NOTE: Available at https://www-x.antd.nist.gov/itg/nistnet/.
ETSI

---------------------- Page: 7 ----------------------
8 ETSI ES 202 739 V1.7.1 (2017-09)
TM
[i.6] Netem .
NOTE: Available at http://www.linuxfoundation.org/en/Net:Netem.
[i.7] DAGA 2008: "Testing Wideband Terminals", March 10-13, Dresden, Proceedings. Poschen S.,
Kettler F., Raake A., Spors S.
[i.8] Trace Control for Netem (TCN): A. Keller, "Trace Control for Netem", Semester Thesis
SA-2006-15, ETH Zürich, 2006.
[i.9] ETSI ES 202 739 (V1.2.1): "Speech and multimedia Transmission Quality (STQ); Transmission
requirements for wideband VoIP terminals (handset and headset) from a QoS perspective as
perceived by the user".
3 Definitions and abbreviations
3.1 Definitions
For the purposes of the present document, the following terms and definitions apply:
artificial ear: device for the calibration of earphones incorporating an acoustic coupler and a calibrated microphone for
the measurement of the sound pressure and having an overall acoustic impedance similar to that of the median adult
human ear over a given frequency band
codec: combination of an analogue-to-digital encoder and a digital-to-analogue decoder operating in opposite directions
of transmission in the same equipment
diffuse field equalization: equalization of the HATS sound pick-up, equalization of the difference, in dB, between the
spectrum level of the acoustic pressure at the ear Drum Reference Point (DRP) and the spectrum level of the acoustic
pressure at the HATS Reference Point (HRP) in a diffuse sound field with the HATS absent using the reverse nominal
curve given in table 3 of Recommendation ITU-T P.58 [10]
ear-Drum Reference Point (DRP): point located at the end of the ear canal, corresponding to the ear-drum position
freefield reference point: point located in the free sound field, at least in 1,5 m distance from a sound source radiating
in free air (in case of a head and torso simulator (HATS) in the centre of the artificial head with no artificial head
present)
Head And Torso Simulator (HATS) for telephonometry: manikin extending downward from the top of the head to
the waist, designed to simulate the sound pick-up characteristics and the acoustic diffraction produced by a median
human adult and to reproduce the acoustic field generated by the human mouth
Mouth Reference Point (MRP): point located on axis and 25 mm in front of the lip plane of a mouth simulator
nominal setting of the volume control: when a receive volume control is provided, the setting which is closest to the
nominal RLR of 2 dB
3.2 Abbreviations
For the purposes of the present document, the following abbreviations apply:
AM-FM Amplitude Modulation-Frequency Modulation
AMR-WB Adaptive Multi Rate - Wideband
CS Composite Source
CSS Composite Source Signal
DRP ear Drum Reference Point
EC Echo Canceller
ELR Echo Loudness Rating
ERP Ears Reference Point
ETH Eidgenössische Technische Hochschule
FFT Fast Fourrier Transform
ETSI

---------------------- Page: 8 ----------------------
9 ETSI ES 202 739 V1.7.1 (2017-09)
G-MOS-LQOw Overall transmission quality wideband
GSM Global System for Mobile communications
HATS Head And Torso Simulator
HRP HATS Reference Point
IEC International Electrotechnical Commission
IP Internet Protocol
IPDV IP Packet Delay Variation
ITU-T International Telecommunication Union -Telecommunication standardization sector
MOS Mean Opinion Score
MOS-LQOy Mean Opinion Score - Listening Quality Objective
NOTE: y being N for narrow-band, M for mixed and S for superwideband. See Recommendation ITU-T
P.800.1 [i.4].
MRP Mouth Reference Point
NIST National Institute of Standards and Technology
NLP Non Linear Processor
N-MOS-LQOw Transmission quality of the background noise wideband
PBX Private Branch eXchange
PC Personal Computer
PCM Pulse Code Modulation
POI Point Of Interconnect
PSTN Public Switched Telephone Network
QoS Quality of Service
RLR Receive Loudness Rating
RMS Root Mean Square
RTP Real Time Protocol
SLR Send Loudness Rating
S-MOS-LQOw Transmission quality of the speech wideband
STMR SideTone Masking Rating
TCL Terminal Coupling Loss
TCN Trace Control for Netem
TDM Time Division Multiplex
TOSQA Telecommunication Objective Speech Quality Assessment
VAD Voice Activity Detection
VoIP Voice over IP
4 General considerations
4.1 Coding algorithm
The assumed coding algorithm is according to Recommendation ITU-T G.722 [5]. VoIP terminals may support other
coding algorithms.
NOTE: Associated Packet Loss Concealment, e.g. as defined in Recommendation ITU-T G.722 [5],
Appendixes 3 and 4 should be used.
4.2 End-to-end considerations
In order to achieve a desired end-to-end speech transmission performance (mouth-to-ear) it is recommended that the
general rules of transmission planning are carried out with the E-model of Recommendation ITU-T G.107 [1] taking
into account that the E-model does not yet address wideband transmission planning; this includes the a-priori
determination of the desired category of speech transmission quality as defined in Recommendation ITU-T G.109 [3].
While, in general, the transmission characteristics of single circuit-oriented network elements, such as switches or
terminals can be assumed to have a single input value for the planning
...

Final draft ETSI ES 202 739 V1.7.1 (2017-07)






ETSI STANDARD
Speech and multimedia Transmission Quality (STQ);
Transmission requirements for wideband
VoIP terminals (handset and headset)
from a QoS perspective as perceived by the user

---------------------- Page: 1 ----------------------
2 Final draft ETSI ES 202 739 V1.7.1 (2017-07)



Reference
RES/STQ-258
Keywords
quality, speech, telephony, terminal, VoIP,
wideband
ETSI
650 Route des Lucioles
F-06921 Sophia Antipolis Cedex - FRANCE

Tel.: +33 4 92 94 42 00  Fax: +33 4 93 65 47 16

Siret N° 348 623 562 00017 - NAF 742 C
Association à but non lucratif enregistrée à la
Sous-Préfecture de Grasse (06) N° 7803/88

Important notice
The present document can be downloaded from:
http://www.etsi.org/standards-search
The present document may be made available in electronic versions and/or in print. The content of any electronic and/or
print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any
existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the
print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat.
Users of the present document should be aware that the document may be subject to revision or change of status.
Information on the current status of this and other ETSI documents is available at
https://portal.etsi.org/TB/ETSIDeliverableStatus.aspx
If you find errors in the present document, please send your comment to one of the following services:
https://portal.etsi.org/People/CommiteeSupportStaff.aspx
Copyright Notification
No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying
and microfilm except as authorized by written permission of ETSI.
The content of the PDF version shall not be modified without the written authorization of ETSI.
The copyright and the foregoing restriction extend to reproduction in all media.

© ETSI 2017.
All rights reserved.

TM TM TM
DECT , PLUGTESTS , UMTS and the ETSI logo are trademarks of ETSI registered for the benefit of its Members.
TM
3GPP and LTE™ are trademarks of ETSI registered for the benefit of its Members and
of the 3GPP Organizational Partners.
oneM2M logo is protected for the benefit of its Members.
GSM® and the GSM logo are trademarks registered and owned by the GSM Association.
ETSI

---------------------- Page: 2 ----------------------
3 Final draft ETSI ES 202 739 V1.7.1 (2017-07)
Contents
Intellectual Property Rights . 5
Foreword . 5
Modal verbs terminology . 5
Introduction . 5
1 Scope . 6
2 References . 6
2.1 Normative references . 6
2.2 Informative references . 7
3 Definitions and abbreviations . 8
3.1 Definitions . 8
3.2 Abbreviations . 8
4 General considerations . 9
4.1 Coding algorithm . 9
4.2 End-to-end considerations . 9
5 Test equipment . 10
5.1 IP half channel measurement adaptor . 10
5.2 Environmental conditions for tests . 10
5.3 Accuracy of measurements and test signal generation . 10
5.4 Network impairment simulation . 11
5.5 Acoustic environment . 12
5.6 Influence of terminal delay on measurements . 12
6 Requirements and associated measurement methodologies . 12
6.1 Notes . 12
6.2 Test setup. 13
6.2.1 General . 13
6.2.2 Setup for handsets and headsets . 13
6.2.3 Position and calibration of HATS . 14
6.2.4 Test signal levels . 14
6.2.5 Setup of background noise simulation . 14
6.2.6 Setup of variable echo path . 14
6.3 Coding independent parameters . 15
6.3.1 Send frequency response . 15
6.3.2 Send Loudness Rating (SLR). 16
6.3.3 Mic mute . 17
6.3.4 Linearity range for SLR . 17
6.3.5 Send distortion . 18
6.3.6 Out-of-band signals in send direction . 19
6.3.7 Send noise . 19
6.3.8 SideTone Masking Rating STMR (mouth to ear) . 20
6.3.9 Sidetone delay . 20
6.3.10 Terminal Coupling Loss (TCL) . 21
6.3.11 Stability loss. 22
6.3.12 Receive frequency response . 23
6.3.13 Receive Loudness Rating (RLR) . 25
6.3.14 Receive distortion . 26
6.3.15 Out-of-band signals in receive direction . 27
6.3.16 Minimum activation level and sensitivity in receive direction . 27
6.3.17 Receive noise . 27
6.3.18 Automatic level control in receive . 28
6.3.19 Double talk performance . 28
6.3.19.1 General . 28
6.3.19.2 Attenuation range in send direction during double talk A . 28
H,S,dt
ETSI

---------------------- Page: 3 ----------------------
4 Final draft ETSI ES 202 739 V1.7.1 (2017-07)
6.3.19.3 Attenuation range in receive direction during double talk A . 29
H,R,dt
6.3.19.4 Detection of echo components during double talk . 30
6.3.19.5 Minimum activation level and sensitivity of double talk detection . 31
6.3.20 Switching characteristics . 31
6.3.20.1 Note . 31
6.3.20.2 Activation in send direction . 32
6.3.20.3 Silence suppression and comfort noise generation . 32
6.3.21 Background noise performance . 32
6.3.21.1 Performance in send in the presence of background noise . 32
6.3.21.2 Speech quality in the presence of background noise . 33
6.3.21.3 Quality of background noise transmission (with far end speech). 34
6.3.22 Quality of echo cancellation . 34
6.3.22.1 Temporal echo effects . 34
6.3.22.2 Spectral echo attenuation . 35
6.3.22.3 Occurrence of artefacts . 36
6.3.22.4 Variable echo path. 36
6.3.23 Variant impairments; network dependant . 36
6.3.23.1 Clock accuracy send . 36
6.3.23.2 Clock accuracy receive . 36
6.3.23.3 Send packet delay variation. 37
6.3.24 Send and receive delay - round trip delay . 37
6.4 Codec specific requirements. 40
6.4.1 Objective listening speech quality MOS-LQO in send direction . 40
6.4.2 Objective listening quality MOS-LQO in receive direction . 41
6.4.3 Quality of jitter buffer adjustment . 43
Annex A (informative): Processing delays in VoIP terminals . 45
Annex B (informative): Optimum frequency responses for wideband transmission in receive
direction - underlying subjective experiments . 48
Annex C (informative): Bibliography . 50
History . 51

ETSI

---------------------- Page: 4 ----------------------
5 Final draft ETSI ES 202 739 V1.7.1 (2017-07)
Intellectual Property Rights
Essential patents
IPRs essential or potentially essential to the present document may have been declared to ETSI. The information
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web
server (https://ipr.etsi.org/).
Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web
server) which are, or may be, or may become, essential to the present document.
Trademarks
The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners.
ETSI claims no ownership of these except for any which are indicated as being the property of ETSI, and conveys no
right to use or reproduce any trademark and/or tradename. Mention of those trademarks in the present document does
not constitute an endorsement by ETSI of products, services or organizations associated with those trademarks.
Foreword
This final draft ETSI Standard (ES) has been produced by ETSI Technical Committee Speech and multimedia
Transmission Quality (STQ), and is now submitted for the ETSI standards Membership Approval Procedure.
Modal verbs terminology
In the present document "shall", "shall not", "should", "should not", "may", "need not", "will", "will not", "can" and
"cannot" are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of
provisions).
"must" and "must not" are NOT allowed in ETSI deliverables except when used in direct citation.
Introduction
Traditionally, the analogue and digital telephones were interfacing switched-circuit 64 kbit/s PCM networks. With the
fast growth of IP networks, wideband terminals providing higher audio-bandwidth and directly interfacing
packet-switched networks (VoIP) are being rapidly introduced. Such IP network edge devices may include gateways,
specifically designed IP phones, soft phones or other devices connected to the IP based networks and providing
telephony service. Since the IP networks will be in many cases interworking with the traditional PSTN and private
networks, many of the basic transmission requirements have to be harmonised with specifications for traditional digital
terminals. However, due to the unique characteristics of the IP networks including packet loss, delay, etc. New
performance specification, as well as appropriate measuring methods, will have to be developed. Terminals are getting
increasingly complex, advanced signal processing is used to address the IP specific issues.
The advanced signal processing of terminals is targeted to speech signals. Therefore, wherever possible speech signals
are used for testing in order to achieve mostly realistic test conditions and meaningful results.
The present document provides speech transmission performance requirements for wideband VoIP handset and headset
terminals.
NOTE: Requirement limits are given in tables, the associated curve when provided is given for illustration.
ETSI

---------------------- Page: 5 ----------------------
6 Final draft ETSI ES 202 739 V1.7.1 (2017-07)
1 Scope
The present document provides speech transmission performance requirements for 8 kHz wideband VoIP handset and
headset terminals; it addresses all types of IP based terminals, including wireless and soft phones.
In contrast to other standards which define minimum performance requirements it is the intention of the present
document to specify terminal equipment requirements which enable manufacturers and service providers to enable good
quality end-to-end speech performance as perceived by the user.
In addition to basic testing procedures, the present document describes advanced testing procedures taking into account
further quality parameters as perceived by the user.
2 References
2.1 Normative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.
Referenced documents which are not found to be publicly available in the expected location might be found at
http://docbox.etsi.org/Reference.
NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.
The following referenced documents are necessary for the application of the present document.
[1] Recommendation ITU-T G.107: "The E-model, a computational model for use in transmission
planning".
[2] Recommendation ITU-T G.108: "Application of the E-model: A planning guide".
[3] Recommendation ITU-T G.109: "Definition of categories of speech transmission quality".
[4] Void.
[5] Recommendation ITU-T G.722: "7 kHz audio-coding within 64 kbit/s".
[6] Recommendation ITU-T G.722.1: "Low-complexity coding at 24 and 32 kbit/s for hands-free
operation in systems with low frame loss".
[7] Recommendation ITU-T G.729.1: "G.729 based Embedded Variable bit-rate coder: An 8-32 kbit/s
scalable wideband coder bitstream interoperable with G.729".
[8] Recommendation ITU-T P.56: "Objective measurement of active speech level".
[9] Recommendation ITU-T P.57: "Artificial ears".
[10] Recommendation ITU-T P.58: "Head and torso simulator for telephonometry".
[11] Recommendation ITU-T P.64: "Determination of sensitivity/frequency characteristics of local
telephone systems".
[12] Recommendation ITU-T P.79: "Calculation of loudness ratings for telephone sets".
[13] Recommendation ITU-T P.340: "Transmission characteristics and speech quality parameters of
hands-free terminals".
[14] Recommendation ITU-T P.380: "Electro-acoustic measurements on headsets".
[15] Recommendation ITU-T P.501: "Test signals for use in telephonometry".
ETSI

---------------------- Page: 6 ----------------------
7 Final draft ETSI ES 202 739 V1.7.1 (2017-07)
[16] Recommendation ITU-T P.502: "Objective test methods for speech communication systems using
complex test signals".
[17] Recommendation ITU-T P.581: "Use of head and torso simulator (HATS) for hands-free terminal
testing".
[18] IEC 61260-1: "Electroacoustics - Octave-band and fractional-octave-band filters - Part 1:
Specifications".
[19] TIA-920.130-A: "Telecommunications Telephone Terminal Equipment Transmission
Requirements for Wideband Digital Wireline Telephones with Headset".
[20] ETSI TS 103 224: "Speech and multimedia Transmission Quality (STQ); A sound field
reproduction method for terminal testing including a background noise database".
[21] Recommendation ITU-T P.863: "Perceptual objective listening quality assessment".
[22] Recommendation ITU-T P.863.1: "Application Guide for Recommendation ITU-T P.863".
[23] ETSI ES 202 737: "Speech and multimedia Transmission Quality (STQ); Transmission
requirements for narrowband VoIP terminals (handset and headset) from a QoS perspective as
perceived by the user".
[24] Recommendation ITU-T P.1010: "Fundamental voice transmission objectives for VoIP terminals
and gateways".
[25] Recommendation ITU-T G.722.2: "Wideband coding of speech at around 16 kbit/s using Adaptive
Multi-Rate Wideband (AMR-WB)".
[26] IETF RFC 3550: "RTP: A Transport Protocol for Real-Time Applications".
[27] Recommendation ITU-T G.122: "Influence of national systems on stability and talker echo in
international connections".
2.2 Informative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.
NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.
The following referenced documents are not necessary for the application of the present document but they assist the
user with regard to a particular subject area.
[i.1] ETSI EG 201 377-1: "Speech and multimedia Transmission Quality (STQ); Specification and
measurement of speech transmission quality; Part 1: Introduction to objective comparison
measurement methods for one-way speech quality across networks".
[i.2] ETSI EG 202 425: "Speech Processing, Transmission and Quality Aspects (STQ); Definition and
implementation of VoIP reference point".
[i.3] ETSI EG 202 396-3: "Speech and multimedia Transmission Quality (STQ); Speech Quality
performance in the presence of background noise; Part 3: Background noise transmission -
Objective test methods".
[i.4] Recommendation ITU-T P.800.1: "Mean Opinion Score (MOS) Terminology".
TM
[i.5] NIST Net .
NOTE: Available at https://www-x.antd.nist.gov/itg/nistnet/.
ETSI

---------------------- Page: 7 ----------------------
8 Final draft ETSI ES 202 739 V1.7.1 (2017-07)
TM
[i.6] Netem .
NOTE: Available at http://www.linuxfoundation.org/en/Net:Netem.
[i.7] DAGA 2008: "Testing Wideband Terminals", March 10-13, Dresden, Proceedings. Poschen S.,
Kettler F., Raake A., Spors S.
[i.8] Trace Control for Netem (TCN): "A. Keller, Trace Control for Netem, Semester Thesis
SA-2006-15, ETH Zürich, 2006".
[i.9] ETSI ES 202 739 (V1.2.1): "Speech and multimedia Transmission Quality (STQ); Transmission
requirements for wideband VoIP terminals (handset and headset) from a QoS perspective as
perceived by the user".
3 Definitions and abbreviations
3.1 Definitions
For the purposes of the present document, the following terms and definitions apply:
artificial ear: device for the calibration of earphones incorporating an acoustic coupler and a calibrated microphone for
the measurement of the sound pressure and having an overall acoustic impedance similar to that of the median adult
human ear over a given frequency band
codec: combination of an analogue-to-digital encoder and a digital-to-analogue decoder operating in opposite directions
of transmission in the same equipment
diffuse field equalization: equalization of the HATS sound pick-up, equalization of the difference, in dB, between the
spectrum level of the acoustic pressure at the ear Drum Reference Point (DRP) and the spectrum level of the acoustic
pressure at the HATS Reference Point (HRP) in a diffuse sound field with the HATS absent using the reverse nominal
curve given in table 3 of Recommendation ITU-T P.58 [10]
ear-Drum Reference Point (DRP): point located at the end of the ear canal, corresponding to the ear-drum position
freefield reference point: point located in the free sound field, at least in 1,5 m distance from a sound source radiating
in free air (in case of a head and torso simulator (HATS) in the centre of the artificial head with no artificial head
present)
Head And Torso Simulator (HATS) for telephonometry: manikin extending downward from the top of the head to
the waist, designed to simulate the sound pick-up characteristics and the acoustic diffraction produced by a median
human adult and to reproduce the acoustic field generated by the human mouth
Mouth Reference Point (MRP): point located on axis and 25 mm in front of the lip plane of a mouth simulator
nominal setting of the volume control: when a receive volume control is provided, the setting which is closest to the
nominal RLR of 2 dB
3.2 Abbreviations
For the purposes of the present document, the following abbreviations apply:
AM-FM Amplitude Modulation-Frequency Modulation
AMR-WB Adaptive Multi Rate - Wideband
CS Composite Source
CSS Composite Source Signal
DRP ear Drum Reference Point
EC Echo Canceller
ELR Echo Loudness Rating
ERP Ears Reference Point
ETH Eidgenössische Technische Hochschule
FFT Fast Fourrier Transform
ETSI

---------------------- Page: 8 ----------------------
9 Final draft ETSI ES 202 739 V1.7.1 (2017-07)
G-MOS-LQOw Overall transmission quality wideband
GSM Global System for Mobile Communications
HATS Head And Torso Simulator
HRP HATS Reference Point
IEC International Electrotechnical Commission
IP Internet Protocol
IPDV IP Packet Delay Variation
ITU-T International Telecommunication Union -Telecommunication standardization sector
MOS Mean Opinion Score
MOS-LQOy Mean Opinion Score - Listening Quality Objective
NOTE: y being N for narrow-band, M for mixed and S for superwideband. See Recommendation
ITU-T P.800.1 [i.4].
MRP Mouth Reference Point
NIST National Institute of Standards and Technology
NLP Non Linear Processor
N-MOS-LQOw Transmission quality of the background noise wideband
PBX Private Branch eXchange
PC Personal Computer
PCM Pulse Code Modulation
POI Point Of Interconnect
PSTN Public Switched Telephone Network
QoS Quality of Service
RLR Receive Loudness Rating
RMS Root Mean Square
RTP Real Time Protocol
SLR Send Loudness Rating
S-MOS-LQOw Transmission quality of the speech wideband
STMR SideTone Masking Rating
TCL Terminal Coupling Loss
TCN Trace Control for Netem
TDM Time Division Multiplex
TOSQA Telecommunication Objective Speech Quality Assessment
VAD Voice Activity Detection
VoIP Voice over IP
4 General considerations
4.1 Coding algorithm
The assumed coding algorithm is according to Recommendation ITU-T G.722 [5]. VoIP terminals may support other
coding algorithms.
NOTE: Associated Packet Loss Concealment, e.g. as defined in Recommendation ITU-T G.722 [5],
Appendixes 3 and 4 should be used.
4.2 End-to-end considerations
In order to achieve a desired end-to-end speech transmission performance (mouth-to-ear) it is recommended that the
general rules of transmission planning are carried out with the E-model of Recommendation ITU-T G.107 [1] taking
into account that the E-model does not yet address wideband transmission planning; this includes the a-priori
determination of the desired category of speech transmission quality as defined in Recommendation ITU-T G.109 [3].
...

SLOVENSKI STANDARD
SIST ES 202 739 V1.7.1:2018
01-maj-2018
.DNRYRVWSUHQRVDJRYRUDLQYHþSUHGVWDYQLKYVHELQ 674 3UHQRVQH]DKWHYH]D
ãLURNRSDVRYQHWHUPLQDOH9R,3 URþQHLQQDJODYQH JOHGHQDNDNRYRVWVWRULWHY
4R6 NRWMLKGRMHPDXSRUDEQLN
Speech and multimedia Transmission Quality (STQ) - Transmission requirements for
wideband VoIP terminals (handset and headset) from a QoS perspective as perceived by
the user
Ta slovenski standard je istoveten z: ETSI ES 202 739 V1.7.1 (2017-09)
ICS:
33.050.01 Telekomunikacijska Telecommunication terminal
terminalska oprema na equipment in general
splošno
SIST ES 202 739 V1.7.1:2018 en
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.

---------------------- Page: 1 ----------------------

SIST ES 202 739 V1.7.1:2018

---------------------- Page: 2 ----------------------

SIST ES 202 739 V1.7.1:2018
ETSI ES 202 739 V1.7.1 (2017-09)






ETSI STANDARD
Speech and multimedia Transmission Quality (STQ);
Transmission requirements for wideband
VoIP terminals (handset and headset)
from a QoS perspective as perceived by the user

---------------------- Page: 3 ----------------------

SIST ES 202 739 V1.7.1:2018
 2 ETSI ES 202 739 V1.7.1 (2017-09)



Reference
RES/STQ-258
Keywords
quality, speech, telephony, terminal, VoIP,
wideband
ETSI
650 Route des Lucioles
F-06921 Sophia Antipolis Cedex - FRANCE

Tel.: +33 4 92 94 42 00  Fax: +33 4 93 65 47 16

Siret N° 348 623 562 00017 - NAF 742 C
Association à but non lucratif enregistrée à la
Sous-Préfecture de Grasse (06) N° 7803/88

Important notice
The present document can be downloaded from:
http://www.etsi.org/standards-search
The present document may be made available in electronic versions and/or in print. The content of any electronic and/or
print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any
existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the
print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat.
Users of the present document should be aware that the document may be subject to revision or change of status.
Information on the current status of this and other ETSI documents is available at
https://portal.etsi.org/TB/ETSIDeliverableStatus.aspx
If you find errors in the present document, please send your comment to one of the following services:
https://portal.etsi.org/People/CommiteeSupportStaff.aspx
Copyright Notification
No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying
and microfilm except as authorized by written permission of ETSI.
The content of the PDF version shall not be modified without the written authorization of ETSI.
The copyright and the foregoing restriction extend to reproduction in all media.

© ETSI 2017.
All rights reserved.

TM TM TM
DECT , PLUGTESTS , UMTS and the ETSI logo are trademarks of ETSI registered for the benefit of its Members.
TM
3GPP and LTE™ are trademarks of ETSI registered for the benefit of its Members and
of the 3GPP Organizational Partners.
oneM2M logo is protected for the benefit of its Members.
GSM® and the GSM logo are trademarks registered and owned by the GSM Association.
ETSI

---------------------- Page: 4 ----------------------

SIST ES 202 739 V1.7.1:2018
 3 ETSI ES 202 739 V1.7.1 (2017-09)
Contents
Intellectual Property Rights . 5
Foreword . 5
Modal verbs terminology . 5
Introduction . 5
1 Scope . 6
2 References . 6
2.1 Normative references . 6
2.2 Informative references . 7
3 Definitions and abbreviations . 8
3.1 Definitions . 8
3.2 Abbreviations . 8
4 General considerations . 9
4.1 Coding algorithm . 9
4.2 End-to-end considerations . 9
5 Test equipment . 10
5.1 IP half channel measurement adaptor . 10
5.2 Environmental conditions for tests . 10
5.3 Accuracy of measurements and test signal generation . 10
5.4 Network impairment simulation . 11
5.5 Acoustic environment . 12
5.6 Influence of terminal delay on measurements . 12
6 Requirements and associated measurement methodologies . 12
6.1 Notes . 12
6.2 Test setup. 13
6.2.1 General . 13
6.2.2 Setup for handsets and headsets . 13
6.2.3 Position and calibration of HATS . 14
6.2.4 Test signal levels . 14
6.2.5 Setup of background noise simulation . 14
6.2.6 Setup of variable echo path . 14
6.3 Coding independent parameters . 15
6.3.1 Send frequency response . 15
6.3.2 Send Loudness Rating (SLR). 16
6.3.3 Mic mute . 17
6.3.4 Linearity range for SLR . 17
6.3.5 Send distortion . 18
6.3.6 Out-of-band signals in send direction . 19
6.3.7 Send noise . 19
6.3.8 SideTone Masking Rating STMR (mouth to ear) . 20
6.3.9 Sidetone delay . 20
6.3.10 Terminal Coupling Loss (TCL) . 21
6.3.11 Stability loss. 22
6.3.12 Receive frequency response . 23
6.3.13 Receive Loudness Rating (RLR) . 25
6.3.14 Receive distortion . 26
6.3.15 Out-of-band signals in receive direction . 27
6.3.16 Minimum activation level and sensitivity in receive direction . 27
6.3.17 Receive noise . 27
6.3.18 Automatic level control in receive . 28
6.3.19 Double talk performance . 28
6.3.19.1 General . 28
6.3.19.2 Attenuation range in send direction during double talk A . 28
H,S,dt
ETSI

---------------------- Page: 5 ----------------------

SIST ES 202 739 V1.7.1:2018
 4 ETSI ES 202 739 V1.7.1 (2017-09)
6.3.19.3 Attenuation range in receive direction during double talk A . 29
H,R,dt
6.3.19.4 Detection of echo components during double talk . 30
6.3.19.5 Minimum activation level and sensitivity of double talk detection . 31
6.3.20 Switching characteristics . 31
6.3.20.1 Note . 31
6.3.20.2 Activation in send direction . 32
6.3.20.3 Silence suppression and comfort noise generation . 32
6.3.21 Background noise performance . 32
6.3.21.1 Performance in send in the presence of background noise . 32
6.3.21.2 Speech quality in the presence of background noise . 33
6.3.21.3 Quality of background noise transmission (with far end speech). 34
6.3.22 Quality of echo cancellation . 34
6.3.22.1 Temporal echo effects . 34
6.3.22.2 Spectral echo attenuation . 35
6.3.22.3 Occurrence of artefacts . 36
6.3.22.4 Variable echo path. 36
6.3.23 Variant impairments; network dependant . 36
6.3.23.1 Clock accuracy send . 36
6.3.23.2 Clock accuracy receive . 36
6.3.23.3 Send packet delay variation. 37
6.3.24 Send and receive delay - round trip delay . 37
6.4 Codec specific requirements. 40
6.4.1 Objective listening speech quality MOS-LQO in send direction . 40
6.4.2 Objective listening quality MOS-LQO in receive direction . 41
6.4.3 Quality of jitter buffer adjustment . 43
Annex A (informative): Processing delays in VoIP terminals . 45
Annex B (informative): Optimum frequency responses for wideband transmission in receive
direction - underlying subjective experiments . 48
Annex C (informative): Bibliography . 50
History . 51

ETSI

---------------------- Page: 6 ----------------------

SIST ES 202 739 V1.7.1:2018
 5 ETSI ES 202 739 V1.7.1 (2017-09)
Intellectual Property Rights
Essential patents
IPRs essential or potentially essential to the present document may have been declared to ETSI. The information
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web
server (https://ipr.etsi.org/).
Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web
server) which are, or may be, or may become, essential to the present document.
Trademarks
The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners.
ETSI claims no ownership of these except for any which are indicated as being the property of ETSI, and conveys no
right to use or reproduce any trademark and/or tradename. Mention of those trademarks in the present document does
not constitute an endorsement by ETSI of products, services or organizations associated with those trademarks.
Foreword
This ETSI Standard (ES) has been produced by ETSI Technical Committee Speech and multimedia Transmission
Quality (STQ).
Modal verbs terminology
In the present document "shall", "shall not", "should", "should not", "may", "need not", "will", "will not", "can" and
"cannot" are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of
provisions).
"must" and "must not" are NOT allowed in ETSI deliverables except when used in direct citation.
Introduction
Traditionally, the analogue and digital telephones were interfacing switched-circuit 64 kbit/s PCM networks. With the
fast growth of IP networks, wideband terminals providing higher audio-bandwidth and directly interfacing
packet-switched networks (VoIP) are being rapidly introduced. Such IP network edge devices may include gateways,
specifically designed IP phones, soft phones or other devices connected to the IP based networks and providing
telephony service. Since the IP networks will be in many cases interworking with the traditional PSTN and private
networks, many of the basic transmission requirements have to be harmonised with specifications for traditional digital
terminals. However, due to the unique characteristics of the IP networks including packet loss, delay, etc. New
performance specification, as well as appropriate measuring methods, will have to be developed. Terminals are getting
increasingly complex, advanced signal processing is used to address the IP specific issues.
The advanced signal processing of terminals is targeted to speech signals. Therefore, wherever possible speech signals
are used for testing in order to achieve mostly realistic test conditions and meaningful results.
The present document provides speech transmission performance requirements for wideband VoIP handset and headset
terminals.
NOTE: Requirement limits are given in tables, the associated curve when provided is given for illustration.
ETSI

---------------------- Page: 7 ----------------------

SIST ES 202 739 V1.7.1:2018
 6 ETSI ES 202 739 V1.7.1 (2017-09)
1 Scope
The present document provides speech transmission performance requirements for 8 kHz wideband VoIP handset and
headset terminals; it addresses all types of IP based terminals, including wireless and soft phones.
In contrast to other standards which define minimum performance requirements it is the intention of the present
document to specify terminal equipment requirements which enable manufacturers and service providers to enable good
quality end-to-end speech performance as perceived by the user.
In addition to basic testing procedures, the present document describes advanced testing procedures taking into account
further quality parameters as perceived by the user.
2 References
2.1 Normative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.
Referenced documents which are not found to be publicly available in the expected location might be found at
http://docbox.etsi.org/Reference.
NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.
The following referenced documents are necessary for the application of the present document.
[1] Recommendation ITU-T G.107: "The E-model, a computational model for use in transmission
planning".
[2] Recommendation ITU-T G.108: "Application of the E-model: A planning guide".
[3] Recommendation ITU-T G.109: "Definition of categories of speech transmission quality".
[4] Void.
[5] Recommendation ITU-T G.722: "7 kHz audio-coding within 64 kbit/s".
[6] Recommendation ITU-T G.722.1: "Low-complexity coding at 24 and 32 kbit/s for hands-free
operation in systems with low frame loss".
[7] Recommendation ITU-T G.729.1: "G.729 based Embedded Variable bit-rate coder: An 8-32 kbit/s
scalable wideband coder bitstream interoperable with G.729".
[8] Recommendation ITU-T P.56: "Objective measurement of active speech level".
[9] Recommendation ITU-T P.57: "Artificial ears".
[10] Recommendation ITU-T P.58: "Head and torso simulator for telephonometry".
[11] Recommendation ITU-T P.64: "Determination of sensitivity/frequency characteristics of local
telephone systems".
[12] Recommendation ITU-T P.79: "Calculation of loudness ratings for telephone sets".
[13] Recommendation ITU-T P.340: "Transmission characteristics and speech quality parameters of
hands-free terminals".
[14] Recommendation ITU-T P.380: "Electro-acoustic measurements on headsets".
[15] Recommendation ITU-T P.501: "Test signals for use in telephonometry".
ETSI

---------------------- Page: 8 ----------------------

SIST ES 202 739 V1.7.1:2018
 7 ETSI ES 202 739 V1.7.1 (2017-09)
[16] Recommendation ITU-T P.502: "Objective test methods for speech communication systems using
complex test signals".
[17] Recommendation ITU-T P.581: "Use of head and torso simulator (HATS) for hands-free terminal
testing".
[18] IEC 61260-1: "Electroacoustics - Octave-band and fractional-octave-band filters - Part 1:
Specifications".
[19] TIA-920.130-A: "Telecommunications Telephone Terminal Equipment Transmission
Requirements for Wideband Digital Wireline Telephones with Headset".
[20] ETSI TS 103 224: "Speech and multimedia Transmission Quality (STQ); A sound field
reproduction method for terminal testing including a background noise database".
[21] Recommendation ITU-T P.863: "Perceptual objective listening quality assessment".
[22] Recommendation ITU-T P.863.1: "Application Guide for Recommendation ITU-T P.863".
[23] ETSI ES 202 737: "Speech and multimedia Transmission Quality (STQ); Transmission
requirements for narrowband VoIP terminals (handset and headset) from a QoS perspective as
perceived by the user".
[24] Recommendation ITU-T P.1010: "Fundamental voice transmission objectives for VoIP terminals
and gateways".
[25] Recommendation ITU-T G.722.2: "Wideband coding of speech at around 16 kbit/s using Adaptive
Multi-Rate Wideband (AMR-WB)".
[26] IETF RFC 3550: "RTP: A Transport Protocol for Real-Time Applications".
[27] Recommendation ITU-T G.122: "Influence of national systems on stability and talker echo in
international connections".
2.2 Informative references
References are either specific (identified by date of publication and/or edition number or version number) or
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the
referenced document (including any amendments) applies.
NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee
their long term validity.
The following referenced documents are not necessary for the application of the present document but they assist the
user with regard to a particular subject area.
[i.1] ETSI EG 201 377-1: "Speech and multimedia Transmission Quality (STQ); Specification and
measurement of speech transmission quality; Part 1: Introduction to objective comparison
measurement methods for one-way speech quality across networks".
[i.2] ETSI EG 202 425: "Speech Processing, Transmission and Quality Aspects (STQ); Definition and
implementation of VoIP reference point".
[i.3] ETSI EG 202 396-3: "Speech and multimedia Transmission Quality (STQ); Speech Quality
performance in the presence of background noise; Part 3: Background noise transmission -
Objective test methods".
[i.4] Recommendation ITU-T P.800.1: "Mean Opinion Score (MOS) Terminology".
TM
[i.5] NIST Net .
NOTE: Available at https://www-x.antd.nist.gov/itg/nistnet/.
ETSI

---------------------- Page: 9 ----------------------

SIST ES 202 739 V1.7.1:2018
 8 ETSI ES 202 739 V1.7.1 (2017-09)
TM
[i.6] Netem .
NOTE: Available at http://www.linuxfoundation.org/en/Net:Netem.
[i.7] DAGA 2008: "Testing Wideband Terminals", March 10-13, Dresden, Proceedings. Poschen S.,
Kettler F., Raake A., Spors S.
[i.8] Trace Control for Netem (TCN): A. Keller, "Trace Control for Netem", Semester Thesis
SA-2006-15, ETH Zürich, 2006.
[i.9] ETSI ES 202 739 (V1.2.1): "Speech and multimedia Transmission Quality (STQ); Transmission
requirements for wideband VoIP terminals (handset and headset) from a QoS perspective as
perceived by the user".
3 Definitions and abbreviations
3.1 Definitions
For the purposes of the present document, the following terms and definitions apply:
artificial ear: device for the calibration of earphones incorporating an acoustic coupler and a calibrated microphone for
the measurement of the sound pressure and having an overall acoustic impedance similar to that of the median adult
human ear over a given frequency band
codec: combination of an analogue-to-digital encoder and a digital-to-analogue decoder operating in opposite directions
of transmission in the same equipment
diffuse field equalization: equalization of the HATS sound pick-up, equalization of the difference, in dB, between the
spectrum level of the acoustic pressure at the ear Drum Reference Point (DRP) and the spectrum level of the acoustic
pressure at the HATS Reference Point (HRP) in a diffuse sound field with the HATS absent using the reverse nominal
curve given in table 3 of Recommendation ITU-T P.58 [10]
ear-Drum Reference Point (DRP): point located at the end of the ear canal, corresponding to the ear-drum position
freefield reference point: point located in the free sound field, at least in 1,5 m distance from a sound source radiating
in free air (in case of a head and torso simulator (HATS) in the centre of the artificial head with no artificial head
present)
Head And Torso Simulator (HATS) for telephonometry: manikin extending downward from the top of the head to
the waist, designed to simulate the sound pick-up characteristics and the acoustic diffraction produced by a median
human adult and to reproduce the acoustic field generated by the human mouth
Mouth Reference Point (MRP): point located on axis and 25 mm in front of the lip plane of a mouth simulator
nominal setting of the volume control: when a receive volume control is provided, the setting which is closest to the
nominal RLR of 2 dB
3.2 Abbreviations
For the purposes of the present document, the following abbreviations apply:
AM-FM Amplitude Modulation-Frequency Modulation
AMR-WB Adaptive Multi Rate - Wideband
CS Composite Source
CSS Composite Source Signal
DRP ear Drum Reference Point
EC Echo Canceller
ELR Echo Loudness Rating
ERP Ears Reference Point
ETH Eidgenössische Technische Hochschule
FFT Fast Fourrier Transform
ETSI

---------------------- Page: 10 ----------------------

SIST ES 202 739 V1.7.1:2018
 9 ETSI ES 202 739 V1.7.1 (2017-09)
G-MOS-LQOw Overall transmission quality wideband
GSM Global System for Mobile communications
HATS Head And Torso Simulator
HRP HATS Reference Point
IEC International Electrotechnical Commission
IP Internet Protocol
IPDV IP Packet Delay Variation
ITU-T International Telecommunication Union -Telecommunication standardization sector
MOS Mean Opinion Score
MOS-LQOy Mean Opinion Score - Listening Quality Objective
NOTE: y being N for narrow-band, M for mixed and S for superwideband. See Recommendation ITU-T
P.800.1 [i.4].
MRP Mouth Reference Point
NIST National Institute of Standards and Technology
NLP Non Linear Processor
N-MOS-LQOw Transmission quality of the background noise wideband
PBX Private Branch eXchange
PC Personal Computer
PCM Pulse Code Modulation
POI Point Of Interconnect
PSTN Public Switched Telephone Network
QoS Quality of Service
RLR Receive Loudness Rating
RMS Root Mean Square
RTP Real Time Protocol
SLR Send Loudness Rating
S-MOS-LQOw Transmission quality of the speech wideband
STMR SideTone Masking Rating
TCL T
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.