(version 5.0) enables voice communication in extremely challenging conditions where it was previously impossible, delivering speech quality and intelligibility comparable to that of MELPe at 600 bps, even while operating at an exceptionally low bit rate.

For Digital HF Radio and other markets.

TWELP Technology Features. The vocoder is based on newest technology of speech coding called "Tri-Wave Excited Linear Prediction" (TWELP) that was developed by experts of DSPINI. 

TWELP technology is a new class of vocoders that differs from any other LPC-based vocoders by:

  • advance reliable method of pitch estimation
  • pitch-synchronous analysis
  • advance tri-wave model of excitation
  • newest quantization schemes
  • pitch-synchronous synthesis

Thanks to these unique features, TWELP technology provides significantly better speech quality than other well-known technologies—including AMBE+2, MELPe, ACELP, and others—at equivalent bit rates ranging from 300 bps to 4800 bps and beyond. Additionally, unlike other low-bitrate vocoders (such as MELPe, for example), TWELP delivers much higher quality for non-speech signals, including sirens, background music, and similar audio.

Speech Quality. This is a comparison with the MELPe vocoder, which operates at 600 bps. The TWELP 300 bps and MELPe 600 bps vocoders were tested using the ITU-T P.50 speech base in 20 different languages.
The speech base was updated by removing all non-speech pauses.
The ITU-T P.862 tool was used to evaluate speech quality in terms of PESQ scores:

 
A diagram just demonstrates a difference in speech quality for TWELP 300 and MELPe 600 vocoders. Exact numbers are shown in the table below.
LanguageMELPe 600 bpsTWELP 300 bps
American 2.211 2.157
Arabic 2.168 2.116
British 2.270 2.212
Chinese 2.055 2.048
Danish 2.174 2.165
Dutch 2.145 2.081
Finnich 2.243 2.127
French 2.258 2.187
German 2.227 2.241
Greek 2.164 2.169
Hindi 2.358 2.243
Hungarian 2.293 2.151
Italian 2.417 2.359
Japanese 2.308 2.248
Norwegian 2.197 2.110
Polish 2.274 2.207
Portuguese 2.370 2.273
Russian 2.119 2.103
Spanish 2.322 2.196
Swedish 2.437 2.339
Average2.2512.187

A difference is on average 0.064 PESQ

 Speech Intelligibility. Here is the comparison with MELPe vocoder, operating on twice higher bit rate 600 bps. TWELP 300 bps vocoder and MELPe 600 bps vocoder were tested, using ITU-T P.50 speech base for 20 different languages.
STOI (Short-Time Objective Intelligibility) and ESTOI (Extended Short-Time Objective Intelligibility) metrics were used to estimate speech intelligibility: 

 
A diagram just demonstrates a difference in speech intelligibility for TWELP 300 and MELPe 600 vocoders in the STOI metric. Exact numbers are shown in the table below.
LanguageMELPe 600 bpsTWELP 300 bps
American 79.24 77.50
Arabic 78.50 75.35
British 75.67 75.29
Chinese 77.40 75.53
Danish 79.12 76.97
Dutch 77.04 75.49
Finnich 74.76 71.22
French 78.79 77.00
German 79.00 76.80
Greek 77.89 75.01
Hindi 78.35 75.96
Hungarian 78.14 76.86
Italian 78.03 75.73
Japanese 79.30 76.40
Norwegian 79.24 77.26
Polish 78.37 76.92
Portuguese 78.04 76.69
Russian 75.74 75.15
Spanish 77.82 73.77
Swedish 76.89 73.57
Average77.9075.72

A difference is on average 2.18 %

Considering that a low-bitrate vocoder is a nonlinear device that significantly distorts the spectrum of the original speech signal, the ESTOI metric provides more accurate assessments of speech intelligibility after vocoding:
 
A diagram just demonstrates a difference in speech intelligibility for TWELP 300 and MELPe 600 vocoders in the ESTOI metric. Exact numbers are shown in the table below.
LanguageMELPe 600 bpsTWELP 300 bps
American 69.76 63.94
Arabic 70.04 64.11
British 67.54 62.67
Chinese 69.85 65.59
Danish 69.81 64.33
Dutch 69.58 63.05
Finnich 66.23 58.80
French 70.28 64.70
German 68.55 63.24
Greek 70.59 64.82
Hindi 67.87 61.23
Hungarian 67.46 61.49
Italian 68.60 62.65
Japanese 71.22 65.31
Norwegian 70.59 66.46
Polish 70.79 65.46
Portuguese 69.53 64.10
Russian 66.88 61.73
Spanish 70.33 62.39
Swedish 67.17 59.12
Average69.1363.26

A difference is on average 5.87 %

You can download the P.862 and STOI/ESTOI utilities, along with all speech samples, by using the links in the 'Downloads' section at the bottom of the page, and then check all the numbers presented above.


Speech Samples (WAV-files). 
A few independent experts compared the TWELP 300 bps vocoder with the MELPe 600 bps vocoder using the preference method.
All experts noted difficulties in recognizing unfamiliar speech in both cases, and none expressed a clear preference for one vocoder over the other.
Despite this, independent experts with HF communication experience observed satisfactory speech intelligibility with the TWELP 300 bps vocoder under conditions of very low (negative) SNR, where analog communication is nearly impossible due to high noise levels completely masking the speech signal.
Many users described the quality of voice communication with this vocoder as “impressive”.

You can play and listen to short samples of the source speech, as well as the speech processed by the MELPe 600 bps vocoder and the TWELP 300 bps vocoder, which operates at half the bit rate, using the links in the table below.

You can also download the complete set of P.50 samples as zip files for all languages simultaneously by using the links in the 'Downloads' section at the bottom of the page.

LanguageSource speechMELPe 600 bpsTWELP 300 bps
American
Arabic
British
Chinese
Danish
Dutch
Finnich
French
German
Greek
Hindi
Hungarian
Italian
Japanese
Norwegian
Polish
Portuguese
Russian
Spanish
Swedish

Superiority In Quality Of The Non-speech Signals. In contrast to other LBR vocoders (MELPe, AMBE+2, etc.), TWELP vocoders provide high quality of non-speech signals, including police, ambulance, fire sirens, etc. This feature in conjunction with high quality natural human-sounding of voice makes TWELP vocoders well suitable for replacement of analog radio by digital radio and also for other applications where high quality transmitting of non-speech signals is relevant along with high quality transmitting of speech signals.

Source typeSource signalMELPe 600 bpsTWELP 300 bps
Siren only
With voice

High Robustness To Acoustic Noise. In contrast to other LBR vocoders, TWELP vocoders are well robust to acoustic noise thanks to robust reliable method of pitch estimation and other features of TWELP technology.

Moreover, vocoder includes in-built Noise Cancellation—Speech Enhancement (NCSE) functionality that improves speech quality in noisy acoustic environment.

NCSE ModeSource signalMELPe 600 bpsTWELP 300 bps
Disabled
Enabled

High Robustness To The Channel Errors. 
The diagram below illustrates the sensitivity of bits at the output of the vocoder to communication channel errors.
Essentially, the diagram shows by what percentage speech quality is reduced when a specific bit is distorted. The first bits in order cause catastrophic distortions, while the latter bits have significantly less impact on quality.

 

We strongly recommend using FEC (Forward Error Correction) with unequal protection of the bits in strong accordance with their sensitivity to errors and utilizing 'Soft Decisions' decoding. This will provide the highest robustness of the vocoder against errors in the channel. The first bits in order cause catastrophic distortions, whereas the latter bits have almost no effect on quality.

Special "robust" versions of the TWELP vocoders include FEC that are integrated with vocoder on base of "joint source-channel coding" approach that provides high speech quality simultaneously in noisy channel as well as in noiseless channel. FEC can operate with "soft decisions" as well as with "hard decisions" from a modem. "Soft decisions" mode provides much better robustness in comparison with the "hard decisions" mode. 

Additional Functionalities. The following additional functionalities are developed by DSPINI and integrated into TWELP vocoders:

  • Automatic Gain Control (AGC),
  • Noise Cancellation for Speech Enhancement (NCSE)
  • Voice Activity Detector (VAD),
  • Tone Detection/Generation (Single tones and Dual tones). The tones are transmitted by the vocoder facilities.

Each functionality has unique features, performance and characteristics, providing significant superiority over any well-known implementations on the market.

Technical Characteristics And Resource Requirements:

Technical characteristics
Bit Rate
(bps)
AlgorithmFrame size
(ms)
Algorithmic delay
(including frame size)
(ms)
Sampling rate
(kHz)
Signal formatBit stream format
300 TWELP 100 120 8 Linear
16-bit
PCM
30
Additional functionalities
NameFunctionalityTechnical characteristics
NameValue
AGC Automatic Gain 
Control
Control range: 0 ... +20 dB
NCSE Noise Canceller -
Speech Enhancer
SNR increasing > 6 dB
Speech quality
improvement
> 0.1 PESQ
Tone
Detector
Single/Dual tones 
detection
In accordance with international standards
Tone
Generator
Single/Dual tones 
generation
Special generator, kept continuity of signal 
(phase and amplitude of signal of previous frame)
VAD Voice Activity 
Detection
Reliable detection speech 
in background noise
CNG Comfort Noise 
Generation
Type of noise "white"
Level - 60 dB

Resources for ARM Cortex-M4 platform
ModuleMIPS*
peak
Memory (KBytes)
ProgramData
ConstantsChannelHeapStack
Voice Encoder 72.5 41 195 4.7 6.7 1.0
NCSE 6.4
AGC 0.2
Voice Decoder 13.3
Voice Encoder +
Voice Decoder
85.8
Total 92.4


Resources for TI's C64 DSP platform
ModuleMIPS*
peak
Memory (KBytes)
ProgramData
ConstantsChannelHeapStack
Voice Encoder 25.9 79 195 4.7 6.7 1.0
NCSE 2.8
AGC 0.1
Voice Decoder 3.8
Voice Encoder +
Voice Decoder
29.7
Total 32.6

Resources (estimated) for TI's C55 DSP platform
ModuleMIPS*
peak
Memory (KBytes)
ProgramData
ConstantsChannelHeapStack
Voice Encoder 44 51 195 4.7 6.7 1.0
NCSE 6.8
AGC 0.2
Voice Decoder 9.0
Voice Encoder +
Voice Decoder
53
Total 60

* DSPINI continues optimization of the TWELP algorithm and code in order to minimize computational complexity of the vocoder.

Vulnerability / Security. DSPINI guarantees ABSOLUTE cleanliness of the software from any undocumented features, undeclared capabilities, etc. All our customers can be sure that any our software/ code doesn't contain any secret functions and features hidden from user. We are ready to provide source codes of our software products for an appropriate certification if need.

Guarantee And Support.  DSPINI guarantees a quality and accordance of all technical characteristics of the product to requirement of current specifications. Testing and other method of quality control are used for guarantee support.

Any Platforms.  DSPINI can port this vocoder software into any other DSP, RISC or general- purposes platform inshort time: 1-2 months.

Licensing Terms.  To use the vocoder, customer should obtain a license from DSPINI only.

Customization.  The vocoder can be customized under any specific requirements- other bit rate, frame size, any other robustness to channel errors, etc. Please contact with us for details.

Prospects.  DSPINI is impoving and developing continuously a set of new vocoders with range from 300 bps up to 9600 bps, based on TWELP technology.

Related Software.  This vocoder may be effectively used in a bundle with other DSPINI's products:

  • Linear and acoustic echo cancellers,
  • Multichannel noise cancellers (including two-microphone adaptive array),
  • Wired or radiomodems for any types of channels and bitrates,
  • Other products.

Downloads: