(version 5.0) enables voice communication in extremely challenging conditions where it was previously impossible, delivering speech quality and intelligibility comparable to that of MELPe at 600 bps, even while operating at an exceptionally low bit rate.
For Digital HF Radio and other markets.
TWELP Technology Features. The vocoder is based on newest technology of speech coding called "Tri-Wave Excited Linear Prediction" (TWELP) that was developed by experts of DSPINI.
TWELP technology is a new class of vocoders that differs from any other LPC-based vocoders by:
- advance reliable method of pitch estimation
- pitch-synchronous analysis
- advance tri-wave model of excitation
- newest quantization schemes
- pitch-synchronous synthesis
Thanks to these unique features, TWELP technology provides significantly better speech quality than other well-known technologies—including AMBE+2, MELPe, ACELP, and others—at equivalent bit rates ranging from 300 bps to 4800 bps and beyond.
Additionally, unlike other low-bitrate vocoders (such as MELPe, for example), TWELP delivers much higher quality for non-speech signals, including sirens, background music, and similar audio.
The TWELP 300 bps and MELPe 600 bps vocoders were tested using the ITU-T P.50 speech base in 20 different languages.
Note:
We have updated the speech database by minimizing inter-speech pauses to eliminate their impact on the evaluation results. Therefore, the numbers obtained from the quality measurements using this updated speech database differ from those previously obtained with the original speech database, where speech pauses were not removed.
Language | MELPe 600 bps | TWELP 300 bps |
---|---|---|
American | 2.211 | 2.157 |
Arabic | 2.168 | 2.116 |
British | 2.270 | 2.212 |
Chinese | 2.055 | 2.048 |
Danish | 2.174 | 2.165 |
Dutch | 2.145 | 2.081 |
Finnish | 2.243 | 2.127 |
French | 2.258 | 2.187 |
German | 2.227 | 2.241 |
Greek | 2.164 | 2.169 |
Hindi | 2.358 | 2.243 |
Hungarian | 2.293 | 2.151 |
Italian | 2.417 | 2.359 |
Japanese | 2.308 | 2.248 |
Norwegian | 2.197 | 2.110 |
Polish | 2.274 | 2.207 |
Portuguese | 2.370 | 2.273 |
Russian | 2.119 | 2.103 |
Spanish | 2.322 | 2.196 |
Swedish | 2.437 | 2.339 |
Average | 2.251 | 2.187 |
A difference is on average 0.064 PESQ |
STOI (Short-Time Objective Intelligibility) and ESTOI (Extended Short-Time Objective Intelligibility) metrics were used to assess speech intelligibility:
Language | MELPe 600 bps | TWELP 300 bps |
---|---|---|
American | 79.24 | 77.50 |
Arabic | 78.50 | 75.35 |
British | 75.67 | 75.29 |
Chinese | 77.40 | 75.53 |
Danish | 79.12 | 76.97 |
Dutch | 77.04 | 75.49 |
Finnish | 74.76 | 71.22 |
French | 78.79 | 77.00 |
German | 79.00 | 76.80 |
Greek | 77.89 | 75.01 |
Hindi | 78.35 | 75.96 |
Hungarian | 78.14 | 76.86 |
Italian | 78.03 | 75.73 |
Japanese | 79.30 | 76.40 |
Norwegian | 79.24 | 77.26 |
Polish | 78.37 | 76.92 |
Portuguese | 78.04 | 76.69 |
Russian | 75.74 | 75.15 |
Spanish | 77.82 | 73.77 |
Swedish | 76.89 | 73.57 |
Average | 77.90 | 75.72 |
A difference is on average 2.18 % |
Language | MELPe 600 bps | TWELP 300 bps |
---|---|---|
American | 69.76 | 63.94 |
Arabic | 70.04 | 64.11 |
British | 67.54 | 62.67 |
Chinese | 69.85 | 65.59 |
Danish | 69.81 | 64.33 |
Dutch | 69.58 | 63.05 |
Finnish | 66.23 | 58.80 |
French | 70.28 | 64.70 |
German | 68.55 | 63.24 |
Greek | 70.59 | 64.82 |
Hindi | 67.87 | 61.23 |
Hungarian | 67.46 | 61.49 |
Italian | 68.60 | 62.65 |
Japanese | 71.22 | 65.31 |
Norwegian | 70.59 | 66.46 |
Polish | 70.79 | 65.46 |
Portuguese | 69.53 | 64.10 |
Russian | 66.88 | 61.73 |
Spanish | 70.33 | 62.39 |
Swedish | 67.17 | 59.12 |
Average | 69.13 | 63.26 |
A difference is on average 5.87 % |
Speech Samples (WAV-files).
A few independent experts compared the TWELP 300 bps vocoder with the MELPe 600 bps vocoder using the preference method.
All experts noted difficulties in recognizing unfamiliar speech in both cases, and none expressed a clear preference for one vocoder over the other.
Despite this, independent experts with HF communication experience observed satisfactory speech intelligibility with the TWELP 300 bps vocoder under conditions of very low (negative) SNR, where analog communication is nearly impossible due to high noise levels completely masking the speech signal.
Many users described the quality of voice communication with this vocoder as “impressive”.
You can play and listen to short samples of the source speech, as well as the speech processed by the MELPe 600 bps vocoder and the TWELP 300 bps vocoder, which operates at half the bit rate, using the links in the table below.
For the best listening experience, we recommend using high-quality headphones or premium audio equipment to hear the nuances and differences in the vocoder sound more clearly.
You can also download the complete set of P.50 samples as zip files for all languages simultaneously by using the links in the 'Downloads' section at the bottom of the page.
Superiority In Quality Of The Non-speech Signals. In contrast to other LBR vocoders (MELPe, AMBE+2, etc.), TWELP vocoders provide high quality of non-speech signals, including police, ambulance, fire sirens, etc. This feature in conjunction with high quality natural human-sounding of voice makes TWELP vocoders well suitable for replacement of analog radio by digital radio and also for other applications where high quality transmitting of non-speech signals is relevant along with high quality transmitting of speech signals.
Source signal | MELPe 600 bps | TWELP 300 bps |
---|---|---|
Note:
Of course, at such an ultra-low bitrate 300 bps, the quality of non-speech signal processing is much worse than at higher bitrates and is essentially not much better than that of MELPe 600 bps.
Language | MELPe 600 bps | TWELP 300 bps |
---|---|---|
American | 58.47 | 55.76 |
Arabic | 59.83 | 55.54 |
British | 56.73 | 54.09 |
Chinese | 60.42 | 58.51 |
Danish | 57.98 | 55.57 |
Dutch | 56.63 | 54.02 |
Finnish | 55.25 | 51.26 |
French | 60.07 | 56.58 |
German | 56.79 | 54.25 |
Greek | 59.48 | 56.64 |
Hindi | 55.59 | 51.82 |
Hungarian | 59.22 | 54.99 |
Italian | 57.39 | 53.62 |
Japanese | 61.64 | 58.13 |
Norwegian | 61.09 | 59.48 |
Polish | 59.62 | 56.63 |
Portuguese | 58.66 | 55.93 |
Russian | 57.10 | 54.10 |
Spanish | 59.90 | 56.40 |
Swedish | 53.69 | 49.37 |
Average | 58.28 | 55.13 |
A difference is on average 3.15 % |
The fact that the difference in speech intelligibility between the MELPe and TWELP vocoders is smaller for noisy speech (3.15%) than for clean speech (5.87%) indicates better resistance of the TWELP vocoder to acoustic noise.
Below, you can listen to a short fragment of heavily noisy English speech after passing through MELPe and TWELP vocoders, with NPP (Noise Pre-Processor) and NCSE disabled and enabled, respectively.
The NCSE integrated into the TWELP vocoder is described in more detail on the webpage for our standalone product, 'NCSE-AGC Preprocessor'.
These vocoders are based on an effective Joint Source-Channel Coding approach. Each vocoder is equipped with a custom-designed FEC, tailored to its specific characteristics and operational conditions.
TWELP Robust vocoders provide high speech quality simultaneously in noisy channel as well as in noiseless channel. FEC can operate with "soft decisions" as well as with "hard decisions" from a modem. "Soft decisions" mode provides much better robustness in comparison with the "hard decisions" mode.
For all users of our non-robust vocoder versions, we offer the following recommendations.
Essentially, the diagram shows by what percentage speech quality is reduced when a specific bit is distorted. The first bits in order cause catastrophic distortions, while the latter bits have significantly less impact on quality.