WARNING!
This page contains information about the previous version (4.0).
The current version (5.0) provides significantly improved speech quality.
We will update the information in the near future.
Includes a few modes (bit rates): 3600, 2400, 1600, 1200, 700, 600, 600/900 (scalable), 480/800 (scalable), 480, 300 bps as well as NRT mode (variable bit rate).
Of course, quantity and presence of the bit rates can be modified under specific requirements of customer.
Provides very high speech quality - much better in comparison with any competitors on the market.
For Digital HF Radio and other markets.
TWELP Technology Features. The vocoder is based on newest technology of speech coding called "Tri-Wave Excited Linear Prediction" (TWELP) that was developed by experts of DSPINI.
TWELP technology is a new class of vocoders that differs from any other LPC-based vocoders by:
- advance reliable method of pitch estimation
- pitch-synchronous analysis
- advance tri-wave model of excitation
- newest quantization schemes
- pitch-synchronous synthesis
Thanks to these unique features, TWELP technology provides much better speech quality in comparison with any well-known technologies, including AMBE+2, MELPe, ACELP, etc. on the same bit rate in range from 300 bps up to 9600 bps and more.
TWELP technology allowes to reduce bit rate two times in practice. Moreover, TWELP vocoder sounds much more naturally and in contrast to other LBR vocoders (like MELPe, etc.) TWELP provides much better quality for non-speech signals like sirens, background music, etc.
Superiority In Speech Quality. Here is the comparison with MELPe and GSM AMR vocoders in noiseless channel. TWELP 2400 bps vocoder, MELPe 2400 bps vocoder and GSM AMR 4.75 kbps vocoder were tested, using ITU-T P.50 speech base for 20 different languages. ITU-T P.862 utility was used for estimation of the speech quality in PESQ terms:
Language | AMR 4750 | TWELP 2400 | MELPe 2400 |
---|---|---|---|
American | 3.351 | 3.330 | 3.077 |
Arabic | 3.277 | 3.253 | 3.053 |
British | 3.272 | 3.181 | 3.019 |
Chinese | 3.267 | 3.307 | 2.970 |
Danish | 3.311 | 3.275 | 3.022 |
Dutch | 3.089 | 3.114 | 2.830 |
Finnich | 3.166 | 3.049 | 2.791 |
French | 3.277 | 3.325 | 3.106 |
German | 3.321 | 3.183 | 2.998 |
Greek | 3.206 | 3.275 | 3.004 |
Hindi | 3.286 | 3.246 | 3.089 |
Hungarian | 3.305 | 3.279 | 3.086 |
Italian | 3.462 | 3.363 | 3.226 |
Japanese | 3.369 | 3.407 | 3.188 |
Norwegian | 3.267 | 3.286 | 3.032 |
Polish | 3.263 | 3.267 | 3.029 |
Portuguese | 3.377 | 3.350 | 3.146 |
Russian | 3.186 | 3.165 | 2.952 |
Spanish | 3.343 | 3.294 | 3.048 |
Swedish | 3.391 | 3.269 | 3.147 |
Average | 3.289 | 3.261 | 3.041 |
Superiority of the TWELP 2400 bps vocoder over MELPe 2400 bps is on average 0.220 PESQ and just -0.028 PESQ in comparison with AMR 4750 bps |
Please find infomation about superiority in speech quality of the TWELP MR vocoder for all other bit rates on appropriate web-pages for specific bit rates.
Speech Samples (WAV-files). A few independent experts listened TWELP 2400 bps vocoder in comparison with MELPe 2400 bps vocoder and GSM AMR 4750 bps vocoder, using preference method. All experts preferred TWELP to the MELPe, having noted much more natural human-sounding of voice in the TWELP vocoder. A majority of experts haven't given preferences to GSM AMR or TWELP, but a majority of the remained prefered TWELP, having noted more clear sounding of speech in TWELP vocoder.
You can play and listen short samples of the source speech as well as the speech processed by these vocoders for any of 20 languages, using links in the table below.
Also, you can download full set of the P.50 samples as zip-files for all languages simultaneously, using the links in the "Downloads" para in a bottom of the page.
Please find samples of the TWELP MR vocoder for all other bit rates on appropriate web-pages for specific bit rates.
Superiority In Quality Of The Non-speech Signals. In contrast to other LBR vocoders (MELPe, AMBE+2, etc.), TWELP vocoders provide high quality of non-speech signals, including police, ambulance, fire sirens, etc. This feature in conjunction with high quality natural human-sounding of voice makes TWELP vocoders well suitable for replacement of analog radio by digital radio and also for other applications where high quality transmitting of non-speech signals is relevant along with high quality transmitting of speech signals.
Source type | Source signal | MELPe 2400 bps | TWELP 2400 bps |
---|---|---|---|
Siren only | |||
With voice |
Please find non-speech samples of the TWELP MR vocoder for all other bit rates on appropriate web-pages for specific bit rates.
High Robustness To Acoustic Noise. In contrast to other LBR vocoders, TWELP vocoders are well robust to acoustic noise thanks to robust reliable method of pitch estimation and other features of TWELP technology.
Moreover, vocoder includes in-built Noise Cancellation—Speech Enhancement (NCSE) functionality that improves speech quality in noisy acoustic environment.
NCSE Mode | Source signal | MELPe 2400 bps | TWELP 2400 bps Robust |
---|---|---|---|
Disabled | |||
Enabled |
Please find similar samples of the TWELP MR vocoder for all other bit rates on appropriate web-pages for specific bit rates.
High Robustness To The Channel Errors. The diagram and table below show a dependence of the averaged speech quality for AWGN-noisy channel for different BER in comparison with other vocoders.
We recommend to use regular versions of the TWELP vocoders in good channels only, where BER is not more than 0.15%.
In case you use the channels, where a majority of calls are at BER > 0.15%, we recommend to use "TWELP Robust" versions of the vocoders (with in-built specific FEC) and primeraly with "Soft Decisions" output from a modem. You can compare the robustness of the vocoders on the diagramm below.
BER % | MELPe 2400 | TWELP 2400 Robust | TWELP 2400 |
---|---|---|---|
0.00 | 3.041 | 3.135 | 3.261 |
0.10 | 2.963 | 3.130 | 3.157 |
0.20 | 2.890 | 3.125 | 3.067 |
0.30 | 2.830 | 3.120 | 3.009 |
0.40 | 2.781 | 3.115 | 2.946 |
0.50 | 2.734 | 3.109 | 2.882 |
0.60 | 2.688 | 3.102 | 2.811 |
0.70 | 2.633 | 3.097 | 2.759 |
0.80 | 2.587 | 3.092 | 2.709 |
0.90 | 2.552 | 3.087 | 2.652 |
1.00 | 2.502 | 3.079 | 2.614 |
You can play and listen short samples of the source speech as well as the speech processed by MELPe and TWELP Robust vocoders on the same bit rate 2400 bps in AWGN channel with BER = 5% for any of 20 languages, using links in the table below.
Please find information about robustness to errors of the TWELP MR vocoder for all other bit rates on appropriate web-pages for specific bit rates.
Additional Functionalities. The following additional functionalities are developed by DSPINI and integrated into TWELP vocoders:
- Automatic Gain Control (AGC),
- Noise Cancellation for Speech Enhancement (NCSE)
- Voice Activity Detector (VAD),
- Tone Detection/Generation (Single tones and Dual tones). The tones are transmitted by the vocoder facilities.
Each functionality has unique features, performance and characteristics, providing significant superiority over any well-known implementations on the market.
Technical Characteristics And Resource Requirements:
Bit Rate (bps) |
Algorithm | Frame size (ms) |
Algorithmic delay (including frame size) (ms) |
Sampling rate (kHz) |
Signal format | Bit stream format |
3600 | TWELPTM | 20 | 40 | 8 | Linear 16-bit PCM |
72 |
2400 | 20 | 40 | 48 | |||
1600 | 40 | 60 | 64 | |||
1200 | 40 | 60 | 48 | |||
700 | 80 | 100 | 56 | |||
600 | 80 | 100 | 48 | |||
600/900 scalable | 100 | 120 | 60+30=90 (DTX is disabled) 11...91 (DTX is enabled) | |||
480/800 scalable | 100 | 120 | 48+32=80 (DTX is disabled) 11...81 (DTX is enabled) | |||
480 | 100 | 120 | 48 (DTX is disabled) 11...49 (DTX is enabled) |
|||
300 | 320 | 340 | 96 | |||
NRT (VBR) | 100хN | 100хN+20 | 11хN...61хN |
Name | Functionality | Technical characteristics | |
---|---|---|---|
Name | Value | ||
AGC | Automatic Gain Control |
Control range: | 0 ... +20 dB |
NCSE | Noise Canceller - Speech Enhancer |
SNR increasing | > 6 dB |
Speech quality improvement |
> 0.1 PESQ | ||
Tone Detector |
Single/Dual tones detection |
In accordance with international standards | |
Tone Generator |
Single/Dual tones generation |
Special generator, kept continuity of signal (phase and amplitude of signal of previous frame) |
|
VAD | Voice Activity Detection |
Reliable detection speech in background noise |
|
CNG | Comfort Noise Generation |
Type of noise | "white" |
Level | - 60 dB |
Bitrate | Module | MIPS* peak | Memory (KBytes) | ||||
---|---|---|---|---|---|---|---|
Program | Data | ||||||
Constants | Channel | Heap | Stack | ||||
3600 | Voice Encoder | 60.2 | 125 | 1825 | 4.9 | 17.9 | 2.1 |
NCSE | 5.8 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 14.6 | ||||||
Voice Encoder + Voice Decoder |
74.8 | ||||||
Total | 81.0 | ||||||
2400 | Voice Encoder | 46.6 | |||||
NCSE | 5.8 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 13.5 | ||||||
Voice Encoder + Voice Decoder |
60.1 | ||||||
Total | 66.3 | ||||||
1600 | Voice Encoder | 105.0 | |||||
NCSE | 6.2 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 10.3 | ||||||
Voice Encoder + Voice Decoder |
115.3 | ||||||
Total | 121.9 | ||||||
1200 | Voice Encoder | 96.9 | |||||
NCSE | 6.4 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 14.0 | ||||||
Voice Encoder + Voice Decoder |
110.9 | ||||||
Total | 117.7 | ||||||
700 | Voice Encoder | 91.3 | |||||
NCSE | 6.4 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 13.7 | ||||||
Voice Encoder + Voice Decoder |
105.0 | ||||||
Total | 111.8 | ||||||
600 | Voice Encoder | 62.4 | |||||
NCSE | 6.0 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 13.7 | ||||||
Voice Encoder + Voice Decoder |
76.1 | ||||||
Total | 82.5 | ||||||
600/900 scalable | Voice Encoder | 94.6 | |||||
NCSE | 6.4 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 13.7 | ||||||
Voice Encoder + Voice Decoder |
108.3 | ||||||
Total | 115.1 | ||||||
480/800 scalable | Voice Encoder | 40.0 | |||||
NCSE | 7.4 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 10.0 | ||||||
Voice Encoder + Voice Decoder |
50.0 | ||||||
Total | 57.8 | ||||||
480 | Voice Encoder | 56.3 | |||||
NCSE | 6.5 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 13.7 | ||||||
Voice Encoder + Voice Decoder |
70.0 | ||||||
Total | 76.9 | ||||||
300 | Voice Encoder | 72.5 | |||||
NCSE | 6.4 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 13.3 | ||||||
Voice Encoder + Voice Decoder |
85.8 | ||||||
Total | 92.6 | ||||||
NRT (VBR) | Voice Encoder | 89.8 | |||||
NCSE | 6.4 | ||||||
AGC | 0.4 | ||||||
Voice Decoder | 13.7 | ||||||
Voice Encoder + Voice Decoder |
103.5 | ||||||
Total | 110.3 |
Bitrate | Module | MIPS* peak | Memory (KBytes) | ||||
---|---|---|---|---|---|---|---|
Program | Data | ||||||
Constants | Channel | Heap | Stack | ||||
3600 | Voice Encoder | 22.1 | 240 | 1825 | 4.9 | 17.9 | 2.1 |
NCSE | 2.9 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 5.4 | ||||||
Voice Encoder + Voice Decoder |
27.5 | ||||||
Total | 30.5 | ||||||
2400 | Voice Encoder | 18.7 | |||||
NCSE | 2.9 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.3 | ||||||
Voice Encoder + Voice Decoder |
23.0 | ||||||
Total | 26.0 | ||||||
1600 | Voice Encoder | 40.0 | |||||
NCSE | 2.8 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.0 | ||||||
Voice Encoder + Voice Decoder |
44.0 | ||||||
Total | 46.9 | ||||||
1200 | Voice Encoder | 35.6 | |||||
NCSE | 2.8 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.2 | ||||||
Voice Encoder + Voice Decoder |
39.8 | ||||||
Total | 42.7 | ||||||
700 | Voice Encoder | 33.9 | |||||
NCSE | 2.8 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.1 | ||||||
Voice Encoder + Voice Decoder |
38.0 | ||||||
Total | 40.9 | ||||||
600 | Voice Encoder | 23.5 | |||||
NCSE | 2.6 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.1 | ||||||
Voice Encoder + Voice Decoder |
27.6 | ||||||
Total | 30.3 | ||||||
600/900 scalable | Voice Encoder | 34.8 | |||||
NCSE | 2.8 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.1 | ||||||
Voice Encoder + Voice Decoder |
38.9 | ||||||
Total | 41.8 | ||||||
480/800 scalable | Voice Encoder | 24.4 | |||||
NCSE | 3.1 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.1 | ||||||
Voice Encoder + Voice Decoder |
28.5 | ||||||
Total | 31.7 | ||||||
480 | Voice Encoder | 21.2 | |||||
NCSE | 2.8 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.1 | ||||||
Voice Encoder + Voice Decoder |
25.3 | ||||||
Total | 28.2 | ||||||
300 | Voice Encoder | 26.8 | |||||
NCSE | 2.8 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.0 | ||||||
Voice Encoder + Voice Decoder |
30.8 | ||||||
Total | 33.7 | ||||||
NRT (VBR) | Voice Encoder | 31.1 | |||||
NCSE | 2.9 | ||||||
AGC | 0.1 | ||||||
Voice Decoder | 4.1 | ||||||
Voice Encoder + Voice Decoder |
35.2 | ||||||
Total | 38.2 |
Module | MIPS* peak | Memory (KBytes) | ||||
---|---|---|---|---|---|---|
Program | Data | |||||
Constants | Channel | Heap | Stack | |||
Voice Encoder | ||||||
NCSE | ||||||
AGC | ||||||
Voice Decoder | ||||||
Voice Encoder + Voice Decoder |
||||||
Total |
Resources for TI's DSP C55 platform are available on request
* DSPINI continues optimization of the TWELP algorithm and code in order to minimize computational complexity of the vocoder.
Vulnerability / Security. DSPINI guarantees ABSOLUTE cleanliness of the software from any undocumented features, undeclared capabilities, etc. All our customers can be sure that any our software/ code doesn't contain any secret functions and features hidden from user. We are ready to provide source codes of our software products for an appropriate certification if need.
Guarantee And Support. DSPINI guarantees a quality and accordance of all technical characteristics of the product to requirement of current specifications. Testing and other method of quality control are used for guarantee support.
Any Platforms. DSPINI can port this vocoder software into any other DSP, RISC or general- purposes platform inshort time: 1-2 months.
Licensing Terms. To use the vocoder, customer should obtain a license from DSPINI only.
Customization. The vocoder can be customized under any specific requirements- other bit rate, frame size, any other robustness to channel errors, etc. Please contact with us for details.
Prospects. DSPINI is impoving and developing continuously a set of new vocoders with range from 300 bps up to 9600 bps, based on TWELP technology.
Related Software. This vocoder may be effectively used in a bundle with other DSPINI's products:
- Linear and acoustic echo cancellers,
- Multichannel noise cancellers (including two-microphone adaptive array),
- Wired or radiomodems for any types of channels and bitrates,
- Other products.
- Datasheet (pdf)
- ITU-T P.50 source speech samples (zip)
- AMBE+2 2450 bps speech samples (zip)
- MELPe 2400 bps speech samples (zip)
- TWELP 2400 bps speech samples (zip)
- PC-evaluation package (zip) — on request
- User's Guide document (pdf) — on request