SPR 2400 bps Vocoder - DSP Innovations

Details: Published: 2016 November 09; Last Updated: 2026 March 19

Provides high speech quality at very low consumption of the computing resources and memory (~2.5 times less than MELPe).

For low cost Digital HF Radio, Digital Mobile Radio (DMR), Voice Storage devices and other markets.

SPR Technology Features. The SPR 2400 bps Vocoder is based on Sinusoidal-Pulsed Representation (SPR) model developed by DSP Innovations.

SPR Vocoder^TM model is based simultaneously on two well-known models: Sinusoidal (Harmonic) Coding (SHC) and Linear Predictive Coding (LPC), where system function is presented by LPC-filter and excitation function is formed by Sinusoidal Harmonic model.

However, SPR model uses more complex excitation of the synthesize LPC-filter that contains three components:

voiced (sinusoidal harmonics of the “fundamental frequency”),
unvoiced (noise),
pulsed (aperiodic pulses)

As rule, the first two components are used in Sinusoidal Coding, and third component is used in Multi-Pulse Excitation coding on high bit rates.

We combined advantages of these models to achieve high speech quality at low bit rate and low resources costs.

So, for example, SHC model can’t represent complex speech intervals with aperiodic fluctuations of vocal chords. However, such intervals can be represented by pulse excitation very well. And vise versa, MPE- model can’t provide high speech quality if number of pulses is not enough, therefore this model is used on relatively high bit rates (more 4500 bps).

The combining of these models to form an excitation signal for LPC- model allows achieving unique characteristics of vocoders based on the SPR model.

Therefore, bit stream on output of the Voice Encoder contains the following quantized parameters:

LPC-parameters (as LSF),
“Fundamental frequency” (Pitch),
Frequency-depended Voiced/Unvoiced decisions,
Pulse parameters,
Gains

Speech Quality. Here is the comparison with MELPe 2400 bps vocoder in noiseless channel. SPR 2400 bps vocoder, MELPe 2400 bps vocoder and TWELP 2400 bps vocoder were tested, using ITU-T P.50 speech base for 20 different languages. ITU-T P.862 utility was used for estimation of the speech quality in PESQ terms:

A diagram demonstrates superiority of the TWELP 2400 bps over MELPe 2400 bps and SPR 2400 bps in speech quality in clear channel. TWELP 2400 bps vocoder provides the best speech quality, but consumes much more (~2 times more) computing resources and memory in comparison with SPR 2400 bps vocoder. Exact numbers of the speech quality are presented in the table below.

Language	TWELP 2400	MELPe 2400	SPR 2400
American	3.217	3.077	3.044
Arabic	3.156	3.053	2.988
British	3.104	3.019	2.960
Chinese	3.190	2.970	2.921
Danish	3.185	3.022	2.978
Dutch	3.035	2.830	2.845
Finnish	2.956	2.791	2.732
French	3.225	3.106	3.051
German	3.127	2.998	2.951
Greek	3.149	3.004	2.996
Hindi	3.169	3.089	3.003
Hungarian	3.205	3.086	3.069
Italian	3.310	3.226	3.187
Japanese	3.316	3.188	3.210
Norwegian	3.176	3.032	3.001
Polish	3.176	3.029	2.985
Portuguese	3.224	3.146	3.072
Russian	3.076	2.952	2.954
Spanish	3.214	3.048	3.030
Swedish	3.176	3.147	3.067
Average	3.169	3.041	3.002
Superiority of the TWELP 2400 bps vocoder is on average 0.13 and 0.17 PESQ appropriately

Speech Samples (WAV-files). A few independent experts listened SPR 2400 bps vocoder in comparison with MELPe 2400 bps vocoder and TWELP 2400 bps vocoder, using method of preferences. Majority of experts preferred TWELP to SPR and MELPe, having noted much more natural human-sounding of voice in the TWELP vocoder. Experts haven't found significant difference in sounding between SPR 2400 and MELPe 2400 vocoders.
You can play and listen short samples of the source speech as well as the speech processed by these vocoders for any of 20 languages, using links in the table below.
Also, you can download full set of the P.50 samples as zip-files for all languages simultaneously, using the links in the "Downloads" para in a bottom of the page.

Language	Source speech	SPR 2400 bps	MELPe 2400 bps	TWELP 2400 bps
American
Arabic
British
Chinese
Danish
Dutch
Finnish
French
German
Greek
Hindi
Hungarian
Italian
Japanese
Norwegian
Polish
Portuguese
Russian
Spanish
Swedish

Superiority In Quality Of The Non-speech Signals. In contrast to other LBR vocoders (MELPe, AMBE+2, etc.), TWELP vocoders provide high quality of non-speech signals, including police, ambulance, fire sirens, etc. This feature in conjunction with high quality natural human-sounding of voice makes TWELP vocoders well suitable for replacement of analog radio by digital radio and also for other applications where high quality transmitting of non-speech signals is relevant along with high quality transmitting of speech signals.

Source type	Source signal	MELPe 2400 bps	SPR 2400 bps	TWELP 2400 bps
Siren only
With voice

High Robustness To Acoustic Noise. In contrast to other LBR vocoders, TWELP vocoders are well robust to acoustic noise thanks to robust reliable method of pitch estimation and other features of TWELP technology.

Moreover, vocoder includes in-built Noise Cancellation—Speech Enhancement (NCSE) functionality that improves speech quality in noisy acoustic environment.

NCSE Mode	Source signal	MELPe 2400 bps	SPR 2400 bps	TWELP 2400 bps
Disabled
Enabled

High Robustness To The Channel Errors. The diagram and table below show a dependence of the averaged speech quality for AWGN-noisy channel for different BER in comparison with other vocoders.

MELPe 2400 bps vocoder has a bit better robustness to the channel errors with comparison with SPR 2400 vocoder because has some internal simple FEC. However, special "robust" versions of the SPR and TWELP vocoders include FEC that are integrated with vocoder on base of "joint source-channel coding" approach that provides high speech quality simultaneously in noisy channel as well as in noiseless channel. FEC can operate with "soft decisions" as well as with "hard decisions" from a modem. "Soft decisions" mode provides much better robustness in comparison with the "hard decisions" mode.

BER %	MELPe 2400	TWELP 3600 Robust	SPR 2400
0.00	3.041	3.169	3.002
0.10	2.963	3.168	2.918
0.20	2.890	3.167	2.836
0.30	2.830	3.165	2.775
0.40	2.781	3.164	2.710
0.50	2.734	3.163	2.664
0.60	2.688	3.162	2.599
0.70	2.633	3.161	2.550
0.80	2.587	3.159	2.505
0.90	2.552	3.158	2.455
1.00	2.502	3.157	2.415

Additional Functionalities. The following additional functionalities are developed by DSPINI and can be integrated (on a request) into vocoders:

Automatic Gain Control (AGC),
Noise Cancellation for Speech Enhancement (NCSE)
Voice Activity Detector (VAD),
Tone Detection/Generation (Single tones and Dual tones). The tones are transmitted by the vocoder facilities.

Each functionality has unique features, performance and characteristics, providing significant superiority over any well-known implementations on the market.

Technical Characteristics And Resource Requirements:

Technical characteristics
Bit Rate (bps)	Algorithm	Frame size (ms)	Algorithmic delay (including frame size) (ms)	Sampling rate (kHz)	Signal format	Bit stream format
2400	SPR^TM	20	40	8	Linear 16-bit PCM	48

Additional functionalities
Name	Functionality	Technical characteristics
Name	Functionality	Name	Value
AGC	Automatic Gain Control	Control range:	0 ... +20 dB
NCSE	Noise Canceller - Speech Enhancer	SNR increasing	> 6 dB
NCSE	Noise Canceller - Speech Enhancer	Speech quality improvement	> 0.1 PESQ
Tone Detector	Single/Dual tones detection	In accordance with international standards
Tone Generator	Single/Dual tones generation	Special generator, kept continuity of signal (phase and amplitude of signal of previous frame)
VAD	Voice Activity Detection	Reliable detection speech in background noise
CNG	Comfort Noise Generation	Type of noise	"white"
CNG	Comfort Noise Generation	Level	- 60 dB

Resources for ARM Cortex-M4 platform
Module	MIPS* peak	Memory (KBytes)
		Program	Data
		Program	Constants	Channel	Heap	Stack
Voice Encoder	37.7	30.2	10.0	5.0	2.0	0.5
Voice Decoder	13.9
Total	51.6

Resources for TI's C64 DSP platform
Module	MIPS* peak	Memory (KBytes)
		Program	Data
		Program	Constants	Channel	Heap	Stack
Voice Encoder	10.9	40.3	10.0	5.0	2.0	0.5
Voice Decoder	3.6
Total	14.5

Resources for TI's C55 DSP platform
Module	MIPS* peak	Memory (KBytes)
		Program	Data
		Program	Constants	Channel	Heap	Stack
Voice Encoder	17.1	14.5	10.0	5.0	2.0	0.5
Voice Decoder	5.9
Total	23.0

* DSPINI continues optimization of the SPR algorithm and code in order to minimize computational complexity of the vocoder.

Vulnerability / Security. DSPINI guarantees ABSOLUTE cleanliness of the software from any undocumented features, undeclared capabilities, etc. All our customers can be sure that any our software/ code doesn't contain any secret functions and features hidden from user. We are ready to provide source codes of our software products for an appropriate certification if need.

Guarantee And Support. DSPINI guarantees a quality and accordance of all technical characteristics of the product to requirement of current specifications. Testing and other method of quality control are used for guarantee support.

Any Platforms. DSPINI can port this vocoder software into any other DSP, RISC or general- purposes platform inshort time: 1-2 months.

Licensing Terms. To use the vocoder, customer should obtain a license from DSPINI only.

Customization. The vocoder can be customized under any specific requirements- other bit rate, frame size, any other robustness to channel errors, etc. Please contact with us for details.

Prospects. DSPINI is impoving and developing continuously a set of new vocoders with range from 300 bps up to 9600 bps, based on SPR and TWELP technologies.

Related Software. This vocoder may be effectively used in a bundle with other DSPINI's products:

Linear and acoustic echo cancellers,
Multichannel noise cancellers (including two-microphone adaptive array),
Wired or radiomodems for any types of channels and bitrates,
Other products.

Downloads:

Datasheet (pdf)
ITU-T P.50 source speech samples (zip)
MELPe 2400 bps speech samples (zip)
SPR 2400 bps speech samples (zip)
TWELP 2400 bps speech samples (zip)
PC-evaluation package (zip) — on request
User's Guide document (pdf) — on request