Audio and Sound

Solutions for Applications Requiring Audio and Speech Playback


Audio applications that require playback of voice often fall in one of two classes:

  • Playback of pre-recorded and stored audio
  • Playback of streaming audio

In both cases, the quality of audio depends on both the compression scheme (G.726A, Speex, etc.) used in firmware as well as the hardware peripherals used (PWM, DAC etc.) for reproducing the sound. Some applications that require playback of streaming audio may also require encoding of recorded speech data for duplex transmission. In such cases, the processor throughput (MIPS) required by the MCU or DSC is largely dependent on the encoding algorithm. These applications are discussed in detail in the Communications section. In this section, we present the solutions required for playback-only applications in two parts:

Software Algorithm Libraries


The choice of software algorithm used to playback audio & speech signals depends on the compression algorithm used to encode the raw speech data prior to storage. The graph below compares a variety of compression schemes in use today. Microchip provides software libraries to support algorithms that require no payment of royalties, such as IMA ADPCM for 8 &16-bit PIC® MCUs, G.711, Speex & G.726A for PIC24 MCU & dsPIC DSCs & Speex & ADPCM for PIC32 MCUs.

Click image to enlarge

Click image to enlarge

Note: The MIPS usage statistics shown for algorithms within gray ellipses in the graph above represent the requirements of Microchip’s implementation of these algorithms on 16-bit PIC MCUs and dsPIC DSCs.

A decision on the algorithm to use is often based on a tradeoff between quality and system cost incurred in saving large amounts of audio data. The table below shows how the various algorithms perform in terms of actual seconds of speech they can store into memory.

Algorithm G.711 G.726A Speex
Memory needed to store 1 second of encoded speech 8 KB 2,3,4 or 5 KB 1 KB
How does this affect your choice of MCU, DSC or Memory component?
Use the device below: To store N seconds of speech in on-chip Flash memory using
the following encoding algorithms:
4 7
Algorithm G.711 G.726A Speex
25XX1024 Serial EEPROM 16 Sec 25 to 64 sec 128 sec
PIC24FJ256GA or PIC24HJ256GP610 32 sec 52 to 128 sec n/a
dsPIC33FJ256GP710 32 sec 51 to 128 sec 256 sec
PIC32MX360F512L 64 sec 102 to 256 sec 512 sec

Hardware Design Options


A few variations are available for implementation in hardware based on trade-offs on quality of audio versus system cost, simplicity versus system-integration etc. Popular methods supported by Microchip’s PIC MCUs and dsPIC DSCs are shown below:

  1. Use on-chip A/D Converter
  2. Use off-chip Codecs or A/D Converters
  3. Use off-chip Codecs or D/A Converters

Hardware Option 1: Use on-chip A/D Converter


In the option shown here, all hardware except for the speaker and analog amplification circuitry resides within the MCU or DSC. A 16-bit Audio D/A converter peripheral is available in some dsPIC Digital Signal Controllers. Learn more about the 16-bit Audio DAC peripheral in our web seminar.

Click image to enlarge

Hardware Option 2: Use on-chip PWM followed by external low-pass filtering


Click image to enlarge

The option shown above can be used with many of Microchip’s 8-, 16- & 32-bit PIC® MCUs and dsPIC® DSCs. PWM peripherals are used to drive speakers. The PWM output is filtered prior to amplification. AN5xx describes this technique in greater detail. This option can be prototyped using the Explorer 16 development board and the Speech Playback PICtail™ Plus accessory.

Hardware Option 3: Use off-chip Codecs of D/A Converters


A representative design exercising this option can be found in the dsPICDEM™ 1.1 Plus Development Board. Schematics and source code for a demo are available in the zip file provided with the board.

Click image to enlarge