SXEmacs Lisp Reference Manual: Prerequisites

53.1 Prerequisites — What do I need to start?

In order to use MM features with SXEmacs you need at least two libraries. One of which is responsible for handling different types of media files, that is parses them, demuxes them and decodes them to a raw form suitable for your audio hardware. The other library cares for the actual audio output, that is takes some raw audio data and feeds it to your speakers (or somewhere else).

In the land of ASCII-arts diagrams this would translate to:

+------------+   +-----------+   +-------------+  +------------+
| media file |   | media lib |   |   SXEmacs   |  | audio lib  |
|------------|-->|-----------|-->|-------------|->|------------|->...
| e.g.       |   | parser    |   | bind to var |  | connect to |
| .wav  .mp3 |   | demuxer   |   | start/stop  |  | soundcard  |
| .ogg  .mka |   | decoder   |   |             |  | and play   |
+------------+   +-----------+   +-------------+  +------------+

As can be seen, MM features will not work if either a media lib or an audio lib is missing. SXEmacs supports a bunch of libraries in either category. We discuss the supported audio libraries, their properties and their availability first, afterwards we discuss the different media handling libraries.

53.1.1 Audio Library: OSS (Open Sound System)

Availability: Linux and BSD systems only; ships with the linux kernel
Dependencies: none
Download:
http://www.kernel.org
Pros: easy consistent interface
Cons: old, ugly, depends too much on hardware
Known caveats with SXE: none

Since OSS was one of the most widespread architectures for audio many of the new generation audio infrastructures support OSS with at least a compatibility layer. For instance PulseAudio provides a tool ‘padsp’, and Esound calls it ‘esddsp’. All these are intended to provide an OSS device emulation for applications which only speak OSS. All read/write accesses are rerouted to the respective audio server.

53.1.2 Audio Library: NAS (Network Audio System)

Availability: Unix-wide
Dependencies: X, OSS
Webpage: http://nas.codebrilliance.com/
Download:
http://nas.codebrilliance.com/nas/nas-1.8.src.tar.gz
Pros: device independent, network-mode possible, mixing possible, small
Cons: integrates to X, not recent, not very configurable
Known caveats with SXE: none

NAS was one of the audio systems which seized the concept of X-Forwarding for audio data. Hence its name. However, large parts of NAS depend on X which disqualifies it for non-local or tty-only use.

53.1.3 Audio Library: ESD (the enlightenment sound daemon)

Availability: Unix-wide
Dependencies: libaudiofile; optional: ALSA
Webpage: http://developer.gnome.org/doc/whitepapers/esd/
Download:
ftp://ftp.gnome.org/pub/gnome/sources/esound/0.2/esound-0.2.36.tar.bz2
Pros: device independent (if used with ALSA), network-mode possible, mixing possible, small
Cons: high latency, not recent, not very configurable
Known caveats with SXE: none

Esound is a more decoupled approach but similar to NAS. Furthermore, it provides transparent mixing facilities, applications just connect to the Esound daemon and transfer the stream data, esd itself will downmix the parallel streams and send it to the local sound card. Hence it is well suited for local and network use.

53.1.4 Audio Library: PulseAudio (formerly known as PolypAudio)

Availability: Unix-wide
Dependencies: OSS, liboil, samplerate, libatomic_ops; optional: ALSA, libasyncns, sndfile
Webpage: http://pulseaudio.org/
Download:
http://0pointer.de/lennart/projects/pulseaudio/pulseaudio-0.9.5.tar.gz

svn co svn://0pointer.de/pulseaudio/trunk pulseaudio
Pros: device independent (if used with ALSA), network-mode possible, mixing possible, multiple inputs, multiple outputs, low latency, very configurable, developers’ choice
Cons: unstable with many simultaneous connections
Known caveats with SXE: none

PulseAudio is one of the most advanced new-generation audio servers. It is modular, supports local and network connections, provides transparent downmixing of incoming streams (like esd) and is fully compatible to esd. Furthermore, you can use sound in both directions, i.e. record from pulse sources. Pulse provides modules to not only directly attach to local hardware but also to other remotely running pulses or other running audio servers (like jack, esd, etc.).

53.1.5 Audio Library: Jack (a low-latency audio server)

Availability: Unix-wide
Dependencies: ALSA
Webpage: http://jackit.sourceforge.net/
Download:
http://prdownloads.sourceforge.net/jackit/,

cvs -z3 -d:pserver:anonymous@cvs.sourceforge.net:/cvsroot/jackit co jack
Pros: high accuracy, extreme low latency, device independent, mixing possible
Cons: not network-aware
Known caveats with SXE: none

JACK is a low-latency audio server, written for POSIX conformant operating systems such as GNU/Linux and Apple’s OS X. It can connect a number of different applications to an audio device, as well as allowing them to share audio between themselves. Its clients can run in their own processes (ie. as normal applications), or can they can run within the JACK server (ie. as a "plugin").

JACK was designed from the ground up for professional audio work, and its design focuses on two key areas: synchronous execution of all clients, and low latency operation.

53.1.6 Audio Library: ao (generic and portable audio output)

Availability: Unix-wide
Optional Dependencies: OSS, ALSA, polyp, esd, sunaudio, NAS
Webpage: http://www.xiph.org/ao/
Download:
http://downloads.xiph.org/releases/ao/libao-0.8.6.tar.gz,

svn co http://svn.xiph.org/trunk/ao ao
Pros: portable, wrapper library around system libraries
Cons:
Known caveats with SXE: none

Libao is a cross-platform audio library that allows programs to output audio using a simple API on a wide variety of platforms. It currently supports Null output (handy for testing without a sound device), OSS, ALSA, polypaudio (next generation GNOME sound server), esd (EsounD or Enlightened Sound Daemon), AIX, Sun/NetBSD/OpenBSD, IRIX, NAS

53.1.7 Audio Library: alsa (Advanced Linux Sound Architecture)

Availability: Linux
Dependencies: ALSA kernel modules
Webpage: http://www.alsa-project.org/
Download:
ftp://ftp.alsa-project.org/pub/lib/

hg clone http://hg-mirror.alsa-project.org/alsa-lib alsa-lib
Pros: mature, SMP and thread-safe design
Cons: only available under linux, needs kernel support
Known caveats with SXE: none

53.1.8 Media Library: sndfile

Availability: Unix-wide
Dependencies: none
Webpage: http://www.mega-nerd.com/libsndfile/
Download:
http://www.mega-nerd.com/libsndfile/libsndfile-1.0.15.tar.gz
Maximally provided formats:
Notes:
Known caveats with SXE: none

53.1.9 Media Library: ffmpeg

Availability: Unix-wide
Optional Dependencies: mp3lame, libogg, libvorbis, theora, faad, faac, xvid, x264, a52dec, libdts, amr_nb, amr_wb, amr_if2, Flac, libmatroska
Webpage: http://ffmpeg.sourceforge.net/
Download:
cvs -z3 -d:pserver:anonymous@mplayerhq.hu:/cvsroot/ffmpeg co ffmpeg
Maximally provided formats: a52, ac3, adpcm, adx, .mp2, .mp3, Ogg/Vorbis, theora, AAC, xvid, mpeg1-video, mpeg-audio, h.264, h.263, h.263p, FLV, RealVideo 1.0, RealVideo 2.0, MPEG-4, WMV1, WMV2, SVQ, MJPEG, LJPEG, JPEGls, .jpeg, .png, .ppm, .pgm, YUV, .pbm, .pam, .bmp, Huffman-YUV, ASV, Snow, Sonic, DV captures, x264, GSM, Indeo2/3, TSCC, CSCD, nuppel-video, Qdraw, Qpeg, Loco, Fraps, Xvmc, MACE3/6, CLJR, ROQ, ROQ Dpcm, interplay video, interplay Dpcm, Xan-WC3, RPZA, Cinepak, MS-RLE, VQA, 8bps, SMC, flac, truemotion1/2, VMD-Video, VMD-Audio, ZMBV, Smacker, .dts, RealAudio-144, RealAudio-288, Qt-RLE, Cook, Truespeech, TTA, AVS, AMR Narrowband, AMR Wideband, ADPCM WAV, PCM/WAV, DVD-Subtitles, h.261, ASF, matroska, ShockWave Flash, Apple .mov, MP4, Westwood, V4L, V4L2, MPEG-PS, DV1394, RealMedia, RTP/RTSP, SGI .aiff, Flic, TTA
Notes: Only recent CVS versions are fully supported
Known caveats with SXE: none

FFmpeg has always been a very experimental and developer-driven project. It is a key component in many multimedia projects and has new features added constantly. New, official "releases" are few and far between. In short, if you want to work with FFmpeg, you are advised to go along with CVS development rather than relying on formal releases. CVS snapshots work really well 99% of the time so people are not afraid to use them.

Sample ‘./configure’-line:

./configure --enable-shared --enable-static --enable-mp3lame \
--enable-libogg --enable-vorbis --enable-theora --enable-faad \
--enable-faadbin --enable-faac --enable-xvid --enable-x264 \
--enable-a52 --enable-a52bin --enable-dts --enable-pp \
--enable-amr_nb --enable-amr_wb --enable-amr_if2 \
--enable-pthreads --enable-gpl

53.1.10 Media Library: mad

Availability: Unix-wide
Dependencies: none
Webpage: http://www.underbit.com/products/mad/
Download:
ftp://ftp.mars.org/pub/mpeg/libmad-0.15.1b.tar.gz
Maximally provided formats: mpeg-audio .mpa, .mp2, .mp3
Notes: seems discontinued, not recent
Known caveats with SXE: none

MAD is a high-quality MPEG audio decoder. It currently supports MPEG-1 and the MPEG-2 extension to lower sampling frequencies, as well as the de facto MPEG 2.5 format. All three audio layers – Layer I, Layer II, and Layer III (i.e. MP3) – are fully implemented.

MAD does not yet support MPEG-2 multichannel audio (although it should be backward compatible with such streams) nor does it currently support AAC.

53.1.11 Media Library: SoX

Availability: Unix-wide
Dependencies: none
Webpage: http://sox.sourceforge.net/
Download:
http://prdownloads.sourceforge.net/sox/sox-12.17.9.tar.gz
Maximally provided formats: raw, 8svx, SGI .aiff, Sun .au, .snd, AVR, GSM raw, HCOM, MAUD, mp3, TX-16w, .voc, ADPCM .vox, .wav, RIFX, ADPCM WAV, Ogg/Vorbis, A-law, .wve
Notes: must do ‘make install-lib’
Known caveats with SXE: none

53.1.12 Media Library: xine

Availability: Unix-wide
Dependencies: none
Webpage: http://xinehq.de/
Download:
http://prdownloads.sourceforge.net/xine/xine-lib-1.1.1.tar.gz

cvs -z3 -d:pserver:anonymous@cvs.sf.net:/cvsroot/xine co xine-lib
Maximally provided formats:
Notes:
Known caveats with SXE: not working

53.1.13 Media Library: gstreamer

Availability: Unix-wide
Dependencies: none
Webpage: http://gstreamer.freedesktop.org/
Download:
http://gstreamer.freedesktop.org/src/gstreamer/gstreamer-0.10.4.tar.bz2

cvs -z3 -d:pserver:anoncvs@anoncvs.freedesktop.org:/cvs/gstreamer co gstreamer
Maximally provided formats:
Notes:
Known caveats with SXE: not working

53.1.14 Built-in media file handling

Availability: Unix-wide
Dependencies: none
Webpage: n/a
Download: n/a
Maximally provided formats: .wav, RIFX, Sun .au
Notes: ugly and old
Known caveats with SXE: very limited, very slow