NAME
cdda2wav - a sampling utility that dumps CD audio data into
wav sound files
SYNOPSIS
cdda2wav [-c chans] [-s] [-m] [-b bits] [-r rate] [-a
divider] [-t track[+endtrack]] [-i index] [-o offset] [-d
duration] [-x] [-q] [-w] [-v] [-V] [-Q] [-J] [-R] [-P sec-
tors] [-F] [-G] [-T] [-e] [-p percentage] [-n sectors] [-l
buffers] [-N] [-J] [-H] [-g] [-B] [-D device] [-A auxdevice]
[-I interface] [-O audiotype] [-C input-endianess] [-E
output-endianess] [-M count] [-S speed] [audio.wav]
DESCRIPTION
cdda2wav can retrieve audio tracks from CDROM drives (see
README for a list of drives) that are capable of reading
audio data digitally to the host (CDDA).
OPTIONS
-D device --device
uses device as the source for CDDA reading: For exam-
ple /dev/cdrom for the cooked_ioctl interface and
Bus,ID,Lun for the generic_scsi interface. The device
has to correspond with the interface setting (see
below). The setting of the environment variable
CDDA_DEVICE is overridden by this option.
-A auxdevice --auxdevice
uses auxdevice as CDROM drive for ioctl usage.
-I interface --interface
specifies the interface for CDROM access: generic_scsi
or (on Linux, and FreeBSD systems) cooked_ioctl.
-c channels --channels
uses 1 for mono, or 2 for stereo recording, or s for
stereo recording with both channels swapped.
-s --stereo
sets to stereo recording.
-m --mono
sets to mono recording.
-x --max
sets maximum (CD) quality.
-b bits --bits-per-sample
sets bits per sample per channel: 8, 12 or 16.
-r rate --rate
sets rate in samples per second. Possible values are
listed with the -R option.
-a divider --divider
sets rate to 44100Hz / divider. Possible values are
listed with the -R option.
-R --dump-rates
shows a list of all sample rates and their dividers.
-P sectors --set-overlap
sets the initial number of overlap sectors for jitter
correction.
-n sectors --sectors-per-request
reads sectors per request.
-l buffers --buffers-in-ring
uses a ring buffer with buffers total.
-t track+endtrack --track
selects the start track and optionally the end track.
-i index --index
selects the start index.
-o offset --offset
starts offset sectors behind start track (one sector
equivalents 1/75 seconds).
-O audiotype --output-format
can be wav (for wav files) or aiff (for apple/sgi aiff
files) or aifc (for apple/sgi aifc files) or au or sun
(for sun .au PCM files) or cdr or raw (for headerless
files to be used for cd writers).
-C endianess --cdrom-endianess
sets endianess of the input samples to 'little', 'big'
or 'guess' to override defaults.
-E endianess --output-endianess
sets endianess of the output samples to 'little' or
'big' to override defaults.
-d duration --duration
sets recording time in seconds or frames. Frames (sec-
tors) are indicated by a 'f' suffix (like 75f for 75
sectors). 0 sets the time for whole track.
-B --bulk
copies each track into a seperate file.
-w --wait
waits for signal, then start recording.
-F --find-extremes
finds extrem amplitudes in samples.
-G --find-mono
finds if input samples are in mono.
-T --deemphasize
undo the effect of pre-emphasis in the input samples.
-e --echo
copies audio data to sound device e.g. /dev/dsp.
-p percentage --set-pitch
changes pitch of audio data copied to sound device.
-v level --verbose-level
prints verbose information about the CD. Level is a
sum of powers of two in the range 0 up to 63. Each
power of two controls the type of information to be
reported. center box ; r| l|.
Power Description _ 1 show table of contents
2 show a summary of the recording parameters 4 deter-
mine and display index offsets 8 retrieve and display the
media catalog number MCN 16 retrieve and display all
Intern. Standard Recording Codes ISRC 32 show the table of
contents in start sector notation 64 show the table of
contents with track titles (when available)
-N --no-write
does not write to a file, it just reads (for debugging
purposes).
-J --info-only
does not write to a file, it just gives information
about the disc.
-H --no-infofile
does not write an info file and a cddb file.
-g --gui
formats the output to be better parsable by gui fron-
tends.
-M count --md5
enables calculation of MD-5 checksum for 'count' bytes
from a beginning of a track.
-S speed --speed-select
sets the cdrom device to one of the selectable speeds
for reading.
-q --quiet
quiet operation, no screen output.
-V --verbose-SCSI
enable SCSI command logging to the console. This is
mainly used for debugging.
-Q --silent-SCSI
suppress SCSI command error reports to the console.
This is mainly used for guis.
-J --version
display version of cdda2wav on standard output.
Defaults depend on the
Makefile and environment variable settings (currently
CDDA_DEVICE ).
ENVIRONMENT VARIABLES
CDDA_DEVICE is used to set the device name. The device nam-
ing is compatible with Jörg Schilling's cdrecord package.
DISCUSSION
cdda2wav is able to read parts of an audio CD or multimedia
CDROM (containing audio parts) directly digitally. These
parts can be written to a file, a pipe, or to a sound dev-
ice.
cdda2wav stands for CDDA to WAV (where CDDA stands for com-
pact disc digital audio and WAV is a sound sample format
introduced by MS Windows). It allows copying CDDA audio
data from the CDROM drive into a file on your ram-, hard-,
floppy- or whatever-disk in WAV or other formats.
The latest versions try to get higher real-time scheduling
priorities to ensure smooth (uninterrupted) operation. These
priorities are available for super users and are higher than
those of 'normal' processes. Thus delays are minimized.
Please note that you need newer kernels and c libraries to
take advantage of this feature.
If your CDROM (not yet for SCSI drives) is on device
/dev/cdrom and it is loaded with an audio CD, you may simply
invoke cdda2wav and it will create the sound file audio.wav
recording the whole track beginning with track 1 in stereo
at 16 bit at 44100 Hz sample rate, if your file system has
enough space free. Otherwise recording time will be lim-
ited. SCSI drives have to use different devices (see files
README and README.INSTALL for details).
HINTS ON OPTIONS
Options
Most of the options are used to control the format of
the WAV file. In the following text all of them are
described.
Select Device
-D device selects the CDROM drive device to be used.
The specifier given should correspond to the selected
interface (see below). CHANGE! For the cooked_ioctl
interface this is the cdrom device descriptor as
before. The SCSI devices used with the addressed with
their SCSI-Bus, SCSI-Id, and SCSI device descriptor!!!
One example for a SCSI CDROM drive on bus 0 with SCSI
ID 3 and lun 0 is -D0,3,0.
Select Auxiliary device
-A auxdevice is necessary for CD-Extra handling. For
Non-SCSI-CDROM drives this is the same device as given
by -D (see above). For SCSI-CDROM drives it is the
CDROM drive (SCSI) device (i.e. /dev/sr0 ) correspond-
ing to the SCSI device (i.e. 0,3,0 ). It has to match
the device used for sampling.
Select Interface
-I interface selects the CDROM drive interface. For
SCSI drives use generic_scsi (cooked_ioctl may not yet
be available for all devices): generic_scsi and
cooked_ioctl. The first uses the generic SCSI inter-
face, the latter uses the ioctl of the CDROM driver.
The latter variant works only when the kernel driver
supports CDDA reading. This entry has to match the
selected CDROM device (see above).
Enable echo to soundcard
-e copies audio data to the sound card while recording,
so you hear it nearly simultaneously. The soundcard
gets the same data that is recorded. This is time crit-
ical, so it works best with the -q option. To use
cdda2wav as a pseudo CD player without recording in a
file you could use cdda2wav -q -e -t2 -d0 -N to play
the whole second track. This feature reduces the
recording speed to at most onefold speed. You cannot
make better recordings than your sound card can play
(since the same data is used).
Change pitch of echoed audio
-p percentage changes the pitch of all audio echoed to
a sound card. Only the copy to the soundcard is
affected, the recorded audio samples in a file remain
the same. Normal pitch, which is the default, is given
by 100%. Lower percentages correspond to lower
pitches, i.e. -p 50 transposes the audio output one
octave lower. See also the script pitchplay as an
example. This option was contributed by Raul Sobon.
Select mono or stereo recording
-m or -c 1 selects mono recording (both stereo channels
are mixed), -s or -c 2 or -c s selects stereo recording
(doubles file size). Parameter s will swap both sound
channels.
Select maximum quality
-x will set stereo, 16 bits per sample at 44.1 KHz
(full CD quality). Note that other format options
given later can change this setting.
Select sample quality
-b 8 specifies 8 bit (1 Byte) for each sample in each
channel; -b 12 specifies 12 bit (2 Byte) for each sam-
ple in each channel; -b 16 specifies 16 bit (2 Byte)
for each sample in each channel (Ensure that your sam-
ple player or sound card is capable of playing 12-bit
or 16-bit samples). Selecting 12 or 16 bits doubles
file size. 12-bit samples are aligned to 16-bit sam-
ples, so they waste some disk space.
Select sample rate
-r samplerate selects a sample rate. samplerate can be
in a range between 44100 and 900. Option -R lists all
available rates.
Select sample rate divider
-a divider selects a sample rate divider. divider can
be minimally 1 and maximally 50.5 and everything
between in steps of 0.5. Option -R lists all available
rates.
To make the sound smoother at lower sampling rates,
cdda2wav sums over n samples (where n is the specific
dividend). So for 22050 Hertz output we have to sum
over 2 samples, for 900 Hertz we have to sum over 49
samples. This cancels higher frequencies. Standard
sector size of an audio CD (ignoring additional infor-
mation) is 2352 Bytes. In order to finish summing for
an output sample at sector boundaries the rates above
have to be choosen. Arbitrary sampling rates in high
quality would require some interpolation scheme, which
needs much more sophisticated programming.
List a table of all sampling rates
-R shows a list of all sample rates and their dividers.
Dividers can range from 1 to 50.5 in steps of 0.5.
Select start track and optionally end track
-t n+m selects n as the start track and optionally m as
the last track of a range to be recorded. These tracks
must be from the table of contents. This sets the
track where recording begins. Recording can advance
through the following tracks as well (limited by the
optional end track or otherwise depending on recording
time). Whether one file or different files are then
created depends on the -B option (see below).
Select start index
-i n selects the index to start recording with.
Indices other than 1 will invoke the index scanner,
which will take some time to find the correct start
position. An offset may be given additionally (see
below).
Set recording time
-d n sets recording time to n seconds or set recording
time for whole track if n is zero. In order to specify
the duration in frames (sectors) also, the argument can
have an appended 'f'. Then the numerical argument is to
be taken as frames (sectors) rather than seconds.
Please note that if track ranges are being used they
define the recording time as well thus overriding any -
d option specified times.
Recording time is defined as the time the generated
sample will play (at the defined sample rate). Since
it's related to the amount of generated samples, it's
not the time of the sampling process itself (which can
be less or more). It's neither strictly coupled with
the time information on the audio CD (shown by your
hifi CD player). Differences can occur by the usage of
the -o option (see below). Notice that recording time
will be shortened, unless enough disk space exists.
Recording can be aborted at anytime by pressing the
break character (signal SIGQUIT).
Record a complete audio CD in one go in different files
-B copies each track into a seperate file. A base name
can be given. File names have an appended track number
and an extension corresponding to the audio format. To
record all audio tracks of a CD, use a sufficient high
duration (i.e. -d99999).
Set start sector offset
-o sectors increments start sector of the track by sec-
tors. By this option you are able to skip a certain
amount at the beginning of a track so you can pick
exactly the part you want. Each sector runs for 1/75
seconds, so you have very fine control. If your offset
is so high that it would not fit into the current
track, a warning message is issued and the offset is
ignored. Recording time is not reduced. (To skip
introductory quiet passages automagically, use the - w
option see below.)
Wait for signal option
-w Turning on this option will suppress all silent out-
put at startup, reducing possibly file size. cdda2wav
will watch for any signal in the output signal and
switches on writing to file.
Find extrem samples
-F Turning on this option will display the most nega-
tive and the most positive sample value found during
recording for both channels. This can be useful for
readjusting the volume. The values shown are not reset
at track boundaries, they cover the complete sampling
process. They are taken from the original samples and
have the same format (i.e. they are independent of the
selected output format).
Find if input samples are in mono
-G If this option is given, input samples for both
channels will be compared. At the end of the program
the result is printed. Differences in the channels
indicate stereo, otherwise when both channels are equal
it will indicate mono.
Undo the pre-emphasis in the input samples
-T Some older audio CDs are recorded with a modified
frequency response called pre-emphasis. This is found
mostly in classical recordings. The correction can be
seen in the flags of the Table Of Contents often. But
there are recordings, that show this setting only in
the subchannels. If this option is given, the index
scanner will be started, which reads the q-subchannel
of each track. If pre-emphasis is indicated in the q-
subchannel of a track, but not in the TOC, pre-emphasis
will be assumed to be present, and subsequently a
reverse filtering is done for this track before the
samples are written into the audio file.
Set audio format
-O audiotype can be wav (for wav files) or au or sun
(for sun PCM files) or cdr or raw (for headerless files
to be used for cd writers). All file samples are coded
in linear pulse code modulation (as done in the audio
compact disc format). This holds for all audio formats.
Wav files are compatible to Wind*ws sound files, they
have lsb,msb byte order as being used on the audio cd.
The default filename extension is '.wav'. Sun type
files are not like the older common logarithmically
coded .au files, but instead as mentioned above linear
PCM is used. The byte order is msb,lsb to be compati-
ble. The default filename extension is '.au'. The AIFF
and the newer variant AIFC from the Apple/SGI world
store their samples in bigendian format (msb,lsb). In
AIFC no compression is used. Finally the easiest 'for-
mat', the cdr aka raw format. It is done per default in
msb,lsb byte order to satisfy the order wanted by most
cd writers. Since there is no header information in
this format, the sample parameters can only be identi-
fied by playing the samples on a soundcard or similiar.
The default filename extension is '.cdr' or '.raw'.
Select cdrom drive reading speed
-S speed allows to switch the cdrom drive to a certain
level of speed in order to reduce read errors. The
argument is transfered verbatim to the drive. Details
depend very much on the cdrom drives. An argument of 0
for example is often the default speed of the drive, a
value of 1 often selects single speed.
Enable MD5 checksums
-M count enables calculation of MD-5 checksum for
'count' bytes from the beginning of a track. This was
introduced for quick comparisons of tracks.
Do linear or overlapping reading of sectors
-P sectors sets the given number of sectors for ini-
tial overlap sampling for jitter correction. Two cases
are to be distinguished. For nonzero values, some sec-
tors are read twice to enable cdda2wav's jitter correc-
tion. If an argument of zero is given, no overlap sam-
pling will be used. For nonzero overlap sectors
cdda2wav dynamically adjusts the setting during sam-
pling (like cdparanoia does). If no match can be
found, cdda2wav retries the read with an increased
overlap. If the amount of jitter is lower than the
current overlapped samples, cdda2wav reduces the over-
lap setting, resulting in a higher reading speed. The
argument given has to be lower than the total number of
sectors per request (see option -n below). Cdda2wav
will check this setting and issues a error message oth-
erwise. The case of zero sectors is nice on low load
situations or errorfree (perfect) cdrom drives and per-
fect (not scratched) audio cds.
Set the transfer size
-n sectors will set the transfer size to the specified
sectors per request.
Set number of ring buffer elements
-l buffers will allocate the specified number of ring
buffer elements.
Set endianess of input samples
-C endianess will override the default settings of the
input format. Endianess can be set explicitly to "lit-
tle" or "big" or to the automatic endianess detection
based on voting with "guess".
Set endianess of output samples
-E endianess (endianess can be "little" or "big") will
override the default settings of the output format.
Verbose option
-v level prints more information. A binary mask allows
selection of different information.
0 keeps quiet
1 displays the table of contents
2 displays a summary of recording parameters
4 invokes the index scanner and displays start posi-
tions of indices
8 retrieves and displays a media catalog number
16 retrieves and displays international standard
recording codes
32 displays track start positions in absolute sector
notation
To combine several requests just add the numbers and
give the sum as argument.
The table of contents
The display will show the table of contents with number
of tracks and total time (displayed in mm:ss.hh format,
mm=minutes, ss=seconds, hh=rounded 1/100 seconds). The
following list displays track number and track time for
each entry. The summary gives a line per track
describing the type of the track.
track preemphasis copypermitted tracktype chans
The track column holds the track number. preemphasis
shows if that track has been given a non linear fre-
quency response. NOTE: You can undo this effect with
the -T option. copy-permitted indicates if this track
is allowed to copy. tracktype can be data or audio. On
multimedia CDs (except hidden track CDs) both of them
should be present. channels is defined for audio
tracks only. There can be two or four channels.
No file output
-N this debugging option switches off writing to a
file.
No infofile generation
-H this option switches off creation of an info file
and a cddb file.
Generation of simple output for gui frontends
-g this option switches on simple line formatting,
which is needed to support gui frontends (like xcd-
roast).
Verbose SCSI logging
-V this option switches on logging of SCSI commands.
This will produce a lot of output (when SCSI devices
are being used), which is needed for debugging pur-
poses. The format is the same as being used with the
cdrecord package from J ö rg Schilling. I will not
describe it here.
Quiet option
-q suppresses all screen output except error messages.
That reduces cpu time resources.
Just show information option
-J does not write a file, it only prints information
about the disc (depending on the -v option). This is
just for information purposes.
HINTS ON USAGE
Don't create samples you cannot read. First check your sam-
ple player software and sound card hardware. I experienced
problems with very low sample rates (stereo <= 1575 Hz, mono
<= 3675 Hz) when trying to play them with standard WAV
players for sound blaster (maybe they are not legal in WAV
format). Most CD-Writers insist on audio samples in a bigen-
dian format. Now cdda2wav supports the -E endianess option
to control the endianess of the written samples.
If your hardware is fast enough to run cdda2wav uninter-
rupted and your CD drive is one of the 'perfect' ones, you
will gain speed when switching all overlap sampling off with
the -P 0 option. Further fine tuning can be done with the -
n sectors option. You can specify how much sectors should
be requested in one go.
Cdda2wav supports pipes now. Use a filename of - to let
cdda2wav output its samples to standard output.
Conversion to other sound formats can be done using the sox
program package (although the use of sox -x to change the
byte order of samples should be no more necessary; see
option -E to change the output byteorder).
If you really want to sample more than one track into dif-
ferent files in one run, this is currently possible with the
-B option. When recording time exceeds the track limit a new
file will be opened for the next track.
FILES
Cdda2wav can generate a lot of files for various purposes.
Audio files:
There are audio files containing samples with default exten-
sions These files are not generated when option (-N) is
given. Multiple files may be written when the bulk copy
option (-B) is used. Individual file names can be given as
arguments. If the number of file names given is sufficient
to cover all included audio tracks, the file names will be
used verbatim. Otherwise, if there are less file names than
files needed to write the included tracks, the part of the
file name before the extension is extended with '_dd' where
dd represents the current track number.
Cddb and Cdindex files:
If cdda2wav detects cd-extra or cd-text (album/track) title
information, then .cddb and .cdindex files are generated
unless suppressed by the option -H. They contain suitable
formatted entries for submission to audio cd track title
databases in the internet. The CDINDEX and CDDB(tm) systems
are currently supported. For more information please visit
www.cdindex.org and www.cddb.com.
Inf files:
The inf files are describing the sample files and the part
from the audio cd, it was taken from. They are a means to
transfer information to a cd burning program like cdrecord.
For example, if the original audio cd had pre-emphasis
enabled, and cdda2wav -T did remove the pre-emphase, then
the inf file has pre-emphasis not set (since the audio file
does not have it anymore), while the .cddb and the .cdindex
have pre-emphase set as the original does.
WARNING
IMPORTANT: it is prohibited to sell copies of copyrighted
material by noncopyright holders. This program may not be
used to circumvent copyrights. The user acknowledges this
constraint when using the software.
BUGS
Performance may not be optimal on slower systems.
The index scanner may give timeouts.
The resampling (rate conversion code) uses polynomial inter-
polation, which is not optimal.
Cdda2wav should use threads.
Cdda2wav currently cannot sample hidden audio tracks (track
1 index 0).
ACKNOWLEDGEMENTS
Thanks goto Project MODE (http://www.mode.net/) and
Fraunhofer Institut f ür integrierte Schaltungen (FhG-IIS)
(http://www.iis.fhg.de/) for financial support. Plextor
Europe and Ricoh Japan provided cdrom disk drives and cd
burners which helped a lot to develop this software. Rammi
has helped a lot with the debugging and showed a lot of
stamina when hearing 100 times the first 16 seconds of the
first track of the Krupps CD. Paranoia patches contributed
by Monty xiphmont@mit.edu.
AUTHOR
Heiko Eissfeldt heiko@colossus.escape.de
DATE
11 Feb 2000
Man(1) output converted with
man2html
GMD Homepage
FOKUS Homepage
Schily's Homepage