Welcome to Video Converter documentation!¶
Contents:
Introduction¶
Video Converter is a Python module for converting video files from one format and codec to another.
It uses the FFmpeg multimedia framework for actual file processing, and adds an easy-to-use API for probing and converting media files on top of it.
Licensing and Patents¶
Although FFmpeg is licensed under LGPL/GPL, Video Converter only invokes the existing ffmpeg executables on the system (ie. doesn’t link to the ffmpeg libraries), so it doesn’t need to be LGPL/GPL as well.
The same applies to patents. If you’re in a country which recognizes software patents, it’s up to you to ensure you’re complying with the patent laws. Please read the FFMpeg Legal FAQ for more information.
Installation and Requirements¶
Video Converter requires a working Python installation, and a fairly recent version of ffmpeg libraries and utilities (ffmpeg and ffprobe).
Video Converter installation¶
To build the library, run:
python setup.py build
To run automated tests:
python setup.py test
To create this documentation:
python setup.py doc
To install the library:
python setup.py install
Custom compiling ffmpeg¶
The supported formats and codecs depend on the support compiled in to ffmpeg. Many distributors choose to enable only a subset of the supported codecs, so if the version installed by your OS/distribution doesn’t support a particular feature, it’s advisable to recompile ffmpeg yourself.
The latest development version of FFmpeg can be downloaded from the official git repository.
To build all the codecs that Video Converter can use, you can use the following configure options:
./configure --prefix=${TARGET_PREFIX} \
--extra-cflags=-I${TARGET_PREFIX}/include \
--extra-ldflags=-L${TARGET_PREFIX}/lib \
--enable-libmp3lame \
--enable-libvorbis \
--enable-libtheora \
--enable-libx264 --enable-gpl \
--enable-libvpx \
--enable-libxvid
make
You will need to install (either the version built by your OS distributor if it’s new enough, or a custom-compiled one) the mentioned extra libraries so ffmpeg can make use of them.
Tutorial¶
Most of the tasks (video probing, converting, thumbnail generation) can be done using the high-level Converter object.
Creating the Converter object¶
First we need to import the module and create the object:
>>> from converter import Converter
>>> c = Converter()
By default, the converter uses ffmpeg and ffprobe binaries in your path. If it should use them from another location, you can specify the paths to them in the Converter constructor.
Getting existing multimedia file properties¶
Probing the media file will return a MediaInfo
object, containing
various information about the file format, media streams, codecs and properties:
>>> info = c.probe('test1.ogg')
>>> info.format.format
'ogg'
>>> info.format.duration
33.00
>>> len(info.streams)
2
>>> info.video.codec
'theora'
>>> info.video.video_width
720
>>> info.video.video_height
400
>>> info.audio.codec
'vorbis'
>>> info.audio.audio_channels
2
A full list of properties can be found in MediaFormatInfo
and
MediaStreamInfo
documentation.
Converting a video into another format¶
To convert a media file into some other format (or to use some other codecs), you need to create a dictionary (map) of options specifying what to convert to.
The options dictionary looks like this:
{
'format': 'mkv',
'audio': {
'codec': 'mp3',
'samplerate': 11025,
'channels': 2
},
'video': {
'codec': 'h264',
'width': 720,
'height': 400,
'fps': 15
},
'subtitle': {
'codec': 'copy'
},
'map': 0
}
The full list of options can be found in Converter
documentation.
To prepare the conversion process:
>>> conv = c.convert('test1.ogg', '/tmp/output.ogg', options)
This won’t start the conversion, it will just prepare everything and return a generator. To run the conversion process, iterate the generator until it’s finished. On each iteration, the generator will yield a timecode, specifying how far into the media file is the conversion process at the moment (ie. at which second in the movie is the process).
To just drive the conversion without using the timecode information:
>>> for timecode in conv:
... pass
Getting audio from a video file¶
To just get the audio content from a video file, you can use the conversion as above, specifying in the options that the video should be dropped:
{
'format': 'mp3',
'audio': {
'codec': 'mp3',
'bitrate': '22050',
'channels': 1
}
}
Since the video is not specified in the output, the video stream will be dropped. Likewise, you can drop the audio stream from the output.
If you just want to copy audio or video stream as is, without conversion, you can do that by specifying the ‘copy’ codec.
Creating a thumbnail¶
To create a thumbnail form a video file (from 10 seconds in the movie):
>>> c.thumbnail('test1.ogg', 10, '/tmp/shot.png')
You can specify the screenshot dimensions:
>>> c.thumbnail('test1.ogg', 10, '/tmp/shot.png', '320x200')
API Reference¶
Converter high-level API¶
-
class
converter.
Converter
(ffmpeg_path=None, ffprobe_path=None)¶ Converter class, encapsulates formats and codecs.
>>> c = Converter()
-
convert
(infile, outfile, options, twopass=False, timeout=10)¶ Convert media file (infile) according to specified options, and save it to outfile. For two-pass encoding, specify the pass (1 or 2) in the twopass parameter.
- Options should be passed as a dictionary. The keys are:
- format (mandatory, string) - container format; see formats.BaseFormat for list of supported formats
- audio (optional, dict) - audio codec and options; see avcodecs.AudioCodec for list of supported options
- video (optional, dict) - video codec and options; see avcodecs.VideoCodec for list of supported options
- map (optional, int) - can be used to map all content of stream 0
Multiple audio/video streams are not supported. The output has to have at least an audio or a video stream (or both).
Convert returns a generator that needs to be iterated to drive the conversion process. The generator will periodically yield timecode of currently processed part of the file (ie. at which second in the content is the conversion process currently).
The optional timeout argument specifies how long should the operation be blocked in case ffmpeg gets stuck and doesn’t report back. This doesn’t limit the total conversion time, just the amount of time Converter will wait for each update from ffmpeg. As it’s usually less than a second, the default of 10 is a reasonable default. To disable the timeout, set it to None. You may need to do this if using Converter in a threading environment, since the way the timeout is handled (using signals) has special restriction when using threads.
>>> conv = Converter().convert('test1.ogg', '/tmp/output.mkv', { ... 'format': 'mkv', ... 'audio': { 'codec': 'aac' }, ... 'video': { 'codec': 'h264' } ... })
>>> for timecode in conv: ... pass # can be used to inform the user about the progress
-
parse_options
(opt, twopass=None)¶ Parse format/codec options and prepare raw ffmpeg option list.
-
probe
(fname, posters_as_video=True)¶ Examine the media file. See the documentation of converter.FFMpeg.probe() for details.
Parameters: posters_as_video – Take poster images (mainly for audio files) as A video stream, defaults to True
-
thumbnail
(fname, time, outfile, size=None, quality=4)¶ Create a thumbnail of the media file. See the documentation of converter.FFMpeg.thumbnail() for details.
-
thumbnails
(fname, option_list)¶ Create one or more thumbnail of the media file. See the documentation of converter.FFMpeg.thumbnails() for details.
-
Container formats¶
-
class
converter.formats.
AviFormat
¶ Avi container format, often used vith DivX video.
-
class
converter.formats.
BaseFormat
¶ Base format class.
Supported formats are: ogg, avi, mkv, webm, flv, mov, mp4, mpeg
-
class
converter.formats.
FlvFormat
¶ Flash Video container format.
-
class
converter.formats.
MkvFormat
¶ Matroska format, often used with H.264 video.
-
class
converter.formats.
MovFormat
¶ Mov container format, used mostly with H.264 video content, often for mobile platforms.
-
class
converter.formats.
Mp3Format
¶ Mp3 container, used audio-only mp3 files
-
class
converter.formats.
Mp4Format
¶ Mp4 container format, the default Format for H.264 video content.
-
class
converter.formats.
MpegFormat
¶ MPEG(TS) container, used mainly for MPEG 1/2 video codecs.
-
class
converter.formats.
OggFormat
¶ Ogg container format, mostly used with Vorbis and Theora.
-
class
converter.formats.
WebmFormat
¶ WebM is Google’s variant of Matroska containing only VP8 for video and Vorbis for audio content.
Audio and video codecs¶
-
class
converter.avcodecs.
AacCodec
¶ AAC audio codec.
-
class
converter.avcodecs.
Ac3Codec
¶ AC3 audio codec.
-
class
converter.avcodecs.
AudioCodec
¶ Base audio codec class handles general audio options. Possible parameters are:
- codec (string) - audio codec name
- channels (integer) - number of audio channels
- bitrate (integer) - stream bitrate
- samplerate (integer) - sample rate (frequency)
Supported audio codecs are: null (no audio), copy (copy from original), vorbis, aac, mp3, mp2
-
class
converter.avcodecs.
AudioCopyCodec
¶ Copy audio stream directly from the source.
-
class
converter.avcodecs.
AudioNullCodec
¶ Null audio codec (no audio).
-
class
converter.avcodecs.
BaseCodec
¶ Base audio/video codec class.
-
class
converter.avcodecs.
DVBSub
¶ DVB subtitles.
-
class
converter.avcodecs.
DVDSub
¶ DVD subtitles.
-
class
converter.avcodecs.
DivxCodec
¶ DivX video codec.
-
class
converter.avcodecs.
DtsCodec
¶ DTS audio codec.
-
class
converter.avcodecs.
FdkAacCodec
¶ AAC audio codec.
-
class
converter.avcodecs.
FlacCodec
¶ FLAC audio codec.
-
class
converter.avcodecs.
FlvCodec
¶ Flash Video codec.
-
class
converter.avcodecs.
H263Codec
¶ H.263 video codec.
-
class
converter.avcodecs.
H264Codec
¶ H.264/AVC video codec. @see http://ffmpeg.org/trac/ffmpeg/wiki/x264EncodingGuide
-
class
converter.avcodecs.
MOVTextCodec
¶ mov_text subtitle codec.
-
class
converter.avcodecs.
Mp2Codec
¶ MP2 (MPEG layer 2) audio codec.
-
class
converter.avcodecs.
Mp3Codec
¶ MP3 (MPEG layer 3) audio codec.
-
class
converter.avcodecs.
Mpeg1Codec
¶ MPEG-1 video codec.
-
class
converter.avcodecs.
Mpeg2Codec
¶ MPEG-2 video codec.
-
class
converter.avcodecs.
MpegCodec
¶ Base MPEG video codec.
-
class
converter.avcodecs.
SSA
¶ SSA (SubStation Alpha) subtitle.
-
class
converter.avcodecs.
SubRip
¶ SubRip subtitle.
-
class
converter.avcodecs.
SubtitleCodec
¶ Base subtitle codec class handles general subtitle options. Possible parameters are:
- codec (string) - subtitle codec name (mov_text, subrib, ssa only supported currently)
- language (string) - language of subtitle stream (3 char code)
- forced (int) - force subtitles (1 true, 0 false)
- default (int) - default subtitles (1 true, 0 false)
Supported subtitle codecs are: null (no subtitle), mov_text
-
class
converter.avcodecs.
SubtitleCopyCodec
¶ Copy subtitle stream directly from the source.
-
class
converter.avcodecs.
SubtitleNullCodec
¶ Null video codec (no video).
-
class
converter.avcodecs.
TheoraCodec
¶ Theora video codec. @see http://ffmpeg.org/trac/ffmpeg/wiki/TheoraVorbisEncodingGuide
-
class
converter.avcodecs.
VideoCodec
¶ Base video codec class handles general video options. Possible parameters are:
- codec (string) - video codec name
- bitrate (string) - stream bitrate
- fps (integer) - frames per second
- width (integer) - video width
- height (integer) - video height
- mode (string) - aspect preserval mode; one of:
- stretch (default) - don’t preserve aspect
- crop - crop extra w/h
- pad - pad with black bars
- src_width (int) - source width
- src_height (int) - source height
Aspect preserval mode is only used if both source and both destination sizes are specified. If source dimensions are not specified, aspect settings are ignored.
If source dimensions are specified, and only one of the destination dimensions is specified, the other one is calculated to preserve the aspect ratio.
Supported video codecs are: null (no video), copy (copy directly from the source), Theora, H.264/AVC, DivX, VP8, H.263, Flv, MPEG-1, MPEG-2.
-
class
converter.avcodecs.
VideoCopyCodec
¶ Copy video stream directly from the source.
-
class
converter.avcodecs.
VideoNullCodec
¶ Null video codec (no video).
-
class
converter.avcodecs.
VorbisCodec
¶ Vorbis audio codec. @see http://ffmpeg.org/trac/ffmpeg/wiki/TheoraVorbisEncodingGuide
-
class
converter.avcodecs.
Vp8Codec
¶ Google VP8 video codec.
Low-level ffmpeg wrapper¶
-
class
converter.ffmpeg.
FFMpeg
(ffmpeg_path=None, ffprobe_path=None)¶ FFMPeg wrapper object, takes care of calling the ffmpeg binaries, passing options and parsing the output.
>>> f = FFMpeg()
-
convert
(infile, outfile, opts, timeout=10)¶ Convert the source media (infile) according to specified options (a list of ffmpeg switches as strings) and save it to outfile.
Convert returns a generator that needs to be iterated to drive the conversion process. The generator will periodically yield timecode of currently processed part of the file (ie. at which second in the content is the conversion process currently).
The optional timeout argument specifies how long should the operation be blocked in case ffmpeg gets stuck and doesn’t report back. See the documentation in Converter.convert() for more details about this option.
>>> conv = FFMpeg().convert('test.ogg', '/tmp/output.mp3', ... ['-acodec libmp3lame', '-vn']) >>> for timecode in conv: ... pass # can be used to inform the user about conversion progress
-
probe
(fname, posters_as_video=True)¶ Examine the media file and determine its format and media streams. Returns the MediaInfo object, or None if the specified file is not a valid media file.
>>> info = FFMpeg().probe('test1.ogg') >>> info.format 'ogg' >>> info.duration 33.00 >>> info.video.codec 'theora' >>> info.video.width 720 >>> info.video.height 400 >>> info.audio.codec 'vorbis' >>> info.audio.channels 2 :param posters_as_video: Take poster images (mainly for audio files) as A video stream, defaults to True
-
thumbnail
(fname, time, outfile, size=None, quality=4)¶ Create a thumbnal of media file, and store it to outfile @param time: time point (in seconds) (float or int) @param size: Size, if specified, is WxH of the desired thumbnail.
If not specified, the video resolution is used.- @param quality: quality of jpeg file in range 2(best)-31(worst)
- recommended range: 2-6
>>> FFMpeg().thumbnail('test1.ogg', 5, '/tmp/shot.png', '320x240')
-
thumbnails
(fname, option_list)¶ Create one or more thumbnails of video. @param option_list: a list of tuples like:
(time, outfile, size=None, quality=DEFAULT_JPEG_QUALITY) see documentation of converter.FFMpeg.thumbnail() for details.>>> FFMpeg().thumbnails('test1.ogg', [(5, '/tmp/shot.png', '320x240'), >>> (10, '/tmp/shot2.png', None, 5)])
-
-
class
converter.ffmpeg.
MediaFormatInfo
¶ - Describes the media container format. The attributes are:
- format - format (short) name (eg. “ogg”)
- fullname - format full (descriptive) name
- bitrate - total bitrate (bps)
- duration - media duration in seconds
- filesize - file size
-
parse_ffprobe
(key, val)¶ Parse raw ffprobe output (key=value).
-
class
converter.ffmpeg.
MediaInfo
(posters_as_video=True)¶ Information about media object, as parsed by ffprobe. The attributes are:
- format - a MediaFormatInfo object
- streams - a list of MediaStreamInfo objects
-
audio
¶ First audio stream, or None if there are no audio streams.
-
parse_ffprobe
(raw)¶ Parse raw ffprobe output.
-
video
¶ First video stream, or None if there are no video streams.
-
class
converter.ffmpeg.
MediaStreamInfo
¶ Describes one stream inside a media file. The general attributes are:
- index - stream index inside the container (0-based)
- type - stream type, either ‘audio’ or ‘video’
- codec - codec (short) name (e.g “vorbis”, “theora”)
- codec_desc - codec full (descriptive) name
- duration - stream duration in seconds
- metadata - optional metadata associated with a video or audio stream
- bitrate - stream bitrate in bytes/second
- attached_pic - (0, 1 or None) is stream a poster image? (e.g. in mp3)
- Video-specific attributes are:
- video_width - width of video in pixels
- video_height - height of video in pixels
- video_fps - average frames per second
- Audio-specific attributes are:
- audio_channels - the number of channels in the stream
- audio_samplerate - sample rate (Hz)
-
parse_ffprobe
(key, val)¶ Parse raw ffprobe output (key=value).