1 @chapter Filtering Introduction
2 @c man begin FILTERING INTRODUCTION
4 Filtering in FFmpeg is enabled through the libavfilter library.
6 In libavfilter, a filter can have multiple inputs and multiple
8 To illustrate the sorts of things that are possible, we consider the
13 input --> split ---------------------> overlay --> output
16 +-----> crop --> vflip -------+
19 This filtergraph splits the input stream in two streams, then sends one
20 stream through the crop filter and the vflip filter, before merging it
21 back with the other stream by overlaying it on top. You can use the
22 following command to achieve this:
25 ffmpeg -i INPUT -vf "split [main][tmp]; [tmp] crop=iw:ih/2:0:0, vflip [flip]; [main][flip] overlay=0:H/2" OUTPUT
28 The result will be that the top half of the video is mirrored
29 onto the bottom half of the output video.
31 Filters in the same linear chain are separated by commas, and distinct
32 linear chains of filters are separated by semicolons. In our example,
33 @var{crop,vflip} are in one linear chain, @var{split} and
34 @var{overlay} are separately in another. The points where the linear
35 chains join are labelled by names enclosed in square brackets. In the
36 example, the split filter generates two outputs that are associated to
37 the labels @var{[main]} and @var{[tmp]}.
39 The stream sent to the second output of @var{split}, labelled as
40 @var{[tmp]}, is processed through the @var{crop} filter, which crops
41 away the lower half part of the video, and then vertically flipped. The
42 @var{overlay} filter takes in input the first unchanged output of the
43 split filter (which was labelled as @var{[main]}), and overlay on its
44 lower half the output generated by the @var{crop,vflip} filterchain.
46 Some filters take in input a list of parameters: they are specified
47 after the filter name and an equal sign, and are separated from each other
50 There exist so-called @var{source filters} that do not have an
51 audio/video input, and @var{sink filters} that will not have audio/video
54 @c man end FILTERING INTRODUCTION
57 @c man begin GRAPH2DOT
59 The @file{graph2dot} program included in the FFmpeg @file{tools}
60 directory can be used to parse a filtergraph description and issue a
61 corresponding textual representation in the dot language.
68 to see how to use @file{graph2dot}.
70 You can then pass the dot description to the @file{dot} program (from
71 the graphviz suite of programs) and obtain a graphical representation
74 For example the sequence of commands:
76 echo @var{GRAPH_DESCRIPTION} | \
77 tools/graph2dot -o graph.tmp && \
78 dot -Tpng graph.tmp -o graph.png && \
82 can be used to create and display an image representing the graph
83 described by the @var{GRAPH_DESCRIPTION} string. Note that this string must be
84 a complete self-contained graph, with its inputs and outputs explicitly defined.
85 For example if your command line is of the form:
87 ffmpeg -i infile -vf scale=640:360 outfile
89 your @var{GRAPH_DESCRIPTION} string will need to be of the form:
91 nullsrc,scale=640:360,nullsink
93 you may also need to set the @var{nullsrc} parameters and add a @var{format}
94 filter in order to simulate a specific input file.
98 @chapter Filtergraph description
99 @c man begin FILTERGRAPH DESCRIPTION
101 A filtergraph is a directed graph of connected filters. It can contain
102 cycles, and there can be multiple links between a pair of
103 filters. Each link has one input pad on one side connecting it to one
104 filter from which it takes its input, and one output pad on the other
105 side connecting it to one filter accepting its output.
107 Each filter in a filtergraph is an instance of a filter class
108 registered in the application, which defines the features and the
109 number of input and output pads of the filter.
111 A filter with no input pads is called a "source", and a filter with no
112 output pads is called a "sink".
114 @anchor{Filtergraph syntax}
115 @section Filtergraph syntax
117 A filtergraph has a textual representation, which is
118 recognized by the @option{-filter}/@option{-vf} and @option{-filter_complex}
119 options in @command{ffmpeg} and @option{-vf} in @command{ffplay}, and by the
120 @code{avfilter_graph_parse()}/@code{avfilter_graph_parse2()} functions defined in
121 @file{libavfilter/avfilter.h}.
123 A filterchain consists of a sequence of connected filters, each one
124 connected to the previous one in the sequence. A filterchain is
125 represented by a list of ","-separated filter descriptions.
127 A filtergraph consists of a sequence of filterchains. A sequence of
128 filterchains is represented by a list of ";"-separated filterchain
131 A filter is represented by a string of the form:
132 [@var{in_link_1}]...[@var{in_link_N}]@var{filter_name}=@var{arguments}[@var{out_link_1}]...[@var{out_link_M}]
134 @var{filter_name} is the name of the filter class of which the
135 described filter is an instance of, and has to be the name of one of
136 the filter classes registered in the program.
137 The name of the filter class is optionally followed by a string
140 @var{arguments} is a string which contains the parameters used to
141 initialize the filter instance. It may have one of two forms:
145 A ':'-separated list of @var{key=value} pairs.
148 A ':'-separated list of @var{value}. In this case, the keys are assumed to be
149 the option names in the order they are declared. E.g. the @code{fade} filter
150 declares three options in this order -- @option{type}, @option{start_frame} and
151 @option{nb_frames}. Then the parameter list @var{in:0:30} means that the value
152 @var{in} is assigned to the option @option{type}, @var{0} to
153 @option{start_frame} and @var{30} to @option{nb_frames}.
156 A ':'-separated list of mixed direct @var{value} and long @var{key=value}
157 pairs. The direct @var{value} must precede the @var{key=value} pairs, and
158 follow the same constraints order of the previous point. The following
159 @var{key=value} pairs can be set in any preferred order.
163 If the option value itself is a list of items (e.g. the @code{format} filter
164 takes a list of pixel formats), the items in the list are usually separated by
167 The list of arguments can be quoted using the character "'" as initial
168 and ending mark, and the character '\' for escaping the characters
169 within the quoted text; otherwise the argument string is considered
170 terminated when the next special character (belonging to the set
171 "[]=;,") is encountered.
173 The name and arguments of the filter are optionally preceded and
174 followed by a list of link labels.
175 A link label allows one to name a link and associate it to a filter output
176 or input pad. The preceding labels @var{in_link_1}
177 ... @var{in_link_N}, are associated to the filter input pads,
178 the following labels @var{out_link_1} ... @var{out_link_M}, are
179 associated to the output pads.
181 When two link labels with the same name are found in the
182 filtergraph, a link between the corresponding input and output pad is
185 If an output pad is not labelled, it is linked by default to the first
186 unlabelled input pad of the next filter in the filterchain.
187 For example in the filterchain
189 nullsrc, split[L1], [L2]overlay, nullsink
191 the split filter instance has two output pads, and the overlay filter
192 instance two input pads. The first output pad of split is labelled
193 "L1", the first input pad of overlay is labelled "L2", and the second
194 output pad of split is linked to the second input pad of overlay,
195 which are both unlabelled.
197 In a complete filterchain all the unlabelled filter input and output
198 pads must be connected. A filtergraph is considered valid if all the
199 filter input and output pads of all the filterchains are connected.
201 Libavfilter will automatically insert @ref{scale} filters where format
202 conversion is required. It is possible to specify swscale flags
203 for those automatically inserted scalers by prepending
204 @code{sws_flags=@var{flags};}
205 to the filtergraph description.
207 Here is a BNF description of the filtergraph syntax:
209 @var{NAME} ::= sequence of alphanumeric characters and '_'
210 @var{LINKLABEL} ::= "[" @var{NAME} "]"
211 @var{LINKLABELS} ::= @var{LINKLABEL} [@var{LINKLABELS}]
212 @var{FILTER_ARGUMENTS} ::= sequence of chars (possibly quoted)
213 @var{FILTER} ::= [@var{LINKLABELS}] @var{NAME} ["=" @var{FILTER_ARGUMENTS}] [@var{LINKLABELS}]
214 @var{FILTERCHAIN} ::= @var{FILTER} [,@var{FILTERCHAIN}]
215 @var{FILTERGRAPH} ::= [sws_flags=@var{flags};] @var{FILTERCHAIN} [;@var{FILTERGRAPH}]
218 @section Notes on filtergraph escaping
220 Filtergraph description composition entails several levels of
221 escaping. See @ref{quoting_and_escaping,,the "Quoting and escaping"
222 section in the ffmpeg-utils(1) manual,ffmpeg-utils} for more
223 information about the employed escaping procedure.
225 A first level escaping affects the content of each filter option
226 value, which may contain the special character @code{:} used to
227 separate values, or one of the escaping characters @code{\'}.
229 A second level escaping affects the whole filter description, which
230 may contain the escaping characters @code{\'} or the special
231 characters @code{[],;} used by the filtergraph description.
233 Finally, when you specify a filtergraph on a shell commandline, you
234 need to perform a third level escaping for the shell special
235 characters contained within it.
237 For example, consider the following string to be embedded in
238 the @ref{drawtext} filter description @option{text} value:
240 this is a 'string': may contain one, or more, special characters
243 This string contains the @code{'} special escaping character, and the
244 @code{:} special character, so it needs to be escaped in this way:
246 text=this is a \'string\'\: may contain one, or more, special characters
249 A second level of escaping is required when embedding the filter
250 description in a filtergraph description, in order to escape all the
251 filtergraph special characters. Thus the example above becomes:
253 drawtext=text=this is a \\\'string\\\'\\: may contain one\, or more\, special characters
255 (note that in addition to the @code{\'} escaping special characters,
256 also @code{,} needs to be escaped).
258 Finally an additional level of escaping is needed when writing the
259 filtergraph description in a shell command, which depends on the
260 escaping rules of the adopted shell. For example, assuming that
261 @code{\} is special and needs to be escaped with another @code{\}, the
262 previous string will finally result in:
264 -vf "drawtext=text=this is a \\\\\\'string\\\\\\'\\\\: may contain one\\, or more\\, special characters"
267 @chapter Timeline editing
269 Some filters support a generic @option{enable} option. For the filters
270 supporting timeline editing, this option can be set to an expression which is
271 evaluated before sending a frame to the filter. If the evaluation is non-zero,
272 the filter will be enabled, otherwise the frame will be sent unchanged to the
273 next filter in the filtergraph.
275 The expression accepts the following values:
278 timestamp expressed in seconds, NAN if the input timestamp is unknown
281 sequential number of the input frame, starting from 0
284 the position in the file of the input frame, NAN if unknown
288 width and height of the input frame if video
291 Additionally, these filters support an @option{enable} command that can be used
292 to re-define the expression.
294 Like any other filtering option, the @option{enable} option follows the same
297 For example, to enable a blur filter (@ref{smartblur}) from 10 seconds to 3
298 minutes, and a @ref{curves} filter starting at 3 seconds:
300 smartblur = enable='between(t,10,3*60)',
301 curves = enable='gte(t,3)' : preset=cross_process
304 @c man end FILTERGRAPH DESCRIPTION
306 @chapter Audio Filters
307 @c man begin AUDIO FILTERS
309 When you configure your FFmpeg build, you can disable any of the
310 existing filters using @code{--disable-filters}.
311 The configure output will show the audio filters included in your
314 Below is a description of the currently available audio filters.
318 Delay one or more audio channels.
320 Samples in delayed channel are filled with silence.
322 The filter accepts the following option:
326 Set list of delays in milliseconds for each channel separated by '|'.
327 At least one delay greater than 0 should be provided.
328 Unused delays will be silently ignored. If number of given delays is
329 smaller than number of channels all remaining channels will not be delayed.
336 Delay first channel by 1.5 seconds, the third channel by 0.5 seconds and leave
337 the second channel (and any other channels that may be present) unchanged.
345 Apply echoing to the input audio.
347 Echoes are reflected sound and can occur naturally amongst mountains
348 (and sometimes large buildings) when talking or shouting; digital echo
349 effects emulate this behaviour and are often used to help fill out the
350 sound of a single instrument or vocal. The time difference between the
351 original signal and the reflection is the @code{delay}, and the
352 loudness of the reflected signal is the @code{decay}.
353 Multiple echoes can have different delays and decays.
355 A description of the accepted parameters follows.
359 Set input gain of reflected signal. Default is @code{0.6}.
362 Set output gain of reflected signal. Default is @code{0.3}.
365 Set list of time intervals in milliseconds between original signal and reflections
366 separated by '|'. Allowed range for each @code{delay} is @code{(0 - 90000.0]}.
367 Default is @code{1000}.
370 Set list of loudnesses of reflected signals separated by '|'.
371 Allowed range for each @code{decay} is @code{(0 - 1.0]}.
372 Default is @code{0.5}.
379 Make it sound as if there are twice as many instruments as are actually playing:
381 aecho=0.8:0.88:60:0.4
385 If delay is very short, then it sound like a (metallic) robot playing music:
391 A longer delay will sound like an open air concert in the mountains:
393 aecho=0.8:0.9:1000:0.3
397 Same as above but with one more mountain:
399 aecho=0.8:0.9:1000|1800:0.3|0.25
405 Modify an audio signal according to the specified expressions.
407 This filter accepts one or more expressions (one for each channel),
408 which are evaluated and used to modify a corresponding audio signal.
410 It accepts the following parameters:
414 Set the '|'-separated expressions list for each separate channel. If
415 the number of input channels is greater than the number of
416 expressions, the last specified expression is used for the remaining
419 @item channel_layout, c
420 Set output channel layout. If not specified, the channel layout is
421 specified by the number of expressions. If set to @samp{same}, it will
422 use by default the same input channel layout.
425 Each expression in @var{exprs} can contain the following constants and functions:
429 channel number of the current expression
432 number of the evaluated sample, starting from 0
438 time of the evaluated sample expressed in seconds
441 @item nb_out_channels
442 input and output number of channels
445 the value of input channel with number @var{CH}
448 Note: this filter is slow. For faster processing you should use a
457 aeval=val(ch)/2:c=same
461 Invert phase of the second channel:
469 Apply fade-in/out effect to input audio.
471 A description of the accepted parameters follows.
475 Specify the effect type, can be either @code{in} for fade-in, or
476 @code{out} for a fade-out effect. Default is @code{in}.
478 @item start_sample, ss
479 Specify the number of the start sample for starting to apply the fade
480 effect. Default is 0.
483 Specify the number of samples for which the fade effect has to last. At
484 the end of the fade-in effect the output audio will have the same
485 volume as the input audio, at the end of the fade-out transition
486 the output audio will be silence. Default is 44100.
489 Specify the start time of the fade effect. Default is 0.
490 The value must be specified as a time duration; see
491 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
492 for the accepted syntax.
493 If set this option is used instead of @var{start_sample}.
496 Specify the duration of the fade effect. See
497 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
498 for the accepted syntax.
499 At the end of the fade-in effect the output audio will have the same
500 volume as the input audio, at the end of the fade-out transition
501 the output audio will be silence.
502 By default the duration is determined by @var{nb_samples}.
503 If set this option is used instead of @var{nb_samples}.
506 Set curve for fade transition.
508 It accepts the following values:
511 select triangular, linear slope (default)
513 select quarter of sine wave
515 select half of sine wave
517 select exponential sine wave
521 select inverted parabola
537 Fade in first 15 seconds of audio:
543 Fade out last 25 seconds of a 900 seconds audio:
545 afade=t=out:st=875:d=25
552 Set output format constraints for the input audio. The framework will
553 negotiate the most appropriate format to minimize conversions.
555 It accepts the following parameters:
559 A '|'-separated list of requested sample formats.
562 A '|'-separated list of requested sample rates.
564 @item channel_layouts
565 A '|'-separated list of requested channel layouts.
567 See @ref{channel layout syntax,,the Channel Layout section in the ffmpeg-utils(1) manual,ffmpeg-utils}
568 for the required syntax.
571 If a parameter is omitted, all values are allowed.
573 Force the output to either unsigned 8-bit or signed 16-bit stereo
575 aformat=sample_fmts=u8|s16:channel_layouts=stereo
580 Apply a two-pole all-pass filter with central frequency (in Hz)
581 @var{frequency}, and filter-width @var{width}.
582 An all-pass filter changes the audio's frequency to phase relationship
583 without changing its frequency to amplitude relationship.
585 The filter accepts the following options:
592 Set method to specify band-width of filter.
605 Specify the band-width of a filter in width_type units.
610 Merge two or more audio streams into a single multi-channel stream.
612 The filter accepts the following options:
617 Set the number of inputs. Default is 2.
621 If the channel layouts of the inputs are disjoint, and therefore compatible,
622 the channel layout of the output will be set accordingly and the channels
623 will be reordered as necessary. If the channel layouts of the inputs are not
624 disjoint, the output will have all the channels of the first input then all
625 the channels of the second input, in that order, and the channel layout of
626 the output will be the default value corresponding to the total number of
629 For example, if the first input is in 2.1 (FL+FR+LF) and the second input
630 is FC+BL+BR, then the output will be in 5.1, with the channels in the
631 following order: a1, a2, b1, a3, b2, b3 (a1 is the first channel of the
632 first input, b1 is the first channel of the second input).
634 On the other hand, if both input are in stereo, the output channels will be
635 in the default order: a1, a2, b1, b2, and the channel layout will be
636 arbitrarily set to 4.0, which may or may not be the expected value.
638 All inputs must have the same sample rate, and format.
640 If inputs do not have the same duration, the output will stop with the
647 Merge two mono files into a stereo stream:
649 amovie=left.wav [l] ; amovie=right.mp3 [r] ; [l] [r] amerge
653 Multiple merges assuming 1 video stream and 6 audio streams in @file{input.mkv}:
655 ffmpeg -i input.mkv -filter_complex "[0:1][0:2][0:3][0:4][0:5][0:6] amerge=inputs=6" -c:a pcm_s16le output.mkv
661 Mixes multiple audio inputs into a single output.
663 Note that this filter only supports float samples (the @var{amerge}
664 and @var{pan} audio filters support many formats). If the @var{amix}
665 input has integer samples then @ref{aresample} will be automatically
666 inserted to perform the conversion to float samples.
670 ffmpeg -i INPUT1 -i INPUT2 -i INPUT3 -filter_complex amix=inputs=3:duration=first:dropout_transition=3 OUTPUT
672 will mix 3 input audio streams to a single output with the same duration as the
673 first input and a dropout transition time of 3 seconds.
675 It accepts the following parameters:
679 The number of inputs. If unspecified, it defaults to 2.
682 How to determine the end-of-stream.
686 The duration of the longest input. (default)
689 The duration of the shortest input.
692 The duration of the first input.
696 @item dropout_transition
697 The transition time, in seconds, for volume renormalization when an input
698 stream ends. The default value is 2 seconds.
704 Pass the audio source unchanged to the output.
708 Pad the end of an audio stream with silence.
710 This can be used together with @command{ffmpeg} @option{-shortest} to
711 extend audio streams to the same length as the video stream.
713 A description of the accepted options follows.
717 Set silence packet size. Default value is 4096.
720 Set the number of samples of silence to add to the end. After the
721 value is reached, the stream is terminated. This option is mutually
722 exclusive with @option{whole_len}.
725 Set the minimum total number of samples in the output audio stream. If
726 the value is longer than the input audio length, silence is added to
727 the end, until the value is reached. This option is mutually exclusive
728 with @option{pad_len}.
731 If neither the @option{pad_len} nor the @option{whole_len} option is
732 set, the filter will add silence to the end of the input stream
739 Add 1024 samples of silence to the end of the input:
745 Make sure the audio output will contain at least 10000 samples, pad
746 the input with silence if required:
752 Use @command{ffmpeg} to pad the audio input with silence, so that the
753 video stream will always result the shortest and will be converted
754 until the end in the output file when using the @option{shortest}
757 ffmpeg -i VIDEO -i AUDIO -filter_complex "[1:0]apad" -shortest OUTPUT
762 Add a phasing effect to the input audio.
764 A phaser filter creates series of peaks and troughs in the frequency spectrum.
765 The position of the peaks and troughs are modulated so that they vary over time, creating a sweeping effect.
767 A description of the accepted parameters follows.
771 Set input gain. Default is 0.4.
774 Set output gain. Default is 0.74
777 Set delay in milliseconds. Default is 3.0.
780 Set decay. Default is 0.4.
783 Set modulation speed in Hz. Default is 0.5.
786 Set modulation type. Default is triangular.
788 It accepts the following values:
798 Resample the input audio to the specified parameters, using the
799 libswresample library. If none are specified then the filter will
800 automatically convert between its input and output.
802 This filter is also able to stretch/squeeze the audio data to make it match
803 the timestamps or to inject silence / cut out audio to make it match the
804 timestamps, do a combination of both or do neither.
806 The filter accepts the syntax
807 [@var{sample_rate}:]@var{resampler_options}, where @var{sample_rate}
808 expresses a sample rate and @var{resampler_options} is a list of
809 @var{key}=@var{value} pairs, separated by ":". See the
810 ffmpeg-resampler manual for the complete list of supported options.
816 Resample the input audio to 44100Hz:
822 Stretch/squeeze samples to the given timestamps, with a maximum of 1000
823 samples per second compensation:
829 @section asetnsamples
831 Set the number of samples per each output audio frame.
833 The last output packet may contain a different number of samples, as
834 the filter will flush all the remaining samples when the input audio
837 The filter accepts the following options:
841 @item nb_out_samples, n
842 Set the number of frames per each output audio frame. The number is
843 intended as the number of samples @emph{per each channel}.
844 Default value is 1024.
847 If set to 1, the filter will pad the last audio frame with zeroes, so
848 that the last frame will contain the same number of samples as the
849 previous ones. Default value is 1.
852 For example, to set the number of per-frame samples to 1234 and
853 disable padding for the last frame, use:
855 asetnsamples=n=1234:p=0
860 Set the sample rate without altering the PCM data.
861 This will result in a change of speed and pitch.
863 The filter accepts the following options:
867 Set the output sample rate. Default is 44100 Hz.
872 Show a line containing various information for each input audio frame.
873 The input audio is not modified.
875 The shown line contains a sequence of key/value pairs of the form
876 @var{key}:@var{value}.
878 The following values are shown in the output:
882 The (sequential) number of the input frame, starting from 0.
885 The presentation timestamp of the input frame, in time base units; the time base
886 depends on the filter input pad, and is usually 1/@var{sample_rate}.
889 The presentation timestamp of the input frame in seconds.
892 position of the frame in the input stream, -1 if this information in
893 unavailable and/or meaningless (for example in case of synthetic audio)
902 The sample rate for the audio frame.
905 The number of samples (per channel) in the frame.
908 The Adler-32 checksum (printed in hexadecimal) of the audio data. For planar
909 audio, the data is treated as if all the planes were concatenated.
911 @item plane_checksums
912 A list of Adler-32 checksums for each data plane.
917 Display time domain statistical information about the audio channels.
918 Statistics are calculated and displayed for each audio channel and,
919 where applicable, an overall figure is also given.
921 It accepts the following option:
924 Short window length in seconds, used for peak and trough RMS measurement.
925 Default is @code{0.05} (50 miliseconds). Allowed range is @code{[0.1 - 10]}.
928 A description of each shown parameter follows:
932 Mean amplitude displacement from zero.
935 Minimal sample level.
938 Maximal sample level.
942 Standard peak and RMS level measured in dBFS.
946 Peak and trough values for RMS level measured over a short window.
949 Standard ratio of peak to RMS level (note: not in dB).
952 Flatness (i.e. consecutive samples with the same value) of the signal at its peak levels
953 (i.e. either @var{Min level} or @var{Max level}).
956 Number of occasions (not the number of samples) that the signal attained either
957 @var{Min level} or @var{Max level}.
962 Forward two audio streams and control the order the buffers are forwarded.
964 The filter accepts the following options:
968 Set the expression deciding which stream should be
969 forwarded next: if the result is negative, the first stream is forwarded; if
970 the result is positive or zero, the second stream is forwarded. It can use
971 the following variables:
975 number of buffers forwarded so far on each stream
977 number of samples forwarded so far on each stream
979 current timestamp of each stream
982 The default value is @code{t1-t2}, which means to always forward the stream
983 that has a smaller timestamp.
988 Stress-test @code{amerge} by randomly sending buffers on the wrong
989 input, while avoiding too much of a desynchronization:
991 amovie=file.ogg [a] ; amovie=file.mp3 [b] ;
992 [a] [b] astreamsync=(2*random(1))-1+tanh(5*(t1-t2)) [a2] [b2] ;
998 Synchronize audio data with timestamps by squeezing/stretching it and/or
999 dropping samples/adding silence when needed.
1001 This filter is not built by default, please use @ref{aresample} to do squeezing/stretching.
1003 It accepts the following parameters:
1007 Enable stretching/squeezing the data to make it match the timestamps. Disabled
1008 by default. When disabled, time gaps are covered with silence.
1011 The minimum difference between timestamps and audio data (in seconds) to trigger
1012 adding/dropping samples. The default value is 0.1. If you get an imperfect
1013 sync with this filter, try setting this parameter to 0.
1016 The maximum compensation in samples per second. Only relevant with compensate=1.
1017 The default value is 500.
1020 Assume that the first PTS should be this value. The time base is 1 / sample
1021 rate. This allows for padding/trimming at the start of the stream. By default,
1022 no assumption is made about the first frame's expected PTS, so no padding or
1023 trimming is done. For example, this could be set to 0 to pad the beginning with
1024 silence if an audio stream starts after the video stream or to trim any samples
1025 with a negative PTS due to encoder delay.
1033 The filter accepts exactly one parameter, the audio tempo. If not
1034 specified then the filter will assume nominal 1.0 tempo. Tempo must
1035 be in the [0.5, 2.0] range.
1037 @subsection Examples
1041 Slow down audio to 80% tempo:
1047 To speed up audio to 125% tempo:
1055 Trim the input so that the output contains one continuous subpart of the input.
1057 It accepts the following parameters:
1060 Timestamp (in seconds) of the start of the section to keep. I.e. the audio
1061 sample with the timestamp @var{start} will be the first sample in the output.
1064 Specify time of the first audio sample that will be dropped, i.e. the
1065 audio sample immediately preceding the one with the timestamp @var{end} will be
1066 the last sample in the output.
1069 Same as @var{start}, except this option sets the start timestamp in samples
1073 Same as @var{end}, except this option sets the end timestamp in samples instead
1077 The maximum duration of the output in seconds.
1080 The number of the first sample that should be output.
1083 The number of the first sample that should be dropped.
1086 @option{start}, @option{end}, and @option{duration} are expressed as time
1087 duration specifications; see
1088 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}.
1090 Note that the first two sets of the start/end options and the @option{duration}
1091 option look at the frame timestamp, while the _sample options simply count the
1092 samples that pass through the filter. So start/end_pts and start/end_sample will
1093 give different results when the timestamps are wrong, inexact or do not start at
1094 zero. Also note that this filter does not modify the timestamps. If you wish
1095 to have the output timestamps start at zero, insert the asetpts filter after the
1098 If multiple start or end options are set, this filter tries to be greedy and
1099 keep all samples that match at least one of the specified constraints. To keep
1100 only the part that matches all the constraints at once, chain multiple atrim
1103 The defaults are such that all the input is kept. So it is possible to set e.g.
1104 just the end values to keep everything before the specified time.
1109 Drop everything except the second minute of input:
1111 ffmpeg -i INPUT -af atrim=60:120
1115 Keep only the first 1000 samples:
1117 ffmpeg -i INPUT -af atrim=end_sample=1000
1124 Apply a two-pole Butterworth band-pass filter with central
1125 frequency @var{frequency}, and (3dB-point) band-width width.
1126 The @var{csg} option selects a constant skirt gain (peak gain = Q)
1127 instead of the default: constant 0dB peak gain.
1128 The filter roll off at 6dB per octave (20dB per decade).
1130 The filter accepts the following options:
1134 Set the filter's central frequency. Default is @code{3000}.
1137 Constant skirt gain if set to 1. Defaults to 0.
1140 Set method to specify band-width of filter.
1153 Specify the band-width of a filter in width_type units.
1158 Apply a two-pole Butterworth band-reject filter with central
1159 frequency @var{frequency}, and (3dB-point) band-width @var{width}.
1160 The filter roll off at 6dB per octave (20dB per decade).
1162 The filter accepts the following options:
1166 Set the filter's central frequency. Default is @code{3000}.
1169 Set method to specify band-width of filter.
1182 Specify the band-width of a filter in width_type units.
1187 Boost or cut the bass (lower) frequencies of the audio using a two-pole
1188 shelving filter with a response similar to that of a standard
1189 hi-fi's tone-controls. This is also known as shelving equalisation (EQ).
1191 The filter accepts the following options:
1195 Give the gain at 0 Hz. Its useful range is about -20
1196 (for a large cut) to +20 (for a large boost).
1197 Beware of clipping when using a positive gain.
1200 Set the filter's central frequency and so can be used
1201 to extend or reduce the frequency range to be boosted or cut.
1202 The default value is @code{100} Hz.
1205 Set method to specify band-width of filter.
1218 Determine how steep is the filter's shelf transition.
1223 Apply a biquad IIR filter with the given coefficients.
1224 Where @var{b0}, @var{b1}, @var{b2} and @var{a0}, @var{a1}, @var{a2}
1225 are the numerator and denominator coefficients respectively.
1228 Bauer stereo to binaural transformation, which improves headphone listening of
1229 stereo audio records.
1231 It accepts the following parameters:
1235 Pre-defined crossfeed level.
1239 Default level (fcut=700, feed=50).
1242 Chu Moy circuit (fcut=700, feed=60).
1245 Jan Meier circuit (fcut=650, feed=95).
1250 Cut frequency (in Hz).
1259 Remap input channels to new locations.
1261 It accepts the following parameters:
1263 @item channel_layout
1264 The channel layout of the output stream.
1267 Map channels from input to output. The argument is a '|'-separated list of
1268 mappings, each in the @code{@var{in_channel}-@var{out_channel}} or
1269 @var{in_channel} form. @var{in_channel} can be either the name of the input
1270 channel (e.g. FL for front left) or its index in the input channel layout.
1271 @var{out_channel} is the name of the output channel or its index in the output
1272 channel layout. If @var{out_channel} is not given then it is implicitly an
1273 index, starting with zero and increasing by one for each mapping.
1276 If no mapping is present, the filter will implicitly map input channels to
1277 output channels, preserving indices.
1279 For example, assuming a 5.1+downmix input MOV file,
1281 ffmpeg -i in.mov -filter 'channelmap=map=DL-FL|DR-FR' out.wav
1283 will create an output WAV file tagged as stereo from the downmix channels of
1286 To fix a 5.1 WAV improperly encoded in AAC's native channel order
1288 ffmpeg -i in.wav -filter 'channelmap=1|2|0|5|3|4:channel_layout=5.1' out.wav
1291 @section channelsplit
1293 Split each channel from an input audio stream into a separate output stream.
1295 It accepts the following parameters:
1297 @item channel_layout
1298 The channel layout of the input stream. The default is "stereo".
1301 For example, assuming a stereo input MP3 file,
1303 ffmpeg -i in.mp3 -filter_complex channelsplit out.mkv
1305 will create an output Matroska file with two audio streams, one containing only
1306 the left channel and the other the right channel.
1308 Split a 5.1 WAV file into per-channel files:
1310 ffmpeg -i in.wav -filter_complex
1311 'channelsplit=channel_layout=5.1[FL][FR][FC][LFE][SL][SR]'
1312 -map '[FL]' front_left.wav -map '[FR]' front_right.wav -map '[FC]'
1313 front_center.wav -map '[LFE]' lfe.wav -map '[SL]' side_left.wav -map '[SR]'
1318 Compress or expand the audio's dynamic range.
1320 It accepts the following parameters:
1326 A list of times in seconds for each channel over which the instantaneous level
1327 of the input signal is averaged to determine its volume. @var{attacks} refers to
1328 increase of volume and @var{decays} refers to decrease of volume. For most
1329 situations, the attack time (response to the audio getting louder) should be
1330 shorter than the decay time, because the human ear is more sensitive to sudden
1331 loud audio than sudden soft audio. A typical value for attack is 0.3 seconds and
1332 a typical value for decay is 0.8 seconds.
1335 A list of points for the transfer function, specified in dB relative to the
1336 maximum possible signal amplitude. Each key points list must be defined using
1337 the following syntax: @code{x0/y0|x1/y1|x2/y2|....} or
1338 @code{x0/y0 x1/y1 x2/y2 ....}
1340 The input values must be in strictly increasing order but the transfer function
1341 does not have to be monotonically rising. The point @code{0/0} is assumed but
1342 may be overridden (by @code{0/out-dBn}). Typical values for the transfer
1343 function are @code{-70/-70|-60/-20}.
1346 Set the curve radius in dB for all joints. It defaults to 0.01.
1349 Set the additional gain in dB to be applied at all points on the transfer
1350 function. This allows for easy adjustment of the overall gain.
1354 Set an initial volume, in dB, to be assumed for each channel when filtering
1355 starts. This permits the user to supply a nominal level initially, so that, for
1356 example, a very large gain is not applied to initial signal levels before the
1357 companding has begun to operate. A typical value for audio which is initially
1358 quiet is -90 dB. It defaults to 0.
1361 Set a delay, in seconds. The input audio is analyzed immediately, but audio is
1362 delayed before being fed to the volume adjuster. Specifying a delay
1363 approximately equal to the attack/decay times allows the filter to effectively
1364 operate in predictive rather than reactive mode. It defaults to 0.
1368 @subsection Examples
1372 Make music with both quiet and loud passages suitable for listening to in a
1375 compand=.3|.3:1|1:-90/-60|-60/-40|-40/-30|-20/-20:6:0:-90:0.2
1379 A noise gate for when the noise is at a lower level than the signal:
1381 compand=.1|.1:.2|.2:-900/-900|-50.1/-900|-50/-50:.01:0:-90:.1
1385 Here is another noise gate, this time for when the noise is at a higher level
1386 than the signal (making it, in some ways, similar to squelch):
1388 compand=.1|.1:.1|.1:-45.1/-45.1|-45/-900|0/-900:.01:45:-90:.1
1394 Make audio easier to listen to on headphones.
1396 This filter adds `cues' to 44.1kHz stereo (i.e. audio CD format) audio
1397 so that when listened to on headphones the stereo image is moved from
1398 inside your head (standard for headphones) to outside and in front of
1399 the listener (standard for speakers).
1405 Apply a two-pole peaking equalisation (EQ) filter. With this
1406 filter, the signal-level at and around a selected frequency can
1407 be increased or decreased, whilst (unlike bandpass and bandreject
1408 filters) that at all other frequencies is unchanged.
1410 In order to produce complex equalisation curves, this filter can
1411 be given several times, each with a different central frequency.
1413 The filter accepts the following options:
1417 Set the filter's central frequency in Hz.
1420 Set method to specify band-width of filter.
1433 Specify the band-width of a filter in width_type units.
1436 Set the required gain or attenuation in dB.
1437 Beware of clipping when using a positive gain.
1440 @subsection Examples
1443 Attenuate 10 dB at 1000 Hz, with a bandwidth of 200 Hz:
1445 equalizer=f=1000:width_type=h:width=200:g=-10
1449 Apply 2 dB gain at 1000 Hz with Q 1 and attenuate 5 dB at 100 Hz with Q 2:
1451 equalizer=f=1000:width_type=q:width=1:g=2,equalizer=f=100:width_type=q:width=2:g=-5
1456 Apply a flanging effect to the audio.
1458 The filter accepts the following options:
1462 Set base delay in milliseconds. Range from 0 to 30. Default value is 0.
1465 Set added swep delay in milliseconds. Range from 0 to 10. Default value is 2.
1468 Set percentage regeneneration (delayed signal feedback). Range from -95 to 95.
1472 Set percentage of delayed signal mixed with original. Range from 0 to 100.
1476 Set sweeps per second (Hz). Range from 0.1 to 10. Default value is 0.5.
1479 Set swept wave shape, can be @var{triangular} or @var{sinusoidal}.
1480 Default value is @var{sinusoidal}.
1483 Set swept wave percentage-shift for multi channel. Range from 0 to 100.
1484 Default value is 25.
1487 Set delay-line interpolation, @var{linear} or @var{quadratic}.
1488 Default is @var{linear}.
1493 Apply a high-pass filter with 3dB point frequency.
1494 The filter can be either single-pole, or double-pole (the default).
1495 The filter roll off at 6dB per pole per octave (20dB per pole per decade).
1497 The filter accepts the following options:
1501 Set frequency in Hz. Default is 3000.
1504 Set number of poles. Default is 2.
1507 Set method to specify band-width of filter.
1520 Specify the band-width of a filter in width_type units.
1521 Applies only to double-pole filter.
1522 The default is 0.707q and gives a Butterworth response.
1527 Join multiple input streams into one multi-channel stream.
1529 It accepts the following parameters:
1533 The number of input streams. It defaults to 2.
1535 @item channel_layout
1536 The desired output channel layout. It defaults to stereo.
1539 Map channels from inputs to output. The argument is a '|'-separated list of
1540 mappings, each in the @code{@var{input_idx}.@var{in_channel}-@var{out_channel}}
1541 form. @var{input_idx} is the 0-based index of the input stream. @var{in_channel}
1542 can be either the name of the input channel (e.g. FL for front left) or its
1543 index in the specified input stream. @var{out_channel} is the name of the output
1547 The filter will attempt to guess the mappings when they are not specified
1548 explicitly. It does so by first trying to find an unused matching input channel
1549 and if that fails it picks the first unused input channel.
1551 Join 3 inputs (with properly set channel layouts):
1553 ffmpeg -i INPUT1 -i INPUT2 -i INPUT3 -filter_complex join=inputs=3 OUTPUT
1556 Build a 5.1 output from 6 single-channel streams:
1558 ffmpeg -i fl -i fr -i fc -i sl -i sr -i lfe -filter_complex
1559 'join=inputs=6:channel_layout=5.1:map=0.0-FL|1.0-FR|2.0-FC|3.0-SL|4.0-SR|5.0-LFE'
1565 Load a LADSPA (Linux Audio Developer's Simple Plugin API) plugin.
1567 To enable compilation of this filter you need to configure FFmpeg with
1568 @code{--enable-ladspa}.
1572 Specifies the name of LADSPA plugin library to load. If the environment
1573 variable @env{LADSPA_PATH} is defined, the LADSPA plugin is searched in
1574 each one of the directories specified by the colon separated list in
1575 @env{LADSPA_PATH}, otherwise in the standard LADSPA paths, which are in
1576 this order: @file{HOME/.ladspa/lib/}, @file{/usr/local/lib/ladspa/},
1577 @file{/usr/lib/ladspa/}.
1580 Specifies the plugin within the library. Some libraries contain only
1581 one plugin, but others contain many of them. If this is not set filter
1582 will list all available plugins within the specified library.
1585 Set the '|' separated list of controls which are zero or more floating point
1586 values that determine the behavior of the loaded plugin (for example delay,
1588 Controls need to be defined using the following syntax:
1589 c0=@var{value0}|c1=@var{value1}|c2=@var{value2}|..., where
1590 @var{valuei} is the value set on the @var{i}-th control.
1591 If @option{controls} is set to @code{help}, all available controls and
1592 their valid ranges are printed.
1594 @item sample_rate, s
1595 Specify the sample rate, default to 44100. Only used if plugin have
1599 Set the number of samples per channel per each output frame, default
1600 is 1024. Only used if plugin have zero inputs.
1603 Set the minimum duration of the sourced audio. See
1604 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
1605 for the accepted syntax.
1606 Note that the resulting duration may be greater than the specified duration,
1607 as the generated audio is always cut at the end of a complete frame.
1608 If not specified, or the expressed duration is negative, the audio is
1609 supposed to be generated forever.
1610 Only used if plugin have zero inputs.
1614 @subsection Examples
1618 List all available plugins within amp (LADSPA example plugin) library:
1624 List all available controls and their valid ranges for @code{vcf_notch}
1625 plugin from @code{VCF} library:
1627 ladspa=f=vcf:p=vcf_notch:c=help
1631 Simulate low quality audio equipment using @code{Computer Music Toolkit} (CMT)
1634 ladspa=file=cmt:plugin=lofi:controls=c0=22|c1=12|c2=12
1638 Add reverberation to the audio using TAP-plugins
1639 (Tom's Audio Processing plugins):
1641 ladspa=file=tap_reverb:tap_reverb
1645 Generate white noise, with 0.2 amplitude:
1647 ladspa=file=cmt:noise_source_white:c=c0=.2
1651 Generate 20 bpm clicks using plugin @code{C* Click - Metronome} from the
1652 @code{C* Audio Plugin Suite} (CAPS) library:
1654 ladspa=file=caps:Click:c=c1=20'
1658 Apply @code{C* Eq10X2 - Stereo 10-band equaliser} effect:
1660 ladspa=caps:Eq10X2:c=c0=-48|c9=-24|c3=12|c4=2
1664 @subsection Commands
1666 This filter supports the following commands:
1669 Modify the @var{N}-th control value.
1671 If the specified value is not valid, it is ignored and prior one is kept.
1676 Apply a low-pass filter with 3dB point frequency.
1677 The filter can be either single-pole or double-pole (the default).
1678 The filter roll off at 6dB per pole per octave (20dB per pole per decade).
1680 The filter accepts the following options:
1684 Set frequency in Hz. Default is 500.
1687 Set number of poles. Default is 2.
1690 Set method to specify band-width of filter.
1703 Specify the band-width of a filter in width_type units.
1704 Applies only to double-pole filter.
1705 The default is 0.707q and gives a Butterworth response.
1710 Mix channels with specific gain levels. The filter accepts the output
1711 channel layout followed by a set of channels definitions.
1713 This filter is also designed to remap efficiently the channels of an audio
1716 The filter accepts parameters of the form:
1717 "@var{l}:@var{outdef}:@var{outdef}:..."
1721 output channel layout or number of channels
1724 output channel specification, of the form:
1725 "@var{out_name}=[@var{gain}*]@var{in_name}[+[@var{gain}*]@var{in_name}...]"
1728 output channel to define, either a channel name (FL, FR, etc.) or a channel
1729 number (c0, c1, etc.)
1732 multiplicative coefficient for the channel, 1 leaving the volume unchanged
1735 input channel to use, see out_name for details; it is not possible to mix
1736 named and numbered input channels
1739 If the `=' in a channel specification is replaced by `<', then the gains for
1740 that specification will be renormalized so that the total is 1, thus
1741 avoiding clipping noise.
1743 @subsection Mixing examples
1745 For example, if you want to down-mix from stereo to mono, but with a bigger
1746 factor for the left channel:
1748 pan=1:c0=0.9*c0+0.1*c1
1751 A customized down-mix to stereo that works automatically for 3-, 4-, 5- and
1752 7-channels surround:
1754 pan=stereo: FL < FL + 0.5*FC + 0.6*BL + 0.6*SL : FR < FR + 0.5*FC + 0.6*BR + 0.6*SR
1757 Note that @command{ffmpeg} integrates a default down-mix (and up-mix) system
1758 that should be preferred (see "-ac" option) unless you have very specific
1761 @subsection Remapping examples
1763 The channel remapping will be effective if, and only if:
1766 @item gain coefficients are zeroes or ones,
1767 @item only one input per channel output,
1770 If all these conditions are satisfied, the filter will notify the user ("Pure
1771 channel mapping detected"), and use an optimized and lossless method to do the
1774 For example, if you have a 5.1 source and want a stereo audio stream by
1775 dropping the extra channels:
1777 pan="stereo: c0=FL : c1=FR"
1780 Given the same source, you can also switch front left and front right channels
1781 and keep the input channel layout:
1783 pan="5.1: c0=c1 : c1=c0 : c2=c2 : c3=c3 : c4=c4 : c5=c5"
1786 If the input is a stereo audio stream, you can mute the front left channel (and
1787 still keep the stereo channel layout) with:
1792 Still with a stereo audio stream input, you can copy the right channel in both
1793 front left and right:
1795 pan="stereo: c0=FR : c1=FR"
1800 ReplayGain scanner filter. This filter takes an audio stream as an input and
1801 outputs it unchanged.
1802 At end of filtering it displays @code{track_gain} and @code{track_peak}.
1806 Convert the audio sample format, sample rate and channel layout. It is
1807 not meant to be used directly.
1809 @section silencedetect
1811 Detect silence in an audio stream.
1813 This filter logs a message when it detects that the input audio volume is less
1814 or equal to a noise tolerance value for a duration greater or equal to the
1815 minimum detected noise duration.
1817 The printed times and duration are expressed in seconds.
1819 The filter accepts the following options:
1823 Set silence duration until notification (default is 2 seconds).
1826 Set noise tolerance. Can be specified in dB (in case "dB" is appended to the
1827 specified value) or amplitude ratio. Default is -60dB, or 0.001.
1830 @subsection Examples
1834 Detect 5 seconds of silence with -50dB noise tolerance:
1836 silencedetect=n=-50dB:d=5
1840 Complete example with @command{ffmpeg} to detect silence with 0.0001 noise
1841 tolerance in @file{silence.mp3}:
1843 ffmpeg -i silence.mp3 -af silencedetect=noise=0.0001 -f null -
1847 @section silenceremove
1849 Remove silence from the beginning, middle or end of the audio.
1851 The filter accepts the following options:
1855 This value is used to indicate if audio should be trimmed at beginning of
1856 the audio. A value of zero indicates no silence should be trimmed from the
1857 beginning. When specifying a non-zero value, it trims audio up until it
1858 finds non-silence. Normally, when trimming silence from beginning of audio
1859 the @var{start_periods} will be @code{1} but it can be increased to higher
1860 values to trim all audio up to specific count of non-silence periods.
1861 Default value is @code{0}.
1863 @item start_duration
1864 Specify the amount of time that non-silence must be detected before it stops
1865 trimming audio. By increasing the duration, bursts of noises can be treated
1866 as silence and trimmed off. Default value is @code{0}.
1868 @item start_threshold
1869 This indicates what sample value should be treated as silence. For digital
1870 audio, a value of @code{0} may be fine but for audio recorded from analog,
1871 you may wish to increase the value to account for background noise.
1872 Can be specified in dB (in case "dB" is appended to the specified value)
1873 or amplitude ratio. Default value is @code{0}.
1876 Set the count for trimming silence from the end of audio.
1877 To remove silence from the middle of a file, specify a @var{stop_periods}
1878 that is negative. This value is then threated as a positive value and is
1879 used to indicate the effect should restart processing as specified by
1880 @var{start_periods}, making it suitable for removing periods of silence
1881 in the middle of the audio.
1882 Default value is @code{0}.
1885 Specify a duration of silence that must exist before audio is not copied any
1886 more. By specifying a higher duration, silence that is wanted can be left in
1888 Default value is @code{0}.
1890 @item stop_threshold
1891 This is the same as @option{start_threshold} but for trimming silence from
1893 Can be specified in dB (in case "dB" is appended to the specified value)
1894 or amplitude ratio. Default value is @code{0}.
1897 This indicate that @var{stop_duration} length of audio should be left intact
1898 at the beginning of each period of silence.
1899 For example, if you want to remove long pauses between words but do not want
1900 to remove the pauses completely. Default value is @code{0}.
1904 @subsection Examples
1908 The following example shows how this filter can be used to start a recording
1909 that does not contain the delay at the start which usually occurs between
1910 pressing the record button and the start of the performance:
1912 silenceremove=1:5:0.02
1918 Boost or cut treble (upper) frequencies of the audio using a two-pole
1919 shelving filter with a response similar to that of a standard
1920 hi-fi's tone-controls. This is also known as shelving equalisation (EQ).
1922 The filter accepts the following options:
1926 Give the gain at whichever is the lower of ~22 kHz and the
1927 Nyquist frequency. Its useful range is about -20 (for a large cut)
1928 to +20 (for a large boost). Beware of clipping when using a positive gain.
1931 Set the filter's central frequency and so can be used
1932 to extend or reduce the frequency range to be boosted or cut.
1933 The default value is @code{3000} Hz.
1936 Set method to specify band-width of filter.
1949 Determine how steep is the filter's shelf transition.
1954 Adjust the input audio volume.
1956 It accepts the following parameters:
1960 Set audio volume expression.
1962 Output values are clipped to the maximum value.
1964 The output audio volume is given by the relation:
1966 @var{output_volume} = @var{volume} * @var{input_volume}
1969 The default value for @var{volume} is "1.0".
1972 This parameter represents the mathematical precision.
1974 It determines which input sample formats will be allowed, which affects the
1975 precision of the volume scaling.
1979 8-bit fixed-point; this limits input sample format to U8, S16, and S32.
1981 32-bit floating-point; this limits input sample format to FLT. (default)
1983 64-bit floating-point; this limits input sample format to DBL.
1987 Choose the behaviour on encountering ReplayGain side data in input frames.
1991 Remove ReplayGain side data, ignoring its contents (the default).
1994 Ignore ReplayGain side data, but leave it in the frame.
1997 Prefer the track gain, if present.
2000 Prefer the album gain, if present.
2003 @item replaygain_preamp
2004 Pre-amplification gain in dB to apply to the selected replaygain gain.
2006 Default value for @var{replaygain_preamp} is 0.0.
2009 Set when the volume expression is evaluated.
2011 It accepts the following values:
2014 only evaluate expression once during the filter initialization, or
2015 when the @samp{volume} command is sent
2018 evaluate expression for each incoming frame
2021 Default value is @samp{once}.
2024 The volume expression can contain the following parameters.
2028 frame number (starting at zero)
2031 @item nb_consumed_samples
2032 number of samples consumed by the filter
2034 number of samples in the current frame
2036 original frame position in the file
2042 PTS at start of stream
2044 time at start of stream
2050 last set volume value
2053 Note that when @option{eval} is set to @samp{once} only the
2054 @var{sample_rate} and @var{tb} variables are available, all other
2055 variables will evaluate to NAN.
2057 @subsection Commands
2059 This filter supports the following commands:
2062 Modify the volume expression.
2063 The command accepts the same syntax of the corresponding option.
2065 If the specified expression is not valid, it is kept at its current
2067 @item replaygain_noclip
2068 Prevent clipping by limiting the gain applied.
2070 Default value for @var{replaygain_noclip} is 1.
2074 @subsection Examples
2078 Halve the input audio volume:
2082 volume=volume=-6.0206dB
2085 In all the above example the named key for @option{volume} can be
2086 omitted, for example like in:
2092 Increase input audio power by 6 decibels using fixed-point precision:
2094 volume=volume=6dB:precision=fixed
2098 Fade volume after time 10 with an annihilation period of 5 seconds:
2100 volume='if(lt(t,10),1,max(1-(t-10)/5,0))':eval=frame
2104 @section volumedetect
2106 Detect the volume of the input video.
2108 The filter has no parameters. The input is not modified. Statistics about
2109 the volume will be printed in the log when the input stream end is reached.
2111 In particular it will show the mean volume (root mean square), maximum
2112 volume (on a per-sample basis), and the beginning of a histogram of the
2113 registered volume values (from the maximum value to a cumulated 1/1000 of
2116 All volumes are in decibels relative to the maximum PCM value.
2118 @subsection Examples
2120 Here is an excerpt of the output:
2122 [Parsed_volumedetect_0 @ 0xa23120] mean_volume: -27 dB
2123 [Parsed_volumedetect_0 @ 0xa23120] max_volume: -4 dB
2124 [Parsed_volumedetect_0 @ 0xa23120] histogram_4db: 6
2125 [Parsed_volumedetect_0 @ 0xa23120] histogram_5db: 62
2126 [Parsed_volumedetect_0 @ 0xa23120] histogram_6db: 286
2127 [Parsed_volumedetect_0 @ 0xa23120] histogram_7db: 1042
2128 [Parsed_volumedetect_0 @ 0xa23120] histogram_8db: 2551
2129 [Parsed_volumedetect_0 @ 0xa23120] histogram_9db: 4609
2130 [Parsed_volumedetect_0 @ 0xa23120] histogram_10db: 8409
2136 The mean square energy is approximately -27 dB, or 10^-2.7.
2138 The largest sample is at -4 dB, or more precisely between -4 dB and -5 dB.
2140 There are 6 samples at -4 dB, 62 at -5 dB, 286 at -6 dB, etc.
2143 In other words, raising the volume by +4 dB does not cause any clipping,
2144 raising it by +5 dB causes clipping for 6 samples, etc.
2146 @c man end AUDIO FILTERS
2148 @chapter Audio Sources
2149 @c man begin AUDIO SOURCES
2151 Below is a description of the currently available audio sources.
2155 Buffer audio frames, and make them available to the filter chain.
2157 This source is mainly intended for a programmatic use, in particular
2158 through the interface defined in @file{libavfilter/asrc_abuffer.h}.
2160 It accepts the following parameters:
2164 The timebase which will be used for timestamps of submitted frames. It must be
2165 either a floating-point number or in @var{numerator}/@var{denominator} form.
2168 The sample rate of the incoming audio buffers.
2171 The sample format of the incoming audio buffers.
2172 Either a sample format name or its corresponging integer representation from
2173 the enum AVSampleFormat in @file{libavutil/samplefmt.h}
2175 @item channel_layout
2176 The channel layout of the incoming audio buffers.
2177 Either a channel layout name from channel_layout_map in
2178 @file{libavutil/channel_layout.c} or its corresponding integer representation
2179 from the AV_CH_LAYOUT_* macros in @file{libavutil/channel_layout.h}
2182 The number of channels of the incoming audio buffers.
2183 If both @var{channels} and @var{channel_layout} are specified, then they
2188 @subsection Examples
2191 abuffer=sample_rate=44100:sample_fmt=s16p:channel_layout=stereo
2194 will instruct the source to accept planar 16bit signed stereo at 44100Hz.
2195 Since the sample format with name "s16p" corresponds to the number
2196 6 and the "stereo" channel layout corresponds to the value 0x3, this is
2199 abuffer=sample_rate=44100:sample_fmt=6:channel_layout=0x3
2204 Generate an audio signal specified by an expression.
2206 This source accepts in input one or more expressions (one for each
2207 channel), which are evaluated and used to generate a corresponding
2210 This source accepts the following options:
2214 Set the '|'-separated expressions list for each separate channel. In case the
2215 @option{channel_layout} option is not specified, the selected channel layout
2216 depends on the number of provided expressions. Otherwise the last
2217 specified expression is applied to the remaining output channels.
2219 @item channel_layout, c
2220 Set the channel layout. The number of channels in the specified layout
2221 must be equal to the number of specified expressions.
2224 Set the minimum duration of the sourced audio. See
2225 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
2226 for the accepted syntax.
2227 Note that the resulting duration may be greater than the specified
2228 duration, as the generated audio is always cut at the end of a
2231 If not specified, or the expressed duration is negative, the audio is
2232 supposed to be generated forever.
2235 Set the number of samples per channel per each output frame,
2238 @item sample_rate, s
2239 Specify the sample rate, default to 44100.
2242 Each expression in @var{exprs} can contain the following constants:
2246 number of the evaluated sample, starting from 0
2249 time of the evaluated sample expressed in seconds, starting from 0
2256 @subsection Examples
2266 Generate a sin signal with frequency of 440 Hz, set sample rate to
2269 aevalsrc="sin(440*2*PI*t):s=8000"
2273 Generate a two channels signal, specify the channel layout (Front
2274 Center + Back Center) explicitly:
2276 aevalsrc="sin(420*2*PI*t)|cos(430*2*PI*t):c=FC|BC"
2280 Generate white noise:
2282 aevalsrc="-2+random(0)"
2286 Generate an amplitude modulated signal:
2288 aevalsrc="sin(10*2*PI*t)*sin(880*2*PI*t)"
2292 Generate 2.5 Hz binaural beats on a 360 Hz carrier:
2294 aevalsrc="0.1*sin(2*PI*(360-2.5/2)*t) | 0.1*sin(2*PI*(360+2.5/2)*t)"
2301 The null audio source, return unprocessed audio frames. It is mainly useful
2302 as a template and to be employed in analysis / debugging tools, or as
2303 the source for filters which ignore the input data (for example the sox
2306 This source accepts the following options:
2310 @item channel_layout, cl
2312 Specifies the channel layout, and can be either an integer or a string
2313 representing a channel layout. The default value of @var{channel_layout}
2316 Check the channel_layout_map definition in
2317 @file{libavutil/channel_layout.c} for the mapping between strings and
2318 channel layout values.
2320 @item sample_rate, r
2321 Specifies the sample rate, and defaults to 44100.
2324 Set the number of samples per requested frames.
2328 @subsection Examples
2332 Set the sample rate to 48000 Hz and the channel layout to AV_CH_LAYOUT_MONO.
2334 anullsrc=r=48000:cl=4
2338 Do the same operation with a more obvious syntax:
2340 anullsrc=r=48000:cl=mono
2344 All the parameters need to be explicitly defined.
2348 Synthesize a voice utterance using the libflite library.
2350 To enable compilation of this filter you need to configure FFmpeg with
2351 @code{--enable-libflite}.
2353 Note that the flite library is not thread-safe.
2355 The filter accepts the following options:
2360 If set to 1, list the names of the available voices and exit
2361 immediately. Default value is 0.
2364 Set the maximum number of samples per frame. Default value is 512.
2367 Set the filename containing the text to speak.
2370 Set the text to speak.
2373 Set the voice to use for the speech synthesis. Default value is
2374 @code{kal}. See also the @var{list_voices} option.
2377 @subsection Examples
2381 Read from file @file{speech.txt}, and synthetize the text using the
2382 standard flite voice:
2384 flite=textfile=speech.txt
2388 Read the specified text selecting the @code{slt} voice:
2390 flite=text='So fare thee well, poor devil of a Sub-Sub, whose commentator I am':voice=slt
2394 Input text to ffmpeg:
2396 ffmpeg -f lavfi -i flite=text='So fare thee well, poor devil of a Sub-Sub, whose commentator I am':voice=slt
2400 Make @file{ffplay} speak the specified text, using @code{flite} and
2401 the @code{lavfi} device:
2403 ffplay -f lavfi flite=text='No more be grieved for which that thou hast done.'
2407 For more information about libflite, check:
2408 @url{http://www.speech.cs.cmu.edu/flite/}
2412 Generate an audio signal made of a sine wave with amplitude 1/8.
2414 The audio signal is bit-exact.
2416 The filter accepts the following options:
2421 Set the carrier frequency. Default is 440 Hz.
2423 @item beep_factor, b
2424 Enable a periodic beep every second with frequency @var{beep_factor} times
2425 the carrier frequency. Default is 0, meaning the beep is disabled.
2427 @item sample_rate, r
2428 Specify the sample rate, default is 44100.
2431 Specify the duration of the generated audio stream.
2433 @item samples_per_frame
2434 Set the number of samples per output frame, default is 1024.
2437 @subsection Examples
2442 Generate a simple 440 Hz sine wave:
2448 Generate a 220 Hz sine wave with a 880 Hz beep each second, for 5 seconds:
2452 sine=frequency=220:beep_factor=4:duration=5
2457 @c man end AUDIO SOURCES
2459 @chapter Audio Sinks
2460 @c man begin AUDIO SINKS
2462 Below is a description of the currently available audio sinks.
2464 @section abuffersink
2466 Buffer audio frames, and make them available to the end of filter chain.
2468 This sink is mainly intended for programmatic use, in particular
2469 through the interface defined in @file{libavfilter/buffersink.h}
2470 or the options system.
2472 It accepts a pointer to an AVABufferSinkContext structure, which
2473 defines the incoming buffers' formats, to be passed as the opaque
2474 parameter to @code{avfilter_init_filter} for initialization.
2477 Null audio sink; do absolutely nothing with the input audio. It is
2478 mainly useful as a template and for use in analysis / debugging
2481 @c man end AUDIO SINKS
2483 @chapter Video Filters
2484 @c man begin VIDEO FILTERS
2486 When you configure your FFmpeg build, you can disable any of the
2487 existing filters using @code{--disable-filters}.
2488 The configure output will show the video filters included in your
2491 Below is a description of the currently available video filters.
2493 @section alphaextract
2495 Extract the alpha component from the input as a grayscale video. This
2496 is especially useful with the @var{alphamerge} filter.
2500 Add or replace the alpha component of the primary input with the
2501 grayscale value of a second input. This is intended for use with
2502 @var{alphaextract} to allow the transmission or storage of frame
2503 sequences that have alpha in a format that doesn't support an alpha
2506 For example, to reconstruct full frames from a normal YUV-encoded video
2507 and a separate video created with @var{alphaextract}, you might use:
2509 movie=in_alpha.mkv [alpha]; [in][alpha] alphamerge [out]
2512 Since this filter is designed for reconstruction, it operates on frame
2513 sequences without considering timestamps, and terminates when either
2514 input reaches end of stream. This will cause problems if your encoding
2515 pipeline drops frames. If you're trying to apply an image as an
2516 overlay to a video stream, consider the @var{overlay} filter instead.
2520 Same as the @ref{subtitles} filter, except that it doesn't require libavcodec
2521 and libavformat to work. On the other hand, it is limited to ASS (Advanced
2522 Substation Alpha) subtitles files.
2524 This filter accepts the following option in addition to the common options from
2525 the @ref{subtitles} filter:
2529 Set the shaping engine
2531 Available values are:
2534 The default libass shaping engine, which is the best available.
2536 Fast, font-agnostic shaper that can do only substitutions
2538 Slower shaper using OpenType for substitutions and positioning
2541 The default is @code{auto}.
2546 Compute the bounding box for the non-black pixels in the input frame
2549 This filter computes the bounding box containing all the pixels with a
2550 luminance value greater than the minimum allowed value.
2551 The parameters describing the bounding box are printed on the filter
2554 The filter accepts the following option:
2558 Set the minimal luminance value. Default is @code{16}.
2561 @section blackdetect
2563 Detect video intervals that are (almost) completely black. Can be
2564 useful to detect chapter transitions, commercials, or invalid
2565 recordings. Output lines contains the time for the start, end and
2566 duration of the detected black interval expressed in seconds.
2568 In order to display the output lines, you need to set the loglevel at
2569 least to the AV_LOG_INFO value.
2571 The filter accepts the following options:
2574 @item black_min_duration, d
2575 Set the minimum detected black duration expressed in seconds. It must
2576 be a non-negative floating point number.
2578 Default value is 2.0.
2580 @item picture_black_ratio_th, pic_th
2581 Set the threshold for considering a picture "black".
2582 Express the minimum value for the ratio:
2584 @var{nb_black_pixels} / @var{nb_pixels}
2587 for which a picture is considered black.
2588 Default value is 0.98.
2590 @item pixel_black_th, pix_th
2591 Set the threshold for considering a pixel "black".
2593 The threshold expresses the maximum pixel luminance value for which a
2594 pixel is considered "black". The provided value is scaled according to
2595 the following equation:
2597 @var{absolute_threshold} = @var{luminance_minimum_value} + @var{pixel_black_th} * @var{luminance_range_size}
2600 @var{luminance_range_size} and @var{luminance_minimum_value} depend on
2601 the input video format, the range is [0-255] for YUV full-range
2602 formats and [16-235] for YUV non full-range formats.
2604 Default value is 0.10.
2607 The following example sets the maximum pixel threshold to the minimum
2608 value, and detects only black intervals of 2 or more seconds:
2610 blackdetect=d=2:pix_th=0.00
2615 Detect frames that are (almost) completely black. Can be useful to
2616 detect chapter transitions or commercials. Output lines consist of
2617 the frame number of the detected frame, the percentage of blackness,
2618 the position in the file if known or -1 and the timestamp in seconds.
2620 In order to display the output lines, you need to set the loglevel at
2621 least to the AV_LOG_INFO value.
2623 It accepts the following parameters:
2628 The percentage of the pixels that have to be below the threshold; it defaults to
2631 @item threshold, thresh
2632 The threshold below which a pixel value is considered black; it defaults to
2639 Blend two video frames into each other.
2641 It takes two input streams and outputs one stream, the first input is the
2642 "top" layer and second input is "bottom" layer.
2643 Output terminates when shortest input terminates.
2645 A description of the accepted options follows.
2653 Set blend mode for specific pixel component or all pixel components in case
2654 of @var{all_mode}. Default value is @code{normal}.
2656 Available values for component modes are:
2689 Set blend opacity for specific pixel component or all pixel components in case
2690 of @var{all_opacity}. Only used in combination with pixel component blend modes.
2697 Set blend expression for specific pixel component or all pixel components in case
2698 of @var{all_expr}. Note that related mode options will be ignored if those are set.
2700 The expressions can use the following variables:
2704 The sequential number of the filtered frame, starting from @code{0}.
2708 the coordinates of the current sample
2712 the width and height of currently filtered plane
2716 Width and height scale depending on the currently filtered plane. It is the
2717 ratio between the corresponding luma plane number of pixels and the current
2718 plane ones. E.g. for YUV4:2:0 the values are @code{1,1} for the luma plane, and
2719 @code{0.5,0.5} for chroma planes.
2722 Time of the current frame, expressed in seconds.
2725 Value of pixel component at current location for first video frame (top layer).
2728 Value of pixel component at current location for second video frame (bottom layer).
2732 Force termination when the shortest input terminates. Default is @code{0}.
2734 Continue applying the last bottom frame after the end of the stream. A value of
2735 @code{0} disable the filter after the last frame of the bottom layer is reached.
2736 Default is @code{1}.
2739 @subsection Examples
2743 Apply transition from bottom layer to top layer in first 10 seconds:
2745 blend=all_expr='A*(if(gte(T,10),1,T/10))+B*(1-(if(gte(T,10),1,T/10)))'
2749 Apply 1x1 checkerboard effect:
2751 blend=all_expr='if(eq(mod(X,2),mod(Y,2)),A,B)'
2755 Apply uncover left effect:
2757 blend=all_expr='if(gte(N*SW+X,W),A,B)'
2761 Apply uncover down effect:
2763 blend=all_expr='if(gte(Y-N*SH,0),A,B)'
2767 Apply uncover up-left effect:
2769 blend=all_expr='if(gte(T*SH*40+Y,H)*gte((T*40*SW+X)*W/H,W),A,B)'
2775 Apply a boxblur algorithm to the input video.
2777 It accepts the following parameters:
2781 @item luma_radius, lr
2782 @item luma_power, lp
2783 @item chroma_radius, cr
2784 @item chroma_power, cp
2785 @item alpha_radius, ar
2786 @item alpha_power, ap
2790 A description of the accepted options follows.
2793 @item luma_radius, lr
2794 @item chroma_radius, cr
2795 @item alpha_radius, ar
2796 Set an expression for the box radius in pixels used for blurring the
2797 corresponding input plane.
2799 The radius value must be a non-negative number, and must not be
2800 greater than the value of the expression @code{min(w,h)/2} for the
2801 luma and alpha planes, and of @code{min(cw,ch)/2} for the chroma
2804 Default value for @option{luma_radius} is "2". If not specified,
2805 @option{chroma_radius} and @option{alpha_radius} default to the
2806 corresponding value set for @option{luma_radius}.
2808 The expressions can contain the following constants:
2812 The input width and height in pixels.
2816 The input chroma image width and height in pixels.
2820 The horizontal and vertical chroma subsample values. For example, for the
2821 pixel format "yuv422p", @var{hsub} is 2 and @var{vsub} is 1.
2824 @item luma_power, lp
2825 @item chroma_power, cp
2826 @item alpha_power, ap
2827 Specify how many times the boxblur filter is applied to the
2828 corresponding plane.
2830 Default value for @option{luma_power} is 2. If not specified,
2831 @option{chroma_power} and @option{alpha_power} default to the
2832 corresponding value set for @option{luma_power}.
2834 A value of 0 will disable the effect.
2837 @subsection Examples
2841 Apply a boxblur filter with the luma, chroma, and alpha radii
2844 boxblur=luma_radius=2:luma_power=1
2849 Set the luma radius to 2, and alpha and chroma radius to 0:
2851 boxblur=2:1:cr=0:ar=0
2855 Set the luma and chroma radii to a fraction of the video dimension:
2857 boxblur=luma_radius=min(h\,w)/10:luma_power=1:chroma_radius=min(cw\,ch)/10:chroma_power=1
2863 Visualize information exported by some codecs.
2865 Some codecs can export information through frames using side-data or other
2866 means. For example, some MPEG based codecs export motion vectors through the
2867 @var{export_mvs} flag in the codec @option{flags2} option.
2869 The filter accepts the following option:
2873 Set motion vectors to visualize.
2875 Available flags for @var{mv} are:
2879 forward predicted MVs of P-frames
2881 forward predicted MVs of B-frames
2883 backward predicted MVs of B-frames
2887 @subsection Examples
2891 Visualizes multi-directionals MVs from P and B-Frames using @command{ffplay}:
2893 ffplay -flags2 +export_mvs input.mpg -vf codecview=mv=pf+bf+bb
2897 @section colorbalance
2898 Modify intensity of primary colors (red, green and blue) of input frames.
2900 The filter allows an input frame to be adjusted in the shadows, midtones or highlights
2901 regions for the red-cyan, green-magenta or blue-yellow balance.
2903 A positive adjustment value shifts the balance towards the primary color, a negative
2904 value towards the complementary color.
2906 The filter accepts the following options:
2912 Adjust red, green and blue shadows (darkest pixels).
2917 Adjust red, green and blue midtones (medium pixels).
2922 Adjust red, green and blue highlights (brightest pixels).
2924 Allowed ranges for options are @code{[-1.0, 1.0]}. Defaults are @code{0}.
2927 @subsection Examples
2931 Add red color cast to shadows:
2937 @section colorchannelmixer
2939 Adjust video input frames by re-mixing color channels.
2941 This filter modifies a color channel by adding the values associated to
2942 the other channels of the same pixels. For example if the value to
2943 modify is red, the output value will be:
2945 @var{red}=@var{red}*@var{rr} + @var{blue}*@var{rb} + @var{green}*@var{rg} + @var{alpha}*@var{ra}
2948 The filter accepts the following options:
2955 Adjust contribution of input red, green, blue and alpha channels for output red channel.
2956 Default is @code{1} for @var{rr}, and @code{0} for @var{rg}, @var{rb} and @var{ra}.
2962 Adjust contribution of input red, green, blue and alpha channels for output green channel.
2963 Default is @code{1} for @var{gg}, and @code{0} for @var{gr}, @var{gb} and @var{ga}.
2969 Adjust contribution of input red, green, blue and alpha channels for output blue channel.
2970 Default is @code{1} for @var{bb}, and @code{0} for @var{br}, @var{bg} and @var{ba}.
2976 Adjust contribution of input red, green, blue and alpha channels for output alpha channel.
2977 Default is @code{1} for @var{aa}, and @code{0} for @var{ar}, @var{ag} and @var{ab}.
2979 Allowed ranges for options are @code{[-2.0, 2.0]}.
2982 @subsection Examples
2986 Convert source to grayscale:
2988 colorchannelmixer=.3:.4:.3:0:.3:.4:.3:0:.3:.4:.3
2991 Simulate sepia tones:
2993 colorchannelmixer=.393:.769:.189:0:.349:.686:.168:0:.272:.534:.131
2997 @section colormatrix
2999 Convert color matrix.
3001 The filter accepts the following options:
3006 Specify the source and destination color matrix. Both values must be
3009 The accepted values are:
3025 For example to convert from BT.601 to SMPTE-240M, use the command:
3027 colormatrix=bt601:smpte240m
3032 Copy the input source unchanged to the output. This is mainly useful for
3037 Crop the input video to given dimensions.
3039 It accepts the following parameters:
3043 The width of the output video. It defaults to @code{iw}.
3044 This expression is evaluated only once during the filter
3048 The height of the output video. It defaults to @code{ih}.
3049 This expression is evaluated only once during the filter
3053 The horizontal position, in the input video, of the left edge of the output
3054 video. It defaults to @code{(in_w-out_w)/2}.
3055 This expression is evaluated per-frame.
3058 The vertical position, in the input video, of the top edge of the output video.
3059 It defaults to @code{(in_h-out_h)/2}.
3060 This expression is evaluated per-frame.
3063 If set to 1 will force the output display aspect ratio
3064 to be the same of the input, by changing the output sample aspect
3065 ratio. It defaults to 0.
3068 The @var{out_w}, @var{out_h}, @var{x}, @var{y} parameters are
3069 expressions containing the following constants:
3074 The computed values for @var{x} and @var{y}. They are evaluated for
3079 The input width and height.
3083 These are the same as @var{in_w} and @var{in_h}.
3087 The output (cropped) width and height.
3091 These are the same as @var{out_w} and @var{out_h}.
3094 same as @var{iw} / @var{ih}
3097 input sample aspect ratio
3100 input display aspect ratio, it is the same as (@var{iw} / @var{ih}) * @var{sar}
3104 horizontal and vertical chroma subsample values. For example for the
3105 pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
3108 The number of the input frame, starting from 0.
3111 the position in the file of the input frame, NAN if unknown
3114 The timestamp expressed in seconds. It's NAN if the input timestamp is unknown.
3118 The expression for @var{out_w} may depend on the value of @var{out_h},
3119 and the expression for @var{out_h} may depend on @var{out_w}, but they
3120 cannot depend on @var{x} and @var{y}, as @var{x} and @var{y} are
3121 evaluated after @var{out_w} and @var{out_h}.
3123 The @var{x} and @var{y} parameters specify the expressions for the
3124 position of the top-left corner of the output (non-cropped) area. They
3125 are evaluated for each frame. If the evaluated value is not valid, it
3126 is approximated to the nearest valid value.
3128 The expression for @var{x} may depend on @var{y}, and the expression
3129 for @var{y} may depend on @var{x}.
3131 @subsection Examples
3135 Crop area with size 100x100 at position (12,34).
3140 Using named options, the example above becomes:
3142 crop=w=100:h=100:x=12:y=34
3146 Crop the central input area with size 100x100:
3152 Crop the central input area with size 2/3 of the input video:
3154 crop=2/3*in_w:2/3*in_h
3158 Crop the input video central square:
3165 Delimit the rectangle with the top-left corner placed at position
3166 100:100 and the right-bottom corner corresponding to the right-bottom
3167 corner of the input image.
3169 crop=in_w-100:in_h-100:100:100
3173 Crop 10 pixels from the left and right borders, and 20 pixels from
3174 the top and bottom borders
3176 crop=in_w-2*10:in_h-2*20
3180 Keep only the bottom right quarter of the input image:
3182 crop=in_w/2:in_h/2:in_w/2:in_h/2
3186 Crop height for getting Greek harmony:
3188 crop=in_w:1/PHI*in_w
3192 Appply trembling effect:
3194 crop=in_w/2:in_h/2:(in_w-out_w)/2+((in_w-out_w)/2)*sin(n/10):(in_h-out_h)/2 +((in_h-out_h)/2)*sin(n/7)
3198 Apply erratic camera effect depending on timestamp:
3200 crop=in_w/2:in_h/2:(in_w-out_w)/2+((in_w-out_w)/2)*sin(t*10):(in_h-out_h)/2 +((in_h-out_h)/2)*sin(t*13)"
3204 Set x depending on the value of y:
3206 crop=in_w/2:in_h/2:y:10+10*sin(n/10)
3212 Auto-detect the crop size.
3214 It calculates the necessary cropping parameters and prints the
3215 recommended parameters via the logging system. The detected dimensions
3216 correspond to the non-black area of the input video.
3218 It accepts the following parameters:
3223 Set higher black value threshold, which can be optionally specified
3224 from nothing (0) to everything (255). An intensity value greater
3225 to the set value is considered non-black. It defaults to 24.
3228 The value which the width/height should be divisible by. It defaults to
3229 16. The offset is automatically adjusted to center the video. Use 2 to
3230 get only even dimensions (needed for 4:2:2 video). 16 is best when
3231 encoding to most video codecs.
3233 @item reset_count, reset
3234 Set the counter that determines after how many frames cropdetect will
3235 reset the previously detected largest video area and start over to
3236 detect the current optimal crop area. Default value is 0.
3238 This can be useful when channel logos distort the video area. 0
3239 indicates 'never reset', and returns the largest area encountered during
3246 Apply color adjustments using curves.
3248 This filter is similar to the Adobe Photoshop and GIMP curves tools. Each
3249 component (red, green and blue) has its values defined by @var{N} key points
3250 tied from each other using a smooth curve. The x-axis represents the pixel
3251 values from the input frame, and the y-axis the new pixel values to be set for
3254 By default, a component curve is defined by the two points @var{(0;0)} and
3255 @var{(1;1)}. This creates a straight line where each original pixel value is
3256 "adjusted" to its own value, which means no change to the image.
3258 The filter allows you to redefine these two points and add some more. A new
3259 curve (using a natural cubic spline interpolation) will be define to pass
3260 smoothly through all these new coordinates. The new defined points needs to be
3261 strictly increasing over the x-axis, and their @var{x} and @var{y} values must
3262 be in the @var{[0;1]} interval. If the computed curves happened to go outside
3263 the vector spaces, the values will be clipped accordingly.
3265 If there is no key point defined in @code{x=0}, the filter will automatically
3266 insert a @var{(0;0)} point. In the same way, if there is no key point defined
3267 in @code{x=1}, the filter will automatically insert a @var{(1;1)} point.
3269 The filter accepts the following options:
3273 Select one of the available color presets. This option can be used in addition
3274 to the @option{r}, @option{g}, @option{b} parameters; in this case, the later
3275 options takes priority on the preset values.
3276 Available presets are:
3279 @item color_negative
3282 @item increase_contrast
3284 @item linear_contrast
3285 @item medium_contrast
3287 @item strong_contrast
3290 Default is @code{none}.
3292 Set the master key points. These points will define a second pass mapping. It
3293 is sometimes called a "luminance" or "value" mapping. It can be used with
3294 @option{r}, @option{g}, @option{b} or @option{all} since it acts like a
3295 post-processing LUT.
3297 Set the key points for the red component.
3299 Set the key points for the green component.
3301 Set the key points for the blue component.
3303 Set the key points for all components (not including master).
3304 Can be used in addition to the other key points component
3305 options. In this case, the unset component(s) will fallback on this
3306 @option{all} setting.
3308 Specify a Photoshop curves file (@code{.asv}) to import the settings from.
3311 To avoid some filtergraph syntax conflicts, each key points list need to be
3312 defined using the following syntax: @code{x0/y0 x1/y1 x2/y2 ...}.
3314 @subsection Examples
3318 Increase slightly the middle level of blue:
3320 curves=blue='0.5/0.58'
3326 curves=r='0/0.11 .42/.51 1/0.95':g='0.50/0.48':b='0/0.22 .49/.44 1/0.8'
3328 Here we obtain the following coordinates for each components:
3331 @code{(0;0.11) (0.42;0.51) (1;0.95)}
3333 @code{(0;0) (0.50;0.48) (1;1)}
3335 @code{(0;0.22) (0.49;0.44) (1;0.80)}
3339 The previous example can also be achieved with the associated built-in preset:
3341 curves=preset=vintage
3351 Use a Photoshop preset and redefine the points of the green component:
3353 curves=psfile='MyCurvesPresets/purple.asv':green='0.45/0.53'
3359 Denoise frames using 2D DCT (frequency domain filtering).
3361 This filter is not designed for real time.
3363 The filter accepts the following options:
3367 Set the noise sigma constant.
3369 This @var{sigma} defines a hard threshold of @code{3 * sigma}; every DCT
3370 coefficient (absolute value) below this threshold with be dropped.
3372 If you need a more advanced filtering, see @option{expr}.
3374 Default is @code{0}.
3377 Set number overlapping pixels for each block. Since the filter can be slow, you
3378 may want to reduce this value, at the cost of a less effective filter and the
3379 risk of various artefacts.
3381 If the overlapping value doesn't allow to process the whole input width or
3382 height, a warning will be displayed and according borders won't be denoised.
3384 Default value is @var{blocksize}-1, which is the best possible setting.
3387 Set the coefficient factor expression.
3389 For each coefficient of a DCT block, this expression will be evaluated as a
3390 multiplier value for the coefficient.
3392 If this is option is set, the @option{sigma} option will be ignored.
3394 The absolute value of the coefficient can be accessed through the @var{c}
3398 Set the @var{blocksize} using the number of bits. @code{1<<@var{n}} defines the
3399 @var{blocksize}, which is the width and height of the processed blocks.
3401 The default value is @var{3} (8x8) and can be raised to @var{4} for a
3402 @var{blocksize} of 16x16. Note that changing this setting has huge consequences
3403 on the speed processing. Also, a larger block size does not necessarily means a
3407 @subsection Examples
3409 Apply a denoise with a @option{sigma} of @code{4.5}:
3414 The same operation can be achieved using the expression system:
3416 dctdnoiz=e='gte(c, 4.5*3)'
3419 Violent denoise using a block size of @code{16x16}:
3427 Drop duplicated frames at regular intervals.
3429 The filter accepts the following options:
3433 Set the number of frames from which one will be dropped. Setting this to
3434 @var{N} means one frame in every batch of @var{N} frames will be dropped.
3435 Default is @code{5}.
3438 Set the threshold for duplicate detection. If the difference metric for a frame
3439 is less than or equal to this value, then it is declared as duplicate. Default
3443 Set scene change threshold. Default is @code{15}.
3447 Set the size of the x and y-axis blocks used during metric calculations.
3448 Larger blocks give better noise suppression, but also give worse detection of
3449 small movements. Must be a power of two. Default is @code{32}.
3452 Mark main input as a pre-processed input and activate clean source input
3453 stream. This allows the input to be pre-processed with various filters to help
3454 the metrics calculation while keeping the frame selection lossless. When set to
3455 @code{1}, the first stream is for the pre-processed input, and the second
3456 stream is the clean source from where the kept frames are chosen. Default is
3460 Set whether or not chroma is considered in the metric calculations. Default is
3466 Remove judder produced by partially interlaced telecined content.
3468 Judder can be introduced, for instance, by @ref{pullup} filter. If the original
3469 source was partially telecined content then the output of @code{pullup,dejudder}
3470 will have a variable frame rate. May change the recorded frame rate of the
3471 container. Aside from that change, this filter will not affect constant frame
3474 The option available in this filter is:
3478 Specify the length of the window over which the judder repeats.
3480 Accepts any integer greater than 1. Useful values are:
3484 If the original was telecined from 24 to 30 fps (Film to NTSC).
3487 If the original was telecined from 25 to 30 fps (PAL to NTSC).
3490 If a mixture of the two.
3493 The default is @samp{4}.
3498 Suppress a TV station logo by a simple interpolation of the surrounding
3499 pixels. Just set a rectangle covering the logo and watch it disappear
3500 (and sometimes something even uglier appear - your mileage may vary).
3502 It accepts the following parameters:
3507 Specify the top left corner coordinates of the logo. They must be
3512 Specify the width and height of the logo to clear. They must be
3516 Specify the thickness of the fuzzy edge of the rectangle (added to
3517 @var{w} and @var{h}). The default value is 4.
3520 When set to 1, a green rectangle is drawn on the screen to simplify
3521 finding the right @var{x}, @var{y}, @var{w}, and @var{h} parameters.
3522 The default value is 0.
3524 The rectangle is drawn on the outermost pixels which will be (partly)
3525 replaced with interpolated values. The values of the next pixels
3526 immediately outside this rectangle in each direction will be used to
3527 compute the interpolated pixel values inside the rectangle.
3531 @subsection Examples
3535 Set a rectangle covering the area with top left corner coordinates 0,0
3536 and size 100x77, and a band of size 10:
3538 delogo=x=0:y=0:w=100:h=77:band=10
3545 Attempt to fix small changes in horizontal and/or vertical shift. This
3546 filter helps remove camera shake from hand-holding a camera, bumping a
3547 tripod, moving on a vehicle, etc.
3549 The filter accepts the following options:
3557 Specify a rectangular area where to limit the search for motion
3559 If desired the search for motion vectors can be limited to a
3560 rectangular area of the frame defined by its top left corner, width
3561 and height. These parameters have the same meaning as the drawbox
3562 filter which can be used to visualise the position of the bounding
3565 This is useful when simultaneous movement of subjects within the frame
3566 might be confused for camera motion by the motion vector search.
3568 If any or all of @var{x}, @var{y}, @var{w} and @var{h} are set to -1
3569 then the full frame is used. This allows later options to be set
3570 without specifying the bounding box for the motion vector search.
3572 Default - search the whole frame.
3576 Specify the maximum extent of movement in x and y directions in the
3577 range 0-64 pixels. Default 16.
3580 Specify how to generate pixels to fill blanks at the edge of the
3581 frame. Available values are:
3584 Fill zeroes at blank locations
3586 Original image at blank locations
3588 Extruded edge value at blank locations
3590 Mirrored edge at blank locations
3592 Default value is @samp{mirror}.
3595 Specify the blocksize to use for motion search. Range 4-128 pixels,
3599 Specify the contrast threshold for blocks. Only blocks with more than
3600 the specified contrast (difference between darkest and lightest
3601 pixels) will be considered. Range 1-255, default 125.
3604 Specify the search strategy. Available values are:
3607 Set exhaustive search
3609 Set less exhaustive search.
3611 Default value is @samp{exhaustive}.
3614 If set then a detailed log of the motion search is written to the
3618 If set to 1, specify using OpenCL capabilities, only available if
3619 FFmpeg was configured with @code{--enable-opencl}. Default value is 0.
3625 Draw a colored box on the input image.
3627 It accepts the following parameters:
3632 The expressions which specify the top left corner coordinates of the box. It defaults to 0.
3636 The expressions which specify the width and height of the box; if 0 they are interpreted as
3637 the input width and height. It defaults to 0.
3640 Specify the color of the box to write. For the general syntax of this option,
3641 check the "Color" section in the ffmpeg-utils manual. If the special
3642 value @code{invert} is used, the box edge color is the same as the
3643 video with inverted luma.
3646 The expression which sets the thickness of the box edge. Default value is @code{3}.
3648 See below for the list of accepted constants.
3651 The parameters for @var{x}, @var{y}, @var{w} and @var{h} and @var{t} are expressions containing the
3652 following constants:
3656 The input display aspect ratio, it is the same as (@var{w} / @var{h}) * @var{sar}.
3660 horizontal and vertical chroma subsample values. For example for the
3661 pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
3665 The input width and height.
3668 The input sample aspect ratio.
3672 The x and y offset coordinates where the box is drawn.
3676 The width and height of the drawn box.
3679 The thickness of the drawn box.
3681 These constants allow the @var{x}, @var{y}, @var{w}, @var{h} and @var{t} expressions to refer to
3682 each other, so you may for example specify @code{y=x/dar} or @code{h=w/dar}.
3686 @subsection Examples
3690 Draw a black box around the edge of the input image:
3696 Draw a box with color red and an opacity of 50%:
3698 drawbox=10:20:200:60:red@@0.5
3701 The previous example can be specified as:
3703 drawbox=x=10:y=20:w=200:h=60:color=red@@0.5
3707 Fill the box with pink color:
3709 drawbox=x=10:y=10:w=100:h=100:color=pink@@0.5:t=max
3713 Draw a 2-pixel red 2.40:1 mask:
3715 drawbox=x=-t:y=0.5*(ih-iw/2.4)-t:w=iw+t*2:h=iw/2.4+t*2:t=2:c=red
3721 Draw a grid on the input image.
3723 It accepts the following parameters:
3728 The expressions which specify the coordinates of some point of grid intersection (meant to configure offset). Both default to 0.
3732 The expressions which specify the width and height of the grid cell, if 0 they are interpreted as the
3733 input width and height, respectively, minus @code{thickness}, so image gets
3734 framed. Default to 0.
3737 Specify the color of the grid. For the general syntax of this option,
3738 check the "Color" section in the ffmpeg-utils manual. If the special
3739 value @code{invert} is used, the grid color is the same as the
3740 video with inverted luma.
3743 The expression which sets the thickness of the grid line. Default value is @code{1}.
3745 See below for the list of accepted constants.
3748 The parameters for @var{x}, @var{y}, @var{w} and @var{h} and @var{t} are expressions containing the
3749 following constants:
3753 The input display aspect ratio, it is the same as (@var{w} / @var{h}) * @var{sar}.
3757 horizontal and vertical chroma subsample values. For example for the
3758 pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
3762 The input grid cell width and height.
3765 The input sample aspect ratio.
3769 The x and y coordinates of some point of grid intersection (meant to configure offset).
3773 The width and height of the drawn cell.
3776 The thickness of the drawn cell.
3778 These constants allow the @var{x}, @var{y}, @var{w}, @var{h} and @var{t} expressions to refer to
3779 each other, so you may for example specify @code{y=x/dar} or @code{h=w/dar}.
3783 @subsection Examples
3787 Draw a grid with cell 100x100 pixels, thickness 2 pixels, with color red and an opacity of 50%:
3789 drawgrid=width=100:height=100:thickness=2:color=red@@0.5
3793 Draw a white 3x3 grid with an opacity of 50%:
3795 drawgrid=w=iw/3:h=ih/3:t=2:c=white@@0.5
3802 Draw a text string or text from a specified file on top of a video, using the
3803 libfreetype library.
3805 To enable compilation of this filter, you need to configure FFmpeg with
3806 @code{--enable-libfreetype}.
3807 To enable default font fallback and the @var{font} option you need to
3808 configure FFmpeg with @code{--enable-libfontconfig}.
3809 To enable the @var{text_shaping} option, you need to configure FFmpeg with
3810 @code{--enable-libfribidi}.
3814 It accepts the following parameters:
3819 Used to draw a box around text using the background color.
3820 The value must be either 1 (enable) or 0 (disable).
3821 The default value of @var{box} is 0.
3824 The color to be used for drawing box around text. For the syntax of this
3825 option, check the "Color" section in the ffmpeg-utils manual.
3827 The default value of @var{boxcolor} is "white".
3830 Set the width of the border to be drawn around the text using @var{bordercolor}.
3831 The default value of @var{borderw} is 0.
3834 Set the color to be used for drawing border around text. For the syntax of this
3835 option, check the "Color" section in the ffmpeg-utils manual.
3837 The default value of @var{bordercolor} is "black".
3840 Select how the @var{text} is expanded. Can be either @code{none},
3841 @code{strftime} (deprecated) or
3842 @code{normal} (default). See the @ref{drawtext_expansion, Text expansion} section
3846 If true, check and fix text coords to avoid clipping.
3849 The color to be used for drawing fonts. For the syntax of this option, check
3850 the "Color" section in the ffmpeg-utils manual.
3852 The default value of @var{fontcolor} is "black".
3854 @item fontcolor_expr
3855 String which is expanded the same way as @var{text} to obtain dynamic
3856 @var{fontcolor} value. By default this option has empty value and is not
3857 processed. When this option is set, it overrides @var{fontcolor} option.
3860 The font family to be used for drawing text. By default Sans.
3863 The font file to be used for drawing text. The path must be included.
3864 This parameter is mandatory if the fontconfig support is disabled.
3867 The font size to be used for drawing text.
3868 The default value of @var{fontsize} is 16.
3871 If set to 1, attempt to shape the text (for example, reverse the order of
3872 right-to-left text and join Arabic characters) before drawing it.
3873 Otherwise, just draw the text exactly as given.
3874 By default 1 (if supported).
3877 The flags to be used for loading the fonts.
3879 The flags map the corresponding flags supported by libfreetype, and are
3880 a combination of the following values:
3887 @item vertical_layout
3888 @item force_autohint
3891 @item ignore_global_advance_width
3893 @item ignore_transform
3899 Default value is "default".
3901 For more information consult the documentation for the FT_LOAD_*
3905 The color to be used for drawing a shadow behind the drawn text. For the
3906 syntax of this option, check the "Color" section in the ffmpeg-utils manual.
3908 The default value of @var{shadowcolor} is "black".
3912 The x and y offsets for the text shadow position with respect to the
3913 position of the text. They can be either positive or negative
3914 values. The default value for both is "0".
3917 The starting frame number for the n/frame_num variable. The default value
3921 The size in number of spaces to use for rendering the tab.
3925 Set the initial timecode representation in "hh:mm:ss[:;.]ff"
3926 format. It can be used with or without text parameter. @var{timecode_rate}
3927 option must be specified.
3929 @item timecode_rate, rate, r
3930 Set the timecode frame rate (timecode only).
3933 The text string to be drawn. The text must be a sequence of UTF-8
3935 This parameter is mandatory if no file is specified with the parameter
3939 A text file containing text to be drawn. The text must be a sequence
3940 of UTF-8 encoded characters.
3942 This parameter is mandatory if no text string is specified with the
3943 parameter @var{text}.
3945 If both @var{text} and @var{textfile} are specified, an error is thrown.
3948 If set to 1, the @var{textfile} will be reloaded before each frame.
3949 Be sure to update it atomically, or it may be read partially, or even fail.
3953 The expressions which specify the offsets where text will be drawn
3954 within the video frame. They are relative to the top/left border of the
3957 The default value of @var{x} and @var{y} is "0".
3959 See below for the list of accepted constants and functions.
3962 The parameters for @var{x} and @var{y} are expressions containing the
3963 following constants and functions:
3967 input display aspect ratio, it is the same as (@var{w} / @var{h}) * @var{sar}
3971 horizontal and vertical chroma subsample values. For example for the
3972 pixel format "yuv422p" @var{hsub} is 2 and @var{vsub} is 1.
3975 the height of each text line
3983 @item max_glyph_a, ascent
3984 the maximum distance from the baseline to the highest/upper grid
3985 coordinate used to place a glyph outline point, for all the rendered
3987 It is a positive value, due to the grid's orientation with the Y axis
3990 @item max_glyph_d, descent
3991 the maximum distance from the baseline to the lowest grid coordinate
3992 used to place a glyph outline point, for all the rendered glyphs.
3993 This is a negative value, due to the grid's orientation, with the Y axis
3997 maximum glyph height, that is the maximum height for all the glyphs
3998 contained in the rendered text, it is equivalent to @var{ascent} -
4002 maximum glyph width, that is the maximum width for all the glyphs
4003 contained in the rendered text
4006 the number of input frame, starting from 0
4008 @item rand(min, max)
4009 return a random number included between @var{min} and @var{max}
4012 The input sample aspect ratio.
4015 timestamp expressed in seconds, NAN if the input timestamp is unknown
4018 the height of the rendered text
4021 the width of the rendered text
4025 the x and y offset coordinates where the text is drawn.
4027 These parameters allow the @var{x} and @var{y} expressions to refer
4028 each other, so you can for example specify @code{y=x/dar}.
4031 @anchor{drawtext_expansion}
4032 @subsection Text expansion
4034 If @option{expansion} is set to @code{strftime},
4035 the filter recognizes strftime() sequences in the provided text and
4036 expands them accordingly. Check the documentation of strftime(). This
4037 feature is deprecated.
4039 If @option{expansion} is set to @code{none}, the text is printed verbatim.
4041 If @option{expansion} is set to @code{normal} (which is the default),
4042 the following expansion mechanism is used.
4044 The backslash character '\', followed by any character, always expands to
4045 the second character.
4047 Sequence of the form @code{%@{...@}} are expanded. The text between the
4048 braces is a function name, possibly followed by arguments separated by ':'.
4049 If the arguments contain special characters or delimiters (':' or '@}'),
4050 they should be escaped.
4052 Note that they probably must also be escaped as the value for the
4053 @option{text} option in the filter argument string and as the filter
4054 argument in the filtergraph description, and possibly also for the shell,
4055 that makes up to four levels of escaping; using a text file avoids these
4058 The following functions are available:
4063 The expression evaluation result.
4065 It must take one argument specifying the expression to be evaluated,
4066 which accepts the same constants and functions as the @var{x} and
4067 @var{y} values. Note that not all constants should be used, for
4068 example the text size is not known when evaluating the expression, so
4069 the constants @var{text_w} and @var{text_h} will have an undefined
4072 @item expr_int_format, eif
4073 Evaluate the expression's value and output as formatted integer.
4075 The first argument is the expression to be evaluated, just as for the @var{expr} function.
4076 The second argument specifies the output format. Allowed values are 'x', 'X', 'd' and
4077 'u'. They are treated exactly as in the printf function.
4078 The third parameter is optional and sets the number of positions taken by the output.
4079 It can be used to add padding with zeros from the left.
4082 The time at which the filter is running, expressed in UTC.
4083 It can accept an argument: a strftime() format string.
4086 The time at which the filter is running, expressed in the local time zone.
4087 It can accept an argument: a strftime() format string.
4090 Frame metadata. It must take one argument specifying metadata key.
4093 The frame number, starting from 0.
4096 A 1 character description of the current picture type.
4099 The timestamp of the current frame.
4100 It can take up to two arguments.
4102 The first argument is the format of the timestamp; it defaults to @code{flt}
4103 for seconds as a decimal number with microsecond accuracy; @code{hms} stands
4104 for a formatted @var{[-]HH:MM:SS.mmm} timestamp with millisecond accuracy.
4106 The second argument is an offset added to the timestamp.
4110 @subsection Examples
4114 Draw "Test Text" with font FreeSerif, using the default values for the
4115 optional parameters.
4118 drawtext="fontfile=/usr/share/fonts/truetype/freefont/FreeSerif.ttf: text='Test Text'"
4122 Draw 'Test Text' with font FreeSerif of size 24 at position x=100
4123 and y=50 (counting from the top-left corner of the screen), text is
4124 yellow with a red box around it. Both the text and the box have an
4128 drawtext="fontfile=/usr/share/fonts/truetype/freefont/FreeSerif.ttf: text='Test Text':\
4129 x=100: y=50: fontsize=24: fontcolor=yellow@@0.2: box=1: boxcolor=red@@0.2"
4132 Note that the double quotes are not necessary if spaces are not used
4133 within the parameter list.
4136 Show the text at the center of the video frame:
4138 drawtext="fontsize=30:fontfile=FreeSerif.ttf:text='hello world':x=(w-text_w)/2:y=(h-text_h-line_h)/2"
4142 Show a text line sliding from right to left in the last row of the video
4143 frame. The file @file{LONG_LINE} is assumed to contain a single line
4146 drawtext="fontsize=15:fontfile=FreeSerif.ttf:text=LONG_LINE:y=h-line_h:x=-50*t"
4150 Show the content of file @file{CREDITS} off the bottom of the frame and scroll up.
4152 drawtext="fontsize=20:fontfile=FreeSerif.ttf:textfile=CREDITS:y=h-20*t"
4156 Draw a single green letter "g", at the center of the input video.
4157 The glyph baseline is placed at half screen height.
4159 drawtext="fontsize=60:fontfile=FreeSerif.ttf:fontcolor=green:text=g:x=(w-max_glyph_w)/2:y=h/2-ascent"
4163 Show text for 1 second every 3 seconds:
4165 drawtext="fontfile=FreeSerif.ttf:fontcolor=white:x=100:y=x/dar:enable=lt(mod(t\,3)\,1):text='blink'"
4169 Use fontconfig to set the font. Note that the colons need to be escaped.
4171 drawtext='fontfile=Linux Libertine O-40\:style=Semibold:text=FFmpeg'
4175 Print the date of a real-time encoding (see strftime(3)):
4177 drawtext='fontfile=FreeSans.ttf:text=%@{localtime\:%a %b %d %Y@}'
4181 Show text fading in and out (appearing/disappearing):
4184 DS=1.0 # display start
4185 DE=10.0 # display end
4186 FID=1.5 # fade in duration
4187 FOD=5 # fade out duration
4188 ffplay -f lavfi "color,drawtext=text=TEST:fontsize=50:fontfile=FreeSerif.ttf:fontcolor_expr=ff0000%@{eif\\\\: clip(255*(1*between(t\\, $DS + $FID\\, $DE - $FOD) + ((t - $DS)/$FID)*between(t\\, $DS\\, $DS + $FID) + (-(t - $DE)/$FOD)*between(t\\, $DE - $FOD\\, $DE) )\\, 0\\, 255) \\\\: x\\\\: 2 @}"
4193 For more information about libfreetype, check:
4194 @url{http://www.freetype.org/}.
4196 For more information about fontconfig, check:
4197 @url{http://freedesktop.org/software/fontconfig/fontconfig-user.html}.
4199 For more information about libfribidi, check:
4200 @url{http://fribidi.org/}.
4204 Detect and draw edges. The filter uses the Canny Edge Detection algorithm.
4206 The filter accepts the following options:
4211 Set low and high threshold values used by the Canny thresholding
4214 The high threshold selects the "strong" edge pixels, which are then
4215 connected through 8-connectivity with the "weak" edge pixels selected
4216 by the low threshold.
4218 @var{low} and @var{high} threshold values must be chosen in the range
4219 [0,1], and @var{low} should be lesser or equal to @var{high}.
4221 Default value for @var{low} is @code{20/255}, and default value for @var{high}
4225 Define the drawing mode.
4229 Draw white/gray wires on black background.
4232 Mix the colors to create a paint/cartoon effect.
4235 Default value is @var{wires}.
4238 @subsection Examples
4242 Standard edge detection with custom values for the hysteresis thresholding:
4244 edgedetect=low=0.1:high=0.4
4248 Painting effect without thresholding:
4250 edgedetect=mode=colormix:high=0
4254 @section extractplanes
4256 Extract color channel components from input video stream into
4257 separate grayscale video streams.
4259 The filter accepts the following option:
4263 Set plane(s) to extract.
4265 Available values for planes are:
4276 Choosing planes not available in the input will result in an error.
4277 That means you cannot select @code{r}, @code{g}, @code{b} planes
4278 with @code{y}, @code{u}, @code{v} planes at same time.
4281 @subsection Examples
4285 Extract luma, u and v color channel component from input video frame
4286 into 3 grayscale outputs:
4288 ffmpeg -i video.avi -filter_complex 'extractplanes=y+u+v[y][u][v]' -map '[y]' y.avi -map '[u]' u.avi -map '[v]' v.avi
4294 Apply a posterize effect using the ELBG (Enhanced LBG) algorithm.
4296 For each input image, the filter will compute the optimal mapping from
4297 the input to the output given the codebook length, that is the number
4298 of distinct output colors.
4300 This filter accepts the following options.
4303 @item codebook_length, l
4304 Set codebook length. The value must be a positive integer, and
4305 represents the number of distinct output colors. Default value is 256.
4308 Set the maximum number of iterations to apply for computing the optimal
4309 mapping. The higher the value the better the result and the higher the
4310 computation time. Default value is 1.
4313 Set a random seed, must be an integer included between 0 and
4314 UINT32_MAX. If not specified, or if explicitly set to -1, the filter
4315 will try to use a good random seed on a best effort basis.
4320 Apply a fade-in/out effect to the input video.
4322 It accepts the following parameters:
4326 The effect type can be either "in" for a fade-in, or "out" for a fade-out
4328 Default is @code{in}.
4330 @item start_frame, s
4331 Specify the number of the frame to start applying the fade
4332 effect at. Default is 0.
4335 The number of frames that the fade effect lasts. At the end of the
4336 fade-in effect, the output video will have the same intensity as the input video.
4337 At the end of the fade-out transition, the output video will be filled with the
4338 selected @option{color}.
4342 If set to 1, fade only alpha channel, if one exists on the input.
4345 @item start_time, st
4346 Specify the timestamp (in seconds) of the frame to start to apply the fade
4347 effect. If both start_frame and start_time are specified, the fade will start at
4348 whichever comes last. Default is 0.
4351 The number of seconds for which the fade effect has to last. At the end of the
4352 fade-in effect the output video will have the same intensity as the input video,
4353 at the end of the fade-out transition the output video will be filled with the
4354 selected @option{color}.
4355 If both duration and nb_frames are specified, duration is used. Default is 0.
4358 Specify the color of the fade. Default is "black".
4361 @subsection Examples
4365 Fade in the first 30 frames of video:
4370 The command above is equivalent to:
4376 Fade out the last 45 frames of a 200-frame video:
4379 fade=type=out:start_frame=155:nb_frames=45
4383 Fade in the first 25 frames and fade out the last 25 frames of a 1000-frame video:
4385 fade=in:0:25, fade=out:975:25
4389 Make the first 5 frames yellow, then fade in from frame 5-24:
4391 fade=in:5:20:color=yellow
4395 Fade in alpha over first 25 frames of video:
4397 fade=in:0:25:alpha=1
4401 Make the first 5.5 seconds black, then fade in for 0.5 seconds:
4403 fade=t=in:st=5.5:d=0.5
4410 Extract a single field from an interlaced image using stride
4411 arithmetic to avoid wasting CPU time. The output frames are marked as
4414 The filter accepts the following options:
4418 Specify whether to extract the top (if the value is @code{0} or
4419 @code{top}) or the bottom field (if the value is @code{1} or
4425 Field matching filter for inverse telecine. It is meant to reconstruct the
4426 progressive frames from a telecined stream. The filter does not drop duplicated
4427 frames, so to achieve a complete inverse telecine @code{fieldmatch} needs to be
4428 followed by a decimation filter such as @ref{decimate} in the filtergraph.
4430 The separation of the field matching and the decimation is notably motivated by
4431 the possibility of inserting a de-interlacing filter fallback between the two.
4432 If the source has mixed telecined and real interlaced content,
4433 @code{fieldmatch} will not be able to match fields for the interlaced parts.
4434 But these remaining combed frames will be marked as interlaced, and thus can be
4435 de-interlaced by a later filter such as @ref{yadif} before decimation.
4437 In addition to the various configuration options, @code{fieldmatch} can take an
4438 optional second stream, activated through the @option{ppsrc} option. If
4439 enabled, the frames reconstruction will be based on the fields and frames from
4440 this second stream. This allows the first input to be pre-processed in order to
4441 help the various algorithms of the filter, while keeping the output lossless
4442 (assuming the fields are matched properly). Typically, a field-aware denoiser,
4443 or brightness/contrast adjustments can help.
4445 Note that this filter uses the same algorithms as TIVTC/TFM (AviSynth project)
4446 and VIVTC/VFM (VapourSynth project). The later is a light clone of TFM from
4447 which @code{fieldmatch} is based on. While the semantic and usage are very
4448 close, some behaviour and options names can differ.
4450 The @ref{decimate} filter currently only works for constant frame rate input.
4451 Do not use @code{fieldmatch} and @ref{decimate} if your input has mixed
4452 telecined and progressive content with changing framerate.
4454 The filter accepts the following options:
4458 Specify the assumed field order of the input stream. Available values are:
4462 Auto detect parity (use FFmpeg's internal parity value).
4464 Assume bottom field first.
4466 Assume top field first.
4469 Note that it is sometimes recommended not to trust the parity announced by the
4472 Default value is @var{auto}.
4475 Set the matching mode or strategy to use. @option{pc} mode is the safest in the
4476 sense that it won't risk creating jerkiness due to duplicate frames when
4477 possible, but if there are bad edits or blended fields it will end up
4478 outputting combed frames when a good match might actually exist. On the other
4479 hand, @option{pcn_ub} mode is the most risky in terms of creating jerkiness,
4480 but will almost always find a good frame if there is one. The other values are
4481 all somewhere in between @option{pc} and @option{pcn_ub} in terms of risking
4482 jerkiness and creating duplicate frames versus finding good matches in sections
4483 with bad edits, orphaned fields, blended fields, etc.
4485 More details about p/c/n/u/b are available in @ref{p/c/n/u/b meaning} section.
4487 Available values are:
4491 2-way matching (p/c)
4493 2-way matching, and trying 3rd match if still combed (p/c + n)
4495 2-way matching, and trying 3rd match (same order) if still combed (p/c + u)
4497 2-way matching, trying 3rd match if still combed, and trying 4th/5th matches if
4498 still combed (p/c + n + u/b)
4500 3-way matching (p/c/n)
4502 3-way matching, and trying 4th/5th matches if all 3 of the original matches are
4503 detected as combed (p/c/n + u/b)
4506 The parenthesis at the end indicate the matches that would be used for that
4507 mode assuming @option{order}=@var{tff} (and @option{field} on @var{auto} or
4510 In terms of speed @option{pc} mode is by far the fastest and @option{pcn_ub} is
4513 Default value is @var{pc_n}.
4516 Mark the main input stream as a pre-processed input, and enable the secondary
4517 input stream as the clean source to pick the fields from. See the filter
4518 introduction for more details. It is similar to the @option{clip2} feature from
4521 Default value is @code{0} (disabled).
4524 Set the field to match from. It is recommended to set this to the same value as
4525 @option{order} unless you experience matching failures with that setting. In
4526 certain circumstances changing the field that is used to match from can have a
4527 large impact on matching performance. Available values are:
4531 Automatic (same value as @option{order}).
4533 Match from the bottom field.
4535 Match from the top field.
4538 Default value is @var{auto}.
4541 Set whether or not chroma is included during the match comparisons. In most
4542 cases it is recommended to leave this enabled. You should set this to @code{0}
4543 only if your clip has bad chroma problems such as heavy rainbowing or other
4544 artifacts. Setting this to @code{0} could also be used to speed things up at
4545 the cost of some accuracy.
4547 Default value is @code{1}.
4551 These define an exclusion band which excludes the lines between @option{y0} and
4552 @option{y1} from being included in the field matching decision. An exclusion
4553 band can be used to ignore subtitles, a logo, or other things that may
4554 interfere with the matching. @option{y0} sets the starting scan line and
4555 @option{y1} sets the ending line; all lines in between @option{y0} and
4556 @option{y1} (including @option{y0} and @option{y1}) will be ignored. Setting
4557 @option{y0} and @option{y1} to the same value will disable the feature.
4558 @option{y0} and @option{y1} defaults to @code{0}.
4561 Set the scene change detection threshold as a percentage of maximum change on
4562 the luma plane. Good values are in the @code{[8.0, 14.0]} range. Scene change
4563 detection is only relevant in case @option{combmatch}=@var{sc}. The range for
4564 @option{scthresh} is @code{[0.0, 100.0]}.
4566 Default value is @code{12.0}.
4569 When @option{combatch} is not @var{none}, @code{fieldmatch} will take into
4570 account the combed scores of matches when deciding what match to use as the
4571 final match. Available values are:
4575 No final matching based on combed scores.
4577 Combed scores are only used when a scene change is detected.
4579 Use combed scores all the time.
4582 Default is @var{sc}.
4585 Force @code{fieldmatch} to calculate the combed metrics for certain matches and
4586 print them. This setting is known as @option{micout} in TFM/VFM vocabulary.
4587 Available values are:
4591 No forced calculation.
4593 Force p/c/n calculations.
4595 Force p/c/n/u/b calculations.
4598 Default value is @var{none}.
4601 This is the area combing threshold used for combed frame detection. This
4602 essentially controls how "strong" or "visible" combing must be to be detected.
4603 Larger values mean combing must be more visible and smaller values mean combing
4604 can be less visible or strong and still be detected. Valid settings are from
4605 @code{-1} (every pixel will be detected as combed) to @code{255} (no pixel will
4606 be detected as combed). This is basically a pixel difference value. A good
4607 range is @code{[8, 12]}.
4609 Default value is @code{9}.
4612 Sets whether or not chroma is considered in the combed frame decision. Only
4613 disable this if your source has chroma problems (rainbowing, etc.) that are
4614 causing problems for the combed frame detection with chroma enabled. Actually,
4615 using @option{chroma}=@var{0} is usually more reliable, except for the case
4616 where there is chroma only combing in the source.
4618 Default value is @code{0}.
4622 Respectively set the x-axis and y-axis size of the window used during combed
4623 frame detection. This has to do with the size of the area in which
4624 @option{combpel} pixels are required to be detected as combed for a frame to be
4625 declared combed. See the @option{combpel} parameter description for more info.
4626 Possible values are any number that is a power of 2 starting at 4 and going up
4629 Default value is @code{16}.
4632 The number of combed pixels inside any of the @option{blocky} by
4633 @option{blockx} size blocks on the frame for the frame to be detected as
4634 combed. While @option{cthresh} controls how "visible" the combing must be, this
4635 setting controls "how much" combing there must be in any localized area (a
4636 window defined by the @option{blockx} and @option{blocky} settings) on the
4637 frame. Minimum value is @code{0} and maximum is @code{blocky x blockx} (at
4638 which point no frames will ever be detected as combed). This setting is known
4639 as @option{MI} in TFM/VFM vocabulary.
4641 Default value is @code{80}.
4644 @anchor{p/c/n/u/b meaning}
4645 @subsection p/c/n/u/b meaning
4647 @subsubsection p/c/n
4649 We assume the following telecined stream:
4652 Top fields: 1 2 2 3 4
4653 Bottom fields: 1 2 3 4 4
4656 The numbers correspond to the progressive frame the fields relate to. Here, the
4657 first two frames are progressive, the 3rd and 4th are combed, and so on.
4659 When @code{fieldmatch} is configured to run a matching from bottom
4660 (@option{field}=@var{bottom}) this is how this input stream get transformed:
4665 B 1 2 3 4 4 <-- matching reference
4674 As a result of the field matching, we can see that some frames get duplicated.
4675 To perform a complete inverse telecine, you need to rely on a decimation filter
4676 after this operation. See for instance the @ref{decimate} filter.
4678 The same operation now matching from top fields (@option{field}=@var{top})
4683 T 1 2 2 3 4 <-- matching reference
4693 In these examples, we can see what @var{p}, @var{c} and @var{n} mean;
4694 basically, they refer to the frame and field of the opposite parity:
4697 @item @var{p} matches the field of the opposite parity in the previous frame
4698 @item @var{c} matches the field of the opposite parity in the current frame
4699 @item @var{n} matches the field of the opposite parity in the next frame
4704 The @var{u} and @var{b} matching are a bit special in the sense that they match
4705 from the opposite parity flag. In the following examples, we assume that we are
4706 currently matching the 2nd frame (Top:2, bottom:2). According to the match, a
4707 'x' is placed above and below each matched fields.
4709 With bottom matching (@option{field}=@var{bottom}):
4714 Top 1 2 2 1 2 2 1 2 2 1 2 2 1 2 2
4715 Bottom 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3
4723 With top matching (@option{field}=@var{top}):
4728 Top 1 2 2 1 2 2 1 2 2 1 2 2 1 2 2
4729 Bottom 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3
4737 @subsection Examples
4739 Simple IVTC of a top field first telecined stream:
4741 fieldmatch=order=tff:combmatch=none, decimate
4744 Advanced IVTC, with fallback on @ref{yadif} for still combed frames:
4746 fieldmatch=order=tff:combmatch=full, yadif=deint=interlaced, decimate
4751 Transform the field order of the input video.
4753 It accepts the following parameters:
4758 The output field order. Valid values are @var{tff} for top field first or @var{bff}
4759 for bottom field first.
4762 The default value is @samp{tff}.
4764 The transformation is done by shifting the picture content up or down
4765 by one line, and filling the remaining line with appropriate picture content.
4766 This method is consistent with most broadcast field order converters.
4768 If the input video is not flagged as being interlaced, or it is already
4769 flagged as being of the required output field order, then this filter does
4770 not alter the incoming video.
4772 It is very useful when converting to or from PAL DV material,
4773 which is bottom field first.
4777 ffmpeg -i in.vob -vf "fieldorder=bff" out.dv
4782 Buffer input images and send them when they are requested.
4784 It is mainly useful when auto-inserted by the libavfilter
4787 It does not take parameters.
4792 Convert the input video to one of the specified pixel formats.
4793 Libavfilter will try to pick one that is suitable as input to
4796 It accepts the following parameters:
4800 A '|'-separated list of pixel format names, such as
4801 "pix_fmts=yuv420p|monow|rgb24".
4805 @subsection Examples
4809 Convert the input video to the @var{yuv420p} format
4811 format=pix_fmts=yuv420p
4814 Convert the input video to any of the formats in the list
4816 format=pix_fmts=yuv420p|yuv444p|yuv410p
4823 Convert the video to specified constant frame rate by duplicating or dropping
4824 frames as necessary.
4826 It accepts the following parameters:
4830 The desired output frame rate. The default is @code{25}.
4835 Possible values are:
4838 zero round towards 0
4842 round towards -infinity
4844 round towards +infinity
4848 The default is @code{near}.
4851 Assume the first PTS should be the given value, in seconds. This allows for
4852 padding/trimming at the start of stream. By default, no assumption is made
4853 about the first frame's expected PTS, so no padding or trimming is done.
4854 For example, this could be set to 0 to pad the beginning with duplicates of
4855 the first frame if a video stream starts after the audio stream or to trim any
4856 frames with a negative PTS.
4860 Alternatively, the options can be specified as a flat string:
4861 @var{fps}[:@var{round}].
4863 See also the @ref{setpts} filter.
4865 @subsection Examples
4869 A typical usage in order to set the fps to 25:
4875 Sets the fps to 24, using abbreviation and rounding method to round to nearest:
4877 fps=fps=film:round=near
4883 Pack two different video streams into a stereoscopic video, setting proper
4884 metadata on supported codecs. The two views should have the same size and
4885 framerate and processing will stop when the shorter video ends. Please note
4886 that you may conveniently adjust view properties with the @ref{scale} and
4889 It accepts the following parameters:
4893 The desired packing format. Supported values are:
4898 The views are next to each other (default).
4901 The views are on top of each other.
4904 The views are packed by line.
4907 The views are packed by column.
4910 The views are temporally interleaved.
4919 # Convert left and right views into a frame-sequential video
4920 ffmpeg -i LEFT -i RIGHT -filter_complex framepack=frameseq OUTPUT
4922 # Convert views into a side-by-side video with the same output resolution as the input
4923 ffmpeg -i LEFT -i RIGHT -filter_complex [0:v]scale=w=iw/2[left],[1:v]scale=w=iw/2[right],[left][right]framepack=sbs OUTPUT
4928 Select one frame every N-th frame.
4930 This filter accepts the following option:
4933 Select frame after every @code{step} frames.
4934 Allowed values are positive integers higher than 0. Default value is @code{1}.
4940 Apply a frei0r effect to the input video.
4942 To enable the compilation of this filter, you need to install the frei0r
4943 header and configure FFmpeg with @code{--enable-frei0r}.
4945 It accepts the following parameters:
4950 The name of the frei0r effect to load. If the environment variable
4951 @env{FREI0R_PATH} is defined, the frei0r effect is searched for in each of the
4952 directories specified by the colon-separated list in @env{FREIOR_PATH}.
4953 Otherwise, the standard frei0r paths are searched, in this order:
4954 @file{HOME/.frei0r-1/lib/}, @file{/usr/local/lib/frei0r-1/},
4955 @file{/usr/lib/frei0r-1/}.
4958 A '|'-separated list of parameters to pass to the frei0r effect.
4962 A frei0r effect parameter can be a boolean (its value is either
4963 "y" or "n"), a double, a color (specified as
4964 @var{R}/@var{G}/@var{B}, where @var{R}, @var{G}, and @var{B} are floating point
4965 numbers between 0.0 and 1.0, inclusive) or by a color description specified in the "Color"
4966 section in the ffmpeg-utils manual), a position (specified as @var{X}/@var{Y}, where
4967 @var{X} and @var{Y} are floating point numbers) and/or a string.
4969 The number and types of parameters depend on the loaded effect. If an
4970 effect parameter is not specified, the default value is set.
4972 @subsection Examples
4976 Apply the distort0r effect, setting the first two double parameters:
4978 frei0r=filter_name=distort0r:filter_params=0.5|0.01
4982 Apply the colordistance effect, taking a color as the first parameter:
4984 frei0r=colordistance:0.2/0.3/0.4
4985 frei0r=colordistance:violet
4986 frei0r=colordistance:0x112233
4990 Apply the perspective effect, specifying the top left and top right image
4993 frei0r=perspective:0.2/0.2|0.8/0.2
4997 For more information, see
4998 @url{http://frei0r.dyne.org}
5002 The filter accepts the following options:
5006 Set the luminance expression.
5008 Set the chrominance blue expression.
5010 Set the chrominance red expression.
5012 Set the alpha expression.
5014 Set the red expression.
5016 Set the green expression.
5018 Set the blue expression.
5021 The colorspace is selected according to the specified options. If one
5022 of the @option{lum_expr}, @option{cb_expr}, or @option{cr_expr}
5023 options is specified, the filter will automatically select a YCbCr
5024 colorspace. If one of the @option{red_expr}, @option{green_expr}, or
5025 @option{blue_expr} options is specified, it will select an RGB
5028 If one of the chrominance expression is not defined, it falls back on the other
5029 one. If no alpha expression is specified it will evaluate to opaque value.
5030 If none of chrominance expressions are specified, they will evaluate
5031 to the luminance expression.
5033 The expressions can use the following variables and functions:
5037 The sequential number of the filtered frame, starting from @code{0}.
5041 The coordinates of the current sample.
5045 The width and height of the image.
5049 Width and height scale depending on the currently filtered plane. It is the
5050 ratio between the corresponding luma plane number of pixels and the current
5051 plane ones. E.g. for YUV4:2:0 the values are @code{1,1} for the luma plane, and
5052 @code{0.5,0.5} for chroma planes.
5055 Time of the current frame, expressed in seconds.
5058 Return the value of the pixel at location (@var{x},@var{y}) of the current
5062 Return the value of the pixel at location (@var{x},@var{y}) of the luminance
5066 Return the value of the pixel at location (@var{x},@var{y}) of the
5067 blue-difference chroma plane. Return 0 if there is no such plane.
5070 Return the value of the pixel at location (@var{x},@var{y}) of the
5071 red-difference chroma plane. Return 0 if there is no such plane.
5076 Return the value of the pixel at location (@var{x},@var{y}) of the
5077 red/green/blue component. Return 0 if there is no such component.
5080 Return the value of the pixel at location (@var{x},@var{y}) of the alpha
5081 plane. Return 0 if there is no such plane.
5084 For functions, if @var{x} and @var{y} are outside the area, the value will be
5085 automatically clipped to the closer edge.
5087 @subsection Examples
5091 Flip the image horizontally:
5097 Generate a bidimensional sine wave, with angle @code{PI/3} and a
5098 wavelength of 100 pixels:
5100 geq=128 + 100*sin(2*(PI/100)*(cos(PI/3)*(X-50*T) + sin(PI/3)*Y)):128:128
5104 Generate a fancy enigmatic moving light:
5106 nullsrc=s=256x256,geq=random(1)/hypot(X-cos(N*0.07)*W/2-W/2\,Y-sin(N*0.09)*H/2-H/2)^2*1000000*sin(N*0.02):128:128
5110 Generate a quick emboss effect:
5112 format=gray,geq=lum_expr='(p(X,Y)+(256-p(X-4,Y-4)))/2'
5116 Modify RGB components depending on pixel position:
5118 geq=r='X/W*r(X,Y)':g='(1-X/W)*g(X,Y)':b='(H-Y)/H*b(X,Y)'
5122 Create a radial gradient that is the same size as the input (also see
5123 the @ref{vignette} filter):
5125 geq=lum=255*gauss((X/W-0.5)*3)*gauss((Y/H-0.5)*3)/gauss(0)/gauss(0),format=gray
5129 Create a linear gradient to use as a mask fo