1 @chapter Filtering Introduction
2 @c man begin FILTERING INTRODUCTION
4 Filtering in FFmpeg is enabled through the libavfilter library.
6 In libavfilter, a filter can have multiple inputs and multiple
8 To illustrate the sorts of things that are possible, we consider the
13 input --> split ---------------------> overlay --> output
16 +-----> crop --> vflip -------+
19 This filtergraph splits the input stream in two streams, then sends one
20 stream through the crop filter and the vflip filter, before merging it
21 back with the other stream by overlaying it on top. You can use the
22 following command to achieve this:
25 ffmpeg -i INPUT -vf "split [main][tmp]; [tmp] crop=iw:ih/2:0:0, vflip [flip]; [main][flip] overlay=0:H/2" OUTPUT
28 The result will be that the top half of the video is mirrored
29 onto the bottom half of the output video.
31 Filters in the same linear chain are separated by commas, and distinct
32 linear chains of filters are separated by semicolons. In our example,
33 @var{crop,vflip} are in one linear chain, @var{split} and
34 @var{overlay} are separately in another. The points where the linear
35 chains join are labelled by names enclosed in square brackets. In the
36 example, the split filter generates two outputs that are associated to
37 the labels @var{[main]} and @var{[tmp]}.
39 The stream sent to the second output of @var{split}, labelled as
40 @var{[tmp]}, is processed through the @var{crop} filter, which crops
41 away the lower half part of the video, and then vertically flipped. The
42 @var{overlay} filter takes in input the first unchanged output of the
43 split filter (which was labelled as @var{[main]}), and overlay on its
44 lower half the output generated by the @var{crop,vflip} filterchain.
46 Some filters take in input a list of parameters: they are specified
47 after the filter name and an equal sign, and are separated from each other
50 There exist so-called @var{source filters} that do not have an
51 audio/video input, and @var{sink filters} that will not have audio/video
54 @c man end FILTERING INTRODUCTION
57 @c man begin GRAPH2DOT
59 The @file{graph2dot} program included in the FFmpeg @file{tools}
60 directory can be used to parse a filtergraph description and issue a
61 corresponding textual representation in the dot language.
68 to see how to use @file{graph2dot}.
70 You can then pass the dot description to the @file{dot} program (from
71 the graphviz suite of programs) and obtain a graphical representation
74 For example the sequence of commands:
76 echo @var{GRAPH_DESCRIPTION} | \
77 tools/graph2dot -o graph.tmp && \
78 dot -Tpng graph.tmp -o graph.png && \
82 can be used to create and display an image representing the graph
83 described by the @var{GRAPH_DESCRIPTION} string. Note that this string must be
84 a complete self-contained graph, with its inputs and outputs explicitly defined.
85 For example if your command line is of the form:
87 ffmpeg -i infile -vf scale=640:360 outfile
89 your @var{GRAPH_DESCRIPTION} string will need to be of the form:
91 nullsrc,scale=640:360,nullsink
93 you may also need to set the @var{nullsrc} parameters and add a @var{format}
94 filter in order to simulate a specific input file.
98 @chapter Filtergraph description
99 @c man begin FILTERGRAPH DESCRIPTION
101 A filtergraph is a directed graph of connected filters. It can contain
102 cycles, and there can be multiple links between a pair of
103 filters. Each link has one input pad on one side connecting it to one
104 filter from which it takes its input, and one output pad on the other
105 side connecting it to one filter accepting its output.
107 Each filter in a filtergraph is an instance of a filter class
108 registered in the application, which defines the features and the
109 number of input and output pads of the filter.
111 A filter with no input pads is called a "source", and a filter with no
112 output pads is called a "sink".
114 @anchor{Filtergraph syntax}
115 @section Filtergraph syntax
117 A filtergraph has a textual representation, which is recognized by the
118 @option{-filter}/@option{-vf}/@option{-af} and
119 @option{-filter_complex} options in @command{ffmpeg} and
120 @option{-vf}/@option{-af} in @command{ffplay}, and by the
121 @code{avfilter_graph_parse_ptr()} function defined in
122 @file{libavfilter/avfilter.h}.
124 A filterchain consists of a sequence of connected filters, each one
125 connected to the previous one in the sequence. A filterchain is
126 represented by a list of ","-separated filter descriptions.
128 A filtergraph consists of a sequence of filterchains. A sequence of
129 filterchains is represented by a list of ";"-separated filterchain
132 A filter is represented by a string of the form:
133 [@var{in_link_1}]...[@var{in_link_N}]@var{filter_name}@@@var{id}=@var{arguments}[@var{out_link_1}]...[@var{out_link_M}]
135 @var{filter_name} is the name of the filter class of which the
136 described filter is an instance of, and has to be the name of one of
137 the filter classes registered in the program optionally followed by "@@@var{id}".
138 The name of the filter class is optionally followed by a string
141 @var{arguments} is a string which contains the parameters used to
142 initialize the filter instance. It may have one of two forms:
146 A ':'-separated list of @var{key=value} pairs.
149 A ':'-separated list of @var{value}. In this case, the keys are assumed to be
150 the option names in the order they are declared. E.g. the @code{fade} filter
151 declares three options in this order -- @option{type}, @option{start_frame} and
152 @option{nb_frames}. Then the parameter list @var{in:0:30} means that the value
153 @var{in} is assigned to the option @option{type}, @var{0} to
154 @option{start_frame} and @var{30} to @option{nb_frames}.
157 A ':'-separated list of mixed direct @var{value} and long @var{key=value}
158 pairs. The direct @var{value} must precede the @var{key=value} pairs, and
159 follow the same constraints order of the previous point. The following
160 @var{key=value} pairs can be set in any preferred order.
164 If the option value itself is a list of items (e.g. the @code{format} filter
165 takes a list of pixel formats), the items in the list are usually separated by
168 The list of arguments can be quoted using the character @samp{'} as initial
169 and ending mark, and the character @samp{\} for escaping the characters
170 within the quoted text; otherwise the argument string is considered
171 terminated when the next special character (belonging to the set
172 @samp{[]=;,}) is encountered.
174 The name and arguments of the filter are optionally preceded and
175 followed by a list of link labels.
176 A link label allows one to name a link and associate it to a filter output
177 or input pad. The preceding labels @var{in_link_1}
178 ... @var{in_link_N}, are associated to the filter input pads,
179 the following labels @var{out_link_1} ... @var{out_link_M}, are
180 associated to the output pads.
182 When two link labels with the same name are found in the
183 filtergraph, a link between the corresponding input and output pad is
186 If an output pad is not labelled, it is linked by default to the first
187 unlabelled input pad of the next filter in the filterchain.
188 For example in the filterchain
190 nullsrc, split[L1], [L2]overlay, nullsink
192 the split filter instance has two output pads, and the overlay filter
193 instance two input pads. The first output pad of split is labelled
194 "L1", the first input pad of overlay is labelled "L2", and the second
195 output pad of split is linked to the second input pad of overlay,
196 which are both unlabelled.
198 In a filter description, if the input label of the first filter is not
199 specified, "in" is assumed; if the output label of the last filter is not
200 specified, "out" is assumed.
202 In a complete filterchain all the unlabelled filter input and output
203 pads must be connected. A filtergraph is considered valid if all the
204 filter input and output pads of all the filterchains are connected.
206 Libavfilter will automatically insert @ref{scale} filters where format
207 conversion is required. It is possible to specify swscale flags
208 for those automatically inserted scalers by prepending
209 @code{sws_flags=@var{flags};}
210 to the filtergraph description.
212 Here is a BNF description of the filtergraph syntax:
214 @var{NAME} ::= sequence of alphanumeric characters and '_'
215 @var{FILTER_NAME} ::= @var{NAME}["@@"@var{NAME}]
216 @var{LINKLABEL} ::= "[" @var{NAME} "]"
217 @var{LINKLABELS} ::= @var{LINKLABEL} [@var{LINKLABELS}]
218 @var{FILTER_ARGUMENTS} ::= sequence of chars (possibly quoted)
219 @var{FILTER} ::= [@var{LINKLABELS}] @var{FILTER_NAME} ["=" @var{FILTER_ARGUMENTS}] [@var{LINKLABELS}]
220 @var{FILTERCHAIN} ::= @var{FILTER} [,@var{FILTERCHAIN}]
221 @var{FILTERGRAPH} ::= [sws_flags=@var{flags};] @var{FILTERCHAIN} [;@var{FILTERGRAPH}]
224 @anchor{filtergraph escaping}
225 @section Notes on filtergraph escaping
227 Filtergraph description composition entails several levels of
228 escaping. See @ref{quoting_and_escaping,,the "Quoting and escaping"
229 section in the ffmpeg-utils(1) manual,ffmpeg-utils} for more
230 information about the employed escaping procedure.
232 A first level escaping affects the content of each filter option
233 value, which may contain the special character @code{:} used to
234 separate values, or one of the escaping characters @code{\'}.
236 A second level escaping affects the whole filter description, which
237 may contain the escaping characters @code{\'} or the special
238 characters @code{[],;} used by the filtergraph description.
240 Finally, when you specify a filtergraph on a shell commandline, you
241 need to perform a third level escaping for the shell special
242 characters contained within it.
244 For example, consider the following string to be embedded in
245 the @ref{drawtext} filter description @option{text} value:
247 this is a 'string': may contain one, or more, special characters
250 This string contains the @code{'} special escaping character, and the
251 @code{:} special character, so it needs to be escaped in this way:
253 text=this is a \'string\'\: may contain one, or more, special characters
256 A second level of escaping is required when embedding the filter
257 description in a filtergraph description, in order to escape all the
258 filtergraph special characters. Thus the example above becomes:
260 drawtext=text=this is a \\\'string\\\'\\: may contain one\, or more\, special characters
262 (note that in addition to the @code{\'} escaping special characters,
263 also @code{,} needs to be escaped).
265 Finally an additional level of escaping is needed when writing the
266 filtergraph description in a shell command, which depends on the
267 escaping rules of the adopted shell. For example, assuming that
268 @code{\} is special and needs to be escaped with another @code{\}, the
269 previous string will finally result in:
271 -vf "drawtext=text=this is a \\\\\\'string\\\\\\'\\\\: may contain one\\, or more\\, special characters"
274 @chapter Timeline editing
276 Some filters support a generic @option{enable} option. For the filters
277 supporting timeline editing, this option can be set to an expression which is
278 evaluated before sending a frame to the filter. If the evaluation is non-zero,
279 the filter will be enabled, otherwise the frame will be sent unchanged to the
280 next filter in the filtergraph.
282 The expression accepts the following values:
285 timestamp expressed in seconds, NAN if the input timestamp is unknown
288 sequential number of the input frame, starting from 0
291 the position in the file of the input frame, NAN if unknown
295 width and height of the input frame if video
298 Additionally, these filters support an @option{enable} command that can be used
299 to re-define the expression.
301 Like any other filtering option, the @option{enable} option follows the same
304 For example, to enable a blur filter (@ref{smartblur}) from 10 seconds to 3
305 minutes, and a @ref{curves} filter starting at 3 seconds:
307 smartblur = enable='between(t,10,3*60)',
308 curves = enable='gte(t,3)' : preset=cross_process
311 See @code{ffmpeg -filters} to view which filters have timeline support.
313 @c man end FILTERGRAPH DESCRIPTION
316 @chapter Options for filters with several inputs (framesync)
317 @c man begin OPTIONS FOR FILTERS WITH SEVERAL INPUTS
319 Some filters with several inputs support a common set of options.
320 These options can only be set by name, not with the short notation.
324 The action to take when EOF is encountered on the secondary input; it accepts
325 one of the following values:
329 Repeat the last frame (the default).
333 Pass the main input through.
337 If set to 1, force the output to terminate when the shortest input
338 terminates. Default value is 0.
341 If set to 1, force the filter to extend the last frame of secondary streams
342 until the end of the primary stream. A value of 0 disables this behavior.
346 @c man end OPTIONS FOR FILTERS WITH SEVERAL INPUTS
348 @chapter Audio Filters
349 @c man begin AUDIO FILTERS
351 When you configure your FFmpeg build, you can disable any of the
352 existing filters using @code{--disable-filters}.
353 The configure output will show the audio filters included in your
356 Below is a description of the currently available audio filters.
360 A compressor is mainly used to reduce the dynamic range of a signal.
361 Especially modern music is mostly compressed at a high ratio to
362 improve the overall loudness. It's done to get the highest attention
363 of a listener, "fatten" the sound and bring more "power" to the track.
364 If a signal is compressed too much it may sound dull or "dead"
365 afterwards or it may start to "pump" (which could be a powerful effect
366 but can also destroy a track completely).
367 The right compression is the key to reach a professional sound and is
368 the high art of mixing and mastering. Because of its complex settings
369 it may take a long time to get the right feeling for this kind of effect.
371 Compression is done by detecting the volume above a chosen level
372 @code{threshold} and dividing it by the factor set with @code{ratio}.
373 So if you set the threshold to -12dB and your signal reaches -6dB a ratio
374 of 2:1 will result in a signal at -9dB. Because an exact manipulation of
375 the signal would cause distortion of the waveform the reduction can be
376 levelled over the time. This is done by setting "Attack" and "Release".
377 @code{attack} determines how long the signal has to rise above the threshold
378 before any reduction will occur and @code{release} sets the time the signal
379 has to fall below the threshold to reduce the reduction again. Shorter signals
380 than the chosen attack time will be left untouched.
381 The overall reduction of the signal can be made up afterwards with the
382 @code{makeup} setting. So compressing the peaks of a signal about 6dB and
383 raising the makeup to this level results in a signal twice as loud than the
384 source. To gain a softer entry in the compression the @code{knee} flattens the
385 hard edge at the threshold in the range of the chosen decibels.
387 The filter accepts the following options:
391 Set input gain. Default is 1. Range is between 0.015625 and 64.
394 Set mode of compressor operation. Can be @code{upward} or @code{downward}.
395 Default is @code{downward}.
398 If a signal of stream rises above this level it will affect the gain
400 By default it is 0.125. Range is between 0.00097563 and 1.
403 Set a ratio by which the signal is reduced. 1:2 means that if the level
404 rose 4dB above the threshold, it will be only 2dB above after the reduction.
405 Default is 2. Range is between 1 and 20.
408 Amount of milliseconds the signal has to rise above the threshold before gain
409 reduction starts. Default is 20. Range is between 0.01 and 2000.
412 Amount of milliseconds the signal has to fall below the threshold before
413 reduction is decreased again. Default is 250. Range is between 0.01 and 9000.
416 Set the amount by how much signal will be amplified after processing.
417 Default is 1. Range is from 1 to 64.
420 Curve the sharp knee around the threshold to enter gain reduction more softly.
421 Default is 2.82843. Range is between 1 and 8.
424 Choose if the @code{average} level between all channels of input stream
425 or the louder(@code{maximum}) channel of input stream affects the
426 reduction. Default is @code{average}.
429 Should the exact signal be taken in case of @code{peak} or an RMS one in case
430 of @code{rms}. Default is @code{rms} which is mostly smoother.
433 How much to use compressed signal in output. Default is 1.
434 Range is between 0 and 1.
438 Simple audio dynamic range compression/expansion filter.
440 The filter accepts the following options:
444 Set contrast. Default is 33. Allowed range is between 0 and 100.
449 Copy the input audio source unchanged to the output. This is mainly useful for
454 Apply cross fade from one input audio stream to another input audio stream.
455 The cross fade is applied for specified duration near the end of first stream.
457 The filter accepts the following options:
461 Specify the number of samples for which the cross fade effect has to last.
462 At the end of the cross fade effect the first input audio will be completely
463 silent. Default is 44100.
466 Specify the duration of the cross fade effect. See
467 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
468 for the accepted syntax.
469 By default the duration is determined by @var{nb_samples}.
470 If set this option is used instead of @var{nb_samples}.
473 Should first stream end overlap with second stream start. Default is enabled.
476 Set curve for cross fade transition for first stream.
479 Set curve for cross fade transition for second stream.
481 For description of available curve types see @ref{afade} filter description.
488 Cross fade from one input to another:
490 ffmpeg -i first.flac -i second.flac -filter_complex acrossfade=d=10:c1=exp:c2=exp output.flac
494 Cross fade from one input to another but without overlapping:
496 ffmpeg -i first.flac -i second.flac -filter_complex acrossfade=d=10:o=0:c1=exp:c2=exp output.flac
501 Split audio stream into several bands.
503 This filter splits audio stream into two or more frequency ranges.
504 Summing all streams back will give flat output.
506 The filter accepts the following options:
510 Set split frequencies. Those must be positive and increasing.
513 Set filter order, can be @var{2nd}, @var{4th} or @var{8th}.
514 Default is @var{4th}.
519 Reduce audio bit resolution.
521 This filter is bit crusher with enhanced functionality. A bit crusher
522 is used to audibly reduce number of bits an audio signal is sampled
523 with. This doesn't change the bit depth at all, it just produces the
524 effect. Material reduced in bit depth sounds more harsh and "digital".
525 This filter is able to even round to continuous values instead of discrete
527 Additionally it has a D/C offset which results in different crushing of
528 the lower and the upper half of the signal.
529 An Anti-Aliasing setting is able to produce "softer" crushing sounds.
531 Another feature of this filter is the logarithmic mode.
532 This setting switches from linear distances between bits to logarithmic ones.
533 The result is a much more "natural" sounding crusher which doesn't gate low
534 signals for example. The human ear has a logarithmic perception,
535 so this kind of crushing is much more pleasant.
536 Logarithmic crushing is also able to get anti-aliased.
538 The filter accepts the following options:
554 Can be linear: @code{lin} or logarithmic: @code{log}.
563 Set sample reduction.
566 Enable LFO. By default disabled.
577 Delay audio filtering until a given wallclock timestamp. See the @ref{cue}
581 Remove impulsive noise from input audio.
583 Samples detected as impulsive noise are replaced by interpolated samples using
584 autoregressive modelling.
588 Set window size, in milliseconds. Allowed range is from @code{10} to
589 @code{100}. Default value is @code{55} milliseconds.
590 This sets size of window which will be processed at once.
593 Set window overlap, in percentage of window size. Allowed range is from
594 @code{50} to @code{95}. Default value is @code{75} percent.
595 Setting this to a very high value increases impulsive noise removal but makes
596 whole process much slower.
599 Set autoregression order, in percentage of window size. Allowed range is from
600 @code{0} to @code{25}. Default value is @code{2} percent. This option also
601 controls quality of interpolated samples using neighbour good samples.
604 Set threshold value. Allowed range is from @code{1} to @code{100}.
605 Default value is @code{2}.
606 This controls the strength of impulsive noise which is going to be removed.
607 The lower value, the more samples will be detected as impulsive noise.
610 Set burst fusion, in percentage of window size. Allowed range is @code{0} to
611 @code{10}. Default value is @code{2}.
612 If any two samples detected as noise are spaced less than this value then any
613 sample between those two samples will be also detected as noise.
618 It accepts the following values:
621 Select overlap-add method. Even not interpolated samples are slightly
622 changed with this method.
625 Select overlap-save method. Not interpolated samples remain unchanged.
628 Default value is @code{a}.
632 Remove clipped samples from input audio.
634 Samples detected as clipped are replaced by interpolated samples using
635 autoregressive modelling.
639 Set window size, in milliseconds. Allowed range is from @code{10} to @code{100}.
640 Default value is @code{55} milliseconds.
641 This sets size of window which will be processed at once.
644 Set window overlap, in percentage of window size. Allowed range is from @code{50}
645 to @code{95}. Default value is @code{75} percent.
648 Set autoregression order, in percentage of window size. Allowed range is from
649 @code{0} to @code{25}. Default value is @code{8} percent. This option also controls
650 quality of interpolated samples using neighbour good samples.
653 Set threshold value. Allowed range is from @code{1} to @code{100}.
654 Default value is @code{10}. Higher values make clip detection less aggressive.
657 Set size of histogram used to detect clips. Allowed range is from @code{100} to @code{9999}.
658 Default value is @code{1000}. Higher values make clip detection less aggressive.
663 It accepts the following values:
666 Select overlap-add method. Even not interpolated samples are slightly changed
670 Select overlap-save method. Not interpolated samples remain unchanged.
673 Default value is @code{a}.
678 Delay one or more audio channels.
680 Samples in delayed channel are filled with silence.
682 The filter accepts the following option:
686 Set list of delays in milliseconds for each channel separated by '|'.
687 Unused delays will be silently ignored. If number of given delays is
688 smaller than number of channels all remaining channels will not be delayed.
689 If you want to delay exact number of samples, append 'S' to number.
690 If you want instead to delay in seconds, append 's' to number.
697 Delay first channel by 1.5 seconds, the third channel by 0.5 seconds and leave
698 the second channel (and any other channels that may be present) unchanged.
704 Delay second channel by 500 samples, the third channel by 700 samples and leave
705 the first channel (and any other channels that may be present) unchanged.
711 @section aderivative, aintegral
713 Compute derivative/integral of audio stream.
715 Applying both filters one after another produces original audio.
719 Apply echoing to the input audio.
721 Echoes are reflected sound and can occur naturally amongst mountains
722 (and sometimes large buildings) when talking or shouting; digital echo
723 effects emulate this behaviour and are often used to help fill out the
724 sound of a single instrument or vocal. The time difference between the
725 original signal and the reflection is the @code{delay}, and the
726 loudness of the reflected signal is the @code{decay}.
727 Multiple echoes can have different delays and decays.
729 A description of the accepted parameters follows.
733 Set input gain of reflected signal. Default is @code{0.6}.
736 Set output gain of reflected signal. Default is @code{0.3}.
739 Set list of time intervals in milliseconds between original signal and reflections
740 separated by '|'. Allowed range for each @code{delay} is @code{(0 - 90000.0]}.
741 Default is @code{1000}.
744 Set list of loudness of reflected signals separated by '|'.
745 Allowed range for each @code{decay} is @code{(0 - 1.0]}.
746 Default is @code{0.5}.
753 Make it sound as if there are twice as many instruments as are actually playing:
755 aecho=0.8:0.88:60:0.4
759 If delay is very short, then it sounds like a (metallic) robot playing music:
765 A longer delay will sound like an open air concert in the mountains:
767 aecho=0.8:0.9:1000:0.3
771 Same as above but with one more mountain:
773 aecho=0.8:0.9:1000|1800:0.3|0.25
778 Audio emphasis filter creates or restores material directly taken from LPs or
779 emphased CDs with different filter curves. E.g. to store music on vinyl the
780 signal has to be altered by a filter first to even out the disadvantages of
781 this recording medium.
782 Once the material is played back the inverse filter has to be applied to
783 restore the distortion of the frequency response.
785 The filter accepts the following options:
795 Set filter mode. For restoring material use @code{reproduction} mode, otherwise
796 use @code{production} mode. Default is @code{reproduction} mode.
799 Set filter type. Selects medium. Can be one of the following:
811 select Compact Disc (CD).
817 select 50µs (FM-KF).
819 select 75µs (FM-KF).
825 Modify an audio signal according to the specified expressions.
827 This filter accepts one or more expressions (one for each channel),
828 which are evaluated and used to modify a corresponding audio signal.
830 It accepts the following parameters:
834 Set the '|'-separated expressions list for each separate channel. If
835 the number of input channels is greater than the number of
836 expressions, the last specified expression is used for the remaining
839 @item channel_layout, c
840 Set output channel layout. If not specified, the channel layout is
841 specified by the number of expressions. If set to @samp{same}, it will
842 use by default the same input channel layout.
845 Each expression in @var{exprs} can contain the following constants and functions:
849 channel number of the current expression
852 number of the evaluated sample, starting from 0
858 time of the evaluated sample expressed in seconds
861 @item nb_out_channels
862 input and output number of channels
865 the value of input channel with number @var{CH}
868 Note: this filter is slow. For faster processing you should use a
877 aeval=val(ch)/2:c=same
881 Invert phase of the second channel:
890 Apply fade-in/out effect to input audio.
892 A description of the accepted parameters follows.
896 Specify the effect type, can be either @code{in} for fade-in, or
897 @code{out} for a fade-out effect. Default is @code{in}.
899 @item start_sample, ss
900 Specify the number of the start sample for starting to apply the fade
901 effect. Default is 0.
904 Specify the number of samples for which the fade effect has to last. At
905 the end of the fade-in effect the output audio will have the same
906 volume as the input audio, at the end of the fade-out transition
907 the output audio will be silence. Default is 44100.
910 Specify the start time of the fade effect. Default is 0.
911 The value must be specified as a time duration; see
912 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
913 for the accepted syntax.
914 If set this option is used instead of @var{start_sample}.
917 Specify the duration of the fade effect. See
918 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
919 for the accepted syntax.
920 At the end of the fade-in effect the output audio will have the same
921 volume as the input audio, at the end of the fade-out transition
922 the output audio will be silence.
923 By default the duration is determined by @var{nb_samples}.
924 If set this option is used instead of @var{nb_samples}.
927 Set curve for fade transition.
929 It accepts the following values:
932 select triangular, linear slope (default)
934 select quarter of sine wave
936 select half of sine wave
938 select exponential sine wave
942 select inverted parabola
956 select inverted quarter of sine wave
958 select inverted half of sine wave
960 select double-exponential seat
962 select double-exponential sigmoid
964 select logistic sigmoid
974 Fade in first 15 seconds of audio:
980 Fade out last 25 seconds of a 900 seconds audio:
982 afade=t=out:st=875:d=25
987 Denoise audio samples with FFT.
989 A description of the accepted parameters follows.
993 Set the noise reduction in dB, allowed range is 0.01 to 97.
994 Default value is 12 dB.
997 Set the noise floor in dB, allowed range is -80 to -20.
998 Default value is -50 dB.
1003 It accepts the following values:
1012 Select shellac noise.
1015 Select custom noise, defined in @code{bn} option.
1017 Default value is white noise.
1021 Set custom band noise for every one of 15 bands.
1022 Bands are separated by ' ' or '|'.
1025 Set the residual floor in dB, allowed range is -80 to -20.
1026 Default value is -38 dB.
1029 Enable noise tracking. By default is disabled.
1030 With this enabled, noise floor is automatically adjusted.
1033 Enable residual tracking. By default is disabled.
1036 Set the output mode.
1038 It accepts the following values:
1041 Pass input unchanged.
1044 Pass noise filtered out.
1049 Default value is @var{o}.
1053 @subsection Commands
1055 This filter supports the following commands:
1057 @item sample_noise, sn
1058 Start or stop measuring noise profile.
1059 Syntax for the command is : "start" or "stop" string.
1060 After measuring noise profile is stopped it will be
1061 automatically applied in filtering.
1063 @item noise_reduction, nr
1064 Change noise reduction. Argument is single float number.
1065 Syntax for the command is : "@var{noise_reduction}"
1067 @item noise_floor, nf
1068 Change noise floor. Argument is single float number.
1069 Syntax for the command is : "@var{noise_floor}"
1071 @item output_mode, om
1072 Change output mode operation.
1073 Syntax for the command is : "i", "o" or "n" string.
1077 Apply arbitrary expressions to samples in frequency domain.
1081 Set frequency domain real expression for each separate channel separated
1082 by '|'. Default is "re".
1083 If the number of input channels is greater than the number of
1084 expressions, the last specified expression is used for the remaining
1088 Set frequency domain imaginary expression for each separate channel
1089 separated by '|'. Default is "im".
1091 Each expression in @var{real} and @var{imag} can contain the following
1092 constants and functions:
1099 current frequency bin number
1102 number of available bins
1105 channel number of the current expression
1114 current real part of frequency bin of current channel
1117 current imaginary part of frequency bin of current channel
1120 Return the value of real part of frequency bin at location (@var{bin},@var{channel})
1123 Return the value of imaginary part of frequency bin at location (@var{bin},@var{channel})
1127 Set window size. Allowed range is from 16 to 131072.
1128 Default is @code{4096}
1131 Set window function. Default is @code{hann}.
1134 Set window overlap. If set to 1, the recommended overlap for selected
1135 window function will be picked. Default is @code{0.75}.
1138 @subsection Examples
1142 Leave almost only low frequencies in audio:
1144 afftfilt="'real=re * (1-clip((b/nb)*b,0,1))':imag='im * (1-clip((b/nb)*b,0,1))'"
1151 Apply an arbitrary Frequency Impulse Response filter.
1153 This filter is designed for applying long FIR filters,
1154 up to 60 seconds long.
1156 It can be used as component for digital crossover filters,
1157 room equalization, cross talk cancellation, wavefield synthesis,
1158 auralization, ambiophonics, ambisonics and spatialization.
1160 This filter uses the second stream as FIR coefficients.
1161 If the second stream holds a single channel, it will be used
1162 for all input channels in the first stream, otherwise
1163 the number of channels in the second stream must be same as
1164 the number of channels in the first stream.
1166 It accepts the following parameters:
1170 Set dry gain. This sets input gain.
1173 Set wet gain. This sets final output gain.
1176 Set Impulse Response filter length. Default is 1, which means whole IR is processed.
1179 Enable applying gain measured from power of IR.
1181 Set which approach to use for auto gain measurement.
1185 Do not apply any gain.
1188 select peak gain, very conservative approach. This is default value.
1191 select DC gain, limited application.
1194 select gain to noise approach, this is most popular one.
1198 Set gain to be applied to IR coefficients before filtering.
1199 Allowed range is 0 to 1. This gain is applied after any gain applied with @var{gtype} option.
1202 Set format of IR stream. Can be @code{mono} or @code{input}.
1203 Default is @code{input}.
1206 Set max allowed Impulse Response filter duration in seconds. Default is 30 seconds.
1207 Allowed range is 0.1 to 60 seconds.
1210 Show IR frequency response, magnitude(magenta), phase(green) and group delay(yellow) in additional video stream.
1211 By default it is disabled.
1214 Set for which IR channel to display frequency response. By default is first channel
1215 displayed. This option is used only when @var{response} is enabled.
1218 Set video stream size. This option is used only when @var{response} is enabled.
1221 Set video stream frame rate. This option is used only when @var{response} is enabled.
1224 Set minimal partition size used for convolution. Default is @var{8192}.
1225 Allowed range is from @var{8} to @var{32768}.
1226 Lower values decreases latency at cost of higher CPU usage.
1229 Set maximal partition size used for convolution. Default is @var{8192}.
1230 Allowed range is from @var{8} to @var{32768}.
1231 Lower values may increase CPU usage.
1234 @subsection Examples
1238 Apply reverb to stream using mono IR file as second input, complete command using ffmpeg:
1240 ffmpeg -i input.wav -i middle_tunnel_1way_mono.wav -lavfi afir output.wav
1247 Set output format constraints for the input audio. The framework will
1248 negotiate the most appropriate format to minimize conversions.
1250 It accepts the following parameters:
1254 A '|'-separated list of requested sample formats.
1257 A '|'-separated list of requested sample rates.
1259 @item channel_layouts
1260 A '|'-separated list of requested channel layouts.
1262 See @ref{channel layout syntax,,the Channel Layout section in the ffmpeg-utils(1) manual,ffmpeg-utils}
1263 for the required syntax.
1266 If a parameter is omitted, all values are allowed.
1268 Force the output to either unsigned 8-bit or signed 16-bit stereo
1270 aformat=sample_fmts=u8|s16:channel_layouts=stereo
1275 A gate is mainly used to reduce lower parts of a signal. This kind of signal
1276 processing reduces disturbing noise between useful signals.
1278 Gating is done by detecting the volume below a chosen level @var{threshold}
1279 and dividing it by the factor set with @var{ratio}. The bottom of the noise
1280 floor is set via @var{range}. Because an exact manipulation of the signal
1281 would cause distortion of the waveform the reduction can be levelled over
1282 time. This is done by setting @var{attack} and @var{release}.
1284 @var{attack} determines how long the signal has to fall below the threshold
1285 before any reduction will occur and @var{release} sets the time the signal
1286 has to rise above the threshold to reduce the reduction again.
1287 Shorter signals than the chosen attack time will be left untouched.
1291 Set input level before filtering.
1292 Default is 1. Allowed range is from 0.015625 to 64.
1295 Set the mode of operation. Can be @code{upward} or @code{downward}.
1296 Default is @code{downward}. If set to @code{upward} mode, higher parts of signal
1297 will be amplified, expanding dynamic range in upward direction.
1298 Otherwise, in case of @code{downward} lower parts of signal will be reduced.
1301 Set the level of gain reduction when the signal is below the threshold.
1302 Default is 0.06125. Allowed range is from 0 to 1.
1303 Setting this to 0 disables reduction and then filter behaves like expander.
1306 If a signal rises above this level the gain reduction is released.
1307 Default is 0.125. Allowed range is from 0 to 1.
1310 Set a ratio by which the signal is reduced.
1311 Default is 2. Allowed range is from 1 to 9000.
1314 Amount of milliseconds the signal has to rise above the threshold before gain
1316 Default is 20 milliseconds. Allowed range is from 0.01 to 9000.
1319 Amount of milliseconds the signal has to fall below the threshold before the
1320 reduction is increased again. Default is 250 milliseconds.
1321 Allowed range is from 0.01 to 9000.
1324 Set amount of amplification of signal after processing.
1325 Default is 1. Allowed range is from 1 to 64.
1328 Curve the sharp knee around the threshold to enter gain reduction more softly.
1329 Default is 2.828427125. Allowed range is from 1 to 8.
1332 Choose if exact signal should be taken for detection or an RMS like one.
1333 Default is @code{rms}. Can be @code{peak} or @code{rms}.
1336 Choose if the average level between all channels or the louder channel affects
1338 Default is @code{average}. Can be @code{average} or @code{maximum}.
1343 Apply an arbitrary Infinite Impulse Response filter.
1345 It accepts the following parameters:
1349 Set numerator/zeros coefficients.
1352 Set denominator/poles coefficients.
1364 Set coefficients format.
1370 Z-plane zeros/poles, cartesian (default)
1372 Z-plane zeros/poles, polar radians
1374 Z-plane zeros/poles, polar degrees
1378 Set kind of processing.
1379 Can be @code{d} - direct or @code{s} - serial cascading. Default is @code{s}.
1382 Set filtering precision.
1386 double-precision floating-point (default)
1388 single-precision floating-point
1396 How much to use filtered signal in output. Default is 1.
1397 Range is between 0 and 1.
1400 Show IR frequency response, magnitude(magenta), phase(green) and group delay(yellow) in additional video stream.
1401 By default it is disabled.
1404 Set for which IR channel to display frequency response. By default is first channel
1405 displayed. This option is used only when @var{response} is enabled.
1408 Set video stream size. This option is used only when @var{response} is enabled.
1411 Coefficients in @code{tf} format are separated by spaces and are in ascending
1414 Coefficients in @code{zp} format are separated by spaces and order of coefficients
1415 doesn't matter. Coefficients in @code{zp} format are complex numbers with @var{i}
1418 Different coefficients and gains can be provided for every channel, in such case
1419 use '|' to separate coefficients or gains. Last provided coefficients will be
1420 used for all remaining channels.
1422 @subsection Examples
1426 Apply 2 pole elliptic notch at around 5000Hz for 48000 Hz sample rate:
1428 aiir=k=1:z=7.957584807809675810E-1 -2.575128568908332300 3.674839853930788710 -2.57512875289799137 7.957586296317130880E-1:p=1 -2.86950072432325953 3.63022088054647218 -2.28075678147272232 6.361362326477423500E-1:f=tf:r=d
1432 Same as above but in @code{zp} format:
1434 aiir=k=0.79575848078096756:z=0.80918701+0.58773007i 0.80918701-0.58773007i 0.80884700+0.58784055i 0.80884700-0.58784055i:p=0.63892345+0.59951235i 0.63892345-0.59951235i 0.79582691+0.44198673i 0.79582691-0.44198673i:f=zp:r=s
1440 The limiter prevents an input signal from rising over a desired threshold.
1441 This limiter uses lookahead technology to prevent your signal from distorting.
1442 It means that there is a small delay after the signal is processed. Keep in mind
1443 that the delay it produces is the attack time you set.
1445 The filter accepts the following options:
1449 Set input gain. Default is 1.
1452 Set output gain. Default is 1.
1455 Don't let signals above this level pass the limiter. Default is 1.
1458 The limiter will reach its attenuation level in this amount of time in
1459 milliseconds. Default is 5 milliseconds.
1462 Come back from limiting to attenuation 1.0 in this amount of milliseconds.
1463 Default is 50 milliseconds.
1466 When gain reduction is always needed ASC takes care of releasing to an
1467 average reduction level rather than reaching a reduction of 0 in the release
1471 Select how much the release time is affected by ASC, 0 means nearly no changes
1472 in release time while 1 produces higher release times.
1475 Auto level output signal. Default is enabled.
1476 This normalizes audio back to 0dB if enabled.
1479 Depending on picked setting it is recommended to upsample input 2x or 4x times
1480 with @ref{aresample} before applying this filter.
1484 Apply a two-pole all-pass filter with central frequency (in Hz)
1485 @var{frequency}, and filter-width @var{width}.
1486 An all-pass filter changes the audio's frequency to phase relationship
1487 without changing its frequency to amplitude relationship.
1489 The filter accepts the following options:
1493 Set frequency in Hz.
1496 Set method to specify band-width of filter.
1511 Specify the band-width of a filter in width_type units.
1514 How much to use filtered signal in output. Default is 1.
1515 Range is between 0 and 1.
1518 Specify which channels to filter, by default all available are filtered.
1521 @subsection Commands
1523 This filter supports the following commands:
1526 Change allpass frequency.
1527 Syntax for the command is : "@var{frequency}"
1530 Change allpass width_type.
1531 Syntax for the command is : "@var{width_type}"
1534 Change allpass width.
1535 Syntax for the command is : "@var{width}"
1539 Syntax for the command is : "@var{mix}"
1546 The filter accepts the following options:
1550 Set the number of loops. Setting this value to -1 will result in infinite loops.
1554 Set maximal number of samples. Default is 0.
1557 Set first sample of loop. Default is 0.
1563 Merge two or more audio streams into a single multi-channel stream.
1565 The filter accepts the following options:
1570 Set the number of inputs. Default is 2.
1574 If the channel layouts of the inputs are disjoint, and therefore compatible,
1575 the channel layout of the output will be set accordingly and the channels
1576 will be reordered as necessary. If the channel layouts of the inputs are not
1577 disjoint, the output will have all the channels of the first input then all
1578 the channels of the second input, in that order, and the channel layout of
1579 the output will be the default value corresponding to the total number of
1582 For example, if the first input is in 2.1 (FL+FR+LF) and the second input
1583 is FC+BL+BR, then the output will be in 5.1, with the channels in the
1584 following order: a1, a2, b1, a3, b2, b3 (a1 is the first channel of the
1585 first input, b1 is the first channel of the second input).
1587 On the other hand, if both input are in stereo, the output channels will be
1588 in the default order: a1, a2, b1, b2, and the channel layout will be
1589 arbitrarily set to 4.0, which may or may not be the expected value.
1591 All inputs must have the same sample rate, and format.
1593 If inputs do not have the same duration, the output will stop with the
1596 @subsection Examples
1600 Merge two mono files into a stereo stream:
1602 amovie=left.wav [l] ; amovie=right.mp3 [r] ; [l] [r] amerge
1606 Multiple merges assuming 1 video stream and 6 audio streams in @file{input.mkv}:
1608 ffmpeg -i input.mkv -filter_complex "[0:1][0:2][0:3][0:4][0:5][0:6] amerge=inputs=6" -c:a pcm_s16le output.mkv
1614 Mixes multiple audio inputs into a single output.
1616 Note that this filter only supports float samples (the @var{amerge}
1617 and @var{pan} audio filters support many formats). If the @var{amix}
1618 input has integer samples then @ref{aresample} will be automatically
1619 inserted to perform the conversion to float samples.
1623 ffmpeg -i INPUT1 -i INPUT2 -i INPUT3 -filter_complex amix=inputs=3:duration=first:dropout_transition=3 OUTPUT
1625 will mix 3 input audio streams to a single output with the same duration as the
1626 first input and a dropout transition time of 3 seconds.
1628 It accepts the following parameters:
1632 The number of inputs. If unspecified, it defaults to 2.
1635 How to determine the end-of-stream.
1639 The duration of the longest input. (default)
1642 The duration of the shortest input.
1645 The duration of the first input.
1649 @item dropout_transition
1650 The transition time, in seconds, for volume renormalization when an input
1651 stream ends. The default value is 2 seconds.
1654 Specify weight of each input audio stream as sequence.
1655 Each weight is separated by space. By default all inputs have same weight.
1660 Multiply first audio stream with second audio stream and store result
1661 in output audio stream. Multiplication is done by multiplying each
1662 sample from first stream with sample at same position from second stream.
1664 With this element-wise multiplication one can create amplitude fades and
1665 amplitude modulations.
1667 @section anequalizer
1669 High-order parametric multiband equalizer for each channel.
1671 It accepts the following parameters:
1675 This option string is in format:
1676 "c@var{chn} f=@var{cf} w=@var{w} g=@var{g} t=@var{f} | ..."
1677 Each equalizer band is separated by '|'.
1681 Set channel number to which equalization will be applied.
1682 If input doesn't have that channel the entry is ignored.
1685 Set central frequency for band.
1686 If input doesn't have that frequency the entry is ignored.
1689 Set band width in hertz.
1692 Set band gain in dB.
1695 Set filter type for band, optional, can be:
1699 Butterworth, this is default.
1710 With this option activated frequency response of anequalizer is displayed
1714 Set video stream size. Only useful if curves option is activated.
1717 Set max gain that will be displayed. Only useful if curves option is activated.
1718 Setting this to a reasonable value makes it possible to display gain which is derived from
1719 neighbour bands which are too close to each other and thus produce higher gain
1720 when both are activated.
1723 Set frequency scale used to draw frequency response in video output.
1724 Can be linear or logarithmic. Default is logarithmic.
1727 Set color for each channel curve which is going to be displayed in video stream.
1728 This is list of color names separated by space or by '|'.
1729 Unrecognised or missing colors will be replaced by white color.
1732 @subsection Examples
1736 Lower gain by 10 of central frequency 200Hz and width 100 Hz
1737 for first 2 channels using Chebyshev type 1 filter:
1739 anequalizer=c0 f=200 w=100 g=-10 t=1|c1 f=200 w=100 g=-10 t=1
1743 @subsection Commands
1745 This filter supports the following commands:
1748 Alter existing filter parameters.
1749 Syntax for the commands is : "@var{fN}|f=@var{freq}|w=@var{width}|g=@var{gain}"
1751 @var{fN} is existing filter number, starting from 0, if no such filter is available
1753 @var{freq} set new frequency parameter.
1754 @var{width} set new width parameter in herz.
1755 @var{gain} set new gain parameter in dB.
1757 Full filter invocation with asendcmd may look like this:
1758 asendcmd=c='4.0 anequalizer change 0|f=200|w=50|g=1',anequalizer=...
1763 Reduce broadband noise in audio samples using Non-Local Means algorithm.
1765 Each sample is adjusted by looking for other samples with similar contexts. This
1766 context similarity is defined by comparing their surrounding patches of size
1767 @option{p}. Patches are searched in an area of @option{r} around the sample.
1769 The filter accepts the following options:
1773 Set denoising strength. Allowed range is from 0.00001 to 10. Default value is 0.00001.
1776 Set patch radius duration. Allowed range is from 1 to 100 milliseconds.
1777 Default value is 2 milliseconds.
1780 Set research radius duration. Allowed range is from 2 to 300 milliseconds.
1781 Default value is 6 milliseconds.
1784 Set the output mode.
1786 It accepts the following values:
1789 Pass input unchanged.
1792 Pass noise filtered out.
1797 Default value is @var{o}.
1801 Set smooth factor. Default value is @var{11}. Allowed range is from @var{1} to @var{15}.
1804 @subsection Commands
1806 This filter supports the following commands:
1809 Change denoise strength. Argument is single float number.
1810 Syntax for the command is : "@var{s}"
1814 Syntax for the command is : "i", "o" or "n" string.
1819 Pass the audio source unchanged to the output.
1823 Pad the end of an audio stream with silence.
1825 This can be used together with @command{ffmpeg} @option{-shortest} to
1826 extend audio streams to the same length as the video stream.
1828 A description of the accepted options follows.
1832 Set silence packet size. Default value is 4096.
1835 Set the number of samples of silence to add to the end. After the
1836 value is reached, the stream is terminated. This option is mutually
1837 exclusive with @option{whole_len}.
1840 Set the minimum total number of samples in the output audio stream. If
1841 the value is longer than the input audio length, silence is added to
1842 the end, until the value is reached. This option is mutually exclusive
1843 with @option{pad_len}.
1846 Specify the duration of samples of silence to add. See
1847 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
1848 for the accepted syntax. Used only if set to non-zero value.
1851 Specify the minimum total duration in the output audio stream. See
1852 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
1853 for the accepted syntax. Used only if set to non-zero value. If the value is longer than
1854 the input audio length, silence is added to the end, until the value is reached.
1855 This option is mutually exclusive with @option{pad_dur}
1858 If neither the @option{pad_len} nor the @option{whole_len} nor @option{pad_dur}
1859 nor @option{whole_dur} option is set, the filter will add silence to the end of
1860 the input stream indefinitely.
1862 @subsection Examples
1866 Add 1024 samples of silence to the end of the input:
1872 Make sure the audio output will contain at least 10000 samples, pad
1873 the input with silence if required:
1875 apad=whole_len=10000
1879 Use @command{ffmpeg} to pad the audio input with silence, so that the
1880 video stream will always result the shortest and will be converted
1881 until the end in the output file when using the @option{shortest}
1884 ffmpeg -i VIDEO -i AUDIO -filter_complex "[1:0]apad" -shortest OUTPUT
1889 Add a phasing effect to the input audio.
1891 A phaser filter creates series of peaks and troughs in the frequency spectrum.
1892 The position of the peaks and troughs are modulated so that they vary over time, creating a sweeping effect.
1894 A description of the accepted parameters follows.
1898 Set input gain. Default is 0.4.
1901 Set output gain. Default is 0.74
1904 Set delay in milliseconds. Default is 3.0.
1907 Set decay. Default is 0.4.
1910 Set modulation speed in Hz. Default is 0.5.
1913 Set modulation type. Default is triangular.
1915 It accepts the following values:
1924 Audio pulsator is something between an autopanner and a tremolo.
1925 But it can produce funny stereo effects as well. Pulsator changes the volume
1926 of the left and right channel based on a LFO (low frequency oscillator) with
1927 different waveforms and shifted phases.
1928 This filter have the ability to define an offset between left and right
1929 channel. An offset of 0 means that both LFO shapes match each other.
1930 The left and right channel are altered equally - a conventional tremolo.
1931 An offset of 50% means that the shape of the right channel is exactly shifted
1932 in phase (or moved backwards about half of the frequency) - pulsator acts as
1933 an autopanner. At 1 both curves match again. Every setting in between moves the
1934 phase shift gapless between all stages and produces some "bypassing" sounds with
1935 sine and triangle waveforms. The more you set the offset near 1 (starting from
1936 the 0.5) the faster the signal passes from the left to the right speaker.
1938 The filter accepts the following options:
1942 Set input gain. By default it is 1. Range is [0.015625 - 64].
1945 Set output gain. By default it is 1. Range is [0.015625 - 64].
1948 Set waveform shape the LFO will use. Can be one of: sine, triangle, square,
1949 sawup or sawdown. Default is sine.
1952 Set modulation. Define how much of original signal is affected by the LFO.
1955 Set left channel offset. Default is 0. Allowed range is [0 - 1].
1958 Set right channel offset. Default is 0.5. Allowed range is [0 - 1].
1961 Set pulse width. Default is 1. Allowed range is [0 - 2].
1964 Set possible timing mode. Can be one of: bpm, ms or hz. Default is hz.
1967 Set bpm. Default is 120. Allowed range is [30 - 300]. Only used if timing
1971 Set ms. Default is 500. Allowed range is [10 - 2000]. Only used if timing
1975 Set frequency in Hz. Default is 2. Allowed range is [0.01 - 100]. Only used
1976 if timing is set to hz.
1982 Resample the input audio to the specified parameters, using the
1983 libswresample library. If none are specified then the filter will
1984 automatically convert between its input and output.
1986 This filter is also able to stretch/squeeze the audio data to make it match
1987 the timestamps or to inject silence / cut out audio to make it match the
1988 timestamps, do a combination of both or do neither.
1990 The filter accepts the syntax
1991 [@var{sample_rate}:]@var{resampler_options}, where @var{sample_rate}
1992 expresses a sample rate and @var{resampler_options} is a list of
1993 @var{key}=@var{value} pairs, separated by ":". See the
1994 @ref{Resampler Options,,"Resampler Options" section in the
1995 ffmpeg-resampler(1) manual,ffmpeg-resampler}
1996 for the complete list of supported options.
1998 @subsection Examples
2002 Resample the input audio to 44100Hz:
2008 Stretch/squeeze samples to the given timestamps, with a maximum of 1000
2009 samples per second compensation:
2011 aresample=async=1000
2017 Reverse an audio clip.
2019 Warning: This filter requires memory to buffer the entire clip, so trimming
2022 @subsection Examples
2026 Take the first 5 seconds of a clip, and reverse it.
2028 atrim=end=5,areverse
2032 @section asetnsamples
2034 Set the number of samples per each output audio frame.
2036 The last output packet may contain a different number of samples, as
2037 the filter will flush all the remaining samples when the input audio
2040 The filter accepts the following options:
2044 @item nb_out_samples, n
2045 Set the number of frames per each output audio frame. The number is
2046 intended as the number of samples @emph{per each channel}.
2047 Default value is 1024.
2050 If set to 1, the filter will pad the last audio frame with zeroes, so
2051 that the last frame will contain the same number of samples as the
2052 previous ones. Default value is 1.
2055 For example, to set the number of per-frame samples to 1234 and
2056 disable padding for the last frame, use:
2058 asetnsamples=n=1234:p=0
2063 Set the sample rate without altering the PCM data.
2064 This will result in a change of speed and pitch.
2066 The filter accepts the following options:
2069 @item sample_rate, r
2070 Set the output sample rate. Default is 44100 Hz.
2075 Show a line containing various information for each input audio frame.
2076 The input audio is not modified.
2078 The shown line contains a sequence of key/value pairs of the form
2079 @var{key}:@var{value}.
2081 The following values are shown in the output:
2085 The (sequential) number of the input frame, starting from 0.
2088 The presentation timestamp of the input frame, in time base units; the time base
2089 depends on the filter input pad, and is usually 1/@var{sample_rate}.
2092 The presentation timestamp of the input frame in seconds.
2095 position of the frame in the input stream, -1 if this information in
2096 unavailable and/or meaningless (for example in case of synthetic audio)
2105 The sample rate for the audio frame.
2108 The number of samples (per channel) in the frame.
2111 The Adler-32 checksum (printed in hexadecimal) of the audio data. For planar
2112 audio, the data is treated as if all the planes were concatenated.
2114 @item plane_checksums
2115 A list of Adler-32 checksums for each data plane.
2119 Apply audio soft clipping.
2121 Soft clipping is a type of distortion effect where the amplitude of a signal is saturated
2122 along a smooth curve, rather than the abrupt shape of hard-clipping.
2124 This filter accepts the following options:
2128 Set type of soft-clipping.
2130 It accepts the following values:
2142 Set additional parameter which controls sigmoid function.
2146 Automatic Speech Recognition
2148 This filter uses PocketSphinx for speech recognition. To enable
2149 compilation of this filter, you need to configure FFmpeg with
2150 @code{--enable-pocketsphinx}.
2152 It accepts the following options:
2156 Set sampling rate of input audio. Defaults is @code{16000}.
2157 This need to match speech models, otherwise one will get poor results.
2160 Set dictionary containing acoustic model files.
2163 Set pronunciation dictionary.
2166 Set language model file.
2169 Set language model set.
2172 Set which language model to use.
2175 Set output for log messages.
2178 The filter exports recognized speech as the frame metadata @code{lavfi.asr.text}.
2183 Display time domain statistical information about the audio channels.
2184 Statistics are calculated and displayed for each audio channel and,
2185 where applicable, an overall figure is also given.
2187 It accepts the following option:
2190 Short window length in seconds, used for peak and trough RMS measurement.
2191 Default is @code{0.05} (50 milliseconds). Allowed range is @code{[0.01 - 10]}.
2195 Set metadata injection. All the metadata keys are prefixed with @code{lavfi.astats.X},
2196 where @code{X} is channel number starting from 1 or string @code{Overall}. Default is
2199 Available keys for each channel are:
2241 For example full key look like this @code{lavfi.astats.1.DC_offset} or
2242 this @code{lavfi.astats.Overall.Peak_count}.
2244 For description what each key means read below.
2247 Set number of frame after which stats are going to be recalculated.
2248 Default is disabled.
2250 @item measure_perchannel
2251 Select the entries which need to be measured per channel. The metadata keys can
2252 be used as flags, default is @option{all} which measures everything.
2253 @option{none} disables all per channel measurement.
2255 @item measure_overall
2256 Select the entries which need to be measured overall. The metadata keys can
2257 be used as flags, default is @option{all} which measures everything.
2258 @option{none} disables all overall measurement.
2262 A description of each shown parameter follows:
2266 Mean amplitude displacement from zero.
2269 Minimal sample level.
2272 Maximal sample level.
2274 @item Min difference
2275 Minimal difference between two consecutive samples.
2277 @item Max difference
2278 Maximal difference between two consecutive samples.
2280 @item Mean difference
2281 Mean difference between two consecutive samples.
2282 The average of each difference between two consecutive samples.
2284 @item RMS difference
2285 Root Mean Square difference between two consecutive samples.
2289 Standard peak and RMS level measured in dBFS.
2293 Peak and trough values for RMS level measured over a short window.
2296 Standard ratio of peak to RMS level (note: not in dB).
2299 Flatness (i.e. consecutive samples with the same value) of the signal at its peak levels
2300 (i.e. either @var{Min level} or @var{Max level}).
2303 Number of occasions (not the number of samples) that the signal attained either
2304 @var{Min level} or @var{Max level}.
2307 Overall bit depth of audio. Number of bits used for each sample.
2310 Measured dynamic range of audio in dB.
2312 @item Zero crossings
2313 Number of points where the waveform crosses the zero level axis.
2315 @item Zero crossings rate
2316 Rate of Zero crossings and number of audio samples.
2323 The filter accepts exactly one parameter, the audio tempo. If not
2324 specified then the filter will assume nominal 1.0 tempo. Tempo must
2325 be in the [0.5, 100.0] range.
2327 Note that tempo greater than 2 will skip some samples rather than
2328 blend them in. If for any reason this is a concern it is always
2329 possible to daisy-chain several instances of atempo to achieve the
2330 desired product tempo.
2332 @subsection Examples
2336 Slow down audio to 80% tempo:
2342 To speed up audio to 300% tempo:
2348 To speed up audio to 300% tempo by daisy-chaining two atempo instances:
2350 atempo=sqrt(3),atempo=sqrt(3)
2356 Trim the input so that the output contains one continuous subpart of the input.
2358 It accepts the following parameters:
2361 Timestamp (in seconds) of the start of the section to keep. I.e. the audio
2362 sample with the timestamp @var{start} will be the first sample in the output.
2365 Specify time of the first audio sample that will be dropped, i.e. the
2366 audio sample immediately preceding the one with the timestamp @var{end} will be
2367 the last sample in the output.
2370 Same as @var{start}, except this option sets the start timestamp in samples
2374 Same as @var{end}, except this option sets the end timestamp in samples instead
2378 The maximum duration of the output in seconds.
2381 The number of the first sample that should be output.
2384 The number of the first sample that should be dropped.
2387 @option{start}, @option{end}, and @option{duration} are expressed as time
2388 duration specifications; see
2389 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}.
2391 Note that the first two sets of the start/end options and the @option{duration}
2392 option look at the frame timestamp, while the _sample options simply count the
2393 samples that pass through the filter. So start/end_pts and start/end_sample will
2394 give different results when the timestamps are wrong, inexact or do not start at
2395 zero. Also note that this filter does not modify the timestamps. If you wish
2396 to have the output timestamps start at zero, insert the asetpts filter after the
2399 If multiple start or end options are set, this filter tries to be greedy and
2400 keep all samples that match at least one of the specified constraints. To keep
2401 only the part that matches all the constraints at once, chain multiple atrim
2404 The defaults are such that all the input is kept. So it is possible to set e.g.
2405 just the end values to keep everything before the specified time.
2410 Drop everything except the second minute of input:
2412 ffmpeg -i INPUT -af atrim=60:120
2416 Keep only the first 1000 samples:
2418 ffmpeg -i INPUT -af atrim=end_sample=1000
2425 Apply a two-pole Butterworth band-pass filter with central
2426 frequency @var{frequency}, and (3dB-point) band-width width.
2427 The @var{csg} option selects a constant skirt gain (peak gain = Q)
2428 instead of the default: constant 0dB peak gain.
2429 The filter roll off at 6dB per octave (20dB per decade).
2431 The filter accepts the following options:
2435 Set the filter's central frequency. Default is @code{3000}.
2438 Constant skirt gain if set to 1. Defaults to 0.
2441 Set method to specify band-width of filter.
2456 Specify the band-width of a filter in width_type units.
2459 How much to use filtered signal in output. Default is 1.
2460 Range is between 0 and 1.
2463 Specify which channels to filter, by default all available are filtered.
2466 @subsection Commands
2468 This filter supports the following commands:
2471 Change bandpass frequency.
2472 Syntax for the command is : "@var{frequency}"
2475 Change bandpass width_type.
2476 Syntax for the command is : "@var{width_type}"
2479 Change bandpass width.
2480 Syntax for the command is : "@var{width}"
2483 Change bandpass mix.
2484 Syntax for the command is : "@var{mix}"
2489 Apply a two-pole Butterworth band-reject filter with central
2490 frequency @var{frequency}, and (3dB-point) band-width @var{width}.
2491 The filter roll off at 6dB per octave (20dB per decade).
2493 The filter accepts the following options:
2497 Set the filter's central frequency. Default is @code{3000}.
2500 Set method to specify band-width of filter.
2515 Specify the band-width of a filter in width_type units.
2518 How much to use filtered signal in output. Default is 1.
2519 Range is between 0 and 1.
2522 Specify which channels to filter, by default all available are filtered.
2525 @subsection Commands
2527 This filter supports the following commands:
2530 Change bandreject frequency.
2531 Syntax for the command is : "@var{frequency}"
2534 Change bandreject width_type.
2535 Syntax for the command is : "@var{width_type}"
2538 Change bandreject width.
2539 Syntax for the command is : "@var{width}"
2542 Change bandreject mix.
2543 Syntax for the command is : "@var{mix}"
2546 @section bass, lowshelf
2548 Boost or cut the bass (lower) frequencies of the audio using a two-pole
2549 shelving filter with a response similar to that of a standard
2550 hi-fi's tone-controls. This is also known as shelving equalisation (EQ).
2552 The filter accepts the following options:
2556 Give the gain at 0 Hz. Its useful range is about -20
2557 (for a large cut) to +20 (for a large boost).
2558 Beware of clipping when using a positive gain.
2561 Set the filter's central frequency and so can be used
2562 to extend or reduce the frequency range to be boosted or cut.
2563 The default value is @code{100} Hz.
2566 Set method to specify band-width of filter.
2581 Determine how steep is the filter's shelf transition.
2584 How much to use filtered signal in output. Default is 1.
2585 Range is between 0 and 1.
2588 Specify which channels to filter, by default all available are filtered.
2591 @subsection Commands
2593 This filter supports the following commands:
2596 Change bass frequency.
2597 Syntax for the command is : "@var{frequency}"
2600 Change bass width_type.
2601 Syntax for the command is : "@var{width_type}"
2605 Syntax for the command is : "@var{width}"
2609 Syntax for the command is : "@var{gain}"
2613 Syntax for the command is : "@var{mix}"
2618 Apply a biquad IIR filter with the given coefficients.
2619 Where @var{b0}, @var{b1}, @var{b2} and @var{a0}, @var{a1}, @var{a2}
2620 are the numerator and denominator coefficients respectively.
2621 and @var{channels}, @var{c} specify which channels to filter, by default all
2622 available are filtered.
2624 @subsection Commands
2626 This filter supports the following commands:
2634 Change biquad parameter.
2635 Syntax for the command is : "@var{value}"
2638 How much to use filtered signal in output. Default is 1.
2639 Range is between 0 and 1.
2643 Bauer stereo to binaural transformation, which improves headphone listening of
2644 stereo audio records.
2646 To enable compilation of this filter you need to configure FFmpeg with
2647 @code{--enable-libbs2b}.
2649 It accepts the following parameters:
2653 Pre-defined crossfeed level.
2657 Default level (fcut=700, feed=50).
2660 Chu Moy circuit (fcut=700, feed=60).
2663 Jan Meier circuit (fcut=650, feed=95).
2668 Cut frequency (in Hz).
2677 Remap input channels to new locations.
2679 It accepts the following parameters:
2682 Map channels from input to output. The argument is a '|'-separated list of
2683 mappings, each in the @code{@var{in_channel}-@var{out_channel}} or
2684 @var{in_channel} form. @var{in_channel} can be either the name of the input
2685 channel (e.g. FL for front left) or its index in the input channel layout.
2686 @var{out_channel} is the name of the output channel or its index in the output
2687 channel layout. If @var{out_channel} is not given then it is implicitly an
2688 index, starting with zero and increasing by one for each mapping.
2690 @item channel_layout
2691 The channel layout of the output stream.
2694 If no mapping is present, the filter will implicitly map input channels to
2695 output channels, preserving indices.
2697 @subsection Examples
2701 For example, assuming a 5.1+downmix input MOV file,
2703 ffmpeg -i in.mov -filter 'channelmap=map=DL-FL|DR-FR' out.wav
2705 will create an output WAV file tagged as stereo from the downmix channels of
2709 To fix a 5.1 WAV improperly encoded in AAC's native channel order
2711 ffmpeg -i in.wav -filter 'channelmap=1|2|0|5|3|4:5.1' out.wav
2715 @section channelsplit
2717 Split each channel from an input audio stream into a separate output stream.
2719 It accepts the following parameters:
2721 @item channel_layout
2722 The channel layout of the input stream. The default is "stereo".
2724 A channel layout describing the channels to be extracted as separate output streams
2725 or "all" to extract each input channel as a separate stream. The default is "all".
2727 Choosing channels not present in channel layout in the input will result in an error.
2730 @subsection Examples
2734 For example, assuming a stereo input MP3 file,
2736 ffmpeg -i in.mp3 -filter_complex channelsplit out.mkv
2738 will create an output Matroska file with two audio streams, one containing only
2739 the left channel and the other the right channel.
2742 Split a 5.1 WAV file into per-channel files:
2744 ffmpeg -i in.wav -filter_complex
2745 'channelsplit=channel_layout=5.1[FL][FR][FC][LFE][SL][SR]'
2746 -map '[FL]' front_left.wav -map '[FR]' front_right.wav -map '[FC]'
2747 front_center.wav -map '[LFE]' lfe.wav -map '[SL]' side_left.wav -map '[SR]'
2752 Extract only LFE from a 5.1 WAV file:
2754 ffmpeg -i in.wav -filter_complex 'channelsplit=channel_layout=5.1:channels=LFE[LFE]'
2755 -map '[LFE]' lfe.wav
2760 Add a chorus effect to the audio.
2762 Can make a single vocal sound like a chorus, but can also be applied to instrumentation.
2764 Chorus resembles an echo effect with a short delay, but whereas with echo the delay is
2765 constant, with chorus, it is varied using using sinusoidal or triangular modulation.
2766 The modulation depth defines the range the modulated delay is played before or after
2767 the delay. Hence the delayed sound will sound slower or faster, that is the delayed
2768 sound tuned around the original one, like in a chorus where some vocals are slightly
2771 It accepts the following parameters:
2774 Set input gain. Default is 0.4.
2777 Set output gain. Default is 0.4.
2780 Set delays. A typical delay is around 40ms to 60ms.
2792 @subsection Examples
2798 chorus=0.7:0.9:55:0.4:0.25:2
2804 chorus=0.6:0.9:50|60:0.4|0.32:0.25|0.4:2|1.3
2808 Fuller sounding chorus with three delays:
2810 chorus=0.5:0.9:50|60|40:0.4|0.32|0.3:0.25|0.4|0.3:2|2.3|1.3
2815 Compress or expand the audio's dynamic range.
2817 It accepts the following parameters:
2823 A list of times in seconds for each channel over which the instantaneous level
2824 of the input signal is averaged to determine its volume. @var{attacks} refers to
2825 increase of volume and @var{decays} refers to decrease of volume. For most
2826 situations, the attack time (response to the audio getting louder) should be
2827 shorter than the decay time, because the human ear is more sensitive to sudden
2828 loud audio than sudden soft audio. A typical value for attack is 0.3 seconds and
2829 a typical value for decay is 0.8 seconds.
2830 If specified number of attacks & decays is lower than number of channels, the last
2831 set attack/decay will be used for all remaining channels.
2834 A list of points for the transfer function, specified in dB relative to the
2835 maximum possible signal amplitude. Each key points list must be defined using
2836 the following syntax: @code{x0/y0|x1/y1|x2/y2|....} or
2837 @code{x0/y0 x1/y1 x2/y2 ....}
2839 The input values must be in strictly increasing order but the transfer function
2840 does not have to be monotonically rising. The point @code{0/0} is assumed but
2841 may be overridden (by @code{0/out-dBn}). Typical values for the transfer
2842 function are @code{-70/-70|-60/-20|1/0}.
2845 Set the curve radius in dB for all joints. It defaults to 0.01.
2848 Set the additional gain in dB to be applied at all points on the transfer
2849 function. This allows for easy adjustment of the overall gain.
2853 Set an initial volume, in dB, to be assumed for each channel when filtering
2854 starts. This permits the user to supply a nominal level initially, so that, for
2855 example, a very large gain is not applied to initial signal levels before the
2856 companding has begun to operate. A typical value for audio which is initially
2857 quiet is -90 dB. It defaults to 0.
2860 Set a delay, in seconds. The input audio is analyzed immediately, but audio is
2861 delayed before being fed to the volume adjuster. Specifying a delay
2862 approximately equal to the attack/decay times allows the filter to effectively
2863 operate in predictive rather than reactive mode. It defaults to 0.
2867 @subsection Examples
2871 Make music with both quiet and loud passages suitable for listening to in a
2874 compand=.3|.3:1|1:-90/-60|-60/-40|-40/-30|-20/-20:6:0:-90:0.2
2877 Another example for audio with whisper and explosion parts:
2879 compand=0|0:1|1:-90/-900|-70/-70|-30/-9|0/-3:6:0:0:0
2883 A noise gate for when the noise is at a lower level than the signal:
2885 compand=.1|.1:.2|.2:-900/-900|-50.1/-900|-50/-50:.01:0:-90:.1
2889 Here is another noise gate, this time for when the noise is at a higher level
2890 than the signal (making it, in some ways, similar to squelch):
2892 compand=.1|.1:.1|.1:-45.1/-45.1|-45/-900|0/-900:.01:45:-90:.1
2896 2:1 compression starting at -6dB:
2898 compand=points=-80/-80|-6/-6|0/-3.8|20/3.5
2902 2:1 compression starting at -9dB:
2904 compand=points=-80/-80|-9/-9|0/-5.3|20/2.9
2908 2:1 compression starting at -12dB:
2910 compand=points=-80/-80|-12/-12|0/-6.8|20/1.9
2914 2:1 compression starting at -18dB:
2916 compand=points=-80/-80|-18/-18|0/-9.8|20/0.7
2920 3:1 compression starting at -15dB:
2922 compand=points=-80/-80|-15/-15|0/-10.8|20/-5.2
2928 compand=points=-80/-105|-62/-80|-15.4/-15.4|0/-12|20/-7.6
2934 compand=attacks=0:points=-80/-169|-54/-80|-49.5/-64.6|-41.1/-41.1|-25.8/-15|-10.8/-4.5|0/0|20/8.3
2938 Hard limiter at -6dB:
2940 compand=attacks=0:points=-80/-80|-6/-6|20/-6
2944 Hard limiter at -12dB:
2946 compand=attacks=0:points=-80/-80|-12/-12|20/-12
2950 Hard noise gate at -35 dB:
2952 compand=attacks=0:points=-80/-115|-35.1/-80|-35/-35|20/20
2958 compand=attacks=0:points=-80/-80|-12.4/-12.4|-6/-8|0/-6.8|20/-2.8
2962 @section compensationdelay
2964 Compensation Delay Line is a metric based delay to compensate differing
2965 positions of microphones or speakers.
2967 For example, you have recorded guitar with two microphones placed in
2968 different locations. Because the front of sound wave has fixed speed in
2969 normal conditions, the phasing of microphones can vary and depends on
2970 their location and interposition. The best sound mix can be achieved when
2971 these microphones are in phase (synchronized). Note that a distance of
2972 ~30 cm between microphones makes one microphone capture the signal in
2973 antiphase to the other microphone. That makes the final mix sound moody.
2974 This filter helps to solve phasing problems by adding different delays
2975 to each microphone track and make them synchronized.
2977 The best result can be reached when you take one track as base and
2978 synchronize other tracks one by one with it.
2979 Remember that synchronization/delay tolerance depends on sample rate, too.
2980 Higher sample rates will give more tolerance.
2982 The filter accepts the following parameters:
2986 Set millimeters distance. This is compensation distance for fine tuning.
2990 Set cm distance. This is compensation distance for tightening distance setup.
2994 Set meters distance. This is compensation distance for hard distance setup.
2998 Set dry amount. Amount of unprocessed (dry) signal.
3002 Set wet amount. Amount of processed (wet) signal.
3006 Set temperature in degrees Celsius. This is the temperature of the environment.
3011 Apply headphone crossfeed filter.
3013 Crossfeed is the process of blending the left and right channels of stereo
3015 It is mainly used to reduce extreme stereo separation of low frequencies.
3017 The intent is to produce more speaker like sound to the listener.
3019 The filter accepts the following options:
3023 Set strength of crossfeed. Default is 0.2. Allowed range is from 0 to 1.
3024 This sets gain of low shelf filter for side part of stereo image.
3025 Default is -6dB. Max allowed is -30db when strength is set to 1.
3028 Set soundstage wideness. Default is 0.5. Allowed range is from 0 to 1.
3029 This sets cut off frequency of low shelf filter. Default is cut off near
3030 1550 Hz. With range set to 1 cut off frequency is set to 2100 Hz.
3033 Set input gain. Default is 0.9.
3036 Set output gain. Default is 1.
3039 @section crystalizer
3040 Simple algorithm to expand audio dynamic range.
3042 The filter accepts the following options:
3046 Sets the intensity of effect (default: 2.0). Must be in range between 0.0
3047 (unchanged sound) to 10.0 (maximum effect).
3050 Enable clipping. By default is enabled.
3054 Apply a DC shift to the audio.
3056 This can be useful to remove a DC offset (caused perhaps by a hardware problem
3057 in the recording chain) from the audio. The effect of a DC offset is reduced
3058 headroom and hence volume. The @ref{astats} filter can be used to determine if
3059 a signal has a DC offset.
3063 Set the DC shift, allowed range is [-1, 1]. It indicates the amount to shift
3067 Optional. It should have a value much less than 1 (e.g. 0.05 or 0.02) and is
3068 used to prevent clipping.
3073 Apply de-essing to the audio samples.
3077 Set intensity for triggering de-essing. Allowed range is from 0 to 1.
3081 Set amount of ducking on treble part of sound. Allowed range is from 0 to 1.
3085 How much of original frequency content to keep when de-essing. Allowed range is from 0 to 1.
3089 Set the output mode.
3091 It accepts the following values:
3094 Pass input unchanged.
3097 Pass ess filtered out.
3102 Default value is @var{o}.
3108 Measure audio dynamic range.
3110 DR values of 14 and higher is found in very dynamic material. DR of 8 to 13
3111 is found in transition material. And anything less that 8 have very poor dynamics
3112 and is very compressed.
3114 The filter accepts the following options:
3118 Set window length in seconds used to split audio into segments of equal length.
3119 Default is 3 seconds.
3123 Dynamic Audio Normalizer.
3125 This filter applies a certain amount of gain to the input audio in order
3126 to bring its peak magnitude to a target level (e.g. 0 dBFS). However, in
3127 contrast to more "simple" normalization algorithms, the Dynamic Audio
3128 Normalizer *dynamically* re-adjusts the gain factor to the input audio.
3129 This allows for applying extra gain to the "quiet" sections of the audio
3130 while avoiding distortions or clipping the "loud" sections. In other words:
3131 The Dynamic Audio Normalizer will "even out" the volume of quiet and loud
3132 sections, in the sense that the volume of each section is brought to the
3133 same target level. Note, however, that the Dynamic Audio Normalizer achieves
3134 this goal *without* applying "dynamic range compressing". It will retain 100%
3135 of the dynamic range *within* each section of the audio file.
3139 Set the frame length in milliseconds. In range from 10 to 8000 milliseconds.
3140 Default is 500 milliseconds.
3141 The Dynamic Audio Normalizer processes the input audio in small chunks,
3142 referred to as frames. This is required, because a peak magnitude has no
3143 meaning for just a single sample value. Instead, we need to determine the
3144 peak magnitude for a contiguous sequence of sample values. While a "standard"
3145 normalizer would simply use the peak magnitude of the complete file, the
3146 Dynamic Audio Normalizer determines the peak magnitude individually for each
3147 frame. The length of a frame is specified in milliseconds. By default, the
3148 Dynamic Audio Normalizer uses a frame length of 500 milliseconds, which has
3149 been found to give good results with most files.
3150 Note that the exact frame length, in number of samples, will be determined
3151 automatically, based on the sampling rate of the individual input audio file.
3154 Set the Gaussian filter window size. In range from 3 to 301, must be odd
3155 number. Default is 31.
3156 Probably the most important parameter of the Dynamic Audio Normalizer is the
3157 @code{window size} of the Gaussian smoothing filter. The filter's window size
3158 is specified in frames, centered around the current frame. For the sake of
3159 simplicity, this must be an odd number. Consequently, the default value of 31
3160 takes into account the current frame, as well as the 15 preceding frames and
3161 the 15 subsequent frames. Using a larger window results in a stronger
3162 smoothing effect and thus in less gain variation, i.e. slower gain
3163 adaptation. Conversely, using a smaller window results in a weaker smoothing
3164 effect and thus in more gain variation, i.e. faster gain adaptation.
3165 In other words, the more you increase this value, the more the Dynamic Audio
3166 Normalizer will behave like a "traditional" normalization filter. On the
3167 contrary, the more you decrease this value, the more the Dynamic Audio
3168 Normalizer will behave like a dynamic range compressor.
3171 Set the target peak value. This specifies the highest permissible magnitude
3172 level for the normalized audio input. This filter will try to approach the
3173 target peak magnitude as closely as possible, but at the same time it also
3174 makes sure that the normalized signal will never exceed the peak magnitude.
3175 A frame's maximum local gain factor is imposed directly by the target peak
3176 magnitude. The default value is 0.95 and thus leaves a headroom of 5%*.
3177 It is not recommended to go above this value.
3180 Set the maximum gain factor. In range from 1.0 to 100.0. Default is 10.0.
3181 The Dynamic Audio Normalizer determines the maximum possible (local) gain
3182 factor for each input frame, i.e. the maximum gain factor that does not
3183 result in clipping or distortion. The maximum gain factor is determined by
3184 the frame's highest magnitude sample. However, the Dynamic Audio Normalizer
3185 additionally bounds the frame's maximum gain factor by a predetermined
3186 (global) maximum gain factor. This is done in order to avoid excessive gain
3187 factors in "silent" or almost silent frames. By default, the maximum gain
3188 factor is 10.0, For most inputs the default value should be sufficient and
3189 it usually is not recommended to increase this value. Though, for input
3190 with an extremely low overall volume level, it may be necessary to allow even
3191 higher gain factors. Note, however, that the Dynamic Audio Normalizer does
3192 not simply apply a "hard" threshold (i.e. cut off values above the threshold).
3193 Instead, a "sigmoid" threshold function will be applied. This way, the
3194 gain factors will smoothly approach the threshold value, but never exceed that
3198 Set the target RMS. In range from 0.0 to 1.0. Default is 0.0 - disabled.
3199 By default, the Dynamic Audio Normalizer performs "peak" normalization.
3200 This means that the maximum local gain factor for each frame is defined
3201 (only) by the frame's highest magnitude sample. This way, the samples can
3202 be amplified as much as possible without exceeding the maximum signal
3203 level, i.e. without clipping. Optionally, however, the Dynamic Audio
3204 Normalizer can also take into account the frame's root mean square,
3205 abbreviated RMS. In electrical engineering, the RMS is commonly used to
3206 determine the power of a time-varying signal. It is therefore considered
3207 that the RMS is a better approximation of the "perceived loudness" than
3208 just looking at the signal's peak magnitude. Consequently, by adjusting all
3209 frames to a constant RMS value, a uniform "perceived loudness" can be
3210 established. If a target RMS value has been specified, a frame's local gain
3211 factor is defined as the factor that would result in exactly that RMS value.
3212 Note, however, that the maximum local gain factor is still restricted by the
3213 frame's highest magnitude sample, in order to prevent clipping.
3216 Enable channels coupling. By default is enabled.
3217 By default, the Dynamic Audio Normalizer will amplify all channels by the same
3218 amount. This means the same gain factor will be applied to all channels, i.e.
3219 the maximum possible gain factor is determined by the "loudest" channel.
3220 However, in some recordings, it may happen that the volume of the different
3221 channels is uneven, e.g. one channel may be "quieter" than the other one(s).
3222 In this case, this option can be used to disable the channel coupling. This way,
3223 the gain factor will be determined independently for each channel, depending
3224 only on the individual channel's highest magnitude sample. This allows for
3225 harmonizing the volume of the different channels.
3228 Enable DC bias correction. By default is disabled.
3229 An audio signal (in the time domain) is a sequence of sample values.
3230 In the Dynamic Audio Normalizer these sample values are represented in the
3231 -1.0 to 1.0 range, regardless of the original input format. Normally, the
3232 audio signal, or "waveform", should be centered around the zero point.
3233 That means if we calculate the mean value of all samples in a file, or in a
3234 single frame, then the result should be 0.0 or at least very close to that
3235 value. If, however, there is a significant deviation of the mean value from
3236 0.0, in either positive or negative direction, this is referred to as a
3237 DC bias or DC offset. Since a DC bias is clearly undesirable, the Dynamic
3238 Audio Normalizer provides optional DC bias correction.
3239 With DC bias correction enabled, the Dynamic Audio Normalizer will determine
3240 the mean value, or "DC correction" offset, of each input frame and subtract
3241 that value from all of the frame's sample values which ensures those samples
3242 are centered around 0.0 again. Also, in order to avoid "gaps" at the frame
3243 boundaries, the DC correction offset values will be interpolated smoothly
3244 between neighbouring frames.
3246 @item altboundary, b
3247 Enable alternative boundary mode. By default is disabled.
3248 The Dynamic Audio Normalizer takes into account a certain neighbourhood
3249 around each frame. This includes the preceding frames as well as the
3250 subsequent frames. However, for the "boundary" frames, located at the very
3251 beginning and at the very end of the audio file, not all neighbouring
3252 frames are available. In particular, for the first few frames in the audio
3253 file, the preceding frames are not known. And, similarly, for the last few
3254 frames in the audio file, the subsequent frames are not known. Thus, the
3255 question arises which gain factors should be assumed for the missing frames
3256 in the "boundary" region. The Dynamic Audio Normalizer implements two modes
3257 to deal with this situation. The default boundary mode assumes a gain factor
3258 of exactly 1.0 for the missing frames, resulting in a smooth "fade in" and
3259 "fade out" at the beginning and at the end of the input, respectively.
3262 Set the compress factor. In range from 0.0 to 30.0. Default is 0.0.
3263 By default, the Dynamic Audio Normalizer does not apply "traditional"
3264 compression. This means that signal peaks will not be pruned and thus the
3265 full dynamic range will be retained within each local neighbourhood. However,
3266 in some cases it may be desirable to combine the Dynamic Audio Normalizer's
3267 normalization algorithm with a more "traditional" compression.
3268 For this purpose, the Dynamic Audio Normalizer provides an optional compression
3269 (thresholding) function. If (and only if) the compression feature is enabled,
3270 all input frames will be processed by a soft knee thresholding function prior
3271 to the actual normalization process. Put simply, the thresholding function is
3272 going to prune all samples whose magnitude exceeds a certain threshold value.
3273 However, the Dynamic Audio Normalizer does not simply apply a fixed threshold
3274 value. Instead, the threshold value will be adjusted for each individual
3276 In general, smaller parameters result in stronger compression, and vice versa.
3277 Values below 3.0 are not recommended, because audible distortion may appear.
3282 Make audio easier to listen to on headphones.
3284 This filter adds `cues' to 44.1kHz stereo (i.e. audio CD format) audio
3285 so that when listened to on headphones the stereo image is moved from
3286 inside your head (standard for headphones) to outside and in front of
3287 the listener (standard for speakers).
3293 Apply a two-pole peaking equalisation (EQ) filter. With this
3294 filter, the signal-level at and around a selected frequency can
3295 be increased or decreased, whilst (unlike bandpass and bandreject
3296 filters) that at all other frequencies is unchanged.
3298 In order to produce complex equalisation curves, this filter can
3299 be given several times, each with a different central frequency.
3301 The filter accepts the following options:
3305 Set the filter's central frequency in Hz.
3308 Set method to specify band-width of filter.
3323 Specify the band-width of a filter in width_type units.
3326 Set the required gain or attenuation in dB.
3327 Beware of clipping when using a positive gain.
3330 How much to use filtered signal in output. Default is 1.
3331 Range is between 0 and 1.
3334 Specify which channels to filter, by default all available are filtered.
3337 @subsection Examples
3340 Attenuate 10 dB at 1000 Hz, with a bandwidth of 200 Hz:
3342 equalizer=f=1000:t=h:width=200:g=-10
3346 Apply 2 dB gain at 1000 Hz with Q 1 and attenuate 5 dB at 100 Hz with Q 2:
3348 equalizer=f=1000:t=q:w=1:g=2,equalizer=f=100:t=q:w=2:g=-5
3352 @subsection Commands
3354 This filter supports the following commands:
3357 Change equalizer frequency.
3358 Syntax for the command is : "@var{frequency}"
3361 Change equalizer width_type.
3362 Syntax for the command is : "@var{width_type}"
3365 Change equalizer width.
3366 Syntax for the command is : "@var{width}"
3369 Change equalizer gain.
3370 Syntax for the command is : "@var{gain}"
3373 Change equalizer mix.
3374 Syntax for the command is : "@var{mix}"
3377 @section extrastereo
3379 Linearly increases the difference between left and right channels which
3380 adds some sort of "live" effect to playback.
3382 The filter accepts the following options:
3386 Sets the difference coefficient (default: 2.5). 0.0 means mono sound
3387 (average of both channels), with 1.0 sound will be unchanged, with
3388 -1.0 left and right channels will be swapped.
3391 Enable clipping. By default is enabled.
3394 @section firequalizer
3395 Apply FIR Equalization using arbitrary frequency response.
3397 The filter accepts the following option:
3401 Set gain curve equation (in dB). The expression can contain variables:
3404 the evaluated frequency
3408 channel number, set to 0 when multichannels evaluation is disabled
3410 channel id, see libavutil/channel_layout.h, set to the first channel id when
3411 multichannels evaluation is disabled
3415 channel_layout, see libavutil/channel_layout.h
3420 @item gain_interpolate(f)
3421 interpolate gain on frequency f based on gain_entry
3422 @item cubic_interpolate(f)
3423 same as gain_interpolate, but smoother
3425 This option is also available as command. Default is @code{gain_interpolate(f)}.
3428 Set gain entry for gain_interpolate function. The expression can
3432 store gain entry at frequency f with value g
3434 This option is also available as command.
3437 Set filter delay in seconds. Higher value means more accurate.
3438 Default is @code{0.01}.
3441 Set filter accuracy in Hz. Lower value means more accurate.
3442 Default is @code{5}.
3445 Set window function. Acceptable values are:
3448 rectangular window, useful when gain curve is already smooth
3450 hann window (default)
3456 3-terms continuous 1st derivative nuttall window
3458 minimum 3-terms discontinuous nuttall window
3460 4-terms continuous 1st derivative nuttall window
3462 minimum 4-terms discontinuous nuttall (blackman-nuttall) window
3464 blackman-harris window
3470 If enabled, use fixed number of audio samples. This improves speed when
3471 filtering with large delay. Default is disabled.
3474 Enable multichannels evaluation on gain. Default is disabled.
3477 Enable zero phase mode by subtracting timestamp to compensate delay.
3478 Default is disabled.
3481 Set scale used by gain. Acceptable values are:
3484 linear frequency, linear gain
3486 linear frequency, logarithmic (in dB) gain (default)
3488 logarithmic (in octave scale where 20 Hz is 0) frequency, linear gain
3490 logarithmic frequency, logarithmic gain
3494 Set file for dumping, suitable for gnuplot.
3497 Set scale for dumpfile. Acceptable values are same with scale option.
3501 Enable 2-channel convolution using complex FFT. This improves speed significantly.
3502 Default is disabled.
3505 Enable minimum phase impulse response. Default is disabled.
3508 @subsection Examples
3513 firequalizer=gain='if(lt(f,1000), 0, -INF)'
3516 lowpass at 1000 Hz with gain_entry:
3518 firequalizer=gain_entry='entry(1000,0); entry(1001, -INF)'
3521 custom equalization:
3523 firequalizer=gain_entry='entry(100,0); entry(400, -4); entry(1000, -6); entry(2000, 0)'
3526 higher delay with zero phase to compensate delay:
3528 firequalizer=delay=0.1:fixed=on:zero_phase=on
3531 lowpass on left channel, highpass on right channel:
3533 firequalizer=gain='if(eq(chid,1), gain_interpolate(f), if(eq(chid,2), gain_interpolate(1e6+f), 0))'
3534 :gain_entry='entry(1000, 0); entry(1001,-INF); entry(1e6+1000,0)':multi=on
3539 Apply a flanging effect to the audio.
3541 The filter accepts the following options:
3545 Set base delay in milliseconds. Range from 0 to 30. Default value is 0.
3548 Set added sweep delay in milliseconds. Range from 0 to 10. Default value is 2.
3551 Set percentage regeneration (delayed signal feedback). Range from -95 to 95.
3555 Set percentage of delayed signal mixed with original. Range from 0 to 100.
3556 Default value is 71.
3559 Set sweeps per second (Hz). Range from 0.1 to 10. Default value is 0.5.
3562 Set swept wave shape, can be @var{triangular} or @var{sinusoidal}.
3563 Default value is @var{sinusoidal}.
3566 Set swept wave percentage-shift for multi channel. Range from 0 to 100.
3567 Default value is 25.
3570 Set delay-line interpolation, @var{linear} or @var{quadratic}.
3571 Default is @var{linear}.
3575 Apply Haas effect to audio.
3577 Note that this makes most sense to apply on mono signals.
3578 With this filter applied to mono signals it give some directionality and
3579 stretches its stereo image.
3581 The filter accepts the following options:
3585 Set input level. By default is @var{1}, or 0dB
3588 Set output level. By default is @var{1}, or 0dB.
3591 Set gain applied to side part of signal. By default is @var{1}.
3594 Set kind of middle source. Can be one of the following:
3604 Pick middle part signal of stereo image.
3607 Pick side part signal of stereo image.
3611 Change middle phase. By default is disabled.
3614 Set left channel delay. By default is @var{2.05} milliseconds.
3617 Set left channel balance. By default is @var{-1}.
3620 Set left channel gain. By default is @var{1}.
3623 Change left phase. By default is disabled.
3626 Set right channel delay. By defaults is @var{2.12} milliseconds.
3629 Set right channel balance. By default is @var{1}.
3632 Set right channel gain. By default is @var{1}.
3635 Change right phase. By default is enabled.
3640 Decodes High Definition Compatible Digital (HDCD) data. A 16-bit PCM stream with
3641 embedded HDCD codes is expanded into a 20-bit PCM stream.
3643 The filter supports the Peak Extend and Low-level Gain Adjustment features
3644 of HDCD, and detects the Transient Filter flag.
3647 ffmpeg -i HDCD16.flac -af hdcd OUT24.flac
3650 When using the filter with wav, note the default encoding for wav is 16-bit,
3651 so the resulting 20-bit stream will be truncated back to 16-bit. Use something
3652 like @command{-acodec pcm_s24le} after the filter to get 24-bit PCM output.
3654 ffmpeg -i HDCD16.wav -af hdcd OUT16.wav
3655 ffmpeg -i HDCD16.wav -af hdcd -c:a pcm_s24le OUT24.wav
3658 The filter accepts the following options:
3661 @item disable_autoconvert
3662 Disable any automatic format conversion or resampling in the filter graph.
3664 @item process_stereo
3665 Process the stereo channels together. If target_gain does not match between
3666 channels, consider it invalid and use the last valid target_gain.
3669 Set the code detect timer period in ms.
3672 Always extend peaks above -3dBFS even if PE isn't signaled.
3675 Replace audio with a solid tone and adjust the amplitude to signal some
3676 specific aspect of the decoding process. The output file can be loaded in
3677 an audio editor alongside the original to aid analysis.
3679 @code{analyze_mode=pe:force_pe=true} can be used to see all samples above the PE level.
3686 Gain adjustment level at each sample
3688 Samples where peak extend occurs
3690 Samples where the code detect timer is active
3692 Samples where the target gain does not match between channels
3698 Apply head-related transfer functions (HRTFs) to create virtual
3699 loudspeakers around the user for binaural listening via headphones.
3700 The HRIRs are provided via additional streams, for each channel
3701 one stereo input stream is needed.
3703 The filter accepts the following options:
3707 Set mapping of input streams for convolution.
3708 The argument is a '|'-separated list of channel names in order as they
3709 are given as additional stream inputs for filter.
3710 This also specify number of input streams. Number of input streams
3711 must be not less than number of channels in first stream plus one.
3714 Set gain applied to audio. Value is in dB. Default is 0.
3717 Set processing type. Can be @var{time} or @var{freq}. @var{time} is
3718 processing audio in time domain which is slow.
3719 @var{freq} is processing audio in frequency domain which is fast.
3720 Default is @var{freq}.
3723 Set custom gain for LFE channels. Value is in dB. Default is 0.
3726 Set size of frame in number of samples which will be processed at once.
3727 Default value is @var{1024}. Allowed range is from 1024 to 96000.
3730 Set format of hrir stream.
3731 Default value is @var{stereo}. Alternative value is @var{multich}.
3732 If value is set to @var{stereo}, number of additional streams should
3733 be greater or equal to number of input channels in first input stream.
3734 Also each additional stream should have stereo number of channels.
3735 If value is set to @var{multich}, number of additional streams should
3736 be exactly one. Also number of input channels of additional stream
3737 should be equal or greater than twice number of channels of first input
3741 @subsection Examples
3745 Full example using wav files as coefficients with amovie filters for 7.1 downmix,
3746 each amovie filter use stereo file with IR coefficients as input.
3747 The files give coefficients for each position of virtual loudspeaker:
3750 -filter_complex "amovie=azi_270_ele_0_DFC.wav[sr];amovie=azi_90_ele_0_DFC.wav[sl];amovie=azi_225_ele_0_DFC.wav[br];amovie=azi_135_ele_0_DFC.wav[bl];amovie=azi_0_ele_0_DFC.wav,asplit[fc][lfe];amovie=azi_35_ele_0_DFC.wav[fl];amovie=azi_325_ele_0_DFC.wav[fr];[0:a][fl][fr][fc][lfe][bl][br][sl][sr]headphone=FL|FR|FC|LFE|BL|BR|SL|SR"
3755 Full example using wav files as coefficients with amovie filters for 7.1 downmix,
3756 but now in @var{multich} @var{hrir} format.
3758 ffmpeg -i input.wav -filter_complex "amovie=minp.wav[hrirs];[0:a][hrirs]headphone=map=FL|FR|FC|LFE|BL|BR|SL|SR:hrir=multich"
3765 Apply a high-pass filter with 3dB point frequency.
3766 The filter can be either single-pole, or double-pole (the default).
3767 The filter roll off at 6dB per pole per octave (20dB per pole per decade).
3769 The filter accepts the following options:
3773 Set frequency in Hz. Default is 3000.
3776 Set number of poles. Default is 2.
3779 Set method to specify band-width of filter.
3794 Specify the band-width of a filter in width_type units.
3795 Applies only to double-pole filter.
3796 The default is 0.707q and gives a Butterworth response.
3799 How much to use filtered signal in output. Default is 1.
3800 Range is between 0 and 1.
3803 Specify which channels to filter, by default all available are filtered.
3806 @subsection Commands
3808 This filter supports the following commands:
3811 Change highpass frequency.
3812 Syntax for the command is : "@var{frequency}"
3815 Change highpass width_type.
3816 Syntax for the command is : "@var{width_type}"
3819 Change highpass width.
3820 Syntax for the command is : "@var{width}"
3823 Change highpass mix.
3824 Syntax for the command is : "@var{mix}"
3829 Join multiple input streams into one multi-channel stream.
3831 It accepts the following parameters:
3835 The number of input streams. It defaults to 2.
3837 @item channel_layout
3838 The desired output channel layout. It defaults to stereo.
3841 Map channels from inputs to output. The argument is a '|'-separated list of
3842 mappings, each in the @code{@var{input_idx}.@var{in_channel}-@var{out_channel}}
3843 form. @var{input_idx} is the 0-based index of the input stream. @var{in_channel}
3844 can be either the name of the input channel (e.g. FL for front left) or its
3845 index in the specified input stream. @var{out_channel} is the name of the output
3849 The filter will attempt to guess the mappings when they are not specified
3850 explicitly. It does so by first trying to find an unused matching input channel
3851 and if that fails it picks the first unused input channel.
3853 Join 3 inputs (with properly set channel layouts):
3855 ffmpeg -i INPUT1 -i INPUT2 -i INPUT3 -filter_complex join=inputs=3 OUTPUT
3858 Build a 5.1 output from 6 single-channel streams:
3860 ffmpeg -i fl -i fr -i fc -i sl -i sr -i lfe -filter_complex
3861 'join=inputs=6:channel_layout=5.1:map=0.0-FL|1.0-FR|2.0-FC|3.0-SL|4.0-SR|5.0-LFE'
3867 Load a LADSPA (Linux Audio Developer's Simple Plugin API) plugin.
3869 To enable compilation of this filter you need to configure FFmpeg with
3870 @code{--enable-ladspa}.
3874 Specifies the name of LADSPA plugin library to load. If the environment
3875 variable @env{LADSPA_PATH} is defined, the LADSPA plugin is searched in
3876 each one of the directories specified by the colon separated list in
3877 @env{LADSPA_PATH}, otherwise in the standard LADSPA paths, which are in
3878 this order: @file{HOME/.ladspa/lib/}, @file{/usr/local/lib/ladspa/},
3879 @file{/usr/lib/ladspa/}.
3882 Specifies the plugin within the library. Some libraries contain only
3883 one plugin, but others contain many of them. If this is not set filter
3884 will list all available plugins within the specified library.
3887 Set the '|' separated list of controls which are zero or more floating point
3888 values that determine the behavior of the loaded plugin (for example delay,
3890 Controls need to be defined using the following syntax:
3891 c0=@var{value0}|c1=@var{value1}|c2=@var{value2}|..., where
3892 @var{valuei} is the value set on the @var{i}-th control.
3893 Alternatively they can be also defined using the following syntax:
3894 @var{value0}|@var{value1}|@var{value2}|..., where
3895 @var{valuei} is the value set on the @var{i}-th control.
3896 If @option{controls} is set to @code{help}, all available controls and
3897 their valid ranges are printed.
3899 @item sample_rate, s
3900 Specify the sample rate, default to 44100. Only used if plugin have
3904 Set the number of samples per channel per each output frame, default
3905 is 1024. Only used if plugin have zero inputs.
3908 Set the minimum duration of the sourced audio. See
3909 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
3910 for the accepted syntax.
3911 Note that the resulting duration may be greater than the specified duration,
3912 as the generated audio is always cut at the end of a complete frame.
3913 If not specified, or the expressed duration is negative, the audio is
3914 supposed to be generated forever.
3915 Only used if plugin have zero inputs.
3919 @subsection Examples
3923 List all available plugins within amp (LADSPA example plugin) library:
3929 List all available controls and their valid ranges for @code{vcf_notch}
3930 plugin from @code{VCF} library:
3932 ladspa=f=vcf:p=vcf_notch:c=help
3936 Simulate low quality audio equipment using @code{Computer Music Toolkit} (CMT)
3939 ladspa=file=cmt:plugin=lofi:controls=c0=22|c1=12|c2=12
3943 Add reverberation to the audio using TAP-plugins
3944 (Tom's Audio Processing plugins):
3946 ladspa=file=tap_reverb:tap_reverb
3950 Generate white noise, with 0.2 amplitude:
3952 ladspa=file=cmt:noise_source_white:c=c0=.2
3956 Generate 20 bpm clicks using plugin @code{C* Click - Metronome} from the
3957 @code{C* Audio Plugin Suite} (CAPS) library:
3959 ladspa=file=caps:Click:c=c1=20'
3963 Apply @code{C* Eq10X2 - Stereo 10-band equaliser} effect:
3965 ladspa=caps:Eq10X2:c=c0=-48|c9=-24|c3=12|c4=2
3969 Increase volume by 20dB using fast lookahead limiter from Steve Harris
3970 @code{SWH Plugins} collection:
3972 ladspa=fast_lookahead_limiter_1913:fastLookaheadLimiter:20|0|2
3976 Attenuate low frequencies using Multiband EQ from Steve Harris
3977 @code{SWH Plugins} collection:
3979 ladspa=mbeq_1197:mbeq:-24|-24|-24|0|0|0|0|0|0|0|0|0|0|0|0
3983 Reduce stereo image using @code{Narrower} from the @code{C* Audio Plugin Suite}
3986 ladspa=caps:Narrower
3990 Another white noise, now using @code{C* Audio Plugin Suite} (CAPS) library:
3992 ladspa=caps:White:.2
3996 Some fractal noise, using @code{C* Audio Plugin Suite} (CAPS) library:
3998 ladspa=caps:Fractal:c=c1=1
4002 Dynamic volume normalization using @code{VLevel} plugin:
4004 ladspa=vlevel-ladspa:vlevel_mono
4008 @subsection Commands
4010 This filter supports the following commands:
4013 Modify the @var{N}-th control value.
4015 If the specified value is not valid, it is ignored and prior one is kept.
4020 EBU R128 loudness normalization. Includes both dynamic and linear normalization modes.
4021 Support for both single pass (livestreams, files) and double pass (files) modes.
4022 This algorithm can target IL, LRA, and maximum true peak. To accurately detect true peaks,
4023 the audio stream will be upsampled to 192 kHz unless the normalization mode is linear.
4024 Use the @code{-ar} option or @code{aresample} filter to explicitly set an output sample rate.
4026 The filter accepts the following options:
4030 Set integrated loudness target.
4031 Range is -70.0 - -5.0. Default value is -24.0.
4034 Set loudness range target.
4035 Range is 1.0 - 20.0. Default value is 7.0.
4038 Set maximum true peak.
4039 Range is -9.0 - +0.0. Default value is -2.0.
4041 @item measured_I, measured_i
4042 Measured IL of input file.
4043 Range is -99.0 - +0.0.
4045 @item measured_LRA, measured_lra
4046 Measured LRA of input file.
4047 Range is 0.0 - 99.0.
4049 @item measured_TP, measured_tp
4050 Measured true peak of input file.
4051 Range is -99.0 - +99.0.
4053 @item measured_thresh
4054 Measured threshold of input file.
4055 Range is -99.0 - +0.0.
4058 Set offset gain. Gain is applied before the true-peak limiter.
4059 Range is -99.0 - +99.0. Default is +0.0.
4062 Normalize linearly if possible.
4063 measured_I, measured_LRA, measured_TP, and measured_thresh must also
4064 to be specified in order to use this mode.
4065 Options are true or false. Default is true.
4068 Treat mono input files as "dual-mono". If a mono file is intended for playback
4069 on a stereo system, its EBU R128 measurement will be perceptually incorrect.
4070 If set to @code{true}, this option will compensate for this effect.
4071 Multi-channel input files are not affected by this option.
4072 Options are true or false. Default is false.
4075 Set print format for stats. Options are summary, json, or none.
4076 Default value is none.
4081 Apply a low-pass filter with 3dB point frequency.
4082 The filter can be either single-pole or double-pole (the default).
4083 The filter roll off at 6dB per pole per octave (20dB per pole per decade).
4085 The filter accepts the following options:
4089 Set frequency in Hz. Default is 500.
4092 Set number of poles. Default is 2.
4095 Set method to specify band-width of filter.
4110 Specify the band-width of a filter in width_type units.
4111 Applies only to double-pole filter.
4112 The default is 0.707q and gives a Butterworth response.
4115 How much to use filtered signal in output. Default is 1.
4116 Range is between 0 and 1.
4119 Specify which channels to filter, by default all available are filtered.
4122 @subsection Examples
4125 Lowpass only LFE channel, it LFE is not present it does nothing:
4131 @subsection Commands
4133 This filter supports the following commands:
4136 Change lowpass frequency.
4137 Syntax for the command is : "@var{frequency}"
4140 Change lowpass width_type.
4141 Syntax for the command is : "@var{width_type}"
4144 Change lowpass width.
4145 Syntax for the command is : "@var{width}"
4149 Syntax for the command is : "@var{mix}"
4154 Load a LV2 (LADSPA Version 2) plugin.
4156 To enable compilation of this filter you need to configure FFmpeg with
4157 @code{--enable-lv2}.
4161 Specifies the plugin URI. You may need to escape ':'.
4164 Set the '|' separated list of controls which are zero or more floating point
4165 values that determine the behavior of the loaded plugin (for example delay,
4167 If @option{controls} is set to @code{help}, all available controls and
4168 their valid ranges are printed.
4170 @item sample_rate, s
4171 Specify the sample rate, default to 44100. Only used if plugin have
4175 Set the number of samples per channel per each output frame, default
4176 is 1024. Only used if plugin have zero inputs.
4179 Set the minimum duration of the sourced audio. See
4180 @ref{time duration syntax,,the Time duration section in the ffmpeg-utils(1) manual,ffmpeg-utils}
4181 for the accepted syntax.
4182 Note that the resulting duration may be greater than the specified duration,
4183 as the generated audio is always cut at the end of a complete frame.
4184 If not specified, or the expressed duration is negative, the audio is
4185 supposed to be generated forever.
4186 Only used if plugin have zero inputs.
4189 @subsection Examples
4193 Apply bass enhancer plugin from Calf:
4195 lv2=p=http\\\\://calf.sourceforge.net/plugins/BassEnhancer:c=amount=2
4199 Apply vinyl plugin from Calf:
4201 lv2=p=http\\\\://calf.sourceforge.net/plugins/Vinyl:c=drone=0.2|aging=0.5
4205 Apply bit crusher plugin from ArtyFX:
4207 lv2=p=http\\\\://www.openavproductions.com/artyfx#bitta:c=crush=0.3
4212 Multiband Compress or expand the audio's dynamic range.
4214 The input audio is divided into bands using 4th order Linkwitz-Riley IIRs.
4215 This is akin to the crossover of a loudspeaker, and results in flat frequency
4216 response when absent compander action.
4218 It accepts the following parameters:
4222 This option syntax is:
4223 attack,decay,[attack,decay..] soft-knee points crossover_frequency [delay [initial_volume [gain]]] | attack,decay ...
4224 For explanation of each item refer to compand filter documentation.
4230 Mix channels with specific gain levels. The filter accepts the output
4231 channel layout followed by a set of channels definitions.
4233 This filter is also designed to efficiently remap the channels of an audio
4236 The filter accepts parameters of the form:
4237 "@var{l}|@var{outdef}|@var{outdef}|..."
4241 output channel layout or number of channels
4244 output channel specification, of the form:
4245 "@var{out_name}=[@var{gain}*]@var{in_name}[(+-)[@var{gain}*]@var{in_name}...]"
4248 output channel to define, either a channel name (FL, FR, etc.) or a channel
4249 number (c0, c1, etc.)
4252 multiplicative coefficient for the channel, 1 leaving the volume unchanged
4255 input channel to use, see out_name for details; it is not possible to mix
4256 named and numbered input channels
4259 If the `=' in a channel specification is replaced by `<', then the gains for
4260 that specification will be renormalized so that the total is 1, thus
4261 avoiding clipping noise.
4263 @subsection Mixing examples
4265 For example, if you want to down-mix from stereo to mono, but with a bigger
4266 factor for the left channel:
4268 pan=1c|c0=0.9*c0+0.1*c1
4271 A customized down-mix to stereo that works automatically for 3-, 4-, 5- and
4272 7-channels surround:
4274 pan=stereo| FL < FL + 0.5*FC + 0.6*BL + 0.6*SL | FR < FR + 0.5*FC + 0.6*BR + 0.6*SR
4277 Note that @command{ffmpeg} integrates a default down-mix (and up-mix) system
4278 that should be preferred (see "-ac" option) unless you have very specific
4281 @subsection Remapping examples
4283 The channel remapping will be effective if, and only if:
4286 @item gain coefficients are zeroes or ones,
4287 @item only one input per channel output,
4290 If all these conditions are satisfied, the filter will notify the user ("Pure
4291 channel mapping detected"), and use an optimized and lossless method to do the
4294 For example, if you have a 5.1 source and want a stereo audio stream by
4295 dropping the extra channels:
4297 pan="stereo| c0=FL | c1=FR"
4300 Given the same source, you can also switch front left and front right channels
4301 and keep the input channel layout:
4303 pan="5.1| c0=c1 | c1=c0 | c2=c2 | c3=c3 | c4=c4 | c5=c5"
4306 If the input is a stereo audio stream, you can mute the front left channel (and
4307 still keep the stereo channel layout) with:
4312 Still with a stereo audio stream input, you can copy the right channel in both
4313 front left and right:
4315 pan="stereo| c0=FR | c1=FR"
4320 ReplayGain scanner filter. This filter takes an audio stream as an input and
4321 outputs it unchanged.
4322 At end of filtering it displays @code{track_gain} and @code{track_peak}.
4326 Convert the audio sample format, sample rate and channel layout. It is
4327 not meant to be used directly.
4330 Apply time-stretching and pitch-shifting with librubberband.
4332 To enable compilation of this filter, you need to configure FFmpeg with
4333 @code{--enable-librubberband}.
4335 The filter accepts the following options:
4339 Set tempo scale factor.
4342 Set pitch scale factor.
4345 Set transients detector.
4346 Possible values are:
4355 Possible values are:
4364 Possible values are:
4371 Set processing window size.
4372 Possible values are:
4381 Possible values are:
4388 Enable formant preservation when shift pitching.
4389 Possible values are:
4397 Possible values are:
4406 Possible values are:
4413 @section sidechaincompress
4415 This filter acts like normal compressor but has the ability to compress
4416 detected signal using second input signal.
4417 It needs two input streams and returns one output stream.
4418 First input stream will be processed depending on second stream signal.
4419 The filtered signal then can be filtered with other filters in later stages of
4420 processing. See @ref{pan} and @ref{amerge} filter.
4422 The filter accepts the following options:
4426 Set input gain. Default is 1. Range is between 0.015625 and 64.
4429 Set mode of compressor operation. Can be @code{upward} or @code{downward}.
4430 Default is @code{downward}.
4433 If a signal of second stream raises above this level it will affect the gain
4434 reduction of first stream.
4435 By default is 0.125. Range is between 0.00097563 and 1.
4438 Set a ratio about which the signal is reduced. 1:2 means that if the level
4439 raised 4dB above the threshold, it will be only 2dB above after the reduction.
4440 Default is 2. Range is between 1 and 20.
4443 Amount of milliseconds the signal has to rise above the threshold before gain
4444 reduction starts. Default is 20. Range is between 0.01 and 2000.
4447 Amount of milliseconds the signal has to fall below the threshold before
4448 reduction is decreased again. Default is 250. Range is between 0.01 and 9000.
4451 Set the amount by how much signal will be amplified after processing.
4452 Default is 1. Range is from 1 to 64.
4455 Curve the sharp knee around the threshold to enter gain reduction more softly.
4456 Default is 2.82843. Range is between 1 and 8.
4459 Choose if the @code{average} level between all channels of side-chain stream
4460 or the louder(@code{maximum}) channel of side-chain stream affects the
4461 reduction. Default is @code{average}.
4464 Should the exact signal be taken in case of @code{peak} or an RMS one in case
4465 of @code{rms}. Default is @code{rms} which is mainly smoother.
4468 Set sidechain gain. Default is 1. Range is between 0.015625 and 64.
4471 How much to use compressed signal in output. Default is 1.
4472 Range is between 0 and 1.
4475 @subsection Examples
4479 Full ffmpeg example taking 2 audio inputs, 1st input to be compressed
4480 depending on the signal of 2nd input and later compressed signal to be
4481 merged with 2nd input:
4483 ffmpeg -i main.flac -i sidechain.flac -filter_complex "[1:a]asplit=2[sc][mix];[0:a][sc]sidechaincompress[compr];[compr][mix]amerge"
4487 @section sidechaingate
4489 A sidechain gate acts like a normal (wideband) gate but has the ability to
4490 filter the detected signal before sending it to the gain reduction stage.
4491 Normally a gate uses the full range signal to detect a level above the
4493 For example: If you cut all lower frequencies from your sidechain signal
4494 the gate will decrease the volume of your track only if not enough highs
4495 appear. With this technique you are able to reduce the resonation of a
4496 natural drum or remove "rumbling" of muted strokes from a heavily distorted
4498 It needs two input streams and returns one output stream.
4499 First input stream will be processed depending on second stream signal.
4501 The filter accepts the following options:
4505 Set input level before filtering.
4506 Default is 1. Allowed range is from 0.015625 to 64.
4509 Set the mode of operation. Can be @code{upward} or @code{downward}.
4510 Default is @code{downward}. If set to @code{upward} mode, higher parts of signal
4511 will be amplified, expanding dynamic range in upward direction.
4512 Otherwise, in case of @code{downward} lower parts of signal will be reduced.
4515 Set the level of gain reduction when the signal is below the threshold.
4516 Default is 0.06125. Allowed range is from 0 to 1.
4517 Setting this to 0 disables reduction and then filter behaves like expander.
4520 If a signal rises above this level the gain reduction is released.
4521 Default is 0.125. Allowed range is from 0 to 1.
4524 Set a ratio about which the signal is reduced.
4525 Default is 2. Allowed range is from 1 to 9000.
4528 Amount of milliseconds the signal has to rise above the threshold before gain
4530 Default is 20 milliseconds. Allowed range is from 0.01 to 9000.
4533 Amount of milliseconds the signal has to fall below the threshold before the
4534 reduction is increased again. Default is 250 milliseconds.
4535 Allowed range is from 0.01 to 9000.
4538 Set amount of amplification of signal after processing.
4539 Default is 1. Allowed range is from 1 to 64.
4542 Curve the sharp knee around the threshold to enter gain reduction more softly.
4543 Default is 2.828427125. Allowed range is from 1 to 8.
4546 Choose if exact signal should be taken for detection or an RMS like one.
4547 Default is rms. Can be peak or rms.
4550 Choose if the average level between all channels or the louder channel affects
4552 Default is average. Can be average or maximum.
4555 Set sidechain gain. Default is 1. Range is from 0.015625 to 64.
4558 @section silencedetect
4560 Detect silence in an audio stream.
4562 This filter logs a message when it detects that the input audio volume is less
4563 or equal to a noise tolerance value for a duration greater or equal to the
4564 minimum detected noise duration.
4566 The printed times and duration are expressed in seconds.
4568 The filter accepts the following options:
4572 Set noise tolerance. Can be specified in dB (in case "dB" is appended to the
4573 specified value) or amplitude ratio. Default is -60dB, or 0.001.
4576 Set silence duration until notification (default is 2 seconds).
4579 Process each channel separately, instead of combined. By default is disabled.
4582 @subsection Examples
4586 Detect 5 seconds of silence with -50dB noise tolerance:
4588 silencedetect=n=-50dB:d=5
4592 Complete example with @command{ffmpeg} to detect silence with 0.0001 noise
4593 tolerance in @file{silence.mp3}:
4595 ffmpeg -i silence.mp3 -af silencedetect=noise=0.0001 -f null -
4599 @section silenceremove
4601 Remove silence from the beginning, middle or end of the audio.
4603 The filter accepts the following options:
4607 This value is used to indicate if audio should be trimmed at beginning of
4608 the audio. A value of zero indicates no silence should be trimmed from the
4609 beginning. When specifying a non-zero value, it trims audio up until it
4610 finds non-silence. Normally, when trimming silence from beginning of audio
4611 the @var{start_periods} will be @code{1} but it can be increased to higher
4612 values to trim all audio up to specific count of non-silence periods.
4613 Default value is @code{0}.
4615 @item start_duration
4616 Specify the amount of time that non-silence must be detected before it stops
4617 trimming audio. By increasing the duration, bursts of noises can be treated
4618 as silence and trimmed off. Default value is @code{0}.
4620 @item start_threshold
4621 This indicates what sample value should be treated as silence. For digital
4622 audio, a value of @code{0} may be fine but for audio recorded from analog,
4623 you may wish to increase the value to account for background noise.
4624 Can be specified in dB (in case "dB" is appended to the specified value)
4625 or amplitude ratio. Default value is @code{0}.
4628 Specify max duration of silence at beginning that will be kept after
4629 trimming. Default is 0, which is equal to trimming all samples detected
4633 Specify mode of detection of silence end in start of multi-channel audio.
4634 Can be @var{any} or @var{all}. Default is @var{any}.
4635 With @var{any}, any sample that is detected as non-silence will cause
4636 stopped trimming of silence.
4637 With @var{all}, only if all channels are detected as non-silence will cause
4638 stopped trimming of silence.
4641 Set the count for trimming silence from the end of audio.
4642 To remove silence from the middle of a file, specify a @var{stop_periods}
4643 that is negative. This value is then treated as a positive value and is
4644 used to indicate the effect should restart processing as specified by
4645 @var{start_periods}, making it suitable for removing periods of silence
4646 in the middle of the audio.
4647 Default value is @code{0}.
4650 Specify a duration of silence that must exist before audio is not copied any
4651 more. By specifying a higher duration, silence that is wanted can be left in
4653 Default value is @code{0}.
4655 @item stop_threshold
4656 This is the same as @option{start_threshold} but for trimming silence from
4658 Can be specified in dB (in case "dB" is appended to the specified value)
4659 or amplitude ratio. Default value is @code{0}.
4662 Specify max duration of silence at end that will be kept after
4663 trimming. Default is 0, which is equal to trimming all samples detected
4667 Specify mode of detection of silence start in end of multi-channel audio.
4668 Can be @var{any} or @var{all}. Default is @var{any}.
4669 With @var{any}, any sample that is detected as non-silence will cause
4670 stopped trimming of silence.
4671 With @var{all}, only if all channels are detected as non-silence will cause
4672 stopped trimming of silence.
4675 Set how is silence detected. Can be @code{rms} or @code{peak}. Second is faster
4676 and works better with digital silence which is exactly 0.
4677 Default value is @code{rms}.
4680 Set duration in number of seconds used to calculate size of window in number
4681 of samples for detecting silence.
4682 Default value is @code{0.02}. Allowed range is from @code{0} to @code{10}.
4685 @subsection Examples
4689 The following example shows how this filter can be used to start a recording
4690 that does not contain the delay at the start which usually occurs between
4691 pressing the record button and the start of the performance:
4693 silenceremove=start_periods=1:start_duration=5:start_threshold=0.02
4697 Trim all silence encountered from beginning to end where there is more than 1
4698 second of silence in audio:
4700 silenceremove=stop_periods=-1:stop_duration=1:stop_threshold=-90dB
4704 Trim all digital silence samples, using peak detection, from beginning to end
4705 where there is more than 0 samples of digital silence in audio and digital
4706 silence is detected in all channels at same positions in stream:
4708 silenceremove=window=0:detection=peak:stop_mode=all:start_mode=all:stop_periods=-1:stop_threshold=0
4714 SOFAlizer uses head-related transfer functions (HRTFs) to create virtual
4715 loudspeakers around the user for binaural listening via headphones (audio
4716 formats up to 9 channels supported).
4717 The HRTFs are stored in SOFA files (see @url{http://www.sofacoustics.org/} for a database).
4718 SOFAlizer is developed at the Acoustics Research Institute (ARI) of the
4719 Austrian Academy of Sciences.
4721 To enable compilation of this filter you need to configure FFmpeg with
4722 @code{--enable-libmysofa}.
4724 The filter accepts the following options:
4728 Set the SOFA file used for rendering.
4731 Set gain applied to audio. Value is in dB. Default is 0.
4734 Set rotation of virtual loudspeakers in deg. Default is 0.
4737 Set elevation of virtual speakers in deg. Default is 0.
4740 Set distance in meters between loudspeakers and the listener with near-field
4741 HRTFs. Default is 1.
4744 Set processing type. Can be @var{time} or @var{freq}. @var{time} is
4745 processing audio in time domain which is slow.
4746 @var{freq} is processing audio in frequency domain which is fast.
4747 Default is @var{freq}.
4750 Set custom positions of virtual loudspeakers. Syntax for this option is:
4751 <CH> <AZIM> <ELEV>[|<CH> <AZIM> <ELEV>|...].
4752 Each virtual loudspeaker is described with short channel name following with
4753 azimuth and elevation in degrees.
4754 Each virtual loudspeaker description is separated by '|'.
4755 For example to override front left and front right channel positions use:
4756 'speakers=FL 45 15|FR 345 15'.
4757 Descriptions with unrecognised channel names are ignored.
4760 Set custom gain for LFE channels. Value is in dB. Default is 0.
4763 Set custom frame size in number of samples. Default is 1024.
4764 Allowed range is from 1024 to 96000. Only used if option @samp{type}
4765 is set to @var{freq}.
4768 Should all IRs be normalized upon importing SOFA file.
4769 By default is enabled.
4772 Should nearest IRs be interpolated with neighbor IRs if exact position
4773 does not match. By default is disabled.
4776 Minphase all IRs upon loading of SOFA file. By default is disabled.
4779 Set neighbor search angle step. Only used if option @var{interpolate} is enabled.
4782 Set neighbor search radius step. Only used if option @var{interpolate} is enabled.
4785 @subsection Examples
4789 Using ClubFritz6 sofa file:
4791 sofalizer=sofa=/path/to/ClubFritz6.sofa:type=freq:radius=1
4795 Using ClubFritz12 sofa file and bigger radius with small rotation:
4797 sofalizer=sofa=/path/to/ClubFritz12.sofa:type=freq:radius=2:rotation=5
4801 Similar as above but with custom speaker positions for front left, front right, back left and back right
4802 and also with custom gain:
4804 "sofalizer=sofa=/path/to/ClubFritz6.sofa:type=freq:radius=2:speakers=FL 45|FR 315|BL 135|BR 225:gain=28"
4808 @section stereotools
4810 This filter has some handy utilities to manage stereo signals, for converting
4811 M/S stereo recordings to L/R signal while having control over the parameters
4812 or spreading the stereo image of master track.
4814 The filter accepts the following options:
4818 Set input level before filtering for both channels. Defaults is 1.
4819 Allowed range is from 0.015625 to 64.
4822 Set output level after filtering for both channels. Defaults is 1.
4823 Allowed range is from 0.015625 to 64.
4826 Set input balance between both channels. Default is 0.
4827 Allowed range is from -1 to 1.
4830 Set output balance between both channels. Default is 0.
4831 Allowed range is from -1 to 1.
4834 Enable softclipping. Results in analog distortion instead of harsh digital 0dB
4835 clipping. Disabled by default.
4838 Mute the left channel. Disabled by default.
4841 Mute the right channel. Disabled by default.
4844 Change the phase of the left channel. Disabled by default.
4847 Change the phase of the right channel. Disabled by default.
4850 Set stereo mode. Available values are:
4854 Left/Right to Left/Right, this is default.
4857 Left/Right to Mid/Side.
4860 Mid/Side to Left/Right.
4863 Left/Right to Left/Left.
4866 Left/Right to Right/Right.
4869 Left/Right to Left + Right.
4872 Left/Right to Right/Left.
4875 Mid/Side to Left/Left.
4878 Mid/Side to Right/Right.
4882 Set level of side signal. Default is 1.
4883 Allowed range is from 0.015625 to 64.
4886 Set balance of side signal. Default is 0.
4887 Allowed range is from -1 to 1.
4890 Set level of the middle signal. Default is 1.
4891 Allowed range is from 0.015625 to 64.
4894 Set middle signal pan. Default is 0. Allowed range is from -1 to 1.
4897 Set stereo base between mono and inversed channels. Default is 0.
4898 Allowed range is from -1 to 1.
4901 Set delay in milliseconds how much to delay left from right channel and
4902 vice versa. Default is 0. Allowed range is from -20 to 20.
4905 Set S/C level. Default is 1. Allowed range is from 1 to 100.
4908 Set the stereo phase in degrees. Default is 0. Allowed range is from 0 to 360.
4910 @item bmode_in, bmode_out
4911 Set balance mode for balance_in/balance_out option.
4913 Can be one of the following:
4917 Classic balance mode. Attenuate one channel at time.
4918 Gain is raised up to 1.
4921 Similar as classic mode above but gain is raised up to 2.
4924 Equal power distribution, from -6dB to +6dB range.
4928 @subsection Examples
4932 Apply karaoke like effect:
4934 stereotools=mlev=0.015625
4938 Convert M/S signal to L/R:
4940 "stereotools=mode=ms>lr"
4944 @section stereowiden
4946 This filter enhance the stereo effect by suppressing signal common to both
4947 channels and by delaying the signal of left into right and vice versa,
4948 thereby widening the stereo effect.
4950 The filter accepts the following options:
4954 Time in milliseconds of the delay of left signal into right and vice versa.
4955 Default is 20 milliseconds.
4958 Amount of gain in delayed signal into right and vice versa. Gives a delay
4959 effect of left signal in right output and vice versa which gives widening
4960 effect. Default is 0.3.
4963 Cross feed of left into right with inverted phase. This helps in suppressing
4964 the mono. If the value is 1 it will cancel all the signal common to both
4965 channels. Default is 0.3.
4968 Set level of input signal of original channel. Default is 0.8.
4971 @section superequalizer
4972 Apply 18 band equalizer.
4974 The filter accepts the following options:
4981 Set 131Hz band gain.
4983 Set 185Hz band gain.
4985 Set 262Hz band gain.
4987 Set 370Hz band gain.
4989 Set 523Hz band gain.
4991 Set 740Hz band gain.
4993 Set 1047Hz band gain.
4995 Set 1480Hz band gain.
4997 Set 2093Hz band gain.
4999 Set 2960Hz band gain.
5001 Set 4186Hz band gain.
5003 Set 5920Hz band gain.
5005 Set 8372Hz band gain.
5007 Set 11840Hz band gain.
5009 Set 16744Hz band gain.
5011 Set 20000Hz band gain.
5015 Apply audio surround upmix filter.
5017 This filter allows to produce multichannel output from audio stream.
5019 The filter accepts the following options:
5023 Set output channel layout. By default, this is @var{5.1}.
5025 See @ref{channel layout syntax,,the Channel Layout section in the ffmpeg-utils(1) manual,ffmpeg-utils}
5026 for the required syntax.
5029 Set input channel layout. By default, this is @var{stereo}.
5031 See @ref{channel layout syntax,,the Channel Layout section in the ffmpeg-utils(1) manual,ffmpeg-utils}
5032 for the required syntax.
5035 Set input volume level. By default, this is @var{1}.
5038 Set output volume level. By default, this is @var{1}.
5041 Enable LFE channel output if output channel layout has it. By default, this is enabled.
5044 Set LFE low cut off frequency. By default, this is @var{128} Hz.
5047 Set LFE high cut off frequency. By default, this is @var{256} Hz.
5050 Set LFE mode, can be @var{add} or @var{sub}. Default is @var{add}.
5051 In @var{add} mode, LFE channel is created from input audio and added to output.
5052 In @var{sub} mode, LFE channel is created from input audio and added to output but
5053 also all non-LFE output channels are subtracted with output LFE channel.
5056 Set angle of stereo surround transform, Allowed range is from @var{0} to @var{360}.
5057 Default is @var{90}.
5060 Set front center input volume. By default, this is @var{1}.
5063 Set front center output volume. By default, this is @var{1}.
5066 Set front left input volume. By default, this is @var{1}.
5069 Set front left output volume. By default, this is @var{1}.
5072 Set front right input volume. By default, this is @var{1}.
5075 Set front right output volume. By default, this is @var{1}.
5078 Set side left input volume. By default, this is @var{1}.
5081 Set side left output volume. By default, this is @var{1}.
5084 Set side right input volume. By default, this is @var{1}.
5087 Set side right output volume. By default, this is @var{1}.
5090 Set back left input volume. By default, this is @var{1}.
5093 Set back left output volume. By default, this is @var{1}.