ffmpeg

mirror of https://git.ffmpeg.org/ffmpeg.git synced 2026-02-16 03:20:25 +00:00

Author	SHA1	Message	Date
Andreas Rheinhardt	2be1b2ea96	avcodec/x86/hpeldsp: Actually use constants in registers Forgotten in `36f92206bb`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-02 12:05:52 +01:00
Andreas Rheinhardt	a677b38298	avcodec/x86/vp3dsp: Remove remnants of MMX Forgotten in `eefec06634`. Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-02 12:01:52 +01:00
Andreas Rheinhardt	d355749ca6	avcodec/x86/hevc/add_res: Avoid unnecessary modification Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-02 09:46:15 +01:00
Andreas Rheinhardt	f4d9fb0bd0	avcodec/x86/hevc/add_res: Reduce number of registers used This makes these functions use only volatile registers (even on Win64). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-02 09:46:15 +01:00
Andreas Rheinhardt	23efbb5e2e	avcodec/x86/hevc/add_res: Remove AVX add_residual functions The AVX and SSE2 functions are identical except for the VEX encodings used since `e9abef437f` and `8b8492452d`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-02 09:46:15 +01:00
James Almer	7770c0bf0d	avcodec/exr: don't remove Float2HalfTables tables alongside the deprecated gamma code It's used by other parts of the module that will fail to build otherwise after the aforementioned removal. Signed-off-by: James Almer <jamrial@gmail.com>	2025-11-02 00:21:18 -03:00
Andreas Rheinhardt	fa959bb135	avcodec/parsers: Silence deprecation warnings Slipped through because Clang (in contrast to GCC) does not warn about this. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 18:48:17 +01:00
Andreas Rheinhardt	25968dbb05	avcodec/parser_internal: Rename PASSTHROUGH macro to avoid name conflict wingdi.h defines its own PASSTHROUGH and it is included implicitly by the VC-1 parser (which is mpegvideo-based and therefore includes a lot of stuff). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 18:41:27 +01:00
Andreas Rheinhardt	8a322c956f	avcodec/avcodec: Schedule parser API to use enum AVCodecID for codec ids Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 16:57:03 +01:00
Andreas Rheinhardt	7c43cc4cb7	avcodec/parser_internal: Remove prefix from parser_{init,parse,close} Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 16:57:03 +01:00
Andreas Rheinhardt	e0b0ca8111	avcodec/avcodec: Schedule moving private fields of AVCodecParser out of avcodec.h AVCodecParser has several fields which are not really meant to be accessed by users, but it has no public-private demarkation line, so these fields are technically public and can therefore not simply be made private like `20f9727018` did for AVCodec.* This commit therefore deprecates these fields and schedules them to become private. All parsers have already been switched to FFCodecParser, which (for now) is a union of AVCodecParser and an unnamed clone of AVCodecParser (new fields can be added at the end of this clone). *: This is also the reason why split has never been removed despite not being set for several years now. Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 16:57:02 +01:00
Andreas Rheinhardt	e9fe30ccd1	avcodec/parsers: Add macro to set list of codec ids The current code relies on AV_CODEC_ID_NONE being zero, so that unused codec ids are set to their proper value. This commit adds a macro to set unset ids to AV_CODEC_ID_NONE. (The actual rationale for this macro is to simplify the transition to making the private fields that are currently public in avcodec.h really private.) Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 16:57:02 +01:00
Andreas Rheinhardt	12f7a7724d	avcodec: Remove unnecessary parser.h inclusions It only contains declarations for some auxiliary functions for parsing that parsers that only work with complete packets don't need. Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 16:57:02 +01:00
James Almer	9c266084fe	avcodec/vc1_parser: stop splitting ENDOFSEQ markers into separate packets The decode API can handle outputting delayed frames without relying on the parser splitting off the ENDOFSEQ marker. Signed-off-by: James Almer <jamrial@gmail.com>	2025-11-01 12:23:19 -03:00
James Almer	9ab6a94195	avcodec/vc1dec: look for ENDOFSEQ markers at the end of a packet's payload This is in preparation for the parser no longer splitting them into their own packets. Signed-off-by: James Almer <jamrial@gmail.com>	2025-11-01 12:22:49 -03:00
Andreas Rheinhardt	fc2c030fb2	avcodec/vc1dec: Deduplicate cleanup code Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 15:47:45 +01:00
Andreas Rheinhardt	a44f7b6c4f	avcodec/vc1dec: Don't initialize write-only intra_scantable VC-1 does not use it. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 15:47:45 +01:00
Andreas Rheinhardt	b91081274f	avcodec/x86/h264_qpel: Add and use ff_{avg,put}_pixels16x16_l2_sse2() This avoids mmx (the size 16 h264qpel dsp now no longer uses any mmx) and improves performance, particularly for the avg case: Old benchmarks: avg_h264_qpel_16_mc01_8_c: 780.0 ( 1.00x) avg_h264_qpel_16_mc01_8_sse2: 91.2 ( 8.55x) avg_h264_qpel_16_mc03_8_c: 804.0 ( 1.00x) avg_h264_qpel_16_mc03_8_sse2: 91.2 ( 8.82x) put_h264_qpel_16_mc01_8_c: 779.5 ( 1.00x) put_h264_qpel_16_mc01_8_sse2: 82.8 ( 9.41x) put_h264_qpel_16_mc03_8_c: 770.1 ( 1.00x) put_h264_qpel_16_mc03_8_sse2: 82.5 ( 9.33x) New benchmarks: avg_h264_qpel_16_mc01_8_c: 783.9 ( 1.00x) avg_h264_qpel_16_mc01_8_sse2: 84.1 ( 9.32x) avg_h264_qpel_16_mc03_8_c: 797.4 ( 1.00x) avg_h264_qpel_16_mc03_8_sse2: 83.9 ( 9.51x) put_h264_qpel_16_mc01_8_c: 767.4 ( 1.00x) put_h264_qpel_16_mc01_8_sse2: 80.5 ( 9.53x) put_h264_qpel_16_mc03_8_c: 779.9 ( 1.00x) put_h264_qpel_16_mc03_8_sse2: 80.3 ( 9.71x) (qpeldsp will use these functions when it gets ported to SSE2; then the mmxext functions will be removed as well.) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 15:17:05 +01:00
Andreas Rheinhardt	529a048c31	avcodec/x86/qpel: Add specializations for put_l2 functions These functions are currently always called with height either being equal to the block size or block size+1. height is a compile-time constant at every callsite. This makes it possible to split this function into two to avoid the check inside the function for whether height is odd or even. The corresponding avg function is only used with height == block size, so that it does not have a height parameter at all. Removing the parameter from the put_l2 functions as well therefore simplifies the C code. The new functions increase the size of .text from qpel{dsp}.o by 32B here, yet they save 464B of C code here. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 15:17:05 +01:00
Andreas Rheinhardt	5715eb1274	avcodec/x86/{h264_qpel,qpeldsp_init}: Move shared decls into header Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 15:17:05 +01:00
Andreas Rheinhardt	b03b45cf22	avcodec/x86/h264_chromamc: Remove MMX(EXT) funcs overridden by SSSE3 SSSE3 is already quite old (introduced 2006 for Intel, 2011 for AMD), so that the overwhelming majority of our users (particularly those that actually update their FFmpeg) will be using the SSSE3 versions. This commit therefore removes the MMX(EXT) functions overridden by them (which don't abide by the ABI) to get closer to a removal of emms_c. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 13:34:23 +01:00
Andreas Rheinhardt	cb054ee89b	avcodec/x86/h264_chromamc: Add SSSE3 RV40 chroma motion compensation functions The only difference between it and the H.264/VC-1 versions is the bias constant which depends on the shift parameters for RV40. This value ends up in a register and therefore one can reuse the H.264 code by setting the registers for RV40 and then jumping into the relevant H.264 function, making the four new functions cheap (just 256 bytes in total). This approach uses one jump more for the no-filter case and one jump less in the one-dimensional case than an approach using separate functions. avg_chroma_mc4_c: 167.5 ( 1.00x) avg_chroma_mc4_mmxext: 48.1 ( 3.48x) avg_chroma_mc4_ssse3: 31.1 ( 5.39x) avg_chroma_mc8_c: 325.5 ( 1.00x) avg_chroma_mc8_mmxext: 103.2 ( 3.15x) avg_chroma_mc8_ssse3: 33.5 ( 9.71x) put_chroma_mc4_c: 137.4 ( 1.00x) put_chroma_mc4_mmx: 44.5 ( 3.09x) put_chroma_mc4_ssse3: 28.4 ( 4.83x) put_chroma_mc8_c: 271.4 ( 1.00x) put_chroma_mc8_mmx: 99.9 ( 2.72x) put_chroma_mc8_ssse3: 30.6 ( 8.86x) Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 13:34:00 +01:00
Andreas Rheinhardt	c6efe1abda	avcodec/h264chroma: Move mc1 function to mpegvideo_dec.c It is only used by mpegvideo decoders (for lowres). It is also only used for bitdepth == 8, so don't build the bitdepth == 16 function at all any more. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-01 13:31:57 +01:00
James Almer	6879c8ee5d	avcodec/avcodec: add helpers to convert between packet and frame side data They will be used in following commits. Signed-off-by: James Almer <jamrial@gmail.com>	2025-10-30 10:54:01 -03:00
Andreas Rheinhardt	0242cb36a5	avcodec/rv34dsp: Reduce size of qpel functions arrays Only size 16 and 8 are used (and set). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-10-30 12:17:25 +01:00
Zhao Zhili	82c495fd15	avcodec/hevc: fix false alarm when build with enable-small profile_name is always NULL with --enable-small, which leading to a warning message "Unknown profile bitstream".	2025-10-30 09:26:17 +00:00
Andreas Rheinhardt	0f105b96a3	avcodec/x86/hevc/idct: Port ff_hevc_idct_4x4_dc_{8,10,12}_mmxext to SSE2 Practically no change in benchmarks (and in codesize). hevc_idct_4x4_dc_8_c: 7.8 ( 1.00x) hevc_idct_4x4_dc_8_mmxext: 6.9 ( 1.14x) hevc_idct_4x4_dc_8_sse2: 6.8 ( 1.15x) hevc_idct_4x4_dc_10_c: 7.9 ( 1.00x) hevc_idct_4x4_dc_10_mmxext: 6.9 ( 1.16x) hevc_idct_4x4_dc_10_sse2: 6.8 ( 1.16x) hevc_idct_4x4_dc_12_c: 7.8 ( 1.00x) hevc_idct_4x4_dc_12_mmxext: 7.0 ( 1.13x) hevc_idct_4x4_dc_12_sse2: 6.8 ( 1.15x) Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-10-30 08:56:45 +01:00
Michael Niedermayer	909af3a571	avcodec/g723_1enc: Make min_err 64bit This is intending to fix the case described in https://lists.ffmpeg.org/archives/list/ffmpeg-devel@ffmpeg.org/thread/AAZ7GJPPUJI5SCVTDGJ6QL7UUEP56WOM/ Where FCBParam optim is used uninitialized a min_err of 1<<30, allows the struct to be never initilialized as all err (which is int32_t) can be larger than min_err. By increasing min_err above the int32_t range this is no longer possible Untested, as i do not have the testcase Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>i	2025-10-30 03:41:24 +01:00
Michael Niedermayer	d8ffec5bf9	avcodec/vlc: Clear val8/16 in vlc_multi_gen() by av_mallocz() Fixes: use of uninitialized memory Fixes: 427814450/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_MAGICYUV_DEC_fuzzer-646512196065689 Fixes: 445961558/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_UTVIDEO_DEC_fuzzer-5515158672965632 the multi vlc code will otherwise return uninitialized data. Now one can argue that this data should not be used, but on errors this data can remain ... Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2025-10-30 02:08:14 +01:00
Lynne	afd927e0ca	prores_raw: skip frames if the discard flag is set All other decoders do this.	2025-10-28 23:10:15 +01:00
Andreas Rheinhardt	0941646182	avcodec/fflcms2: Don't access inexistent array elements This would happen if one of the extended transfer characteristics is in use (currently only AVCOL_TRC_V_LOG). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-10-28 21:40:50 +01:00
Lynne	e85947576c	ffv1enc_vulkan: limit probability caching to RADV only Nvidia's drivers recently broke this.	2025-10-28 20:46:25 +01:00
Lynne	bcfb4b2e3e	prores_raw_parser: set color params based on vendor Panasonic cameras record ProRes RAW in V-Log/V-Gamut internally. Atomos is a brand of recorders which can record uncompressed non-debayered RAW over HDMI. All known cameras output linear RAW over HDMI, so mark the transfer function as linear in that case.	2025-10-28 20:46:21 +01:00
Lynne	aeb9b19ebc	lavu: add support for Panasonic V-Log	2025-10-28 20:46:21 +01:00
James Almer	530ca627a3	avcodec/mlpdec: don't depend on context channel layout when setting substream masks If avctx->ch_layout is unset (as it's allowed and even expeced by the AV_CODEC_CAP_CHANNEL_CONF flag), the code setting substream masks will fail for stereo and mono layouts unless a downmix channel was requested. Fix this by deriving the mask with coded values only. Fixes issue #20764. Signed-off-by: James Almer <jamrial@gmail.com>	2025-10-28 13:08:02 +00:00
xin.guo	0a4bd6cc23	avcodec/h264_vulkan: Fix param error in set_sps Signed-off-by: xin.guo <xin.guo@sophgo.com>	2025-10-28 06:12:36 +00:00
averne	909d71322a	vulkan/prores: output LSB-padded data For consistency with existing Vulkan-based hwaccels	2025-10-28 06:12:14 +00:00
Lynne	70be2e2ae2	lavc/hwaccels: properly order list	2025-10-28 07:11:26 +01:00
Lynne	51843adfe5	vulkan/rangecoder: ifdef out encode and decode chunks There's little code sharing between them.	2025-10-28 07:11:26 +01:00
Lynne	3cd678506c	vulkan_decode: align images to the subsampling Normally, the Vulkan drivers handle this. But Vulkan decided "nah". This requires API users to crop out odd-numbered images with subsampling.	2025-10-28 07:11:26 +01:00
Lynne	5b388f2838	hwcontext_vulkan: remove unsupported/broken pixel formats We have no use for 14-bit pixel formats for now, so remove support for gray14, which was broken due to the LSB padding issue. Similarly YUVA at 10/12 bit was broken for the same reason.	2025-10-27 22:59:41 -03:00
Lynne	98ee3f6718	hwcontext_vulkan: fix planar 10 and 12-bit RGB formats using the new MSB formats	2025-10-27 22:59:41 -03:00
Lynne	41ecb203c5	hwcontext_vulkan: fix 3-plane 444 10 and 12-bit formats using the new MSB formats We previously tried to fudge this somehow, but the pixel formats are simply broken and we cannot use them without declaring them as MSB.	2025-10-27 22:59:41 -03:00
Lynne	471acedec2	hwcontext_vulkan: fix grayscale 10 and 12-bit formats using the new MSB formats	2025-10-27 22:59:41 -03:00
Kacper Michajłow	a6ccaa2eea	avcodec/d3d12va_encode_h264: remove unused variables Signed-off-by: Kacper Michajłow <kasper93@gmail.com>	2025-10-27 15:39:39 +01:00
Kacper Michajłow	d57de83352	avcodec/d3d12va_encode: fix format specifier for HRESULT Signed-off-by: Kacper Michajłow <kasper93@gmail.com>	2025-10-27 15:39:39 +01:00
David Rosca	a0a16f2ea4	cbs_vp9: Always update loop filter and segmentation from current frame Fixes decoding vp90-2-09-aq2, vp90-2-15-segkey_adpq, vp90-2-15-segkey and vp90-2-22-svc_1280x720_1 with Vulkan hwaccel. Fixes: `26a2a76346` ("cbs_vp9: Fix VP9 passthrough")	2025-10-27 13:44:03 +00:00
Andreas Rheinhardt	d01608e022	avcodec/proresdec: Remove unused hwaccel_last_picture_private ProRes is an intra-only codec. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-10-25 22:34:30 +02:00
averne	23df9d4172	avcodec/prores: add parser Introduce a basic parser for ProRes frame headers. This avoid having to decode an entire frame to extract codec information.	2025-10-25 19:56:44 +00:00
averne	98412edfed	lavc: add a ProRes Vulkan hwaccel Add a shader-based Apple ProRes decoder. It supports all codec features for profiles up to the 4444 XQ profile, ie.: - 4:2:2 and 4:4:4 chroma subsampling - 10- and 12-bit component depth - Interlacing - Alpha The implementation consists in two shaders: the VLD kernel does entropy decoding for color/alpha, and the IDCT kernel performs the inverse transform on color components. Benchmarks for a 4k yuv422p10 sample: - AMD Radeon 6700XT: 178 fps - Intel i7 Tiger Lake: 37 fps - NVidia Orin Nano: 70 fps	2025-10-25 19:54:13 +00:00

1 2 3 4 5 ...

52983 commits