ffmpeg

mirror of https://git.ffmpeg.org/ffmpeg.git synced 2026-06-04 22:50:24 +00:00

Author	SHA1	Message	Date
Diego de Souza	7ac3d83e7a	avcodec/nvdec: fix dimension rounding for monochrome/444 formats frames_ctx->width/height were unconditionally rounded to even, causing odd-dimension monochrome/444 clips to be reported with incorrect surface pool dimensions. Round only for 4:2:0 and 4:2:2; for monochrome/444 use avctx->coded_width/coded_height unchanged, matching the dimensions set by the software codec layer. Patch by: Aniket Dhok <adhok@nvidia.com> Signed-off-by: Diego de Souza <ddesouza@nvidia.com>	2026-05-18 20:47:15 +00:00
James Almer	d6a22cda38	avcodec/decode: add a hwaccel specific post_process callback to FrameDecodeData Leave the existing one for non decoder-specific, post processing usage. With this, scenarios like nvdec decoding can work algonside lcevc enhancement application. Signed-off-by: James Almer <jamrial@gmail.com>	2026-03-28 20:14:13 +00:00
Diego de Souza	6ef0ef51dc	avcodec/nvdec: fix surface pool limits and unsafe_output lifetime Cap ulNumDecodeSurfaces to 32 and ulNumOutputSurfaces to 64 to prevent cuvidCreateDecoder from failing with CUDA_ERROR_INVALID_VALUE when initial_pool_size exceeds the hardware limits. Also cap the decoder index pool (dpb_size) to 32 so that indices handed out via av_refstruct_pool_get stay within the valid range for cuvidDecodePicture's CurrPicIdx. When unsafe_output is enabled, stop holding idx_ref in the unmap callback. Since cuvidMapVideoFrame copies decoded data into an independent output mapping slot, the decode surface index can safely be reused as soon as the DPB releases it, without waiting for the downstream consumer to release the mapped frame. This decouples the decode surface index lifetime (max 32) from the output mapping slot lifetime (max 64), eliminating the "No decoder surfaces left" error that occurred when downstream components like nvenc held too many frames. Signed-off-by: Diego de Souza <ddesouza@nvidia.com>	2026-03-16 18:18:12 +00:00
Timo Rothenpieler	3ce348063c	avcodec/nvdec: switch to proper pixfmts on next major bump	2025-07-11 17:49:58 +02:00
Timo Rothenpieler	bf5f3f1f2e	avcodec/nvdec: fix 10bit output pixel formats Fixes #11655	2025-07-04 17:20:57 +02:00
Andreas Rheinhardt	b306683d12	avutil/frame: Port AVFrame.private_ref to RefStruct API This is possible without deprecation period, because said field is documented as only for our libav* libraries and not the general public. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com> Signed-off-by: James Almer <jamrial@gmail.com>	2025-03-28 14:33:08 -03:00
Diego de Souza	30e6effff9	avcodec/nvdec: add 4:2:2 decoding and 10-bit support This commit adds support for 4:2:2 decoding for HEVC and H.264 on NVIDIA Blackwell GPUs. Additionally, it supports 10-bit decoding for H.264 on Blackwell GPUs. Signed-off-by: Diego de Souza <ddesouza@nvidia.com> Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	2025-02-02 20:01:56 +01:00
Anton Khirnov	56ba57b672	lavc/refstruct: move to lavu and make public It is highly versatile and generally useful.	2024-12-15 14:03:47 +01:00
Andreas Rheinhardt	790f793844	avutil/common: Don't auto-include mem.h There are lots of files that don't need it: The number of object files that actually need it went down from 2011 to 884 here. Keep it for external users in order to not cause breakages. Also improve the other headers a bit while just at it. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-03-31 00:08:43 +01:00
Timo Rothenpieler	e99c273fec	avcodec/nvdec: reset bitstream_len/nb_slices when resetting bitstream pointer	2024-03-30 00:12:23 +01:00
Andreas Rheinhardt	9ae40f282d	avcodec/nvdec: Constify bitstream pointee Reviewed-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-08 14:00:37 +01:00
James Almer	7f92014aca	avcodec/nvdec: don't free NVDECContext->bitstream Ensure all hwaccels that allocate a buffer use NVDECContext->bitstream_internal instead. Otherwise, if FFHWAccel->end_frame() isn't called before FFHWAccel->uninit(), an attempt to free a stale pointer to memory not owned by the hwaccel could take place. Reviewed-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-07 11:31:33 -03:00
Andreas Rheinhardt	e01e30ede1	avcodec/nvdec: Use RefStruct-pool API for decoder pool It involves less allocations, in particular no allocations after the entry has been created. Therefore creating a new reference from an existing one can't fail and therefore need not be checked. It also avoids indirections and casts. Also note that nvdec_decoder_frame_init() (the callback to initialize new entries from the pool) does not use atomics to read and replace the number of entries currently used by the pool. This relies on nvdec (like most other hwaccels) not being run in a truely frame-threaded way. Tested-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:13:01 +01:00
Andreas Rheinhardt	c7fb4d0eb6	avcodec/nvdec: Use RefStruct API for decoder_ref Avoids allocations and error checks as well as the boilerplate code for creating an AVBuffer with a custom free callback. Also increases type safety. Reviewed-by: Anton Khirnov <anton@khirnov.net> Tested-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-07 22:36:21 +02:00
Timo Rothenpieler	7e8b539389	avcodec/nvdec: make explicit copy of frames unless user requested otherwise	2022-12-10 00:52:34 +01:00
Andreas Rheinhardt	56973eb687	avcodec/nvdec: Use av_buffer_replace() where appropriate Reviewed-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-08-08 11:42:28 +02:00
Andreas Rheinhardt	d3730acca3	avcodec/nvdec: Check av_buffer_ref() It (unfortunately) involves an allocation and can therefore fail. Reviewed-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-08-08 11:41:52 +02:00
Martin Storsjö	a78f136f3f	configure: Use a separate config_components.h header for $ALL_COMPONENTS This avoids unnecessary rebuilds of most source files if only the list of enabled components has changed, but not the other properties of the build, set in config.h. Signed-off-by: Martin Storsjö <martin@martin.st>	2022-03-16 14:12:49 +02:00
Andreas Rheinhardt	ef6a9e5e31	avutil/buffer: Switch AVBuffer API to size_t Announced in `14040a1d91`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com> Signed-off-by: James Almer <jamrial@gmail.com>	2021-04-27 10:43:13 -03:00
James Almer	d8a18c8fc2	avcodec: use the buffer_size_t typedef where required Signed-off-by: James Almer <jamrial@gmail.com>	2021-03-10 20:26:36 -03:00
Philip Langdale	67bb11b5f6	avcodec/nvdec: Add support for decoding monochrome av1 The nvidia hardware explicitly supports decoding monochrome content, presumably for the AVIF alpha channel. Supporting this requires an adjustment in av1dec and explicit monochrome detection in nvdec. I'm not sure why the monochrome path in av1dec did what it did - it seems non-functional - YUV440P doesn't seem a logical pix_fmt for monochrome and conditioning on chroma sub-sampling doesn't make sense. So I changed it. I've tested 8bit content, but I haven't found a way to create a 10bit sample, so that path is untested for now.	2020-12-06 14:59:24 -08:00
Timo Rothenpieler	ac5b45abab	avcodec/nvdec: add av1 hwaccel Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org> Co-authored-by: James Almer <jamrial@gmail.com>	2020-11-11 18:36:09 +01:00
Timo Rothenpieler	72982f8cb5	avcodec/nvdec: add support for separate reference frame Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	2020-11-11 18:36:09 +01:00
Timo Rothenpieler	767f53533a	nvdec: attach real hw_frames to post-processed frames	2020-03-28 17:58:54 +01:00
Philip Langdale	83c7ac2e47	avcodec/nvdec: Explicitly mark codecs that support 444 output formats With the introduction of HEVC 444 support, we technically have two codecs that can handle 444 - HEVC and MJPEG. In the case of MJPEG, it can decode, but can only output one of the semi-planar formats. That means we need additional logic to decide whether to use a 444 output format or not.	2019-02-16 08:47:36 -08:00
Philip Langdale	e06ccfbe1d	avcodec/nvdec: Add support for decoding HEVC 4:4:4 content The latest generation video decoder on the Turing chips supports decoding HEVC 4:4:4. Supporting this is relatively straight-forward; we need to account for the different chroma format and pick the right output and sw formats at the right times. There was one bug which was the hard-coded assumption that the first chroma plane would be half-height; I fixed this to use the actual shift value on the plane. We also need to pass the SPS and PPS range extension flags.	2019-02-16 08:47:36 -08:00
Philip Langdale	19d3d0c057	avutil/hwcontext_cuda: Define and use common CHECK_CU() We have a pattern of wrapping CUDA calls to print errors and normalise return values that is used in a couple of places. To avoid duplication and increase consistency, let's put the wrapper implementation in a shared place and use it everywhere. Affects: * avcodec/cuviddec * avcodec/nvdec * avcodec/nvenc * avfilter/vf_scale_cuda * avfilter/vf_scale_npp * avfilter/vf_thumbnail_cuda * avfilter/vf_transpose_npp * avfilter/vf_yadif_cuda	2018-11-14 17:39:42 -08:00
Philip Langdale	1b41115ef7	avcodec/nvdec: Increase frame pool size to help deinterlacing With the cuda yadif filter in use, the number of mapped decoder frames could increase by two, as the filter holds on to additional frames.	2018-11-02 11:27:13 -07:00
Philip Langdale	2d0ee127be	avcodec/nvdec: Push the context before destroying the decoder This has no visible effects but I happened to run under the cuda memcheck tool and it called it out as an error.	2018-10-24 10:43:41 -07:00
Timo Rothenpieler	880236e898	avcodec/nvdec: pass CUstream in vpp parameters	2018-05-10 00:34:22 +02:00
Timo Rothenpieler	baabd3c2ad	avcodec/nvdec: avoid needless copy of output frame Replaces the data pointers with the mapped cuvid ones. Adds buffer_refs to the frame to ensure the needed contexts stay alive and the cuvid idx stays allocated. Adds another buffer_ref to unmap the frame when it's unreferenced itself.	2018-05-10 00:34:21 +02:00
Philip Langdale	cd98f20b4a	avcodec/nvdec: Implement mjpeg nvdec hwaccel	2018-02-21 23:38:42 +00:00
Jacob Trimble	2fdc9f7c49	avcodec/nvdec: Fix capability check with old drivers. Copied the check from cuviddec.c (*_cuvid decoders) to allow the capability check to be optional for older drivers. Signed-off-by: Jacob Trimble <modmaker@google.com> Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	2017-12-08 17:56:38 +01:00
Philip Langdale	1da9851e34	avcodec/nvdec: Implement vp8 hwaccel	2017-11-26 14:55:01 -08:00
Philip Langdale	4186a77f26	avcodec/nvdec: Round up odd width/height values nvdec will not produce odd width/height output, and while this is basically never an issue with most codecs, due to internal alignment requirements, you can get odd sized jpegs. If an odd-sized jpeg is encountered, nvdec will actually round down internally and produce output that is slightly smaller. This isn't the end of the world, as long as you know the output size doesn't match the original image resolution. However, with an hwaccel, we don't know. The decoder controls the reported output size and the hwaccel cannot change it. I was able to trigger an error in mpv where it tries to copy the output surface as part of rendering and triggers a cuda error because cuda knows the output frame is smaller than expected. To fix this, we can round up the configured width/height passed to nvdec so that the frames are always at least as large as the decoder's reported size, and data can be copied out safely. In this particular jpeg case, you end up with a blank (green) line at the bottom due to nvdec refusing to decode the last line, but the behaviour matches cuviddec, so it's as good as you're going to get.	2017-11-24 12:19:31 -08:00
Mark Thompson	1dc483a6f2	compat/cuda: Pass a logging context to load functions Reviewed-by: Timo Rothenpieler <timo@rothenpieler.org>	2017-11-20 15:47:05 +00:00
Philip Langdale	6b77a10e43	avcodec: Implement mpeg4 nvdec hwaccel This was predictably nightmarish, given how ridiculous mpeg4 is. I had to stare at the cuvid parser output for a long time to work out what each field was supposed to be, and even then, I still don't fully understand some of them. Particularly: vop_coded: If I'm reading the decoder correctly, this flag will always be 1 as the decoder will not pass the hwaccel any frame where it is not 1. divx_flags: There's obviously no documentation on what the possible flags are. I simply observed that this is '0' for a normal bitstream and '5' for packed b-frames. gmc_enabled: I had a number of guesses as to what this mapped to. I picked the condition I did based on when the cuvid parser was setting flag. Also note that as with the vdpau hwaccel, the decoder needs to consume the entire frame and not the slice.	2017-11-20 07:21:41 -08:00
Philip Langdale	8bca292c30	avcodec: Implement mpeg1 nvdec hwaccel Once I remembered that there's a separate decoder type for mpeg1, even though params struct is shared with mpeg2, everything worked.	2017-11-20 07:03:26 -08:00
Philip Langdale	4c7b023d56	avcodec: Refactor common nvdec hwaccel logic The 'simple' hwaccels (not h.264 and hevc) all use the same bitstream management and reference lookup logic so let's refactor all that into common functions. I verified that casting a signed int -1 to unsigned char produces 255 according to the C language specification.	2017-11-20 07:03:26 -08:00
Philip Langdale	7c9f739d86	avcodec: Implement mpeg2 nvdec hwaccel This is mostly straight-forward. The weird part is that it should just work for mpeg1, but I see corruption in my test cases, so I'm going to try and fix that separately.	2017-11-18 08:13:50 -08:00
Philip Langdale	912ceba61b	avcodec: Implement vc1 nvdec hwaccel This hwaccel is interesting because it also works for wmv3/9 content, which is not supported by the nvidia parser used by cuviddec.	2017-11-14 19:40:01 -08:00
Timo Rothenpieler	8bcf5840ea	avcodec/nvdec: fix return value on error	2017-11-13 20:33:10 +01:00
Timo Rothenpieler	538de4354d	avcodec/nvdec: warn about thread count if applicable	2017-11-13 20:33:10 +01:00
Timo Rothenpieler	f3f73f0893	avcodec: implement vp9 nvdec hwaccel	2017-11-13 20:33:10 +01:00
Timo Rothenpieler	3f6294a53d	avcodec/nvdec: add support for 12 bit formats	2017-11-12 15:46:39 +01:00
Timo Rothenpieler	c60bc02bf4	avcodec/nvdec: check hardware capabilities	2017-11-12 15:46:39 +01:00
Timo Rothenpieler	3e0e163458	avcodec/nvdec: don't add thread buffer twice This is already added to the initial pool size in ff_decode_get_hw_frames_ctx, so adding it here again increases the amount of surfaces needlessly.	2017-11-12 15:46:39 +01:00
wm4	7546964f96	nvdec: add frames_params support	2017-11-11 20:33:45 -03:00
James Almer	2760454945	avcodec/nvdec: fix copyright headers Fixes fate-source. Signed-off-by: James Almer <jamrial@gmail.com>	2017-11-10 21:06:58 -03:00
James Almer	1178babaca	Merge commit '`b90fdb2c71`' * commit '`b90fdb2c71`': hevcdec: add a CUVID hwaccel Adapted for ffmpeg by Timo Rothenpieler. Merged-by: James Almer <jamrial@gmail.com>	2017-11-10 20:43:15 -03:00

1 2

51 commits