ffmpeg

mirror of https://git.ffmpeg.org/ffmpeg.git synced 2025-12-08 06:09:50 +00:00

Author	SHA1	Message	Date
Lynne	680d969a30	vulkan_decode: port to the new queue family API	2024-08-11 05:13:16 +02:00
Lynne	1c05661ec4	vulkan_decode: add \n to error message	2024-08-11 05:13:15 +02:00
Lynne	ca591e6b50	vulkan_decode: force layered_dpb to 0 when dedicated_dpb is 0 layered_dpb only makes sense when dedicated_dpb is set to 1. For some mysterious reason, some Nvidia drivers stopped indicating SEPARATE_REFRENCES, but kept the COINCIDE flag, which broke the code.	2024-08-11 05:13:14 +02:00
Lynne	6757cdb535	vulkan_video: remove NIH pooled buffer implementation The code predates ff_vk_get_pooled_buffer().	2024-08-11 05:13:10 +02:00
Lynne	db09f1a5d8	vulkan_av1: add workaround for NVIDIA drivers tested on broken CTS The first release of the CTS for AV1 decoding had incorrect offsets for the OrderHints values. The CTS will be fixed, and eventually, the drivers will be updated to the proper spec-conforming behaviour, but we still need to add a workaround as this will take months. Only NVIDIA use these values at all, so limit the workaround to only NVIDIA. Also, other vendors don't tend to provide accurate CTS information.	2024-04-15 02:40:02 +02:00
Andreas Rheinhardt	790f793844	avutil/common: Don't auto-include mem.h There are lots of files that don't need it: The number of object files that actually need it went down from 2011 to 884 here. Keep it for external users in order to not cause breakages. Also improve the other headers a bit while just at it. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-03-31 00:08:43 +01:00
Lynne	ecdc94b97f	vulkan_av1: port to the new stable API Co-Authored-by: Dave Airlie <airlied@redhat.com>	2024-03-25 08:54:40 +01:00
Andreas Rheinhardt	ccb432c1fe	avcodec/vulkan_decode: Remove always-false check These fields are set for all Vulkan decoding hwaccels; they would be useless if it were different. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-03-07 09:00:47 +01:00
Andreas Rheinhardt	f9d35e78fe	avcodec/vulkan_decode: Un-sparse extensions table Only three of the 226 (== AV_CODEC_ID_AV1) entries have been used. Unsparsing this table is especially important given that this array lives in .data.rel.ro. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-03-07 09:00:39 +01:00
Andreas Rheinhardt	f7b227bec3	avcodec/vulkan_video: Merge dec part of FFVkCodecMap and extension props All the fields of FFVkCodecMap are either decoder-only or encoder-only (with the latter being unused and unset for now). Yet there is already a per-decoder struct containing static information about these decoders, namely VkExtensionProperties. This commit merges the decoder-parts of FFVkCodecMap with the VkExtensionProperties into a common structure. Given that FFVkCodecMap is now unused, it is removed. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-03-07 09:00:30 +01:00
Andreas Rheinhardt	e429b0fdb7	avutil/vulkan: Don't autoinclude vulkan_loader.h Only include it where necessary. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-03-03 22:55:26 +01:00
Andreas Rheinhardt	cb15b7b29e	avcodec/vulkan_video: Don't use sparse table ff_vk_codec_map currently is an array indexed by AVCodecID; it has AV_CODEC_ID_FIRST_AUDIO (= 65536) entries, but uses only three of them; only 24B of 1MiB were actually used This commit fixes this by adding an AVCodecID field to the table and making it non-sparse. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-03-03 17:17:13 +01:00
Sam James	2f24f10d9c	libavcodec: fix -Wint-conversion in vulkan FIx warnings (soon to be errors in GCC 14, already so in Clang 15): ``` src/libavcodec/vulkan_av1.c: In function ‘vk_av1_create_params’: src/libavcodec/vulkan_av1.c:183:43: error: initialization of ‘long long unsigned int’ from ‘void *’ makes integer from pointer without a cast [-Wint-conversion] 183 \| .videoSessionParametersTemplate = NULL, \| ^~~~ src/libavcodec/vulkan_av1.c:183:43: note: (near initialization for ‘(anonymous).videoSessionParametersTemplate’) ``` Use Vulkan's VK_NULL_HANDLE instead of bare NULL. Fix Trac ticket #10724. Was reported downstream in Gentoo at https://bugs.gentoo.org/919067. Signed-off-by: Sam James <sam@gentoo.org>	2024-01-06 22:38:55 +01:00
Lynne	70864e6adb	vulkan_decode: correct flipped condition in image layout Changed by the previous commit. Caused validation issues on hardware with !reuse_dpb_dst but not layered_dpb.	2023-10-25 22:01:21 +02:00
Lynne	0b3616231d	vulkan_decode: fix another validation issue Surprising no one, the insane usage rule has a catch.	2023-10-25 20:51:55 +02:00
Lynne	467e411839	vulkan_decode: fix pedantic validation issue "Validation Error: [ VUID-VkImageViewCreateInfo-imageViewType-04974 ] Object 0: handle = 0x9f9b41000000003c, type = VK_OBJECT_TYPE_IMAGE; \| MessageID = 0xc120e150 \| vkCreateImageView(): Using pCreateInfo->viewType VK_IMAGE_VIEW_TYPE_2D and the subresourceRange.layerCount VK_REMAINING_ARRAY_LAYERS=(17) and must 1 (try looking into VK_IMAGE_VIEW_TYPE_*_ARRAY). The Vulkan spec states: If viewType is VK_IMAGE_VIEW_TYPE_1D, VK_IMAGE_VIEW_TYPE_2D, or VK_IMAGE_VIEW_TYPE_3D; and subresourceRange.layerCount is VK_REMAINING_ARRAY_LAYERS, then the remaining number of layers must be 1"	2023-10-25 20:51:54 +02:00
Lynne	9ee4f47c94	vulkan_decode: use coded_width/height instead of the non-coded width and height Partially fixes https://streams.videolan.org/issues/19938/20000_20180305-15.04.59.ts The is coded as 1920x1080, meant to be rendered at 1440x1080 with cropping, or 1680x1080 before cropping. Currently, the created DPB is 1440x1080, which results in the image being decoded incorrectly, as the decoder overwrites output memory. This commit fixes this.	2023-10-25 20:51:05 +02:00
Andreas Rheinhardt	6695c0af0e	avcodec/vulkan_decode: Use RefStruct API for shared_ref Avoids allocations, error checks and indirections. Also increases type-safety. Reviewed-by: Lynne <dev@lynne.ee> Tested-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-07 22:35:50 +02:00
Lynne	9310ffc809	vulkan_decode: don't call get_proc_addr on every frame's destruction The issue is that we cannot rely on any context existing when we free frames. The Vulkan functions are loaded in each context separately, so until now, we've just been loading them on every frame's destruction. Rather than do this, just save the function pointers we need in each frame. The function pointers are guaranteed to not change and exist.	2023-09-15 17:35:22 +02:00
Lynne	552a5fa496	vulkan_hevc: switch from a buffer pool to a malloc and simplify Simpler and more robust now that contexts are not shared between threads.	2023-09-15 17:35:19 +02:00
Andreas Rheinhardt	c1b6235d41	avcodec/vulkan_decode: Factor creating session params out, fix leak All Vulkan HWAccels share the same boilerplate code for creating session params and this includes a common bug: In case actually creating the video session parameters fails, the buffer destined to hold them leaks; in case of HEVC this is also true if get_data_set_buf() fails. This commit factors this code out and fixes the leak. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-09-15 02:38:22 +02:00
Lynne	398467f519	vulkan_decode: convert max level from vulkan to av for comparisons	2023-09-08 06:56:43 +02:00
Andreas Rheinhardt	8238bc0b5e	avcodec/defs: Add AV_PROFILE_* defines, deprecate FF_PROFILE_* defines These defines are also used in other contexts than just AVCodecContext ones, e.g. in libavformat. Furthermore, given that these defines are public, the AV-prefix is the right one, so deprecate (and not just move) the FF-macros. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-09-07 00:39:02 +02:00
Kacper Michajłow	9f66286f0b	avcodec/vulkan_decode: print also codec header name Signed-off-by: Kacper Michajłow <kasper93@gmail.com>	2023-08-24 22:51:36 +02:00
Kacper Michajłow	9d0da996f0	avcodec/vulkan_decode: fix struct type for h265_profile Signed-off-by: Kacper Michajłow <kasper93@gmail.com>	2023-08-24 22:51:25 +02:00
Lynne	c06ad641ec	lavc/vulkan_decode: use a single execution pool per thread The spec says command buffer pools must be externally synchronized objects. This still lets us pool some, just not as much.	2023-07-21 20:04:15 +02:00
Lynne	4ff303a7b8	vulkan_decode: simplify and make session parameter generation more robust This commit scraps a bool to signal to recreate the session parameters, but instead destroys them, forcing them to be recreated. As this can happen between start_frame and end_frame, do this at both places.	2023-06-22 18:17:54 +02:00
Lynne	ba8a803236	vulkan_decode: clean up slice handling Move the slice offsets buffer to the thread decode context. It isn't part of the resources for frame decoding, the driver has to process and finish with it at submission time. That way, it doesn't need to be alloc'd + freed on every frame.	2023-06-22 18:17:54 +02:00
Lynne	237c400727	vulkan_decode: remove unused fields	2023-06-22 18:17:53 +02:00
Lynne	d9af84426b	vulkan_decode: fix small memory leak This requires using the new AVHWFramesContext.opaque field, as otherwise, the profile attached to the decoder will be freed before the frames context, rendering the frames context useless.	2023-06-22 18:17:53 +02:00
Lynne	13ff3aa9e7	vulkan_decode: use the hwfc->user_opaque field to store the profile	2023-06-22 18:17:47 +02:00
Lynne	ca818ab51c	vulkan_h264: filter out constrained/inter flags from the profile index As the comment says, Vulkan signals all the constrant_set flags, and does not want them OR'd onto the profile IDC. So just unset them.	2023-06-15 22:00:42 +02:00
Lynne	24c4307b80	vulkan_decode: halve execution pool size Determined experimentally, on various videos and hardware. On Intel, using less resources in-flight is around 15% faster, with similar results on Nvidia hardware.	2023-06-07 23:59:17 +02:00
Lynne	9f9534f5b6	vulkan_decode: fix typo when setting AV1 capabilities All pNext chained structs in Vulkan are defined as void *, so it doesn't help catch this.	2023-05-29 23:26:10 +02:00
Lynne	e71cd18049	vulkan_decode: do not align the image dimensions According to Dave Airlie: > <airlied> but I think ignoring it should be fine, I can't see any > other way to get the imaeg extents correct for other usage > <Lynne> what width/height should be used for the images? > the final presentable dimensions, or the coded dimensions? > <airlied> if you don't want noise I think the presentable dims > <airlied> the driver should round up the allocations internally, > but if you are going to sample from the images then w/h have to be > the bounds of the image you want > <airlied> since otherwise there's no way to stop the sampler from > going outside the edges Apparently, the alignment values are informative, rather than mandatory, but the spec's wording makes it sound as if they're mandatory.	2023-05-29 05:12:27 +02:00
James Almer	fe103ee61f	avcodec/vulkan_dec: use PRId64 specifier for an int64_t Fixes warnings on x86-32 and Windows. Signed-off-by: James Almer <jamrial@gmail.com>	2023-05-28 23:18:53 -03:00
Lynne	bae92361ed	vulkan_decode: check if yuv_sampler exists before freeing it This prevents multiple NULL accesses - if yuv_sampler exists, then everything required for it to be destroyed also exists.	2023-05-29 03:23:06 +02:00
Lynne	77478f6793	av1dec: add Vulkan hwaccel	2023-05-29 00:42:00 +02:00
Lynne	1e8fefff93	libavcodec: add Vulkan common video decoding code	2023-05-29 00:41:57 +02:00

39 commits