ffmpeg

mirror of https://git.ffmpeg.org/ffmpeg.git synced 2026-02-16 20:40:24 +00:00

Author	SHA1	Message	Date
Michael Niedermayer	6194cb87cb	avcodec/alsdec: Clear shift_value (the exact issue is unreproducable but the use of uninitialized data is reproducable) Should fix: signed integer overflow: -2147483648 - 127 cannot be represented in type 'int' Should fix: 69881/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_ALS_fuzzer-4751301204836352 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-23 23:21:14 +02:00
Michael Niedermayer	5d9544cfb0	avcodec/hevc/hevcdec: Do not allow slices to depend on failed slices An alternative would be to leave the context unchanged on failure of hls_slice_header() Fixes: out of array access Fixes: NULL pointer dereference Fixes: 69584/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_HEVC_fuzzer-5931086299856896 Fixes: 69724/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_HEVC_fuzzer-5104066422702080 Fixes: 70422/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_HEVC_fuzzer-5908731129298944 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-23 23:21:13 +02:00
Martin Storsjö	4acb9b7d10	aarch64: vvc: Fix unnecessary extra spaces Signed-off-by: Martin Storsjö <martin@martin.st>	2024-07-23 16:04:28 +03:00
Martin Storsjö	99598629e8	aarch64: vvc: Consistently use # for immediate constants Signed-off-by: Martin Storsjö <martin@martin.st>	2024-07-23 15:24:37 +03:00
Martin Storsjö	400843151d	aarch64: vvc: Fix compilation of alf.S with MSVC 2022 17.7 and older Use the "ldur" instruction explicitly, instead of having the assembler implicitly convert "ldr" instructions to "ldur". This fixes build errors like these: libavcodec\aarch64\vvc\alf.o.asm(1023) : error A2518: operand 2: Memory offset must be aligned ldr q22, [x3, #24] libavcodec\aarch64\vvc\alf.o.asm(1024) : error A2518: operand 2: Memory offset must be aligned ldr q24, [x2, #24] libavcodec\aarch64\vvc\alf.o.asm(1393) : error A2518: operand 2: Memory offset must be aligned ldr q22, [x3, #24] libavcodec\aarch64\vvc\alf.o.asm(1394) : error A2518: operand 2: Memory offset must be aligned ldr q24, [x2, #24] Signed-off-by: Martin Storsjö <martin@martin.st>	2024-07-23 15:24:33 +03:00
aaron	f44353cfb6	avcodec/adpcm: Mono ADPCM for EA WVE Files Reviewed-by: Peter Ross <pross@xvid.org>	2024-07-23 06:40:30 +10:00
Zhao Zhili	2d4ef304c9	avcodec/vvc: Add aarch64 neon optimization for ALF vvc_alf_filter_chroma_4x4_8_c: 3.0 vvc_alf_filter_chroma_4x4_8_neon: 1.0 vvc_alf_filter_chroma_4x4_10_c: 2.7 vvc_alf_filter_chroma_4x4_10_neon: 1.0 vvc_alf_filter_chroma_4x4_12_c: 2.7 vvc_alf_filter_chroma_4x4_12_neon: 1.0 vvc_alf_filter_chroma_8x8_8_c: 10.2 vvc_alf_filter_chroma_8x8_8_neon: 3.0 vvc_alf_filter_chroma_8x8_10_c: 10.0 vvc_alf_filter_chroma_8x8_10_neon: 2.5 vvc_alf_filter_chroma_8x8_12_c: 10.0 vvc_alf_filter_chroma_8x8_12_neon: 2.5 vvc_alf_filter_chroma_16x16_8_c: 41.7 vvc_alf_filter_chroma_16x16_8_neon: 11.2 vvc_alf_filter_chroma_16x16_10_c: 39.0 vvc_alf_filter_chroma_16x16_10_neon: 10.0 vvc_alf_filter_chroma_16x16_12_c: 40.2 vvc_alf_filter_chroma_16x16_12_neon: 10.2 vvc_alf_filter_chroma_32x32_8_c: 162.0 vvc_alf_filter_chroma_32x32_8_neon: 45.0 vvc_alf_filter_chroma_32x32_10_c: 155.5 vvc_alf_filter_chroma_32x32_10_neon: 39.5 vvc_alf_filter_chroma_32x32_12_c: 155.5 vvc_alf_filter_chroma_32x32_12_neon: 40.0 vvc_alf_filter_chroma_64x64_8_c: 646.0 vvc_alf_filter_chroma_64x64_8_neon: 175.5 vvc_alf_filter_chroma_64x64_10_c: 708.2 vvc_alf_filter_chroma_64x64_10_neon: 166.7 vvc_alf_filter_chroma_64x64_12_c: 619.2 vvc_alf_filter_chroma_64x64_12_neon: 157.2 vvc_alf_filter_chroma_128x128_8_c: 2611.5 vvc_alf_filter_chroma_128x128_8_neon: 698.2 vvc_alf_filter_chroma_128x128_10_c: 2470.0 vvc_alf_filter_chroma_128x128_10_neon: 616.0 vvc_alf_filter_chroma_128x128_12_c: 2531.5 vvc_alf_filter_chroma_128x128_12_neon: 620.2 vvc_alf_filter_luma_8x8_8_c: 25.2 vvc_alf_filter_luma_8x8_8_neon: 4.2 vvc_alf_filter_luma_8x8_10_c: 18.5 vvc_alf_filter_luma_8x8_10_neon: 4.0 vvc_alf_filter_luma_8x8_12_c: 19.0 vvc_alf_filter_luma_8x8_12_neon: 4.0 vvc_alf_filter_luma_16x16_8_c: 106.5 vvc_alf_filter_luma_16x16_8_neon: 16.2 vvc_alf_filter_luma_16x16_10_c: 75.2 vvc_alf_filter_luma_16x16_10_neon: 14.7 vvc_alf_filter_luma_16x16_12_c: 79.7 vvc_alf_filter_luma_16x16_12_neon: 14.7 vvc_alf_filter_luma_32x32_8_c: 400.5 vvc_alf_filter_luma_32x32_8_neon: 63.2 vvc_alf_filter_luma_32x32_10_c: 299.2 vvc_alf_filter_luma_32x32_10_neon: 57.7 vvc_alf_filter_luma_32x32_12_c: 299.2 vvc_alf_filter_luma_32x32_12_neon: 57.7 vvc_alf_filter_luma_64x64_8_c: 1602.5 vvc_alf_filter_luma_64x64_8_neon: 251.7 vvc_alf_filter_luma_64x64_10_c: 1197.0 vvc_alf_filter_luma_64x64_10_neon: 235.5 vvc_alf_filter_luma_64x64_12_c: 1220.2 vvc_alf_filter_luma_64x64_12_neon: 235.7 vvc_alf_filter_luma_128x128_8_c: 6570.2 vvc_alf_filter_luma_128x128_8_neon: 1007.7 vvc_alf_filter_luma_128x128_10_c: 4822.7 vvc_alf_filter_luma_128x128_10_neon: 936.2 vvc_alf_filter_luma_128x128_12_c: 4791.2 vvc_alf_filter_luma_128x128_12_neon: 938.5 Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-07-22 21:09:56 +08:00
Rémi Denis-Courmont	9135dffd17	lavc/h264dsp: reduce spills in R-V V idct_add16	2024-07-21 22:39:45 +03:00
Rémi Denis-Courmont	245f76ad74	lavc/h264dsp: reuse the R-V V IDCT DC add functions This reuses the DC bypass functions from the multiple IDCT functions, to leverage vector code. As an added bonus, the caller functions can now rely on the callee functions to preserve their parameters, thus cutting down on stack spills.	2024-07-21 22:39:45 +03:00
Rémi Denis-Courmont	0a5b5bae89	lavc/h264dsp: correct VL and LMUL in idct_dc_add T-Head C908 (cycles): h264_idct4_dc_add_8bpp_c: 94.7 h264_idct4_dc_add_8bpp_rvv_i32: 55.0 (before) h264_idct4_dc_add_8bpp_rvv_i32: 34.5 (after) h264_idct4_dc_add_9bpp_c: 94.7 h264_idct4_dc_add_9bpp_rvv_i32: 43.5 (before) h264_idct4_dc_add_9bpp_rvv_i32: 38.2 (after) h264_idct4_dc_add_10bpp_c: 94.7 h264_idct4_dc_add_10bpp_rvv_i32: 43.5 (before) h264_idct4_dc_add_10bpp_rvv_i32: 38.2 (after) h264_idct4_dc_add_12bpp_c: 94.7 h264_idct4_dc_add_12bpp_rvv_i32: 43.7 (before) h264_idct4_dc_add_12bpp_rvv_i32: 38.5 (after) h264_idct4_dc_add_14bpp_c: 94.7 h264_idct4_dc_add_14bpp_rvv_i32: 43.7 (before) h264_idct4_dc_add_14bpp_rvv_i32: 38.5 (after)	2024-07-21 22:39:45 +03:00
J. Dekker	c9dc2ad09b	lavc/h264dsp: move R-V V idct_dc_add No functional changes. This just moves the assembler so that it can be referenced by other functions in h264idct_rvv.S with local jumps. Edited-by: Rémi Denis-Courmont <remi@remlab.net>	2024-07-21 22:39:45 +03:00
Rémi Denis-Courmont	d15169c51f	lavc/h264dsp: factor some mostly identical R-V V code	2024-07-21 22:39:45 +03:00
Mark Thompson	7110a36ba0	cbs_av1: Reject thirty-two zero bits in uvlc code The spec allows at least thirty-two zero bits followed by a one to mean 2^32-1, with no constraint on the number of zeroes. The libaom reference decoder does not match this, instead reading thirty-two zeroes but not the following one to mean 2^32-1. These two interpretations are incompatible and other implementations may follow one or the other. Therefore reject thirty-two zeroes because the intended behaviour is not clear. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-21 15:29:25 +02:00
Michael Niedermayer	3faadbe2a2	avcodec/pnmdec: Use 64bit for input size check Fixes: out of array read Fixes: poc3 Reported-by: VulDB CNA Team Found-by: CookedMelon Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-21 15:29:24 +02:00
Tong Wu	5c8523cef1	lavc/hw_base_encode: correct the timestamp when input_order = decode_delay Fixed the command line: ffmpeg -hwaccel vaapi -pix_fmt nv12 -s:v widthxheight -i input.yuv -vf "hwupload" -c:v hevc_vaapi -bf 10 -b_depth 3 -vframes 3 -f null - Signed-off-by: Tong Wu <wutong1208@outlook.com>	2024-07-20 11:21:36 +02:00
Leo Izen	90e28331c7	avcodec/png: more informative error message for invalid sBIT size If the sBIT chunk size is invalid, we should print a more informative error message rather than return an error and print nothing. Signed-off-by: Leo Izen <leo.izen@gmail.com>	2024-07-18 21:20:38 -04:00
Leo Izen	4225f51c62	avcodec/pngdec: avoid erroring with sBIT on indexed-color images Indexed color images use three colors for sBIT, but the function ff_png_get_nb_channels returns 1 in this case. We should avoid erroring out on valid files in this scenario. Regression since `84b454935f`. Signed-off-by: Leo Izen <leo.izen@gmail.com> Reported-by: Ramiro Polla <ramiro.polla@gmail.com> Reviewed-by: Marton Balint <cus@passwd.hu>	2024-07-18 21:16:18 -04:00
Rémi Denis-Courmont	483fd732ab	lavc/h264dsp: R-V V high-depth idct_add{,intra}16, idct8_add4 As with 8-bit, this tends to be faster, but results are all over the place due to the variable distribution of non-zero coefficients.	2024-07-18 20:37:09 +03:00
J. Dekker	fa5a605542	avcodec/riscv: add h264 dc idct rvv checkasm: bench runs 131072 (1 << 17) h264_idct4_add_dc_8bpp_c: 1.5 h264_idct4_add_dc_8bpp_rvv_i64: 0.7 h264_idct4_add_dc_9bpp_c: 1.5 h264_idct4_add_dc_9bpp_rvv_i64: 0.7 h264_idct4_add_dc_10bpp_c: 1.5 h264_idct4_add_dc_10bpp_rvv_i64: 0.7 h264_idct4_add_dc_12bpp_c: 1.2 h264_idct4_add_dc_12bpp_rvv_i64: 0.7 h264_idct4_add_dc_14bpp_c: 1.2 h264_idct4_add_dc_14bpp_rvv_i64: 0.7 h264_idct8_add_dc_8bpp_c: 5.2 h264_idct8_add_dc_8bpp_rvv_i64: 1.5 h264_idct8_add_dc_9bpp_c: 5.5 h264_idct8_add_dc_9bpp_rvv_i64: 1.2 h264_idct8_add_dc_10bpp_c: 5.5 h264_idct8_add_dc_10bpp_rvv_i64: 1.2 h264_idct8_add_dc_12bpp_c: 4.2 h264_idct8_add_dc_12bpp_rvv_i64: 1.2 h264_idct8_add_dc_14bpp_c: 4.2 h264_idct8_add_dc_14bpp_rvv_i64: 1.2 Signed-off-by: J. Dekker <jdek@itanimul.li>	2024-07-18 02:47:30 +02:00
Zhao Zhili	b3aeef3bf9	avcodec/vvc: Remove write-only assignments in alf_filter_chroma	2024-07-17 21:23:41 +08:00
Zhao Zhili	8bac9d4a21	avcodec/vvc: Remove NOP condition check in alf_filter_luma If (y + i == vb_above) or (y + i == vb_below), the if body has no operation.	2024-07-17 21:23:41 +08:00
Michael Niedermayer	0993ef675f	avcodec/mpeg12enc: Use av_rescale() in vbv_buffer_size computation Fixes: signed integer overflow: 20 * 2314885530818453759 cannot be represented in type 'long' Fixes: 69098/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_MPEG2VIDEO_fuzzer-6107989688778752 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-16 19:03:50 +02:00
Michael Niedermayer	69e90491f1	avcodec/utvideoenc: Use unsigned shift to build flags Fixes: left shift of 255 by 24 places cannot be represented in type 'int' Fixes: 69083/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_UTVIDEO_fuzzer-5608202363273216 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-16 19:03:50 +02:00
Michael Niedermayer	a84fbd7471	avcodec/j2kenc: Merge dwt_norm into lambda This moves computations out of a loop This may help with UB in vsynth-jpeg2000-yuva444p16 Fixes: signed integer overflow: 31665934879948800 9998 cannot be represented in type 'long' Fixes: 69024/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_JPEG2000_fuzzer-5949662967169024 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-16 19:03:50 +02:00
Michael Niedermayer	664fbfb9ac	avcodec/mscc: move frame allocates to later Fixes: Timeout Fixes: 66964/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_SRGC_fuzzer-5413170363564032 Fixes: 69373/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_MSCC_fuzzer-5239787748392960 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-16 18:43:15 +02:00
Michael Niedermayer	7147c3c911	avcodec/ratecontrol: Handle wanted bits overflow Fixes: 5.92611e+20 is outside the range of representable values of type 'unsigned long' Fixes: 68984/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_SNOW_fuzzer-5155755073273856 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-16 18:43:14 +02:00
Michael Niedermayer	af99358353	avcodec/vc2enc: Fix overflows with storing large values Fixes: left shift of 1431634944 by 2 places cannot be represented in type 'int' Fixes: left shift of 1073741824 by 1 places cannot be represented in type 'int' Fixes: 69061/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_VC2_fuzzer-6325700826038272 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-16 18:43:14 +02:00
Rémi Denis-Courmont	3002310b70	lavc/h264dsp: R-V V high-depth add_pixels8 T-Head C908 (cycles); h264_add_pixels8_9bpp_c: 270.5 h264_add_pixels8_9bpp_rvv_i32: 164.2 h264_add_pixels8_10bpp_c: 270.5 h264_add_pixels8_10bpp_rvv_i32: 164.2 h264_add_pixels8_12bpp_c: 270.5 h264_add_pixels8_12bpp_rvv_i32: 164.2 h264_add_pixels8_14bpp_c: 270.5 h264_add_pixels8_14bpp_rvv_i32: 164.2	2024-07-16 17:25:40 +03:00
Rémi Denis-Courmont	7744c08240	lavc/h264dsp: R-V V add_pixels4 and 8-bit add_pixels8 T-Head C908 (cycles): h264_add_pixels4_8bpp_c: 93.5 h264_add_pixels4_8bpp_rvv_i32: 39.5 h264_add_pixels4_9bpp_c: 87.5 h264_add_pixels4_9bpp_rvv_i64: 50.5 h264_add_pixels4_10bpp_c: 87.5 h264_add_pixels4_10bpp_rvv_i64: 50.5 h264_add_pixels4_12bpp_c: 87.5 h264_add_pixels4_12bpp_rvv_i64: 50.5 h264_add_pixels4_14bpp_c: 87.5 h264_add_pixels4_14bpp_rvv_i64: 50.5 h264_add_pixels8_8bpp_c: 265.2 h264_add_pixels8_8bpp_rvv_i64: 84.5	2024-07-16 17:25:40 +03:00
Zhao Zhili	635f7c0f6c	avcodec/videotoolboxenc: Put ExtraSEI inside BufNode directly Reduce error path and simplify the code. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-07-16 19:53:53 +08:00
Zhao Zhili	2fca8e400e	avcodec/videotoolboxenc: Fix concurrent access to CVPixelBufferRef For a frame comes from AV_HWDEVICE_TYPE_VIDEOTOOLBOX, it's CVPixelBufferRef is maintained by a pool. CVPixelBufferRef returned to the pool when frame buffer reference reached to zero. However, VTCompressionSessionEncodeFrame also hold a reference to the CVPixelBufferRef. So a new frame get from av_hwframe_get_buffer may access a CVPixelBufferRef which still used by the encoder. It's only after vtenc_output_callback that we can make sure CVPixelBufferRef has been released by the encoder. The issue can be tested with sample from trac #10884. ffmpeg -hwaccel videotoolbox \ -hwaccel_output_format videotoolbox_vld \ -i input.mp4 \ -c:v hevc_videotoolbox \ -profile:v main \ -b:v 3M \ -vf scale_vt=w=iw/2:h=ih/2:color_matrix=bt709:color_primaries=bt709:color_transfer=bt709 \ -c:a copy \ -tag:v hvc1 \ output.mp4 Withtout the patch, there are some out of order images in output.mp4. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-07-16 19:53:53 +08:00
Zhao Zhili	0e338c4114	avcodec/videotoolboxenc: Use BufNode as sourceFrameRefcon ExtraSEI is used as the sourceFrameRefcon of VTCompressionSessionEncodeFrame. It cannot hold other information which is necessary to fix another issue in the following patch. This patch also fixed leak of ExtraSEI on the error path of vtenc_output_callback. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-07-16 19:53:53 +08:00
Zhao Zhili	4a3625859b	avcodec/videotoolboxenc: Remove unused variable Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-07-16 19:53:53 +08:00
Zhao Zhili	2eae57c862	avcodec/videotoolboxenc: Don't ignore ENOMEM Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-07-16 19:53:53 +08:00
James Almer	27eb55a9c9	avcodec/cbs_h265: add support for 3D Reference Displays Information SEI Signed-off-by: James Almer <jamrial@gmail.com>	2024-07-15 16:39:44 -03:00
James Almer	64807ccc91	avcodec/cbs_h265: add support for PPS Multilayer extension fields Signed-off-by: James Almer <jamrial@gmail.com>	2024-07-15 16:39:44 -03:00
James Almer	25138fa0f3	avcodec/cbs_h265: reindent after the previous commit Signed-off-by: James Almer <jamrial@gmail.com>	2024-07-15 16:39:44 -03:00
James Almer	41211edc1b	avcodec/cbs_h265: add support for SPS Multilayer extension fields Signed-off-by: James Almer <jamrial@gmail.com>	2024-07-15 16:39:44 -03:00
James Almer	5fe13aeb65	avcodec/cbs_h265: fix range of sps_max_sub_layers_minus1 The VPS referenced by the SPS must always be present as the max value for sps_max_sub_layers_minus1 is vps_max_sub_layers_minus1. This replaces a buggy custom range check for the aforementioned field. Also, add the missing conformance check for sps_temporal_id_nesting_flag while at it. Signed-off-by: James Almer <jamrial@gmail.com>	2024-07-15 16:39:44 -03:00
Fei Wang	246600974f	lavc/vaapi_{decode, av1}: Fix memory leak in fail codepath Signed-off-by: Fei Wang <fei.w.wang@intel.com>	2024-07-15 10:25:43 +08:00
Michael Niedermayer	9c8881cb35	avcodec/mpegvideo_enc: Do not duplicate pictures on shifting Fixes: out of array access Fixes: 69098/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_MPEG2VIDEO_fuzzer-6107989688778752 Fixes: 69599/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_MPEG4_fuzzer-4848626296225792.fuzz Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-15 01:59:41 +02:00
Michael Niedermayer	66d6b8033b	avcodec/tiff: Check value on positive signed targets Fixes: CID1604593 Overflowed constant Sponsored-by: Sovereign Tech Fund Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-15 01:59:40 +02:00
Michael Niedermayer	8f74c313f1	avcodec/vvc/ctu: Simplify code at the end of pred_mode_decode() This simplification assumes that the code is correct Fixes: CID1560036 Logically dead code Sponsored-by: Sovereign Tech Fund Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-15 01:59:37 +02:00
Rémi Denis-Courmont	c654e37254	lavc/h264dsp: R-V V high-depth h264_idct8_add Unlike the 8-bit version, we need two iterations to process this within 128-bit vectors. This adds some extra complexity for pointer arithmetic and counting down which is unnecessary in the 8-bit variant. Accordingly the gain relative to C are just slight better than half as good with 128-bit vectors as with 256-bit ones. T-Head C908 (2 iterations): h264_idct8_add_9bpp_c: 17.5 h264_idct8_add_9bpp_rvv_i32: 10.0 h264_idct8_add_10bpp_c: 17.5 h264_idct8_add_10bpp_rvv_i32: 9.7 h264_idct8_add_12bpp_c: 17.7 h264_idct8_add_12bpp_rvv_i32: 9.7 h264_idct8_add_14bpp_c: 17.7 h264_idct8_add_14bpp_rvv_i32: 9.7 SpacemiT X60 (single iteration): h264_idct8_add_9bpp_c: 15.2 h264_idct8_add_9bpp_rvv_i32: 5.0 h264_idct8_add_10bpp_c: 15.2 h264_idct8_add_10bpp_rvv_i32: 5.0 h264_idct8_add_12bpp_c: 14.7 h264_idct8_add_12bpp_rvv_i32: 5.0 h264_idct8_add_14bpp_c: 14.7 h264_idct8_add_14bpp_rvv_i32: 4.7	2024-07-14 21:06:50 +03:00
Rémi Denis-Courmont	8b3d997bed	lavc/h264dsp: remove MMI 8-bit 4:2:2 chroma DC dequant The function is exactly identical to the C reference, only with the constant propagated and the loop unrolled manually.	2024-07-14 11:39:35 +03:00
Rémi Denis-Courmont	a194131cb6	lavc/h264dsp: remove MMI 8-bit chroma DC dequant The function is exactly identical to the C reference, only with the constant propagated manually. It does not optimise anything.	2024-07-14 11:39:35 +03:00
Rémi Denis-Courmont	4e0e872881	lavc/h264dsp: R-V V high-depth h264_idct_add T-Head C908 (cycles): h264_idct4_add_9bpp_c: 248.2 h264_idct4_add_9bpp_rvv_i32: 128.7 h264_idct4_add_10bpp_c: 256.7 h264_idct4_add_10bpp_rvv_i32: 128.7 h264_idct4_add_12bpp_c: 252.5 h264_idct4_add_12bpp_rvv_i32: 129.7 h264_idct4_add_14bpp_c: 258.0 h264_idct4_add_14bpp_rvv_i32: 129.7	2024-07-14 11:39:35 +03:00
James Almer	d059ea5663	avcodec/bsf/showinfo: print packet data checksum Reviewed-by: Anton Khirnov <anton@khirnov.net> Signed-off-by: James Almer <jamrial@gmail.com>	2024-07-13 23:48:34 -03:00
Michael Niedermayer	9af348bd1a	avcodec/flac_parser: Assert that we do not overrun the link_penalty array Helps: CID1454676 Out-of-bounds read Sponsored-by: Sovereign Tech Fund Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-12 22:49:33 +02:00
Michael Niedermayer	ed34b0c54e	avcodec/osq: avoid signed overflow in downsample path Fixes: signed integer overflow: 865309950 * 256 cannot be represented in type 'int' Fixes: 69191/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_OSQ_fuzzer-6310214413385728 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-07-12 22:45:58 +02:00

1 2 3 4 5 ...

50686 commits