ffmpeg

mirror of https://git.ffmpeg.org/ffmpeg.git synced 2026-02-10 12:09:53 +00:00

Author	SHA1	Message	Date
Lynne	bbe95f7353	x86: replace explicit REP_RETs with RETs From x86inc: > On AMD cpus <=K10, an ordinary ret is slow if it immediately follows either > a branch or a branch target. So switch to a 2-byte form of ret in that case. > We can automatically detect "follows a branch", but not a branch target. > (SSSE3 is a sufficient condition to know that your cpu doesn't have this problem.) x86inc can automatically determine whether to use REP_RET rather than REP in most of these cases, so impact is minimal. Additionally, a few REP_RETs were used unnecessary, despite the return being nowhere near a branch. The only CPUs affected were AMD K10s, made between 2007 and 2011, 16 years ago and 12 years ago, respectively. In the future, everyone involved with x86inc should consider dropping REP_RETs altogether.	2023-02-01 04:23:55 +01:00
Paul B Mahol	37a503ac87	avcodec/x86/audiodsp: add scalarproduct avx2	2022-09-13 17:43:16 +02:00
Andreas Rheinhardt	3d716d38ab	avcodec/x86/audiodsp_init: Remove obsolete MMX(EXT) functions x64 always has MMX, MMXEXT, SSE and SSE2 and this means that some functions for MMX, MMXEXT and 3dnow are always overridden by other functions (unless one e.g. explicitly disables SSE2) for x64. So given that the only systems that benefit from these functions are truely ancient 32bit x86s they are removed. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-06-22 13:30:13 +02:00
James Almer	2904db9045	Merge commit '`994c4bc107`' * commit '`994c4bc107`': x86util: Port all macros to cpuflags See `d5f8a642f6` Merged-by: James Almer <jamrial@gmail.com>	2017-10-21 12:15:57 -03:00
James Almer	29db87af52	Merge commit '`6be7944ee2`' * commit '`6be7944ee2`': x86: Add missing colons after assembly labels Merged-by: James Almer <jamrial@gmail.com>	2017-03-23 18:05:27 -03:00
James Almer	aee046a895	x86/audiodsp: remove an unnecessary movss	2017-03-22 00:14:56 -03:00
Clément Bœsch	83cd80d10a	Merge commit '`12004a9a7f`' * commit '`12004a9a7f`': audiodsp/x86: yasmify vector_clipf_sse audiodsp: reorder arguments for vector_clipf Merged the version from Libav after a discussion with James Almer on IRC: 19:22 <ubitux> jamrial: opinion on `12004a9a7f`? 19:23 <ubitux> it was apparently yasmified differently 19:23 <ubitux> (it depends on the previous commit arg shuffle) 19:24 <ubitux> i don't see the magic movsxdifnidn in your port btw 19:24 <ubitux> it's a port from `1d36defe94` 19:25 <jamrial> seems better thanks to said arg shuffle 19:25 <jamrial> the loop is the same, but init is simpler 19:25 <jamrial> probably worth merging 19:25 <ubitux> OK 19:25 <ubitux> thanks 19:26 <jamrial> curious they didn't make len ptrdiff_t after the previous bunch of commits, heh 19:26 <ubitux> yeah indeed Both commits are merged at the same time to prevent a conflict with our existing yasmified ff_vector_clipf_sse. Merged-by: Clément Bœsch <u@pkh.me>	2017-03-20 22:35:07 +01:00
Clément Bœsch	43a4c729d4	Merge commit '`75d98e30af`' * commit '`75d98e30af`': audiodsp/x86: clear the high bits of the order parameter on 64bit Merged-by: Clément Bœsch <u@pkh.me>	2017-03-20 18:44:00 +01:00
Clément Bœsch	072fad7cf5	Merge commit '`1d6c76e11f`' * commit '`1d6c76e11f`': audiodsp/x86: fix ff_vector_clip_int32_sse2 No functionnal changes, only cosmetics. This issue was fixed in `9a9e2f1c8a`. Merged-by: Clément Bœsch <u@pkh.me>	2017-03-20 18:42:37 +01:00
Diego Biurrun	994c4bc107	x86util: Port all macros to cpuflags Also do some small cosmetic changes: Drop pointless _MMX suffix from ABSD2 macro name, drop pointless check for MMX support, we always assume MMX is available in our SIMD code, fix spelling.	2017-03-14 17:23:32 +01:00
Diego Biurrun	6be7944ee2	x86: Add missing colons after assembly labels This fixes many warnings of the sort warning: label alone on a line without a colon might be in error	2016-10-17 16:31:26 +02:00
Anton Khirnov	12004a9a7f	audiodsp/x86: yasmify vector_clipf_sse	2016-09-22 09:47:52 +02:00
Anton Khirnov	75d98e30af	audiodsp/x86: clear the high bits of the order parameter on 64bit Also change shl to add, since it can be faster on some CPUs. CC: libav-stable@libav.org	2016-09-19 19:18:07 +02:00
Anton Khirnov	1d6c76e11f	audiodsp/x86: fix ff_vector_clip_int32_sse2 This version, which is the only one doing two processing cycles per loop iteration, computes the load/store indices incorrectly for the second cycle. CC: libav-stable@libav.org	2016-09-19 19:18:07 +02:00
Henrik Gramner	ab43beefab	x86inc: Drop SECTION_TEXT macro The .text section is already 16-byte aligned by default on all supported platforms so `SECTION_TEXT` isn't any different from `SECTION .text`. Signed-off-by: Anton Khirnov <anton@khirnov.net>	2015-08-11 11:12:01 +02:00
Henrik Gramner	f0b7882ceb	x86inc: Drop SECTION_TEXT macro The .text section is already 16-byte aligned by default on all supported platforms so `SECTION_TEXT` isn't any different from `SECTION .text`.	2015-08-04 20:13:09 +02:00
James Almer	6ec3dc97fc	x86/audiodsp: move asm code out of dsputil Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2014-06-22 19:53:09 +02:00
Michael Niedermayer	99497b4683	Merge commit '`9a9e2f1c8a`' * commit '`9a9e2f1c8a`': dsputil: Split audio operations off into a separate context Conflicts: configure libavcodec/takdec.c libavcodec/x86/Makefile libavcodec/x86/dsputil.asm libavcodec/x86/dsputil_init.c libavcodec/x86/dsputil_mmx.c libavcodec/x86/dsputil_x86.h Merged-by: Michael Niedermayer <michaelni@gmx.at>	2014-06-22 17:58:28 +02:00
Diego Biurrun	9a9e2f1c8a	dsputil: Split audio operations off into a separate context	2014-06-22 06:20:15 -07:00

19 commits