| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | // Copyright 2015 The Go Authors. All rights reserved. | 
					
						
							|  |  |  | // Use of this source code is governed by a BSD-style | 
					
						
							|  |  |  | // license that can be found in the LICENSE file. | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | package gc | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | import ( | 
					
						
							| 
									
										
										
										
											2018-06-14 18:20:03 +03:00
										 |  |  | 	"bufio" | 
					
						
							| 
									
										
										
										
											2015-08-10 12:15:52 -07:00
										 |  |  | 	"bytes" | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	"encoding/binary" | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | 	"fmt" | 
					
						
							| 
									
										
										
										
											2015-08-10 12:15:52 -07:00
										 |  |  | 	"html" | 
					
						
							| 
									
										
										
										
											2015-07-22 13:13:53 -07:00
										 |  |  | 	"os" | 
					
						
							| 
									
										
										
										
											2016-11-01 15:28:10 -07:00
										 |  |  | 	"sort" | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-28 13:49:20 -07:00
										 |  |  | 	"cmd/compile/internal/ssa" | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	"cmd/compile/internal/types" | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 	"cmd/internal/obj" | 
					
						
							| 
									
										
										
										
											2016-12-06 17:08:06 -08:00
										 |  |  | 	"cmd/internal/src" | 
					
						
							| 
									
										
										
										
											2016-04-06 12:01:40 -07:00
										 |  |  | 	"cmd/internal/sys" | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | ) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-01-28 13:46:30 -08:00
										 |  |  | var ssaConfig *ssa.Config | 
					
						
							| 
									
										
											  
											
												cmd/compile: add initial backend concurrency support
This CL adds initial support for concurrent backend compilation.
BACKGROUND
The compiler currently consists (very roughly) of the following phases:
1. Initialization.
2. Lexing and parsing into the cmd/compile/internal/syntax AST.
3. Translation into the cmd/compile/internal/gc AST.
4. Some gc AST passes: typechecking, escape analysis, inlining,
   closure handling, expression evaluation ordering (order.go),
   and some lowering and optimization (walk.go).
5. Translation into the cmd/compile/internal/ssa SSA form.
6. Optimization and lowering of SSA form.
7. Translation from SSA form to assembler instructions.
8. Translation from assembler instructions to machine code.
9. Writing lots of output: machine code, DWARF symbols,
   type and reflection info, export data.
Phase 2 was already concurrent as of Go 1.8.
Phase 3 is planned for eventual removal;
we hope to go straight from syntax AST to SSA.
Phases 5–8 are per-function; this CL adds support for
processing multiple functions concurrently.
The slowest phases in the compiler are 5 and 6,
so this offers the opportunity for some good speed-ups.
Unfortunately, it's not quite that straightforward.
In the current compiler, the latter parts of phase 4
(order, walk) are done function-at-a-time as needed.
Making order and walk concurrency-safe proved hard,
and they're not particularly slow, so there wasn't much reward.
To enable phases 5–8 to be done concurrently,
when concurrent backend compilation is requested,
we complete phase 4 for all functions
before starting later phases for any functions.
Also, in reality, we automatically generate new
functions in phase 9, such as method wrappers
and equality and has routines.
Those new functions then go through phases 4–8.
This CL disables concurrent backend compilation
after the first, big, user-provided batch of
functions has been compiled.
This is done to keep things simple,
and because the autogenerated functions
tend to be small, few, simple, and fast to compile.
USAGE
Concurrent backend compilation still defaults to off.
To set the number of functions that may be backend-compiled
concurrently, use the compiler flag -c.
In future work, cmd/go will automatically set -c.
Furthermore, this CL has been intentionally written
so that the c=1 path has no backend concurrency whatsoever,
not even spawning any goroutines.
This helps ensure that, should problems arise
late in the development cycle,
we can simply have cmd/go set c=1 always,
and revert to the original compiler behavior.
MUTEXES
Most of the work required to make concurrent backend
compilation safe has occurred over the past month.
This CL adds a handful of mutexes to get the rest of the way there;
they are the mutexes that I didn't see a clean way to avoid.
Some of them may still be eliminable in future work.
In no particular order:
* gc.funcsymsmu. The global funcsyms slice is populated
  lazily when we need function symbols for closures.
  This occurs during gc AST to SSA translation.
  The function funcsym also does a package lookup,
  which is a source of races on types.Pkg.Syms;
  funcsymsmu also covers that package lookup.
  This mutex is low priority: it adds a single global,
  it is in an infrequently used code path, and it is low contention.
  Since funcsyms may now be added in any order,
  we must sort them to preserve reproducible builds.
* gc.largeStackFramesMu. We don't discover until after SSA compilation
  that a function's stack frame is gigantic.
  Recording that error happens basically never,
  but it does happen concurrently.
  Fix with a low priority mutex and sorting.
* obj.Link.hashmu. ctxt.hash stores the mapping from
  types.Syms (compiler symbols) to obj.LSyms (linker symbols).
  It is accessed fairly heavily through all the phases.
  This is the only heavily contended mutex.
* gc.signatlistmu. The global signatlist map is
  populated with types through several of the concurrent phases,
  including notably via ngotype during DWARF generation.
  It is low priority for removal.
* gc.typepkgmu. Looking up symbols in the types package
  happens a fair amount during backend compilation
  and DWARF generation, particularly via ngotype.
  This mutex helps us to avoid a broader mutex on types.Pkg.Syms.
  It has low-to-moderate contention.
* types.internedStringsmu. gc AST to SSA conversion and
  some SSA work introduce new autotmps.
  Those autotmps have their names interned to reduce allocations.
  That interning requires protecting types.internedStrings.
  The autotmp names are heavily re-used, and the mutex
  overhead and contention here are low, so it is probably
  a worthwhile performance optimization to keep this mutex.
TESTING
I have been testing this code locally by running
'go install -race cmd/compile'
and then doing
'go build -a -gcflags=-c=128 std cmd'
for all architectures and a variety of compiler flags.
This obviously needs to be made part of the builders,
but it is too expensive to make part of all.bash.
I have filed #19962 for this.
REPRODUCIBLE BUILDS
This version of the compiler generates reproducible builds.
Testing reproducible builds also needs automation, however,
and is also too expensive for all.bash.
This is #19961.
Also of note is that some of the compiler flags used by 'toolstash -cmp'
are currently incompatible with concurrent backend compilation.
They still work fine with c=1.
Time will tell whether this is a problem.
NEXT STEPS
* Continue to find and fix races and bugs,
  using a combination of code inspection, fuzzing,
  and hopefully some community experimentation.
  I do not know of any outstanding races,
  but there probably are some.
* Improve testing.
* Improve performance, for many values of c.
* Integrate with cmd/go and fine tune.
* Support concurrent compilation with the -race flag.
  It is a sad irony that it does not yet work.
* Minor code cleanup that has been deferred during
  the last month due to uncertainty about the
  ultimate shape of this CL.
PERFORMANCE
Here's the buried lede, at last. :)
All benchmarks are from my 8 core 2.9 GHz Intel Core i7 darwin/amd64 laptop.
First, going from tip to this CL with c=1 has almost no impact.
name        old time/op       new time/op       delta
Template          195ms ± 3%        194ms ± 5%    ~     (p=0.370 n=30+29)
Unicode          86.6ms ± 3%       87.0ms ± 7%    ~     (p=0.958 n=29+30)
GoTypes           548ms ± 3%        555ms ± 4%  +1.35%  (p=0.001 n=30+28)
Compiler          2.51s ± 2%        2.54s ± 2%  +1.17%  (p=0.000 n=28+30)
SSA               5.16s ± 3%        5.16s ± 2%    ~     (p=0.910 n=30+29)
Flate             124ms ± 5%        124ms ± 4%    ~     (p=0.947 n=30+30)
GoParser          146ms ± 3%        146ms ± 3%    ~     (p=0.150 n=29+28)
Reflect           354ms ± 3%        352ms ± 4%    ~     (p=0.096 n=29+29)
Tar               107ms ± 5%        106ms ± 3%    ~     (p=0.370 n=30+29)
XML               200ms ± 4%        201ms ± 4%    ~     (p=0.313 n=29+28)
[Geo mean]        332ms             333ms       +0.10%
name        old user-time/op  new user-time/op  delta
Template          227ms ± 5%        225ms ± 5%    ~     (p=0.457 n=28+27)
Unicode           109ms ± 4%        109ms ± 5%    ~     (p=0.758 n=29+29)
GoTypes           713ms ± 4%        721ms ± 5%    ~     (p=0.051 n=30+29)
Compiler          3.36s ± 2%        3.38s ± 3%    ~     (p=0.146 n=30+30)
SSA               7.46s ± 3%        7.47s ± 3%    ~     (p=0.804 n=30+29)
Flate             146ms ± 7%        147ms ± 3%    ~     (p=0.833 n=29+27)
GoParser          179ms ± 5%        179ms ± 5%    ~     (p=0.866 n=30+30)
Reflect           431ms ± 4%        429ms ± 4%    ~     (p=0.593 n=29+30)
Tar               124ms ± 5%        123ms ± 5%    ~     (p=0.140 n=29+29)
XML               243ms ± 4%        242ms ± 7%    ~     (p=0.404 n=29+29)
[Geo mean]        415ms             415ms       +0.02%
name        old obj-bytes     new obj-bytes     delta
Template           382k ± 0%         382k ± 0%    ~     (all equal)
Unicode            203k ± 0%         203k ± 0%    ~     (all equal)
GoTypes           1.18M ± 0%        1.18M ± 0%    ~     (all equal)
Compiler          3.98M ± 0%        3.98M ± 0%    ~     (all equal)
SSA               8.28M ± 0%        8.28M ± 0%    ~     (all equal)
Flate              230k ± 0%         230k ± 0%    ~     (all equal)
GoParser           287k ± 0%         287k ± 0%    ~     (all equal)
Reflect           1.00M ± 0%        1.00M ± 0%    ~     (all equal)
Tar                190k ± 0%         190k ± 0%    ~     (all equal)
XML                416k ± 0%         416k ± 0%    ~     (all equal)
[Geo mean]         660k              660k       +0.00%
Comparing this CL to itself, from c=1 to c=2
improves real times 20-30%, costs 5-10% more CPU time,
and adds about 2% alloc.
The allocation increase comes from allocating more ssa.Caches.
name       old time/op       new time/op       delta
Template         202ms ± 3%        149ms ± 3%  -26.15%  (p=0.000 n=49+49)
Unicode         87.4ms ± 4%       84.2ms ± 3%   -3.68%  (p=0.000 n=48+48)
GoTypes          560ms ± 2%        398ms ± 2%  -28.96%  (p=0.000 n=49+49)
Compiler         2.46s ± 3%        1.76s ± 2%  -28.61%  (p=0.000 n=48+46)
SSA              6.17s ± 2%        4.04s ± 1%  -34.52%  (p=0.000 n=49+49)
Flate            126ms ± 3%         92ms ± 2%  -26.81%  (p=0.000 n=49+48)
GoParser         148ms ± 4%        107ms ± 2%  -27.78%  (p=0.000 n=49+48)
Reflect          361ms ± 3%        281ms ± 3%  -22.10%  (p=0.000 n=49+49)
Tar              109ms ± 4%         86ms ± 3%  -20.81%  (p=0.000 n=49+47)
XML              204ms ± 3%        144ms ± 2%  -29.53%  (p=0.000 n=48+45)
name       old user-time/op  new user-time/op  delta
Template         246ms ± 9%        246ms ± 4%     ~     (p=0.401 n=50+48)
Unicode          109ms ± 4%        111ms ± 4%   +1.47%  (p=0.000 n=44+50)
GoTypes          728ms ± 3%        765ms ± 3%   +5.04%  (p=0.000 n=46+50)
Compiler         3.33s ± 3%        3.41s ± 2%   +2.31%  (p=0.000 n=49+48)
SSA              8.52s ± 2%        9.11s ± 2%   +6.93%  (p=0.000 n=49+47)
Flate            149ms ± 4%        161ms ± 3%   +8.13%  (p=0.000 n=50+47)
GoParser         181ms ± 5%        192ms ± 2%   +6.40%  (p=0.000 n=49+46)
Reflect          452ms ± 9%        474ms ± 2%   +4.99%  (p=0.000 n=50+48)
Tar              126ms ± 6%        136ms ± 4%   +7.95%  (p=0.000 n=50+49)
XML              247ms ± 5%        264ms ± 3%   +6.94%  (p=0.000 n=48+50)
name       old alloc/op      new alloc/op      delta
Template        38.8MB ± 0%       39.3MB ± 0%   +1.48%  (p=0.008 n=5+5)
Unicode         29.8MB ± 0%       30.2MB ± 0%   +1.19%  (p=0.008 n=5+5)
GoTypes          113MB ± 0%        114MB ± 0%   +0.69%  (p=0.008 n=5+5)
Compiler         443MB ± 0%        447MB ± 0%   +0.95%  (p=0.008 n=5+5)
SSA             1.25GB ± 0%       1.26GB ± 0%   +0.89%  (p=0.008 n=5+5)
Flate           25.3MB ± 0%       25.9MB ± 1%   +2.35%  (p=0.008 n=5+5)
GoParser        31.7MB ± 0%       32.2MB ± 0%   +1.59%  (p=0.008 n=5+5)
Reflect         78.2MB ± 0%       78.9MB ± 0%   +0.91%  (p=0.008 n=5+5)
Tar             26.6MB ± 0%       27.0MB ± 0%   +1.80%  (p=0.008 n=5+5)
XML             42.4MB ± 0%       43.4MB ± 0%   +2.35%  (p=0.008 n=5+5)
name       old allocs/op     new allocs/op     delta
Template          379k ± 0%         378k ± 0%     ~     (p=0.421 n=5+5)
Unicode           322k ± 0%         321k ± 0%     ~     (p=0.222 n=5+5)
GoTypes          1.14M ± 0%        1.14M ± 0%     ~     (p=0.548 n=5+5)
Compiler         4.12M ± 0%        4.11M ± 0%   -0.14%  (p=0.032 n=5+5)
SSA              9.72M ± 0%        9.72M ± 0%     ~     (p=0.421 n=5+5)
Flate             234k ± 1%         234k ± 0%     ~     (p=0.421 n=5+5)
GoParser          316k ± 1%         315k ± 0%     ~     (p=0.222 n=5+5)
Reflect           980k ± 0%         979k ± 0%     ~     (p=0.095 n=5+5)
Tar               249k ± 1%         249k ± 1%     ~     (p=0.841 n=5+5)
XML               392k ± 0%         391k ± 0%     ~     (p=0.095 n=5+5)
From c=1 to c=4, real time is down ~40%, CPU usage up 10-20%, alloc up ~5%:
name       old time/op       new time/op       delta
Template         203ms ± 3%        131ms ± 5%  -35.45%  (p=0.000 n=50+50)
Unicode         87.2ms ± 4%       84.1ms ± 2%   -3.61%  (p=0.000 n=48+47)
GoTypes          560ms ± 4%        310ms ± 2%  -44.65%  (p=0.000 n=50+49)
Compiler         2.47s ± 3%        1.41s ± 2%  -43.10%  (p=0.000 n=50+46)
SSA              6.17s ± 2%        3.20s ± 2%  -48.06%  (p=0.000 n=49+49)
Flate            126ms ± 4%         74ms ± 2%  -41.06%  (p=0.000 n=49+48)
GoParser         148ms ± 4%         89ms ± 3%  -39.97%  (p=0.000 n=49+50)
Reflect          360ms ± 3%        242ms ± 3%  -32.81%  (p=0.000 n=49+49)
Tar              108ms ± 4%         73ms ± 4%  -32.48%  (p=0.000 n=50+49)
XML              203ms ± 3%        119ms ± 3%  -41.56%  (p=0.000 n=49+48)
name       old user-time/op  new user-time/op  delta
Template         246ms ± 9%        287ms ± 9%  +16.98%  (p=0.000 n=50+50)
Unicode          109ms ± 4%        118ms ± 5%   +7.56%  (p=0.000 n=46+50)
GoTypes          735ms ± 4%        806ms ± 2%   +9.62%  (p=0.000 n=50+50)
Compiler         3.34s ± 4%        3.56s ± 2%   +6.78%  (p=0.000 n=49+49)
SSA              8.54s ± 3%       10.04s ± 3%  +17.55%  (p=0.000 n=50+50)
Flate            149ms ± 6%        176ms ± 3%  +17.82%  (p=0.000 n=50+48)
GoParser         181ms ± 5%        213ms ± 3%  +17.47%  (p=0.000 n=50+50)
Reflect          453ms ± 6%        499ms ± 2%  +10.11%  (p=0.000 n=50+48)
Tar              126ms ± 5%        149ms ±11%  +18.76%  (p=0.000 n=50+50)
XML              246ms ± 5%        287ms ± 4%  +16.53%  (p=0.000 n=49+50)
name       old alloc/op      new alloc/op      delta
Template        38.8MB ± 0%       40.4MB ± 0%   +4.21%  (p=0.008 n=5+5)
Unicode         29.8MB ± 0%       30.9MB ± 0%   +3.68%  (p=0.008 n=5+5)
GoTypes          113MB ± 0%        116MB ± 0%   +2.71%  (p=0.008 n=5+5)
Compiler         443MB ± 0%        455MB ± 0%   +2.75%  (p=0.008 n=5+5)
SSA             1.25GB ± 0%       1.27GB ± 0%   +1.84%  (p=0.008 n=5+5)
Flate           25.3MB ± 0%       26.9MB ± 1%   +6.31%  (p=0.008 n=5+5)
GoParser        31.7MB ± 0%       33.2MB ± 0%   +4.61%  (p=0.008 n=5+5)
Reflect         78.2MB ± 0%       80.2MB ± 0%   +2.53%  (p=0.008 n=5+5)
Tar             26.6MB ± 0%       27.9MB ± 0%   +5.19%  (p=0.008 n=5+5)
XML             42.4MB ± 0%       44.6MB ± 0%   +5.20%  (p=0.008 n=5+5)
name       old allocs/op     new allocs/op     delta
Template          380k ± 0%         379k ± 0%   -0.39%  (p=0.032 n=5+5)
Unicode           321k ± 0%         321k ± 0%     ~     (p=0.841 n=5+5)
GoTypes          1.14M ± 0%        1.14M ± 0%     ~     (p=0.421 n=5+5)
Compiler         4.12M ± 0%        4.14M ± 0%   +0.52%  (p=0.008 n=5+5)
SSA              9.72M ± 0%        9.76M ± 0%   +0.37%  (p=0.008 n=5+5)
Flate             234k ± 1%         234k ± 1%     ~     (p=0.690 n=5+5)
GoParser          316k ± 0%         317k ± 1%     ~     (p=0.841 n=5+5)
Reflect           981k ± 0%         981k ± 0%     ~     (p=1.000 n=5+5)
Tar               250k ± 0%         249k ± 1%     ~     (p=0.151 n=5+5)
XML               393k ± 0%         392k ± 0%     ~     (p=0.056 n=5+5)
Going beyond c=4 on my machine tends to increase CPU time and allocs
without impacting real time.
The CPU time numbers matter, because when there are many concurrent
compilation processes, that will impact the overall throughput.
The numbers above are in many ways the best case scenario;
we can take full advantage of all cores.
Fortunately, the most common compilation scenario is incremental
re-compilation of a single package during a build/test cycle.
Updates #15756
Change-Id: I6725558ca2069edec0ac5b0d1683105a9fff6bea
Reviewed-on: https://go-review.googlesource.com/40693
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
Reviewed-by: Robert Griesemer <gri@golang.org>
Run-TryBot: Brad Fitzpatrick <bradfitz@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
											
										 
											2017-03-19 08:27:26 -07:00
										 |  |  | var ssaCaches []ssa.Cache | 
					
						
							| 
									
										
										
										
											2016-01-28 13:46:30 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-07-24 10:39:00 +03:00
										 |  |  | var ssaDump string     // early copy of $GOSSAFUNC; the func name to dump output for | 
					
						
							|  |  |  | var ssaDumpStdout bool // whether to dump to stdout | 
					
						
							| 
									
										
										
										
											2018-07-24 13:04:35 +03:00
										 |  |  | const ssaDumpFile = "ssa.html" | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-07-26 12:51:06 +03:00
										 |  |  | // ssaDumpInlined holds all inlined functions when ssaDump contains a function name. | 
					
						
							|  |  |  | var ssaDumpInlined []*Node | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: rearrange fields between ssa.Func, ssa.Cache, and ssa.Config
This makes ssa.Func, ssa.Cache, and ssa.Config fulfill
the roles laid out for them in CL 38160.
The only non-trivial change in this CL is how cached
values and blocks get IDs. Prior to this CL, their IDs were
assigned as part of resetting the cache, and only modified
IDs were reset. This required knowing how many values and
blocks were modified, which required a tight coupling between
ssa.Func and ssa.Config. To eliminate that coupling,
we now zero values and blocks during reset,
and assign their IDs when they are used.
Since unused values and blocks have ID == 0,
we can efficiently find the last used value/block,
to avoid zeroing everything.
Bulk zeroing is efficient, but not efficient enough
to obviate the need to avoid zeroing everything every time.
As a happy side-effect, ssa.Func.Free is no longer necessary.
DebugHashMatch and friends now belong in func.go.
They have been left in place for clarity and review.
I will move them in a subsequent CL.
Passes toolstash -cmp. No compiler performance impact.
No change in 'go test cmd/compile/internal/ssa' execution time.
Change-Id: I2eb7af58da067ef6a36e815a6f386cfe8634d098
Reviewed-on: https://go-review.googlesource.com/38167
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-03-15 11:15:13 -07:00
										 |  |  | func initssaconfig() { | 
					
						
							| 
									
										
										
										
											2018-02-14 14:54:59 +11:00
										 |  |  | 	types_ := ssa.NewTypes() | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	if thearch.SoftFloat { | 
					
						
							|  |  |  | 		softfloatInit() | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-18 08:11:49 -07:00
										 |  |  | 	// Generate a few pointer types that are uncommon in the frontend but common in the backend. | 
					
						
							|  |  |  | 	// Caching is disabled in the backend, so generating these here avoids allocations. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	_ = types.NewPtr(types.Types[TINTER])                             // *interface{} | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.NewPtr(types.Types[TSTRING]))              // **string | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.NewPtr(types.Idealstring))                 // **string | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.NewSlice(types.Types[TINTER]))             // *[]interface{} | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.NewPtr(types.Bytetype))                    // **byte | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.NewSlice(types.Bytetype))                  // *[]byte | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.NewSlice(types.Types[TSTRING]))            // *[]string | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.NewSlice(types.Idealstring))               // *[]string | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.NewPtr(types.NewPtr(types.Types[TUINT8]))) // ***uint8 | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.Types[TINT16])                             // *int16 | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.Types[TINT64])                             // *int64 | 
					
						
							|  |  |  | 	_ = types.NewPtr(types.Errortype)                                 // *error | 
					
						
							|  |  |  | 	types.NewPtrCacheEnabled = false | 
					
						
							| 
									
										
										
										
											2018-02-14 14:54:59 +11:00
										 |  |  | 	ssaConfig = ssa.NewConfig(thearch.LinkArch.Name, *types_, Ctxt, Debug['N'] == 0) | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:36 -07:00
										 |  |  | 	if thearch.LinkArch.Name == "386" { | 
					
						
							|  |  |  | 		ssaConfig.Set387(thearch.Use387) | 
					
						
							| 
									
										
										
											
												[dev.ssa] cmd/compile: enhance command line option processing for SSA
The -d compiler flag can also specify ssa phase and flag,
for example -d=ssa/generic_cse/time,ssa/generic_cse/stats
Spaces in the phase names can be specified with an
underscore.  Flags currently parsed (not necessarily
recognized by the phases yet) are:
   on, off, mem, time, debug, stats, and test
On, off and time are handled in the harness,
debug, stats, and test are interpreted by the phase itself.
The pass is now attached to the Func being compiled, and a
new method logStats(key, ...value) on *Func to encourage a
semi-standardized format for that output.  Output fields
are separated by tabs to ease digestion by awk and
spreadsheets.  For example,
	if f.pass.stats > 0 {
		f.logStat("CSE REWRITES", rewrites)
	}
Change-Id: I16db2b5af64c50ca9a47efeb51d961147a903abc
Reviewed-on: https://go-review.googlesource.com/19885
Reviewed-by: Keith Randall <khr@golang.org>
Reviewed-by: Todd Neal <todd@tneal.org>
											
										 
											2016-02-25 13:10:51 -05:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 	ssaConfig.SoftFloat = thearch.SoftFloat | 
					
						
							| 
									
										
										
										
											2018-08-21 10:26:45 +01:00
										 |  |  | 	ssaConfig.Race = flag_race | 
					
						
							| 
									
										
											  
											
												cmd/compile: add initial backend concurrency support
This CL adds initial support for concurrent backend compilation.
BACKGROUND
The compiler currently consists (very roughly) of the following phases:
1. Initialization.
2. Lexing and parsing into the cmd/compile/internal/syntax AST.
3. Translation into the cmd/compile/internal/gc AST.
4. Some gc AST passes: typechecking, escape analysis, inlining,
   closure handling, expression evaluation ordering (order.go),
   and some lowering and optimization (walk.go).
5. Translation into the cmd/compile/internal/ssa SSA form.
6. Optimization and lowering of SSA form.
7. Translation from SSA form to assembler instructions.
8. Translation from assembler instructions to machine code.
9. Writing lots of output: machine code, DWARF symbols,
   type and reflection info, export data.
Phase 2 was already concurrent as of Go 1.8.
Phase 3 is planned for eventual removal;
we hope to go straight from syntax AST to SSA.
Phases 5–8 are per-function; this CL adds support for
processing multiple functions concurrently.
The slowest phases in the compiler are 5 and 6,
so this offers the opportunity for some good speed-ups.
Unfortunately, it's not quite that straightforward.
In the current compiler, the latter parts of phase 4
(order, walk) are done function-at-a-time as needed.
Making order and walk concurrency-safe proved hard,
and they're not particularly slow, so there wasn't much reward.
To enable phases 5–8 to be done concurrently,
when concurrent backend compilation is requested,
we complete phase 4 for all functions
before starting later phases for any functions.
Also, in reality, we automatically generate new
functions in phase 9, such as method wrappers
and equality and has routines.
Those new functions then go through phases 4–8.
This CL disables concurrent backend compilation
after the first, big, user-provided batch of
functions has been compiled.
This is done to keep things simple,
and because the autogenerated functions
tend to be small, few, simple, and fast to compile.
USAGE
Concurrent backend compilation still defaults to off.
To set the number of functions that may be backend-compiled
concurrently, use the compiler flag -c.
In future work, cmd/go will automatically set -c.
Furthermore, this CL has been intentionally written
so that the c=1 path has no backend concurrency whatsoever,
not even spawning any goroutines.
This helps ensure that, should problems arise
late in the development cycle,
we can simply have cmd/go set c=1 always,
and revert to the original compiler behavior.
MUTEXES
Most of the work required to make concurrent backend
compilation safe has occurred over the past month.
This CL adds a handful of mutexes to get the rest of the way there;
they are the mutexes that I didn't see a clean way to avoid.
Some of them may still be eliminable in future work.
In no particular order:
* gc.funcsymsmu. The global funcsyms slice is populated
  lazily when we need function symbols for closures.
  This occurs during gc AST to SSA translation.
  The function funcsym also does a package lookup,
  which is a source of races on types.Pkg.Syms;
  funcsymsmu also covers that package lookup.
  This mutex is low priority: it adds a single global,
  it is in an infrequently used code path, and it is low contention.
  Since funcsyms may now be added in any order,
  we must sort them to preserve reproducible builds.
* gc.largeStackFramesMu. We don't discover until after SSA compilation
  that a function's stack frame is gigantic.
  Recording that error happens basically never,
  but it does happen concurrently.
  Fix with a low priority mutex and sorting.
* obj.Link.hashmu. ctxt.hash stores the mapping from
  types.Syms (compiler symbols) to obj.LSyms (linker symbols).
  It is accessed fairly heavily through all the phases.
  This is the only heavily contended mutex.
* gc.signatlistmu. The global signatlist map is
  populated with types through several of the concurrent phases,
  including notably via ngotype during DWARF generation.
  It is low priority for removal.
* gc.typepkgmu. Looking up symbols in the types package
  happens a fair amount during backend compilation
  and DWARF generation, particularly via ngotype.
  This mutex helps us to avoid a broader mutex on types.Pkg.Syms.
  It has low-to-moderate contention.
* types.internedStringsmu. gc AST to SSA conversion and
  some SSA work introduce new autotmps.
  Those autotmps have their names interned to reduce allocations.
  That interning requires protecting types.internedStrings.
  The autotmp names are heavily re-used, and the mutex
  overhead and contention here are low, so it is probably
  a worthwhile performance optimization to keep this mutex.
TESTING
I have been testing this code locally by running
'go install -race cmd/compile'
and then doing
'go build -a -gcflags=-c=128 std cmd'
for all architectures and a variety of compiler flags.
This obviously needs to be made part of the builders,
but it is too expensive to make part of all.bash.
I have filed #19962 for this.
REPRODUCIBLE BUILDS
This version of the compiler generates reproducible builds.
Testing reproducible builds also needs automation, however,
and is also too expensive for all.bash.
This is #19961.
Also of note is that some of the compiler flags used by 'toolstash -cmp'
are currently incompatible with concurrent backend compilation.
They still work fine with c=1.
Time will tell whether this is a problem.
NEXT STEPS
* Continue to find and fix races and bugs,
  using a combination of code inspection, fuzzing,
  and hopefully some community experimentation.
  I do not know of any outstanding races,
  but there probably are some.
* Improve testing.
* Improve performance, for many values of c.
* Integrate with cmd/go and fine tune.
* Support concurrent compilation with the -race flag.
  It is a sad irony that it does not yet work.
* Minor code cleanup that has been deferred during
  the last month due to uncertainty about the
  ultimate shape of this CL.
PERFORMANCE
Here's the buried lede, at last. :)
All benchmarks are from my 8 core 2.9 GHz Intel Core i7 darwin/amd64 laptop.
First, going from tip to this CL with c=1 has almost no impact.
name        old time/op       new time/op       delta
Template          195ms ± 3%        194ms ± 5%    ~     (p=0.370 n=30+29)
Unicode          86.6ms ± 3%       87.0ms ± 7%    ~     (p=0.958 n=29+30)
GoTypes           548ms ± 3%        555ms ± 4%  +1.35%  (p=0.001 n=30+28)
Compiler          2.51s ± 2%        2.54s ± 2%  +1.17%  (p=0.000 n=28+30)
SSA               5.16s ± 3%        5.16s ± 2%    ~     (p=0.910 n=30+29)
Flate             124ms ± 5%        124ms ± 4%    ~     (p=0.947 n=30+30)
GoParser          146ms ± 3%        146ms ± 3%    ~     (p=0.150 n=29+28)
Reflect           354ms ± 3%        352ms ± 4%    ~     (p=0.096 n=29+29)
Tar               107ms ± 5%        106ms ± 3%    ~     (p=0.370 n=30+29)
XML               200ms ± 4%        201ms ± 4%    ~     (p=0.313 n=29+28)
[Geo mean]        332ms             333ms       +0.10%
name        old user-time/op  new user-time/op  delta
Template          227ms ± 5%        225ms ± 5%    ~     (p=0.457 n=28+27)
Unicode           109ms ± 4%        109ms ± 5%    ~     (p=0.758 n=29+29)
GoTypes           713ms ± 4%        721ms ± 5%    ~     (p=0.051 n=30+29)
Compiler          3.36s ± 2%        3.38s ± 3%    ~     (p=0.146 n=30+30)
SSA               7.46s ± 3%        7.47s ± 3%    ~     (p=0.804 n=30+29)
Flate             146ms ± 7%        147ms ± 3%    ~     (p=0.833 n=29+27)
GoParser          179ms ± 5%        179ms ± 5%    ~     (p=0.866 n=30+30)
Reflect           431ms ± 4%        429ms ± 4%    ~     (p=0.593 n=29+30)
Tar               124ms ± 5%        123ms ± 5%    ~     (p=0.140 n=29+29)
XML               243ms ± 4%        242ms ± 7%    ~     (p=0.404 n=29+29)
[Geo mean]        415ms             415ms       +0.02%
name        old obj-bytes     new obj-bytes     delta
Template           382k ± 0%         382k ± 0%    ~     (all equal)
Unicode            203k ± 0%         203k ± 0%    ~     (all equal)
GoTypes           1.18M ± 0%        1.18M ± 0%    ~     (all equal)
Compiler          3.98M ± 0%        3.98M ± 0%    ~     (all equal)
SSA               8.28M ± 0%        8.28M ± 0%    ~     (all equal)
Flate              230k ± 0%         230k ± 0%    ~     (all equal)
GoParser           287k ± 0%         287k ± 0%    ~     (all equal)
Reflect           1.00M ± 0%        1.00M ± 0%    ~     (all equal)
Tar                190k ± 0%         190k ± 0%    ~     (all equal)
XML                416k ± 0%         416k ± 0%    ~     (all equal)
[Geo mean]         660k              660k       +0.00%
Comparing this CL to itself, from c=1 to c=2
improves real times 20-30%, costs 5-10% more CPU time,
and adds about 2% alloc.
The allocation increase comes from allocating more ssa.Caches.
name       old time/op       new time/op       delta
Template         202ms ± 3%        149ms ± 3%  -26.15%  (p=0.000 n=49+49)
Unicode         87.4ms ± 4%       84.2ms ± 3%   -3.68%  (p=0.000 n=48+48)
GoTypes          560ms ± 2%        398ms ± 2%  -28.96%  (p=0.000 n=49+49)
Compiler         2.46s ± 3%        1.76s ± 2%  -28.61%  (p=0.000 n=48+46)
SSA              6.17s ± 2%        4.04s ± 1%  -34.52%  (p=0.000 n=49+49)
Flate            126ms ± 3%         92ms ± 2%  -26.81%  (p=0.000 n=49+48)
GoParser         148ms ± 4%        107ms ± 2%  -27.78%  (p=0.000 n=49+48)
Reflect          361ms ± 3%        281ms ± 3%  -22.10%  (p=0.000 n=49+49)
Tar              109ms ± 4%         86ms ± 3%  -20.81%  (p=0.000 n=49+47)
XML              204ms ± 3%        144ms ± 2%  -29.53%  (p=0.000 n=48+45)
name       old user-time/op  new user-time/op  delta
Template         246ms ± 9%        246ms ± 4%     ~     (p=0.401 n=50+48)
Unicode          109ms ± 4%        111ms ± 4%   +1.47%  (p=0.000 n=44+50)
GoTypes          728ms ± 3%        765ms ± 3%   +5.04%  (p=0.000 n=46+50)
Compiler         3.33s ± 3%        3.41s ± 2%   +2.31%  (p=0.000 n=49+48)
SSA              8.52s ± 2%        9.11s ± 2%   +6.93%  (p=0.000 n=49+47)
Flate            149ms ± 4%        161ms ± 3%   +8.13%  (p=0.000 n=50+47)
GoParser         181ms ± 5%        192ms ± 2%   +6.40%  (p=0.000 n=49+46)
Reflect          452ms ± 9%        474ms ± 2%   +4.99%  (p=0.000 n=50+48)
Tar              126ms ± 6%        136ms ± 4%   +7.95%  (p=0.000 n=50+49)
XML              247ms ± 5%        264ms ± 3%   +6.94%  (p=0.000 n=48+50)
name       old alloc/op      new alloc/op      delta
Template        38.8MB ± 0%       39.3MB ± 0%   +1.48%  (p=0.008 n=5+5)
Unicode         29.8MB ± 0%       30.2MB ± 0%   +1.19%  (p=0.008 n=5+5)
GoTypes          113MB ± 0%        114MB ± 0%   +0.69%  (p=0.008 n=5+5)
Compiler         443MB ± 0%        447MB ± 0%   +0.95%  (p=0.008 n=5+5)
SSA             1.25GB ± 0%       1.26GB ± 0%   +0.89%  (p=0.008 n=5+5)
Flate           25.3MB ± 0%       25.9MB ± 1%   +2.35%  (p=0.008 n=5+5)
GoParser        31.7MB ± 0%       32.2MB ± 0%   +1.59%  (p=0.008 n=5+5)
Reflect         78.2MB ± 0%       78.9MB ± 0%   +0.91%  (p=0.008 n=5+5)
Tar             26.6MB ± 0%       27.0MB ± 0%   +1.80%  (p=0.008 n=5+5)
XML             42.4MB ± 0%       43.4MB ± 0%   +2.35%  (p=0.008 n=5+5)
name       old allocs/op     new allocs/op     delta
Template          379k ± 0%         378k ± 0%     ~     (p=0.421 n=5+5)
Unicode           322k ± 0%         321k ± 0%     ~     (p=0.222 n=5+5)
GoTypes          1.14M ± 0%        1.14M ± 0%     ~     (p=0.548 n=5+5)
Compiler         4.12M ± 0%        4.11M ± 0%   -0.14%  (p=0.032 n=5+5)
SSA              9.72M ± 0%        9.72M ± 0%     ~     (p=0.421 n=5+5)
Flate             234k ± 1%         234k ± 0%     ~     (p=0.421 n=5+5)
GoParser          316k ± 1%         315k ± 0%     ~     (p=0.222 n=5+5)
Reflect           980k ± 0%         979k ± 0%     ~     (p=0.095 n=5+5)
Tar               249k ± 1%         249k ± 1%     ~     (p=0.841 n=5+5)
XML               392k ± 0%         391k ± 0%     ~     (p=0.095 n=5+5)
From c=1 to c=4, real time is down ~40%, CPU usage up 10-20%, alloc up ~5%:
name       old time/op       new time/op       delta
Template         203ms ± 3%        131ms ± 5%  -35.45%  (p=0.000 n=50+50)
Unicode         87.2ms ± 4%       84.1ms ± 2%   -3.61%  (p=0.000 n=48+47)
GoTypes          560ms ± 4%        310ms ± 2%  -44.65%  (p=0.000 n=50+49)
Compiler         2.47s ± 3%        1.41s ± 2%  -43.10%  (p=0.000 n=50+46)
SSA              6.17s ± 2%        3.20s ± 2%  -48.06%  (p=0.000 n=49+49)
Flate            126ms ± 4%         74ms ± 2%  -41.06%  (p=0.000 n=49+48)
GoParser         148ms ± 4%         89ms ± 3%  -39.97%  (p=0.000 n=49+50)
Reflect          360ms ± 3%        242ms ± 3%  -32.81%  (p=0.000 n=49+49)
Tar              108ms ± 4%         73ms ± 4%  -32.48%  (p=0.000 n=50+49)
XML              203ms ± 3%        119ms ± 3%  -41.56%  (p=0.000 n=49+48)
name       old user-time/op  new user-time/op  delta
Template         246ms ± 9%        287ms ± 9%  +16.98%  (p=0.000 n=50+50)
Unicode          109ms ± 4%        118ms ± 5%   +7.56%  (p=0.000 n=46+50)
GoTypes          735ms ± 4%        806ms ± 2%   +9.62%  (p=0.000 n=50+50)
Compiler         3.34s ± 4%        3.56s ± 2%   +6.78%  (p=0.000 n=49+49)
SSA              8.54s ± 3%       10.04s ± 3%  +17.55%  (p=0.000 n=50+50)
Flate            149ms ± 6%        176ms ± 3%  +17.82%  (p=0.000 n=50+48)
GoParser         181ms ± 5%        213ms ± 3%  +17.47%  (p=0.000 n=50+50)
Reflect          453ms ± 6%        499ms ± 2%  +10.11%  (p=0.000 n=50+48)
Tar              126ms ± 5%        149ms ±11%  +18.76%  (p=0.000 n=50+50)
XML              246ms ± 5%        287ms ± 4%  +16.53%  (p=0.000 n=49+50)
name       old alloc/op      new alloc/op      delta
Template        38.8MB ± 0%       40.4MB ± 0%   +4.21%  (p=0.008 n=5+5)
Unicode         29.8MB ± 0%       30.9MB ± 0%   +3.68%  (p=0.008 n=5+5)
GoTypes          113MB ± 0%        116MB ± 0%   +2.71%  (p=0.008 n=5+5)
Compiler         443MB ± 0%        455MB ± 0%   +2.75%  (p=0.008 n=5+5)
SSA             1.25GB ± 0%       1.27GB ± 0%   +1.84%  (p=0.008 n=5+5)
Flate           25.3MB ± 0%       26.9MB ± 1%   +6.31%  (p=0.008 n=5+5)
GoParser        31.7MB ± 0%       33.2MB ± 0%   +4.61%  (p=0.008 n=5+5)
Reflect         78.2MB ± 0%       80.2MB ± 0%   +2.53%  (p=0.008 n=5+5)
Tar             26.6MB ± 0%       27.9MB ± 0%   +5.19%  (p=0.008 n=5+5)
XML             42.4MB ± 0%       44.6MB ± 0%   +5.20%  (p=0.008 n=5+5)
name       old allocs/op     new allocs/op     delta
Template          380k ± 0%         379k ± 0%   -0.39%  (p=0.032 n=5+5)
Unicode           321k ± 0%         321k ± 0%     ~     (p=0.841 n=5+5)
GoTypes          1.14M ± 0%        1.14M ± 0%     ~     (p=0.421 n=5+5)
Compiler         4.12M ± 0%        4.14M ± 0%   +0.52%  (p=0.008 n=5+5)
SSA              9.72M ± 0%        9.76M ± 0%   +0.37%  (p=0.008 n=5+5)
Flate             234k ± 1%         234k ± 1%     ~     (p=0.690 n=5+5)
GoParser          316k ± 0%         317k ± 1%     ~     (p=0.841 n=5+5)
Reflect           981k ± 0%         981k ± 0%     ~     (p=1.000 n=5+5)
Tar               250k ± 0%         249k ± 1%     ~     (p=0.151 n=5+5)
XML               393k ± 0%         392k ± 0%     ~     (p=0.056 n=5+5)
Going beyond c=4 on my machine tends to increase CPU time and allocs
without impacting real time.
The CPU time numbers matter, because when there are many concurrent
compilation processes, that will impact the overall throughput.
The numbers above are in many ways the best case scenario;
we can take full advantage of all cores.
Fortunately, the most common compilation scenario is incremental
re-compilation of a single package during a build/test cycle.
Updates #15756
Change-Id: I6725558ca2069edec0ac5b0d1683105a9fff6bea
Reviewed-on: https://go-review.googlesource.com/40693
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
Reviewed-by: Robert Griesemer <gri@golang.org>
Run-TryBot: Brad Fitzpatrick <bradfitz@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
											
										 
											2017-03-19 08:27:26 -07:00
										 |  |  | 	ssaCaches = make([]ssa.Cache, nBackendWorkers) | 
					
						
							| 
									
										
										
										
											2017-03-30 11:13:01 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// Set up some runtime functions we'll need to call. | 
					
						
							| 
									
										
										
										
											2018-04-18 08:52:40 -07:00
										 |  |  | 	assertE2I = sysfunc("assertE2I") | 
					
						
							|  |  |  | 	assertE2I2 = sysfunc("assertE2I2") | 
					
						
							|  |  |  | 	assertI2I = sysfunc("assertI2I") | 
					
						
							|  |  |  | 	assertI2I2 = sysfunc("assertI2I2") | 
					
						
							| 
									
										
										
										
											2018-08-20 10:55:26 +02:00
										 |  |  | 	deferproc = sysfunc("deferproc") | 
					
						
							| 
									
										
										
										
											2017-08-08 16:40:07 -07:00
										 |  |  | 	Deferreturn = sysfunc("deferreturn") | 
					
						
							|  |  |  | 	Duffcopy = sysfunc("duffcopy") | 
					
						
							|  |  |  | 	Duffzero = sysfunc("duffzero") | 
					
						
							| 
									
										
										
										
											2018-04-18 08:52:40 -07:00
										 |  |  | 	gcWriteBarrier = sysfunc("gcWriteBarrier") | 
					
						
							|  |  |  | 	goschedguarded = sysfunc("goschedguarded") | 
					
						
							| 
									
										
										
										
											2017-08-08 16:40:07 -07:00
										 |  |  | 	growslice = sysfunc("growslice") | 
					
						
							| 
									
										
										
										
											2018-04-18 08:52:40 -07:00
										 |  |  | 	msanread = sysfunc("msanread") | 
					
						
							|  |  |  | 	msanwrite = sysfunc("msanwrite") | 
					
						
							| 
									
										
										
										
											2018-08-20 10:55:26 +02:00
										 |  |  | 	newproc = sysfunc("newproc") | 
					
						
							| 
									
										
										
										
											2018-04-18 08:52:40 -07:00
										 |  |  | 	panicdivide = sysfunc("panicdivide") | 
					
						
							| 
									
										
										
										
											2017-08-08 16:40:07 -07:00
										 |  |  | 	panicdottypeE = sysfunc("panicdottypeE") | 
					
						
							|  |  |  | 	panicdottypeI = sysfunc("panicdottypeI") | 
					
						
							| 
									
										
										
										
											2018-04-18 08:52:40 -07:00
										 |  |  | 	panicindex = sysfunc("panicindex") | 
					
						
							| 
									
										
										
										
											2017-08-08 16:40:07 -07:00
										 |  |  | 	panicnildottype = sysfunc("panicnildottype") | 
					
						
							| 
									
										
										
										
											2018-04-18 08:52:40 -07:00
										 |  |  | 	panicslice = sysfunc("panicslice") | 
					
						
							| 
									
										
										
										
											2018-03-27 13:50:08 -07:00
										 |  |  | 	raceread = sysfunc("raceread") | 
					
						
							|  |  |  | 	racereadrange = sysfunc("racereadrange") | 
					
						
							| 
									
										
										
										
											2018-04-18 08:52:40 -07:00
										 |  |  | 	racewrite = sysfunc("racewrite") | 
					
						
							| 
									
										
										
										
											2018-03-27 13:50:08 -07:00
										 |  |  | 	racewriterange = sysfunc("racewriterange") | 
					
						
							| 
									
										
										
										
											2018-04-18 08:48:04 -07:00
										 |  |  | 	supportPopcnt = sysfunc("support_popcnt") | 
					
						
							|  |  |  | 	supportSSE41 = sysfunc("support_sse41") | 
					
						
							| 
									
										
										
										
											2017-11-03 02:05:28 +00:00
										 |  |  | 	arm64SupportAtomics = sysfunc("arm64_support_atomics") | 
					
						
							| 
									
										
										
										
											2018-04-18 08:52:40 -07:00
										 |  |  | 	typedmemclr = sysfunc("typedmemclr") | 
					
						
							|  |  |  | 	typedmemmove = sysfunc("typedmemmove") | 
					
						
							|  |  |  | 	Udiv = sysfunc("udiv") | 
					
						
							|  |  |  | 	writeBarrier = sysfunc("writeBarrier") | 
					
						
							| 
									
										
										
										
											2017-08-08 16:38:25 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// GO386=387 runtime functions | 
					
						
							| 
									
										
										
										
											2017-08-08 16:40:07 -07:00
										 |  |  | 	ControlWord64trunc = sysfunc("controlWord64trunc") | 
					
						
							|  |  |  | 	ControlWord32 = sysfunc("controlWord32") | 
					
						
							| 
									
										
										
										
											2018-03-29 00:55:53 +02:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// Wasm | 
					
						
							|  |  |  | 	WasmMove = sysfunc("wasmMove") | 
					
						
							|  |  |  | 	WasmZero = sysfunc("wasmZero") | 
					
						
							|  |  |  | 	WasmDiv = sysfunc("wasmDiv") | 
					
						
							|  |  |  | 	WasmTruncS = sysfunc("wasmTruncS") | 
					
						
							|  |  |  | 	WasmTruncU = sysfunc("wasmTruncU") | 
					
						
							|  |  |  | 	SigPanic = sysfunc("sigpanic") | 
					
						
							| 
									
										
										
											
												[dev.ssa] cmd/compile: enhance command line option processing for SSA
The -d compiler flag can also specify ssa phase and flag,
for example -d=ssa/generic_cse/time,ssa/generic_cse/stats
Spaces in the phase names can be specified with an
underscore.  Flags currently parsed (not necessarily
recognized by the phases yet) are:
   on, off, mem, time, debug, stats, and test
On, off and time are handled in the harness,
debug, stats, and test are interpreted by the phase itself.
The pass is now attached to the Func being compiled, and a
new method logStats(key, ...value) on *Func to encourage a
semi-standardized format for that output.  Output fields
are separated by tabs to ease digestion by awk and
spreadsheets.  For example,
	if f.pass.stats > 0 {
		f.logStat("CSE REWRITES", rewrites)
	}
Change-Id: I16db2b5af64c50ca9a47efeb51d961147a903abc
Reviewed-on: https://go-review.googlesource.com/19885
Reviewed-by: Keith Randall <khr@golang.org>
Reviewed-by: Todd Neal <todd@tneal.org>
											
										 
											2016-02-25 13:10:51 -05:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: add initial backend concurrency support
This CL adds initial support for concurrent backend compilation.
BACKGROUND
The compiler currently consists (very roughly) of the following phases:
1. Initialization.
2. Lexing and parsing into the cmd/compile/internal/syntax AST.
3. Translation into the cmd/compile/internal/gc AST.
4. Some gc AST passes: typechecking, escape analysis, inlining,
   closure handling, expression evaluation ordering (order.go),
   and some lowering and optimization (walk.go).
5. Translation into the cmd/compile/internal/ssa SSA form.
6. Optimization and lowering of SSA form.
7. Translation from SSA form to assembler instructions.
8. Translation from assembler instructions to machine code.
9. Writing lots of output: machine code, DWARF symbols,
   type and reflection info, export data.
Phase 2 was already concurrent as of Go 1.8.
Phase 3 is planned for eventual removal;
we hope to go straight from syntax AST to SSA.
Phases 5–8 are per-function; this CL adds support for
processing multiple functions concurrently.
The slowest phases in the compiler are 5 and 6,
so this offers the opportunity for some good speed-ups.
Unfortunately, it's not quite that straightforward.
In the current compiler, the latter parts of phase 4
(order, walk) are done function-at-a-time as needed.
Making order and walk concurrency-safe proved hard,
and they're not particularly slow, so there wasn't much reward.
To enable phases 5–8 to be done concurrently,
when concurrent backend compilation is requested,
we complete phase 4 for all functions
before starting later phases for any functions.
Also, in reality, we automatically generate new
functions in phase 9, such as method wrappers
and equality and has routines.
Those new functions then go through phases 4–8.
This CL disables concurrent backend compilation
after the first, big, user-provided batch of
functions has been compiled.
This is done to keep things simple,
and because the autogenerated functions
tend to be small, few, simple, and fast to compile.
USAGE
Concurrent backend compilation still defaults to off.
To set the number of functions that may be backend-compiled
concurrently, use the compiler flag -c.
In future work, cmd/go will automatically set -c.
Furthermore, this CL has been intentionally written
so that the c=1 path has no backend concurrency whatsoever,
not even spawning any goroutines.
This helps ensure that, should problems arise
late in the development cycle,
we can simply have cmd/go set c=1 always,
and revert to the original compiler behavior.
MUTEXES
Most of the work required to make concurrent backend
compilation safe has occurred over the past month.
This CL adds a handful of mutexes to get the rest of the way there;
they are the mutexes that I didn't see a clean way to avoid.
Some of them may still be eliminable in future work.
In no particular order:
* gc.funcsymsmu. The global funcsyms slice is populated
  lazily when we need function symbols for closures.
  This occurs during gc AST to SSA translation.
  The function funcsym also does a package lookup,
  which is a source of races on types.Pkg.Syms;
  funcsymsmu also covers that package lookup.
  This mutex is low priority: it adds a single global,
  it is in an infrequently used code path, and it is low contention.
  Since funcsyms may now be added in any order,
  we must sort them to preserve reproducible builds.
* gc.largeStackFramesMu. We don't discover until after SSA compilation
  that a function's stack frame is gigantic.
  Recording that error happens basically never,
  but it does happen concurrently.
  Fix with a low priority mutex and sorting.
* obj.Link.hashmu. ctxt.hash stores the mapping from
  types.Syms (compiler symbols) to obj.LSyms (linker symbols).
  It is accessed fairly heavily through all the phases.
  This is the only heavily contended mutex.
* gc.signatlistmu. The global signatlist map is
  populated with types through several of the concurrent phases,
  including notably via ngotype during DWARF generation.
  It is low priority for removal.
* gc.typepkgmu. Looking up symbols in the types package
  happens a fair amount during backend compilation
  and DWARF generation, particularly via ngotype.
  This mutex helps us to avoid a broader mutex on types.Pkg.Syms.
  It has low-to-moderate contention.
* types.internedStringsmu. gc AST to SSA conversion and
  some SSA work introduce new autotmps.
  Those autotmps have their names interned to reduce allocations.
  That interning requires protecting types.internedStrings.
  The autotmp names are heavily re-used, and the mutex
  overhead and contention here are low, so it is probably
  a worthwhile performance optimization to keep this mutex.
TESTING
I have been testing this code locally by running
'go install -race cmd/compile'
and then doing
'go build -a -gcflags=-c=128 std cmd'
for all architectures and a variety of compiler flags.
This obviously needs to be made part of the builders,
but it is too expensive to make part of all.bash.
I have filed #19962 for this.
REPRODUCIBLE BUILDS
This version of the compiler generates reproducible builds.
Testing reproducible builds also needs automation, however,
and is also too expensive for all.bash.
This is #19961.
Also of note is that some of the compiler flags used by 'toolstash -cmp'
are currently incompatible with concurrent backend compilation.
They still work fine with c=1.
Time will tell whether this is a problem.
NEXT STEPS
* Continue to find and fix races and bugs,
  using a combination of code inspection, fuzzing,
  and hopefully some community experimentation.
  I do not know of any outstanding races,
  but there probably are some.
* Improve testing.
* Improve performance, for many values of c.
* Integrate with cmd/go and fine tune.
* Support concurrent compilation with the -race flag.
  It is a sad irony that it does not yet work.
* Minor code cleanup that has been deferred during
  the last month due to uncertainty about the
  ultimate shape of this CL.
PERFORMANCE
Here's the buried lede, at last. :)
All benchmarks are from my 8 core 2.9 GHz Intel Core i7 darwin/amd64 laptop.
First, going from tip to this CL with c=1 has almost no impact.
name        old time/op       new time/op       delta
Template          195ms ± 3%        194ms ± 5%    ~     (p=0.370 n=30+29)
Unicode          86.6ms ± 3%       87.0ms ± 7%    ~     (p=0.958 n=29+30)
GoTypes           548ms ± 3%        555ms ± 4%  +1.35%  (p=0.001 n=30+28)
Compiler          2.51s ± 2%        2.54s ± 2%  +1.17%  (p=0.000 n=28+30)
SSA               5.16s ± 3%        5.16s ± 2%    ~     (p=0.910 n=30+29)
Flate             124ms ± 5%        124ms ± 4%    ~     (p=0.947 n=30+30)
GoParser          146ms ± 3%        146ms ± 3%    ~     (p=0.150 n=29+28)
Reflect           354ms ± 3%        352ms ± 4%    ~     (p=0.096 n=29+29)
Tar               107ms ± 5%        106ms ± 3%    ~     (p=0.370 n=30+29)
XML               200ms ± 4%        201ms ± 4%    ~     (p=0.313 n=29+28)
[Geo mean]        332ms             333ms       +0.10%
name        old user-time/op  new user-time/op  delta
Template          227ms ± 5%        225ms ± 5%    ~     (p=0.457 n=28+27)
Unicode           109ms ± 4%        109ms ± 5%    ~     (p=0.758 n=29+29)
GoTypes           713ms ± 4%        721ms ± 5%    ~     (p=0.051 n=30+29)
Compiler          3.36s ± 2%        3.38s ± 3%    ~     (p=0.146 n=30+30)
SSA               7.46s ± 3%        7.47s ± 3%    ~     (p=0.804 n=30+29)
Flate             146ms ± 7%        147ms ± 3%    ~     (p=0.833 n=29+27)
GoParser          179ms ± 5%        179ms ± 5%    ~     (p=0.866 n=30+30)
Reflect           431ms ± 4%        429ms ± 4%    ~     (p=0.593 n=29+30)
Tar               124ms ± 5%        123ms ± 5%    ~     (p=0.140 n=29+29)
XML               243ms ± 4%        242ms ± 7%    ~     (p=0.404 n=29+29)
[Geo mean]        415ms             415ms       +0.02%
name        old obj-bytes     new obj-bytes     delta
Template           382k ± 0%         382k ± 0%    ~     (all equal)
Unicode            203k ± 0%         203k ± 0%    ~     (all equal)
GoTypes           1.18M ± 0%        1.18M ± 0%    ~     (all equal)
Compiler          3.98M ± 0%        3.98M ± 0%    ~     (all equal)
SSA               8.28M ± 0%        8.28M ± 0%    ~     (all equal)
Flate              230k ± 0%         230k ± 0%    ~     (all equal)
GoParser           287k ± 0%         287k ± 0%    ~     (all equal)
Reflect           1.00M ± 0%        1.00M ± 0%    ~     (all equal)
Tar                190k ± 0%         190k ± 0%    ~     (all equal)
XML                416k ± 0%         416k ± 0%    ~     (all equal)
[Geo mean]         660k              660k       +0.00%
Comparing this CL to itself, from c=1 to c=2
improves real times 20-30%, costs 5-10% more CPU time,
and adds about 2% alloc.
The allocation increase comes from allocating more ssa.Caches.
name       old time/op       new time/op       delta
Template         202ms ± 3%        149ms ± 3%  -26.15%  (p=0.000 n=49+49)
Unicode         87.4ms ± 4%       84.2ms ± 3%   -3.68%  (p=0.000 n=48+48)
GoTypes          560ms ± 2%        398ms ± 2%  -28.96%  (p=0.000 n=49+49)
Compiler         2.46s ± 3%        1.76s ± 2%  -28.61%  (p=0.000 n=48+46)
SSA              6.17s ± 2%        4.04s ± 1%  -34.52%  (p=0.000 n=49+49)
Flate            126ms ± 3%         92ms ± 2%  -26.81%  (p=0.000 n=49+48)
GoParser         148ms ± 4%        107ms ± 2%  -27.78%  (p=0.000 n=49+48)
Reflect          361ms ± 3%        281ms ± 3%  -22.10%  (p=0.000 n=49+49)
Tar              109ms ± 4%         86ms ± 3%  -20.81%  (p=0.000 n=49+47)
XML              204ms ± 3%        144ms ± 2%  -29.53%  (p=0.000 n=48+45)
name       old user-time/op  new user-time/op  delta
Template         246ms ± 9%        246ms ± 4%     ~     (p=0.401 n=50+48)
Unicode          109ms ± 4%        111ms ± 4%   +1.47%  (p=0.000 n=44+50)
GoTypes          728ms ± 3%        765ms ± 3%   +5.04%  (p=0.000 n=46+50)
Compiler         3.33s ± 3%        3.41s ± 2%   +2.31%  (p=0.000 n=49+48)
SSA              8.52s ± 2%        9.11s ± 2%   +6.93%  (p=0.000 n=49+47)
Flate            149ms ± 4%        161ms ± 3%   +8.13%  (p=0.000 n=50+47)
GoParser         181ms ± 5%        192ms ± 2%   +6.40%  (p=0.000 n=49+46)
Reflect          452ms ± 9%        474ms ± 2%   +4.99%  (p=0.000 n=50+48)
Tar              126ms ± 6%        136ms ± 4%   +7.95%  (p=0.000 n=50+49)
XML              247ms ± 5%        264ms ± 3%   +6.94%  (p=0.000 n=48+50)
name       old alloc/op      new alloc/op      delta
Template        38.8MB ± 0%       39.3MB ± 0%   +1.48%  (p=0.008 n=5+5)
Unicode         29.8MB ± 0%       30.2MB ± 0%   +1.19%  (p=0.008 n=5+5)
GoTypes          113MB ± 0%        114MB ± 0%   +0.69%  (p=0.008 n=5+5)
Compiler         443MB ± 0%        447MB ± 0%   +0.95%  (p=0.008 n=5+5)
SSA             1.25GB ± 0%       1.26GB ± 0%   +0.89%  (p=0.008 n=5+5)
Flate           25.3MB ± 0%       25.9MB ± 1%   +2.35%  (p=0.008 n=5+5)
GoParser        31.7MB ± 0%       32.2MB ± 0%   +1.59%  (p=0.008 n=5+5)
Reflect         78.2MB ± 0%       78.9MB ± 0%   +0.91%  (p=0.008 n=5+5)
Tar             26.6MB ± 0%       27.0MB ± 0%   +1.80%  (p=0.008 n=5+5)
XML             42.4MB ± 0%       43.4MB ± 0%   +2.35%  (p=0.008 n=5+5)
name       old allocs/op     new allocs/op     delta
Template          379k ± 0%         378k ± 0%     ~     (p=0.421 n=5+5)
Unicode           322k ± 0%         321k ± 0%     ~     (p=0.222 n=5+5)
GoTypes          1.14M ± 0%        1.14M ± 0%     ~     (p=0.548 n=5+5)
Compiler         4.12M ± 0%        4.11M ± 0%   -0.14%  (p=0.032 n=5+5)
SSA              9.72M ± 0%        9.72M ± 0%     ~     (p=0.421 n=5+5)
Flate             234k ± 1%         234k ± 0%     ~     (p=0.421 n=5+5)
GoParser          316k ± 1%         315k ± 0%     ~     (p=0.222 n=5+5)
Reflect           980k ± 0%         979k ± 0%     ~     (p=0.095 n=5+5)
Tar               249k ± 1%         249k ± 1%     ~     (p=0.841 n=5+5)
XML               392k ± 0%         391k ± 0%     ~     (p=0.095 n=5+5)
From c=1 to c=4, real time is down ~40%, CPU usage up 10-20%, alloc up ~5%:
name       old time/op       new time/op       delta
Template         203ms ± 3%        131ms ± 5%  -35.45%  (p=0.000 n=50+50)
Unicode         87.2ms ± 4%       84.1ms ± 2%   -3.61%  (p=0.000 n=48+47)
GoTypes          560ms ± 4%        310ms ± 2%  -44.65%  (p=0.000 n=50+49)
Compiler         2.47s ± 3%        1.41s ± 2%  -43.10%  (p=0.000 n=50+46)
SSA              6.17s ± 2%        3.20s ± 2%  -48.06%  (p=0.000 n=49+49)
Flate            126ms ± 4%         74ms ± 2%  -41.06%  (p=0.000 n=49+48)
GoParser         148ms ± 4%         89ms ± 3%  -39.97%  (p=0.000 n=49+50)
Reflect          360ms ± 3%        242ms ± 3%  -32.81%  (p=0.000 n=49+49)
Tar              108ms ± 4%         73ms ± 4%  -32.48%  (p=0.000 n=50+49)
XML              203ms ± 3%        119ms ± 3%  -41.56%  (p=0.000 n=49+48)
name       old user-time/op  new user-time/op  delta
Template         246ms ± 9%        287ms ± 9%  +16.98%  (p=0.000 n=50+50)
Unicode          109ms ± 4%        118ms ± 5%   +7.56%  (p=0.000 n=46+50)
GoTypes          735ms ± 4%        806ms ± 2%   +9.62%  (p=0.000 n=50+50)
Compiler         3.34s ± 4%        3.56s ± 2%   +6.78%  (p=0.000 n=49+49)
SSA              8.54s ± 3%       10.04s ± 3%  +17.55%  (p=0.000 n=50+50)
Flate            149ms ± 6%        176ms ± 3%  +17.82%  (p=0.000 n=50+48)
GoParser         181ms ± 5%        213ms ± 3%  +17.47%  (p=0.000 n=50+50)
Reflect          453ms ± 6%        499ms ± 2%  +10.11%  (p=0.000 n=50+48)
Tar              126ms ± 5%        149ms ±11%  +18.76%  (p=0.000 n=50+50)
XML              246ms ± 5%        287ms ± 4%  +16.53%  (p=0.000 n=49+50)
name       old alloc/op      new alloc/op      delta
Template        38.8MB ± 0%       40.4MB ± 0%   +4.21%  (p=0.008 n=5+5)
Unicode         29.8MB ± 0%       30.9MB ± 0%   +3.68%  (p=0.008 n=5+5)
GoTypes          113MB ± 0%        116MB ± 0%   +2.71%  (p=0.008 n=5+5)
Compiler         443MB ± 0%        455MB ± 0%   +2.75%  (p=0.008 n=5+5)
SSA             1.25GB ± 0%       1.27GB ± 0%   +1.84%  (p=0.008 n=5+5)
Flate           25.3MB ± 0%       26.9MB ± 1%   +6.31%  (p=0.008 n=5+5)
GoParser        31.7MB ± 0%       33.2MB ± 0%   +4.61%  (p=0.008 n=5+5)
Reflect         78.2MB ± 0%       80.2MB ± 0%   +2.53%  (p=0.008 n=5+5)
Tar             26.6MB ± 0%       27.9MB ± 0%   +5.19%  (p=0.008 n=5+5)
XML             42.4MB ± 0%       44.6MB ± 0%   +5.20%  (p=0.008 n=5+5)
name       old allocs/op     new allocs/op     delta
Template          380k ± 0%         379k ± 0%   -0.39%  (p=0.032 n=5+5)
Unicode           321k ± 0%         321k ± 0%     ~     (p=0.841 n=5+5)
GoTypes          1.14M ± 0%        1.14M ± 0%     ~     (p=0.421 n=5+5)
Compiler         4.12M ± 0%        4.14M ± 0%   +0.52%  (p=0.008 n=5+5)
SSA              9.72M ± 0%        9.76M ± 0%   +0.37%  (p=0.008 n=5+5)
Flate             234k ± 1%         234k ± 1%     ~     (p=0.690 n=5+5)
GoParser          316k ± 0%         317k ± 1%     ~     (p=0.841 n=5+5)
Reflect           981k ± 0%         981k ± 0%     ~     (p=1.000 n=5+5)
Tar               250k ± 0%         249k ± 1%     ~     (p=0.151 n=5+5)
XML               393k ± 0%         392k ± 0%     ~     (p=0.056 n=5+5)
Going beyond c=4 on my machine tends to increase CPU time and allocs
without impacting real time.
The CPU time numbers matter, because when there are many concurrent
compilation processes, that will impact the overall throughput.
The numbers above are in many ways the best case scenario;
we can take full advantage of all cores.
Fortunately, the most common compilation scenario is incremental
re-compilation of a single package during a build/test cycle.
Updates #15756
Change-Id: I6725558ca2069edec0ac5b0d1683105a9fff6bea
Reviewed-on: https://go-review.googlesource.com/40693
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
Reviewed-by: Robert Griesemer <gri@golang.org>
Run-TryBot: Brad Fitzpatrick <bradfitz@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
											
										 
											2017-03-19 08:27:26 -07:00
										 |  |  | // buildssa builds an SSA function for fn. | 
					
						
							|  |  |  | // worker indicates which of the backend workers is doing the processing. | 
					
						
							|  |  |  | func buildssa(fn *Node, worker int) *ssa.Func { | 
					
						
							| 
									
										
										
										
											2017-04-23 05:10:21 -07:00
										 |  |  | 	name := fn.funcname() | 
					
						
							| 
									
										
										
										
											2018-07-24 13:04:35 +03:00
										 |  |  | 	printssa := name == ssaDump | 
					
						
							| 
									
										
										
										
											2018-07-31 18:13:05 +03:00
										 |  |  | 	var astBuf *bytes.Buffer | 
					
						
							| 
									
										
										
										
											2015-12-11 20:41:52 -08:00
										 |  |  | 	if printssa { | 
					
						
							| 
									
										
										
										
											2018-07-31 18:13:05 +03:00
										 |  |  | 		astBuf = &bytes.Buffer{} | 
					
						
							|  |  |  | 		fdumplist(astBuf, "buildssa-enter", fn.Func.Enter) | 
					
						
							|  |  |  | 		fdumplist(astBuf, "buildssa-body", fn.Nbody) | 
					
						
							|  |  |  | 		fdumplist(astBuf, "buildssa-exit", fn.Func.Exit) | 
					
						
							|  |  |  | 		if ssaDumpStdout { | 
					
						
							|  |  |  | 			fmt.Println("generating SSA for", name) | 
					
						
							|  |  |  | 			fmt.Print(astBuf.String()) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	var s state | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 	s.pushLine(fn.Pos) | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | 	defer s.popLine() | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-15 22:55:21 -07:00
										 |  |  | 	s.hasdefer = fn.Func.HasDefer() | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 	if fn.Func.Pragma&CgoUnsafeArgs != 0 { | 
					
						
							|  |  |  | 		s.cgoUnsafeArgs = true | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | 	fe := ssafn{ | 
					
						
							|  |  |  | 		curfn: fn, | 
					
						
							| 
									
										
										
										
											2018-07-24 10:39:00 +03:00
										 |  |  | 		log:   printssa && ssaDumpStdout, | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-03-22 20:27:54 -07:00
										 |  |  | 	s.curfn = fn | 
					
						
							| 
									
										
										
											
												[dev.ssa] cmd/compile: enhance command line option processing for SSA
The -d compiler flag can also specify ssa phase and flag,
for example -d=ssa/generic_cse/time,ssa/generic_cse/stats
Spaces in the phase names can be specified with an
underscore.  Flags currently parsed (not necessarily
recognized by the phases yet) are:
   on, off, mem, time, debug, stats, and test
On, off and time are handled in the harness,
debug, stats, and test are interpreted by the phase itself.
The pass is now attached to the Func being compiled, and a
new method logStats(key, ...value) on *Func to encourage a
semi-standardized format for that output.  Output fields
are separated by tabs to ease digestion by awk and
spreadsheets.  For example,
	if f.pass.stats > 0 {
		f.logStat("CSE REWRITES", rewrites)
	}
Change-Id: I16db2b5af64c50ca9a47efeb51d961147a903abc
Reviewed-on: https://go-review.googlesource.com/19885
Reviewed-by: Keith Randall <khr@golang.org>
Reviewed-by: Todd Neal <todd@tneal.org>
											
										 
											2016-02-25 13:10:51 -05:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | 	s.f = ssa.NewFunc(&fe) | 
					
						
							| 
									
										
											  
											
												cmd/compile: rearrange fields between ssa.Func, ssa.Cache, and ssa.Config
This makes ssa.Func, ssa.Cache, and ssa.Config fulfill
the roles laid out for them in CL 38160.
The only non-trivial change in this CL is how cached
values and blocks get IDs. Prior to this CL, their IDs were
assigned as part of resetting the cache, and only modified
IDs were reset. This required knowing how many values and
blocks were modified, which required a tight coupling between
ssa.Func and ssa.Config. To eliminate that coupling,
we now zero values and blocks during reset,
and assign their IDs when they are used.
Since unused values and blocks have ID == 0,
we can efficiently find the last used value/block,
to avoid zeroing everything.
Bulk zeroing is efficient, but not efficient enough
to obviate the need to avoid zeroing everything every time.
As a happy side-effect, ssa.Func.Free is no longer necessary.
DebugHashMatch and friends now belong in func.go.
They have been left in place for clarity and review.
I will move them in a subsequent CL.
Passes toolstash -cmp. No compiler performance impact.
No change in 'go test cmd/compile/internal/ssa' execution time.
Change-Id: I2eb7af58da067ef6a36e815a6f386cfe8634d098
Reviewed-on: https://go-review.googlesource.com/38167
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-03-15 11:15:13 -07:00
										 |  |  | 	s.config = ssaConfig | 
					
						
							| 
									
										
										
										
											2018-03-23 21:56:21 +03:00
										 |  |  | 	s.f.Type = fn.Type | 
					
						
							| 
									
										
											  
											
												cmd/compile: rearrange fields between ssa.Func, ssa.Cache, and ssa.Config
This makes ssa.Func, ssa.Cache, and ssa.Config fulfill
the roles laid out for them in CL 38160.
The only non-trivial change in this CL is how cached
values and blocks get IDs. Prior to this CL, their IDs were
assigned as part of resetting the cache, and only modified
IDs were reset. This required knowing how many values and
blocks were modified, which required a tight coupling between
ssa.Func and ssa.Config. To eliminate that coupling,
we now zero values and blocks during reset,
and assign their IDs when they are used.
Since unused values and blocks have ID == 0,
we can efficiently find the last used value/block,
to avoid zeroing everything.
Bulk zeroing is efficient, but not efficient enough
to obviate the need to avoid zeroing everything every time.
As a happy side-effect, ssa.Func.Free is no longer necessary.
DebugHashMatch and friends now belong in func.go.
They have been left in place for clarity and review.
I will move them in a subsequent CL.
Passes toolstash -cmp. No compiler performance impact.
No change in 'go test cmd/compile/internal/ssa' execution time.
Change-Id: I2eb7af58da067ef6a36e815a6f386cfe8634d098
Reviewed-on: https://go-review.googlesource.com/38167
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-03-15 11:15:13 -07:00
										 |  |  | 	s.f.Config = ssaConfig | 
					
						
							| 
									
										
											  
											
												cmd/compile: add initial backend concurrency support
This CL adds initial support for concurrent backend compilation.
BACKGROUND
The compiler currently consists (very roughly) of the following phases:
1. Initialization.
2. Lexing and parsing into the cmd/compile/internal/syntax AST.
3. Translation into the cmd/compile/internal/gc AST.
4. Some gc AST passes: typechecking, escape analysis, inlining,
   closure handling, expression evaluation ordering (order.go),
   and some lowering and optimization (walk.go).
5. Translation into the cmd/compile/internal/ssa SSA form.
6. Optimization and lowering of SSA form.
7. Translation from SSA form to assembler instructions.
8. Translation from assembler instructions to machine code.
9. Writing lots of output: machine code, DWARF symbols,
   type and reflection info, export data.
Phase 2 was already concurrent as of Go 1.8.
Phase 3 is planned for eventual removal;
we hope to go straight from syntax AST to SSA.
Phases 5–8 are per-function; this CL adds support for
processing multiple functions concurrently.
The slowest phases in the compiler are 5 and 6,
so this offers the opportunity for some good speed-ups.
Unfortunately, it's not quite that straightforward.
In the current compiler, the latter parts of phase 4
(order, walk) are done function-at-a-time as needed.
Making order and walk concurrency-safe proved hard,
and they're not particularly slow, so there wasn't much reward.
To enable phases 5–8 to be done concurrently,
when concurrent backend compilation is requested,
we complete phase 4 for all functions
before starting later phases for any functions.
Also, in reality, we automatically generate new
functions in phase 9, such as method wrappers
and equality and has routines.
Those new functions then go through phases 4–8.
This CL disables concurrent backend compilation
after the first, big, user-provided batch of
functions has been compiled.
This is done to keep things simple,
and because the autogenerated functions
tend to be small, few, simple, and fast to compile.
USAGE
Concurrent backend compilation still defaults to off.
To set the number of functions that may be backend-compiled
concurrently, use the compiler flag -c.
In future work, cmd/go will automatically set -c.
Furthermore, this CL has been intentionally written
so that the c=1 path has no backend concurrency whatsoever,
not even spawning any goroutines.
This helps ensure that, should problems arise
late in the development cycle,
we can simply have cmd/go set c=1 always,
and revert to the original compiler behavior.
MUTEXES
Most of the work required to make concurrent backend
compilation safe has occurred over the past month.
This CL adds a handful of mutexes to get the rest of the way there;
they are the mutexes that I didn't see a clean way to avoid.
Some of them may still be eliminable in future work.
In no particular order:
* gc.funcsymsmu. The global funcsyms slice is populated
  lazily when we need function symbols for closures.
  This occurs during gc AST to SSA translation.
  The function funcsym also does a package lookup,
  which is a source of races on types.Pkg.Syms;
  funcsymsmu also covers that package lookup.
  This mutex is low priority: it adds a single global,
  it is in an infrequently used code path, and it is low contention.
  Since funcsyms may now be added in any order,
  we must sort them to preserve reproducible builds.
* gc.largeStackFramesMu. We don't discover until after SSA compilation
  that a function's stack frame is gigantic.
  Recording that error happens basically never,
  but it does happen concurrently.
  Fix with a low priority mutex and sorting.
* obj.Link.hashmu. ctxt.hash stores the mapping from
  types.Syms (compiler symbols) to obj.LSyms (linker symbols).
  It is accessed fairly heavily through all the phases.
  This is the only heavily contended mutex.
* gc.signatlistmu. The global signatlist map is
  populated with types through several of the concurrent phases,
  including notably via ngotype during DWARF generation.
  It is low priority for removal.
* gc.typepkgmu. Looking up symbols in the types package
  happens a fair amount during backend compilation
  and DWARF generation, particularly via ngotype.
  This mutex helps us to avoid a broader mutex on types.Pkg.Syms.
  It has low-to-moderate contention.
* types.internedStringsmu. gc AST to SSA conversion and
  some SSA work introduce new autotmps.
  Those autotmps have their names interned to reduce allocations.
  That interning requires protecting types.internedStrings.
  The autotmp names are heavily re-used, and the mutex
  overhead and contention here are low, so it is probably
  a worthwhile performance optimization to keep this mutex.
TESTING
I have been testing this code locally by running
'go install -race cmd/compile'
and then doing
'go build -a -gcflags=-c=128 std cmd'
for all architectures and a variety of compiler flags.
This obviously needs to be made part of the builders,
but it is too expensive to make part of all.bash.
I have filed #19962 for this.
REPRODUCIBLE BUILDS
This version of the compiler generates reproducible builds.
Testing reproducible builds also needs automation, however,
and is also too expensive for all.bash.
This is #19961.
Also of note is that some of the compiler flags used by 'toolstash -cmp'
are currently incompatible with concurrent backend compilation.
They still work fine with c=1.
Time will tell whether this is a problem.
NEXT STEPS
* Continue to find and fix races and bugs,
  using a combination of code inspection, fuzzing,
  and hopefully some community experimentation.
  I do not know of any outstanding races,
  but there probably are some.
* Improve testing.
* Improve performance, for many values of c.
* Integrate with cmd/go and fine tune.
* Support concurrent compilation with the -race flag.
  It is a sad irony that it does not yet work.
* Minor code cleanup that has been deferred during
  the last month due to uncertainty about the
  ultimate shape of this CL.
PERFORMANCE
Here's the buried lede, at last. :)
All benchmarks are from my 8 core 2.9 GHz Intel Core i7 darwin/amd64 laptop.
First, going from tip to this CL with c=1 has almost no impact.
name        old time/op       new time/op       delta
Template          195ms ± 3%        194ms ± 5%    ~     (p=0.370 n=30+29)
Unicode          86.6ms ± 3%       87.0ms ± 7%    ~     (p=0.958 n=29+30)
GoTypes           548ms ± 3%        555ms ± 4%  +1.35%  (p=0.001 n=30+28)
Compiler          2.51s ± 2%        2.54s ± 2%  +1.17%  (p=0.000 n=28+30)
SSA               5.16s ± 3%        5.16s ± 2%    ~     (p=0.910 n=30+29)
Flate             124ms ± 5%        124ms ± 4%    ~     (p=0.947 n=30+30)
GoParser          146ms ± 3%        146ms ± 3%    ~     (p=0.150 n=29+28)
Reflect           354ms ± 3%        352ms ± 4%    ~     (p=0.096 n=29+29)
Tar               107ms ± 5%        106ms ± 3%    ~     (p=0.370 n=30+29)
XML               200ms ± 4%        201ms ± 4%    ~     (p=0.313 n=29+28)
[Geo mean]        332ms             333ms       +0.10%
name        old user-time/op  new user-time/op  delta
Template          227ms ± 5%        225ms ± 5%    ~     (p=0.457 n=28+27)
Unicode           109ms ± 4%        109ms ± 5%    ~     (p=0.758 n=29+29)
GoTypes           713ms ± 4%        721ms ± 5%    ~     (p=0.051 n=30+29)
Compiler          3.36s ± 2%        3.38s ± 3%    ~     (p=0.146 n=30+30)
SSA               7.46s ± 3%        7.47s ± 3%    ~     (p=0.804 n=30+29)
Flate             146ms ± 7%        147ms ± 3%    ~     (p=0.833 n=29+27)
GoParser          179ms ± 5%        179ms ± 5%    ~     (p=0.866 n=30+30)
Reflect           431ms ± 4%        429ms ± 4%    ~     (p=0.593 n=29+30)
Tar               124ms ± 5%        123ms ± 5%    ~     (p=0.140 n=29+29)
XML               243ms ± 4%        242ms ± 7%    ~     (p=0.404 n=29+29)
[Geo mean]        415ms             415ms       +0.02%
name        old obj-bytes     new obj-bytes     delta
Template           382k ± 0%         382k ± 0%    ~     (all equal)
Unicode            203k ± 0%         203k ± 0%    ~     (all equal)
GoTypes           1.18M ± 0%        1.18M ± 0%    ~     (all equal)
Compiler          3.98M ± 0%        3.98M ± 0%    ~     (all equal)
SSA               8.28M ± 0%        8.28M ± 0%    ~     (all equal)
Flate              230k ± 0%         230k ± 0%    ~     (all equal)
GoParser           287k ± 0%         287k ± 0%    ~     (all equal)
Reflect           1.00M ± 0%        1.00M ± 0%    ~     (all equal)
Tar                190k ± 0%         190k ± 0%    ~     (all equal)
XML                416k ± 0%         416k ± 0%    ~     (all equal)
[Geo mean]         660k              660k       +0.00%
Comparing this CL to itself, from c=1 to c=2
improves real times 20-30%, costs 5-10% more CPU time,
and adds about 2% alloc.
The allocation increase comes from allocating more ssa.Caches.
name       old time/op       new time/op       delta
Template         202ms ± 3%        149ms ± 3%  -26.15%  (p=0.000 n=49+49)
Unicode         87.4ms ± 4%       84.2ms ± 3%   -3.68%  (p=0.000 n=48+48)
GoTypes          560ms ± 2%        398ms ± 2%  -28.96%  (p=0.000 n=49+49)
Compiler         2.46s ± 3%        1.76s ± 2%  -28.61%  (p=0.000 n=48+46)
SSA              6.17s ± 2%        4.04s ± 1%  -34.52%  (p=0.000 n=49+49)
Flate            126ms ± 3%         92ms ± 2%  -26.81%  (p=0.000 n=49+48)
GoParser         148ms ± 4%        107ms ± 2%  -27.78%  (p=0.000 n=49+48)
Reflect          361ms ± 3%        281ms ± 3%  -22.10%  (p=0.000 n=49+49)
Tar              109ms ± 4%         86ms ± 3%  -20.81%  (p=0.000 n=49+47)
XML              204ms ± 3%        144ms ± 2%  -29.53%  (p=0.000 n=48+45)
name       old user-time/op  new user-time/op  delta
Template         246ms ± 9%        246ms ± 4%     ~     (p=0.401 n=50+48)
Unicode          109ms ± 4%        111ms ± 4%   +1.47%  (p=0.000 n=44+50)
GoTypes          728ms ± 3%        765ms ± 3%   +5.04%  (p=0.000 n=46+50)
Compiler         3.33s ± 3%        3.41s ± 2%   +2.31%  (p=0.000 n=49+48)
SSA              8.52s ± 2%        9.11s ± 2%   +6.93%  (p=0.000 n=49+47)
Flate            149ms ± 4%        161ms ± 3%   +8.13%  (p=0.000 n=50+47)
GoParser         181ms ± 5%        192ms ± 2%   +6.40%  (p=0.000 n=49+46)
Reflect          452ms ± 9%        474ms ± 2%   +4.99%  (p=0.000 n=50+48)
Tar              126ms ± 6%        136ms ± 4%   +7.95%  (p=0.000 n=50+49)
XML              247ms ± 5%        264ms ± 3%   +6.94%  (p=0.000 n=48+50)
name       old alloc/op      new alloc/op      delta
Template        38.8MB ± 0%       39.3MB ± 0%   +1.48%  (p=0.008 n=5+5)
Unicode         29.8MB ± 0%       30.2MB ± 0%   +1.19%  (p=0.008 n=5+5)
GoTypes          113MB ± 0%        114MB ± 0%   +0.69%  (p=0.008 n=5+5)
Compiler         443MB ± 0%        447MB ± 0%   +0.95%  (p=0.008 n=5+5)
SSA             1.25GB ± 0%       1.26GB ± 0%   +0.89%  (p=0.008 n=5+5)
Flate           25.3MB ± 0%       25.9MB ± 1%   +2.35%  (p=0.008 n=5+5)
GoParser        31.7MB ± 0%       32.2MB ± 0%   +1.59%  (p=0.008 n=5+5)
Reflect         78.2MB ± 0%       78.9MB ± 0%   +0.91%  (p=0.008 n=5+5)
Tar             26.6MB ± 0%       27.0MB ± 0%   +1.80%  (p=0.008 n=5+5)
XML             42.4MB ± 0%       43.4MB ± 0%   +2.35%  (p=0.008 n=5+5)
name       old allocs/op     new allocs/op     delta
Template          379k ± 0%         378k ± 0%     ~     (p=0.421 n=5+5)
Unicode           322k ± 0%         321k ± 0%     ~     (p=0.222 n=5+5)
GoTypes          1.14M ± 0%        1.14M ± 0%     ~     (p=0.548 n=5+5)
Compiler         4.12M ± 0%        4.11M ± 0%   -0.14%  (p=0.032 n=5+5)
SSA              9.72M ± 0%        9.72M ± 0%     ~     (p=0.421 n=5+5)
Flate             234k ± 1%         234k ± 0%     ~     (p=0.421 n=5+5)
GoParser          316k ± 1%         315k ± 0%     ~     (p=0.222 n=5+5)
Reflect           980k ± 0%         979k ± 0%     ~     (p=0.095 n=5+5)
Tar               249k ± 1%         249k ± 1%     ~     (p=0.841 n=5+5)
XML               392k ± 0%         391k ± 0%     ~     (p=0.095 n=5+5)
From c=1 to c=4, real time is down ~40%, CPU usage up 10-20%, alloc up ~5%:
name       old time/op       new time/op       delta
Template         203ms ± 3%        131ms ± 5%  -35.45%  (p=0.000 n=50+50)
Unicode         87.2ms ± 4%       84.1ms ± 2%   -3.61%  (p=0.000 n=48+47)
GoTypes          560ms ± 4%        310ms ± 2%  -44.65%  (p=0.000 n=50+49)
Compiler         2.47s ± 3%        1.41s ± 2%  -43.10%  (p=0.000 n=50+46)
SSA              6.17s ± 2%        3.20s ± 2%  -48.06%  (p=0.000 n=49+49)
Flate            126ms ± 4%         74ms ± 2%  -41.06%  (p=0.000 n=49+48)
GoParser         148ms ± 4%         89ms ± 3%  -39.97%  (p=0.000 n=49+50)
Reflect          360ms ± 3%        242ms ± 3%  -32.81%  (p=0.000 n=49+49)
Tar              108ms ± 4%         73ms ± 4%  -32.48%  (p=0.000 n=50+49)
XML              203ms ± 3%        119ms ± 3%  -41.56%  (p=0.000 n=49+48)
name       old user-time/op  new user-time/op  delta
Template         246ms ± 9%        287ms ± 9%  +16.98%  (p=0.000 n=50+50)
Unicode          109ms ± 4%        118ms ± 5%   +7.56%  (p=0.000 n=46+50)
GoTypes          735ms ± 4%        806ms ± 2%   +9.62%  (p=0.000 n=50+50)
Compiler         3.34s ± 4%        3.56s ± 2%   +6.78%  (p=0.000 n=49+49)
SSA              8.54s ± 3%       10.04s ± 3%  +17.55%  (p=0.000 n=50+50)
Flate            149ms ± 6%        176ms ± 3%  +17.82%  (p=0.000 n=50+48)
GoParser         181ms ± 5%        213ms ± 3%  +17.47%  (p=0.000 n=50+50)
Reflect          453ms ± 6%        499ms ± 2%  +10.11%  (p=0.000 n=50+48)
Tar              126ms ± 5%        149ms ±11%  +18.76%  (p=0.000 n=50+50)
XML              246ms ± 5%        287ms ± 4%  +16.53%  (p=0.000 n=49+50)
name       old alloc/op      new alloc/op      delta
Template        38.8MB ± 0%       40.4MB ± 0%   +4.21%  (p=0.008 n=5+5)
Unicode         29.8MB ± 0%       30.9MB ± 0%   +3.68%  (p=0.008 n=5+5)
GoTypes          113MB ± 0%        116MB ± 0%   +2.71%  (p=0.008 n=5+5)
Compiler         443MB ± 0%        455MB ± 0%   +2.75%  (p=0.008 n=5+5)
SSA             1.25GB ± 0%       1.27GB ± 0%   +1.84%  (p=0.008 n=5+5)
Flate           25.3MB ± 0%       26.9MB ± 1%   +6.31%  (p=0.008 n=5+5)
GoParser        31.7MB ± 0%       33.2MB ± 0%   +4.61%  (p=0.008 n=5+5)
Reflect         78.2MB ± 0%       80.2MB ± 0%   +2.53%  (p=0.008 n=5+5)
Tar             26.6MB ± 0%       27.9MB ± 0%   +5.19%  (p=0.008 n=5+5)
XML             42.4MB ± 0%       44.6MB ± 0%   +5.20%  (p=0.008 n=5+5)
name       old allocs/op     new allocs/op     delta
Template          380k ± 0%         379k ± 0%   -0.39%  (p=0.032 n=5+5)
Unicode           321k ± 0%         321k ± 0%     ~     (p=0.841 n=5+5)
GoTypes          1.14M ± 0%        1.14M ± 0%     ~     (p=0.421 n=5+5)
Compiler         4.12M ± 0%        4.14M ± 0%   +0.52%  (p=0.008 n=5+5)
SSA              9.72M ± 0%        9.76M ± 0%   +0.37%  (p=0.008 n=5+5)
Flate             234k ± 1%         234k ± 1%     ~     (p=0.690 n=5+5)
GoParser          316k ± 0%         317k ± 1%     ~     (p=0.841 n=5+5)
Reflect           981k ± 0%         981k ± 0%     ~     (p=1.000 n=5+5)
Tar               250k ± 0%         249k ± 1%     ~     (p=0.151 n=5+5)
XML               393k ± 0%         392k ± 0%     ~     (p=0.056 n=5+5)
Going beyond c=4 on my machine tends to increase CPU time and allocs
without impacting real time.
The CPU time numbers matter, because when there are many concurrent
compilation processes, that will impact the overall throughput.
The numbers above are in many ways the best case scenario;
we can take full advantage of all cores.
Fortunately, the most common compilation scenario is incremental
re-compilation of a single package during a build/test cycle.
Updates #15756
Change-Id: I6725558ca2069edec0ac5b0d1683105a9fff6bea
Reviewed-on: https://go-review.googlesource.com/40693
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
Reviewed-by: Robert Griesemer <gri@golang.org>
Run-TryBot: Brad Fitzpatrick <bradfitz@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
											
										 
											2017-03-19 08:27:26 -07:00
										 |  |  | 	s.f.Cache = &ssaCaches[worker] | 
					
						
							| 
									
										
											  
											
												cmd/compile: rearrange fields between ssa.Func, ssa.Cache, and ssa.Config
This makes ssa.Func, ssa.Cache, and ssa.Config fulfill
the roles laid out for them in CL 38160.
The only non-trivial change in this CL is how cached
values and blocks get IDs. Prior to this CL, their IDs were
assigned as part of resetting the cache, and only modified
IDs were reset. This required knowing how many values and
blocks were modified, which required a tight coupling between
ssa.Func and ssa.Config. To eliminate that coupling,
we now zero values and blocks during reset,
and assign their IDs when they are used.
Since unused values and blocks have ID == 0,
we can efficiently find the last used value/block,
to avoid zeroing everything.
Bulk zeroing is efficient, but not efficient enough
to obviate the need to avoid zeroing everything every time.
As a happy side-effect, ssa.Func.Free is no longer necessary.
DebugHashMatch and friends now belong in func.go.
They have been left in place for clarity and review.
I will move them in a subsequent CL.
Passes toolstash -cmp. No compiler performance impact.
No change in 'go test cmd/compile/internal/ssa' execution time.
Change-Id: I2eb7af58da067ef6a36e815a6f386cfe8634d098
Reviewed-on: https://go-review.googlesource.com/38167
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-03-15 11:15:13 -07:00
										 |  |  | 	s.f.Cache.Reset() | 
					
						
							|  |  |  | 	s.f.DebugTest = s.f.DebugHashMatch("GOSSAHASH", name) | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | 	s.f.Name = name | 
					
						
							| 
									
										
										
										
											2016-11-10 16:03:47 -05:00
										 |  |  | 	if fn.Func.Pragma&Nosplit != 0 { | 
					
						
							|  |  |  | 		s.f.NoSplit = true | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-11-09 21:35:40 -08:00
										 |  |  | 	s.panics = map[funcLine]*ssa.Block{} | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 	s.softFloat = s.config.SoftFloat | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-06-14 18:20:03 +03:00
										 |  |  | 	if printssa { | 
					
						
							| 
									
										
										
										
											2018-07-24 13:04:35 +03:00
										 |  |  | 		s.f.HTMLWriter = ssa.NewHTMLWriter(ssaDumpFile, s.f.Frontend(), name) | 
					
						
							| 
									
										
										
										
											2015-08-10 12:15:52 -07:00
										 |  |  | 		// TODO: generate and print a mapping from nodes to values and blocks | 
					
						
							| 
									
										
										
										
											2018-07-26 12:51:06 +03:00
										 |  |  | 		dumpSourcesColumn(s.f.HTMLWriter, fn) | 
					
						
							| 
									
										
										
										
											2018-07-31 18:13:05 +03:00
										 |  |  | 		s.f.HTMLWriter.WriteAST("AST", astBuf) | 
					
						
							| 
									
										
										
										
											2015-08-10 12:15:52 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	// Allocate starting block | 
					
						
							|  |  |  | 	s.f.Entry = s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	// Allocate starting values | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 	s.labels = map[string]*ssaLabel{} | 
					
						
							|  |  |  | 	s.labeledNodes = map[*Node]*ssaLabel{} | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	s.fwdVars = map[*Node]*ssa.Value{} | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 	s.startmem = s.entryNewValue0(ssa.OpInitMem, types.TypeMem) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	s.sp = s.entryNewValue0(ssa.OpSP, types.Types[TUINTPTR]) // TODO: use generic pointer type (unsafe.Pointer?) instead | 
					
						
							|  |  |  | 	s.sb = s.entryNewValue0(ssa.OpSB, types.Types[TUINTPTR]) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-11 16:40:05 -04:00
										 |  |  | 	s.startBlock(s.f.Entry) | 
					
						
							|  |  |  | 	s.vars[&memVar] = s.startmem | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 	// Generate addresses of local declarations | 
					
						
							|  |  |  | 	s.decladdrs = map[*Node]*ssa.Value{} | 
					
						
							| 
									
										
										
										
											2016-02-29 13:31:48 -08:00
										 |  |  | 	for _, n := range fn.Func.Dcl { | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 		switch n.Class() { | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 		case PPARAM, PPARAMOUT: | 
					
						
							| 
									
										
										
										
											2018-07-03 11:34:38 -04:00
										 |  |  | 			s.decladdrs[n] = s.entryNewValue2A(ssa.OpLocalAddr, types.NewPtr(n.Type), n, s.sp, s.startmem) | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 			if n.Class() == PPARAMOUT && s.canSSA(n) { | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 				// Save ssa-able PPARAMOUT variables so we can | 
					
						
							|  |  |  | 				// store them back to the stack at the end of | 
					
						
							|  |  |  | 				// the function. | 
					
						
							|  |  |  | 				s.returns = append(s.returns, n) | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 		case PAUTO: | 
					
						
							| 
									
										
										
										
											2015-08-24 02:16:19 -07:00
										 |  |  | 			// processed at each use, to prevent Addr coming | 
					
						
							|  |  |  | 			// before the decl. | 
					
						
							| 
									
										
											  
											
												cmd/compile: fix liveness computation for heap-escaped parameters
The liveness computation of parameters generally was never
correct, but forcing all parameters to be live throughout the
function covered up that problem. The new SSA back end is
too clever: even though it currently keeps the parameter values live
throughout the function, it may find optimizations that mean
the current values are not written back to the original parameter
stack slots immediately or ever (for example if a parameter is set
to nil, SSA constant propagation may replace all later uses of the
parameter with a constant nil, eliminating the need to write the nil
value back to the stack slot), so the liveness code must now
track the actual operations on the stack slots, exposing these
problems.
One small problem in the handling of arguments is that nodarg
can return ONAME PPARAM nodes with adjusted offsets, so that
there are actually multiple *Node pointers for the same parameter
in the instruction stream. This might be possible to correct, but
not in this CL. For now, we fix this by using n.Orig instead of n
when considering PPARAM and PPARAMOUT nodes.
The major problem in the handling of arguments is general
confusion in the liveness code about the meaning of PPARAM|PHEAP
and PPARAMOUT|PHEAP nodes, especially as contrasted with PAUTO|PHEAP.
The difference between these two is that when a local variable "moves"
to the heap, it's really just allocated there to start with; in contrast,
when an argument moves to the heap, the actual data has to be copied
there from the stack at the beginning of the function, and when a
result "moves" to the heap the value in the heap has to be copied
back to the stack when the function returns
This general confusion is also present in the SSA back end.
The PHEAP bit worked decently when I first introduced it 7 years ago (!)
in 391425ae. The back end did nothing sophisticated, and in particular
there was no analysis at all: no escape analysis, no liveness analysis,
and certainly no SSA back end. But the complications caused in the
various downstream consumers suggest that this should be a detail
kept mainly in the front end.
This CL therefore eliminates both the PHEAP bit and even the idea of
"heap variables" from the back ends.
First, it replaces the PPARAM|PHEAP, PPARAMOUT|PHEAP, and PAUTO|PHEAP
variable classes with the single PAUTOHEAP, a pseudo-class indicating
a variable maintained on the heap and available by indirecting a
local variable kept on the stack (a plain PAUTO).
Second, walkexpr replaces all references to PAUTOHEAP variables
with indirections of the corresponding PAUTO variable.
The back ends and the liveness code now just see plain indirected
variables. This may actually produce better code, but the real goal
here is to eliminate these little-used and somewhat suspect code
paths in the back end analyses.
The OPARAM node type goes away too.
A followup CL will do the same to PPARAMREF. I'm not sure that
the back ends (SSA in particular) are handling those right either,
and with the framework established in this CL that change is trivial
and the result clearly more correct.
Fixes #15747.
Change-Id: I2770b1ce3cbc93981bfc7166be66a9da12013d74
Reviewed-on: https://go-review.googlesource.com/23393
Reviewed-by: Keith Randall <khr@golang.org>
Run-TryBot: Russ Cox <rsc@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
											
										 
											2016-05-25 01:33:24 -04:00
										 |  |  | 		case PAUTOHEAP: | 
					
						
							|  |  |  | 			// moved to heap - already handled by frontend | 
					
						
							| 
									
										
										
										
											2015-09-06 13:42:26 -07:00
										 |  |  | 		case PFUNC: | 
					
						
							|  |  |  | 			// local function - already handled by frontend | 
					
						
							| 
									
										
										
										
											2015-07-01 20:37:25 +01:00
										 |  |  | 		default: | 
					
						
							| 
									
										
										
										
											2017-11-10 18:35:30 +00:00
										 |  |  | 			s.Fatalf("local variable with class %v unimplemented", n.Class()) | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-02-15 13:12:53 -08:00
										 |  |  | 	// Populate SSAable arguments. | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	for _, n := range fn.Func.Dcl { | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 		if n.Class() == PPARAM && s.canSSA(n) { | 
					
						
							| 
									
										
										
										
											2017-02-15 13:12:53 -08:00
										 |  |  | 			s.vars[n] = s.newValue0A(ssa.OpArg, n.Type, n) | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	// Convert the AST-based IR to the SSA-based IR | 
					
						
							| 
									
										
										
										
											2016-10-04 09:49:33 -07:00
										 |  |  | 	s.stmtList(fn.Func.Enter) | 
					
						
							|  |  |  | 	s.stmtList(fn.Nbody) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	// fallthrough to exit | 
					
						
							| 
									
										
										
										
											2015-09-08 16:04:37 -07:00
										 |  |  | 	if s.curBlock != nil { | 
					
						
							| 
									
										
										
										
											2016-03-09 19:27:57 -08:00
										 |  |  | 		s.pushLine(fn.Func.Endlineno) | 
					
						
							|  |  |  | 		s.exit() | 
					
						
							|  |  |  | 		s.popLine() | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 	for _, b := range s.f.Blocks { | 
					
						
							|  |  |  | 		if b.Pos != src.NoXPos { | 
					
						
							| 
									
										
										
										
											2017-10-29 11:53:18 -04:00
										 |  |  | 			s.updateUnsetPredPos(b) | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	s.insertPhis() | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-17 16:47:43 +00:00
										 |  |  | 	// Main call to ssa package to compile function | 
					
						
							|  |  |  | 	ssa.Compile(s.f) | 
					
						
							| 
									
										
										
										
											2015-12-11 20:41:52 -08:00
										 |  |  | 	return s.f | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-07-26 12:51:06 +03:00
										 |  |  | func dumpSourcesColumn(writer *ssa.HTMLWriter, fn *Node) { | 
					
						
							|  |  |  | 	// Read sources of target function fn. | 
					
						
							|  |  |  | 	fname := Ctxt.PosTable.Pos(fn.Pos).Filename() | 
					
						
							|  |  |  | 	targetFn, err := readFuncLines(fname, fn.Pos.Line(), fn.Func.Endlineno.Line()) | 
					
						
							|  |  |  | 	if err != nil { | 
					
						
							|  |  |  | 		writer.Logger.Logf("cannot read sources for function %v: %v", fn, err) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Read sources of inlined functions. | 
					
						
							|  |  |  | 	var inlFns []*ssa.FuncLines | 
					
						
							|  |  |  | 	for _, fi := range ssaDumpInlined { | 
					
						
							|  |  |  | 		var elno src.XPos | 
					
						
							|  |  |  | 		if fi.Name.Defn == nil { | 
					
						
							|  |  |  | 			// Endlineno is filled from exported data. | 
					
						
							|  |  |  | 			elno = fi.Func.Endlineno | 
					
						
							|  |  |  | 		} else { | 
					
						
							|  |  |  | 			elno = fi.Name.Defn.Func.Endlineno | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		fname := Ctxt.PosTable.Pos(fi.Pos).Filename() | 
					
						
							|  |  |  | 		fnLines, err := readFuncLines(fname, fi.Pos.Line(), elno.Line()) | 
					
						
							|  |  |  | 		if err != nil { | 
					
						
							|  |  |  | 			writer.Logger.Logf("cannot read sources for function %v: %v", fi, err) | 
					
						
							|  |  |  | 			continue | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		inlFns = append(inlFns, fnLines) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	sort.Sort(ssa.ByTopo(inlFns)) | 
					
						
							|  |  |  | 	if targetFn != nil { | 
					
						
							|  |  |  | 		inlFns = append([]*ssa.FuncLines{targetFn}, inlFns...) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	writer.WriteSources("sources", inlFns) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | func readFuncLines(file string, start, end uint) (*ssa.FuncLines, error) { | 
					
						
							|  |  |  | 	f, err := os.Open(os.ExpandEnv(file)) | 
					
						
							|  |  |  | 	if err != nil { | 
					
						
							|  |  |  | 		return nil, err | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	defer f.Close() | 
					
						
							|  |  |  | 	var lines []string | 
					
						
							|  |  |  | 	ln := uint(1) | 
					
						
							|  |  |  | 	scanner := bufio.NewScanner(f) | 
					
						
							|  |  |  | 	for scanner.Scan() && ln <= end { | 
					
						
							|  |  |  | 		if ln >= start { | 
					
						
							|  |  |  | 			lines = append(lines, scanner.Text()) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		ln++ | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return &ssa.FuncLines{Filename: file, StartLineno: start, Lines: lines}, nil | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-10-29 11:53:18 -04:00
										 |  |  | // updateUnsetPredPos propagates the earliest-value position information for b | 
					
						
							|  |  |  | // towards all of b's predecessors that need a position, and recurs on that | 
					
						
							|  |  |  | // predecessor if its position is updated. B should have a non-empty position. | 
					
						
							|  |  |  | func (s *state) updateUnsetPredPos(b *ssa.Block) { | 
					
						
							|  |  |  | 	if b.Pos == src.NoXPos { | 
					
						
							|  |  |  | 		s.Fatalf("Block %s should have a position", b) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	bestPos := src.NoXPos | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 	for _, e := range b.Preds { | 
					
						
							|  |  |  | 		p := e.Block() | 
					
						
							| 
									
										
										
										
											2017-10-29 11:53:18 -04:00
										 |  |  | 		if !p.LackingPos() { | 
					
						
							|  |  |  | 			continue | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		if bestPos == src.NoXPos { | 
					
						
							|  |  |  | 			bestPos = b.Pos | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 			for _, v := range b.Values { | 
					
						
							| 
									
										
										
										
											2017-10-29 11:53:18 -04:00
										 |  |  | 				if v.LackingPos() { | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 					continue | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 				if v.Pos != src.NoXPos { | 
					
						
							| 
									
										
										
										
											2017-10-29 11:53:18 -04:00
										 |  |  | 					// Assume values are still in roughly textual order; | 
					
						
							|  |  |  | 					// TODO: could also seek minimum position? | 
					
						
							|  |  |  | 					bestPos = v.Pos | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 					break | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-10-29 11:53:18 -04:00
										 |  |  | 		p.Pos = bestPos | 
					
						
							|  |  |  | 		s.updateUnsetPredPos(p) // We do not expect long chains of these, thus recursion is okay. | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | type state struct { | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	// configuration (arch) information | 
					
						
							|  |  |  | 	config *ssa.Config | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// function we're building | 
					
						
							|  |  |  | 	f *ssa.Func | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-22 20:27:54 -07:00
										 |  |  | 	// Node for function | 
					
						
							|  |  |  | 	curfn *Node | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-02-02 11:53:41 -05:00
										 |  |  | 	// labels and labeled control flow nodes (OFOR, OFORUNTIL, OSWITCH, OSELECT) in f | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 	labels       map[string]*ssaLabel | 
					
						
							|  |  |  | 	labeledNodes map[*Node]*ssaLabel | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// unlabeled break and continue statement tracking | 
					
						
							|  |  |  | 	breakTo    *ssa.Block // current target for plain break statement | 
					
						
							|  |  |  | 	continueTo *ssa.Block // current target for plain continue statement | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// current location where we're interpreting the AST | 
					
						
							|  |  |  | 	curBlock *ssa.Block | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 	// variable assignments in the current block (map from variable symbol to ssa value) | 
					
						
							|  |  |  | 	// *Node is the unique identifier (an ONAME Node) for the variable. | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	// TODO: keep a single varnum map, then make all of these maps slices instead? | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 	vars map[*Node]*ssa.Value | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	// fwdVars are variables that are used before they are defined in the current block. | 
					
						
							|  |  |  | 	// This map exists just to coalesce multiple references into a single FwdRef op. | 
					
						
							|  |  |  | 	// *Node is the unique identifier (an ONAME Node) for the variable. | 
					
						
							|  |  |  | 	fwdVars map[*Node]*ssa.Value | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 	// all defined variables at the end of each block. Indexed by block ID. | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 	defvars []map[*Node]*ssa.Value | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-24 02:16:19 -07:00
										 |  |  | 	// addresses of PPARAM and PPARAMOUT variables. | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 	decladdrs map[*Node]*ssa.Value | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 	// starting values. Memory, stack pointer, and globals pointer | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	startmem *ssa.Value | 
					
						
							|  |  |  | 	sp       *ssa.Value | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 	sb       *ssa.Value | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 	// line number stack. The current line number is top of stack | 
					
						
							| 
									
										
										
										
											2016-12-15 17:17:01 -08:00
										 |  |  | 	line []src.XPos | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 	// the last line number processed; it may have been popped | 
					
						
							|  |  |  | 	lastPos src.XPos | 
					
						
							| 
									
										
										
										
											2015-11-09 21:35:40 -08:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// list of panic calls by function name and line number. | 
					
						
							|  |  |  | 	// Used to deduplicate panic calls. | 
					
						
							|  |  |  | 	panics map[funcLine]*ssa.Block | 
					
						
							| 
									
										
										
										
											2016-01-14 16:02:23 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: fix liveness computation for heap-escaped parameters
The liveness computation of parameters generally was never
correct, but forcing all parameters to be live throughout the
function covered up that problem. The new SSA back end is
too clever: even though it currently keeps the parameter values live
throughout the function, it may find optimizations that mean
the current values are not written back to the original parameter
stack slots immediately or ever (for example if a parameter is set
to nil, SSA constant propagation may replace all later uses of the
parameter with a constant nil, eliminating the need to write the nil
value back to the stack slot), so the liveness code must now
track the actual operations on the stack slots, exposing these
problems.
One small problem in the handling of arguments is that nodarg
can return ONAME PPARAM nodes with adjusted offsets, so that
there are actually multiple *Node pointers for the same parameter
in the instruction stream. This might be possible to correct, but
not in this CL. For now, we fix this by using n.Orig instead of n
when considering PPARAM and PPARAMOUT nodes.
The major problem in the handling of arguments is general
confusion in the liveness code about the meaning of PPARAM|PHEAP
and PPARAMOUT|PHEAP nodes, especially as contrasted with PAUTO|PHEAP.
The difference between these two is that when a local variable "moves"
to the heap, it's really just allocated there to start with; in contrast,
when an argument moves to the heap, the actual data has to be copied
there from the stack at the beginning of the function, and when a
result "moves" to the heap the value in the heap has to be copied
back to the stack when the function returns
This general confusion is also present in the SSA back end.
The PHEAP bit worked decently when I first introduced it 7 years ago (!)
in 391425ae. The back end did nothing sophisticated, and in particular
there was no analysis at all: no escape analysis, no liveness analysis,
and certainly no SSA back end. But the complications caused in the
various downstream consumers suggest that this should be a detail
kept mainly in the front end.
This CL therefore eliminates both the PHEAP bit and even the idea of
"heap variables" from the back ends.
First, it replaces the PPARAM|PHEAP, PPARAMOUT|PHEAP, and PAUTO|PHEAP
variable classes with the single PAUTOHEAP, a pseudo-class indicating
a variable maintained on the heap and available by indirecting a
local variable kept on the stack (a plain PAUTO).
Second, walkexpr replaces all references to PAUTOHEAP variables
with indirections of the corresponding PAUTO variable.
The back ends and the liveness code now just see plain indirected
variables. This may actually produce better code, but the real goal
here is to eliminate these little-used and somewhat suspect code
paths in the back end analyses.
The OPARAM node type goes away too.
A followup CL will do the same to PPARAMREF. I'm not sure that
the back ends (SSA in particular) are handling those right either,
and with the framework established in this CL that change is trivial
and the result clearly more correct.
Fixes #15747.
Change-Id: I2770b1ce3cbc93981bfc7166be66a9da12013d74
Reviewed-on: https://go-review.googlesource.com/23393
Reviewed-by: Keith Randall <khr@golang.org>
Run-TryBot: Russ Cox <rsc@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
											
										 
											2016-05-25 01:33:24 -04:00
										 |  |  | 	// list of PPARAMOUT (return) variables. | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 	returns []*Node | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	cgoUnsafeArgs bool | 
					
						
							| 
									
										
										
										
											2017-03-15 22:55:21 -07:00
										 |  |  | 	hasdefer      bool // whether the function contains a defer statement | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 	softFloat     bool | 
					
						
							| 
									
										
										
										
											2015-11-09 21:35:40 -08:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | type funcLine struct { | 
					
						
							| 
									
										
										
										
											2017-02-06 13:40:19 -08:00
										 |  |  | 	f    *obj.LSym | 
					
						
							| 
									
										
										
										
											2017-10-02 15:47:41 -07:00
										 |  |  | 	base *src.PosBase | 
					
						
							| 
									
										
										
										
											2017-05-11 11:41:02 -07:00
										 |  |  | 	line uint | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | type ssaLabel struct { | 
					
						
							|  |  |  | 	target         *ssa.Block // block identified by this label | 
					
						
							|  |  |  | 	breakTarget    *ssa.Block // block to break to in control flow node identified by this label | 
					
						
							|  |  |  | 	continueTarget *ssa.Block // block to continue to in control flow node identified by this label | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // label returns the label associated with sym, creating it if necessary. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) label(sym *types.Sym) *ssaLabel { | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 	lab := s.labels[sym.Name] | 
					
						
							|  |  |  | 	if lab == nil { | 
					
						
							|  |  |  | 		lab = new(ssaLabel) | 
					
						
							|  |  |  | 		s.labels[sym.Name] = lab | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return lab | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-16 22:42:10 -07:00
										 |  |  | func (s *state) Logf(msg string, args ...interface{}) { s.f.Logf(msg, args...) } | 
					
						
							|  |  |  | func (s *state) Log() bool                            { return s.f.Log() } | 
					
						
							|  |  |  | func (s *state) Fatalf(msg string, args ...interface{}) { | 
					
						
							|  |  |  | 	s.f.Frontend().Fatalf(s.peekPos(), msg, args...) | 
					
						
							| 
									
										
										
										
											2016-12-06 17:08:06 -08:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2017-03-16 22:42:10 -07:00
										 |  |  | func (s *state) Warnl(pos src.XPos, msg string, args ...interface{}) { s.f.Warnl(pos, msg, args...) } | 
					
						
							|  |  |  | func (s *state) Debug_checknil() bool                                { return s.f.Frontend().Debug_checknil() } | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | var ( | 
					
						
							|  |  |  | 	// dummy node for the memory variable | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	memVar = Node{Op: ONAME, Sym: &types.Sym{Name: "mem"}} | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// dummy nodes for temporary variables | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	ptrVar    = Node{Op: ONAME, Sym: &types.Sym{Name: "ptr"}} | 
					
						
							|  |  |  | 	lenVar    = Node{Op: ONAME, Sym: &types.Sym{Name: "len"}} | 
					
						
							|  |  |  | 	newlenVar = Node{Op: ONAME, Sym: &types.Sym{Name: "newlen"}} | 
					
						
							|  |  |  | 	capVar    = Node{Op: ONAME, Sym: &types.Sym{Name: "cap"}} | 
					
						
							|  |  |  | 	typVar    = Node{Op: ONAME, Sym: &types.Sym{Name: "typ"}} | 
					
						
							|  |  |  | 	okVar     = Node{Op: ONAME, Sym: &types.Sym{Name: "ok"}} | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | ) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | // startBlock sets the current block we're generating code in to b. | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | func (s *state) startBlock(b *ssa.Block) { | 
					
						
							|  |  |  | 	if s.curBlock != nil { | 
					
						
							| 
									
										
										
										
											2015-06-24 14:03:39 -07:00
										 |  |  | 		s.Fatalf("starting block %v when block %v has not ended", b, s.curBlock) | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	s.curBlock = b | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 	s.vars = map[*Node]*ssa.Value{} | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	for n := range s.fwdVars { | 
					
						
							|  |  |  | 		delete(s.fwdVars, n) | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // endBlock marks the end of generating code for the current block. | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | // Returns the (former) current block. Returns nil if there is no current | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | // block, i.e. if no code flows to the current execution point. | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | func (s *state) endBlock() *ssa.Block { | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	b := s.curBlock | 
					
						
							|  |  |  | 	if b == nil { | 
					
						
							|  |  |  | 		return nil | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	for len(s.defvars) <= int(b.ID) { | 
					
						
							|  |  |  | 		s.defvars = append(s.defvars, nil) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	s.defvars[b.ID] = s.vars | 
					
						
							|  |  |  | 	s.curBlock = nil | 
					
						
							|  |  |  | 	s.vars = nil | 
					
						
							| 
									
										
										
										
											2017-10-29 11:53:18 -04:00
										 |  |  | 	if b.LackingPos() { | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 		// Empty plain blocks get the line of their successor (handled after all blocks created), | 
					
						
							|  |  |  | 		// except for increment blocks in For statements (handled in ssa conversion of OFOR), | 
					
						
							|  |  |  | 		// and for blocks ending in GOTO/BREAK/CONTINUE. | 
					
						
							|  |  |  | 		b.Pos = src.NoXPos | 
					
						
							|  |  |  | 	} else { | 
					
						
							|  |  |  | 		b.Pos = s.lastPos | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	return b | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | // pushLine pushes a line number on the line number stack. | 
					
						
							| 
									
										
										
										
											2016-12-15 17:17:01 -08:00
										 |  |  | func (s *state) pushLine(line src.XPos) { | 
					
						
							| 
									
										
										
										
											2016-12-07 16:02:42 -08:00
										 |  |  | 	if !line.IsKnown() { | 
					
						
							| 
									
										
										
										
											2016-06-30 06:36:31 -04:00
										 |  |  | 		// the frontend may emit node with line number missing, | 
					
						
							|  |  |  | 		// use the parent line number in this case. | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 		line = s.peekPos() | 
					
						
							| 
									
										
										
										
											2016-06-30 06:36:31 -04:00
										 |  |  | 		if Debug['K'] != 0 { | 
					
						
							| 
									
										
										
										
											2016-12-07 16:02:42 -08:00
										 |  |  | 			Warn("buildssa: unknown position (line 0)") | 
					
						
							| 
									
										
										
										
											2016-06-30 06:36:31 -04:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 	} else { | 
					
						
							|  |  |  | 		s.lastPos = line | 
					
						
							| 
									
										
										
										
											2016-06-30 06:36:31 -04:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | 	s.line = append(s.line, line) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // popLine pops the top of the line number stack. | 
					
						
							|  |  |  | func (s *state) popLine() { | 
					
						
							|  |  |  | 	s.line = s.line[:len(s.line)-1] | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | // peekPos peeks the top of the line number stack. | 
					
						
							| 
									
										
										
										
											2016-12-15 17:17:01 -08:00
										 |  |  | func (s *state) peekPos() src.XPos { | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | 	return s.line[len(s.line)-1] | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-11 21:29:25 -07:00
										 |  |  | // newValue0 adds a new value with no arguments to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue0(op ssa.Op, t *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue0(s.peekPos(), op, t) | 
					
						
							| 
									
										
										
										
											2015-06-11 21:29:25 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // newValue0A adds a new value with no arguments and an aux value to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue0A(op ssa.Op, t *types.Type, aux interface{}) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue0A(s.peekPos(), op, t, aux) | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-03 18:24:22 -05:00
										 |  |  | // newValue0I adds a new value with no arguments and an auxint value to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue0I(op ssa.Op, t *types.Type, auxint int64) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue0I(s.peekPos(), op, t, auxint) | 
					
						
							| 
									
										
										
										
											2015-09-03 18:24:22 -05:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | // newValue1 adds a new value with one argument to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue1(op ssa.Op, t *types.Type, arg *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue1(s.peekPos(), op, t, arg) | 
					
						
							| 
									
										
										
										
											2015-06-11 21:29:25 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // newValue1A adds a new value with one argument and an aux value to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue1A(op ssa.Op, t *types.Type, aux interface{}, arg *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue1A(s.peekPos(), op, t, aux, arg) | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-03-02 20:33:15 -05:00
										 |  |  | // newValue1Apos adds a new value with one argument and an aux value to the current block. | 
					
						
							|  |  |  | // isStmt determines whether the created values may be a statement or not | 
					
						
							|  |  |  | // (i.e., false means never, yes means maybe). | 
					
						
							|  |  |  | func (s *state) newValue1Apos(op ssa.Op, t *types.Type, aux interface{}, arg *ssa.Value, isStmt bool) *ssa.Value { | 
					
						
							|  |  |  | 	if isStmt { | 
					
						
							|  |  |  | 		return s.curBlock.NewValue1A(s.peekPos(), op, t, aux, arg) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return s.curBlock.NewValue1A(s.peekPos().WithNotStmt(), op, t, aux, arg) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-15 21:33:49 -07:00
										 |  |  | // newValue1I adds a new value with one argument and an auxint value to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue1I(op ssa.Op, t *types.Type, aux int64, arg *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue1I(s.peekPos(), op, t, aux, arg) | 
					
						
							| 
									
										
										
										
											2015-07-15 21:33:49 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | // newValue2 adds a new value with two arguments to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue2(op ssa.Op, t *types.Type, arg0, arg1 *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue2(s.peekPos(), op, t, arg0, arg1) | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-07-03 11:34:38 -04:00
										 |  |  | // newValue2Apos adds a new value with two arguments and an aux value to the current block. | 
					
						
							|  |  |  | // isStmt determines whether the created values may be a statement or not | 
					
						
							|  |  |  | // (i.e., false means never, yes means maybe). | 
					
						
							|  |  |  | func (s *state) newValue2Apos(op ssa.Op, t *types.Type, aux interface{}, arg0, arg1 *ssa.Value, isStmt bool) *ssa.Value { | 
					
						
							|  |  |  | 	if isStmt { | 
					
						
							|  |  |  | 		return s.curBlock.NewValue2A(s.peekPos(), op, t, aux, arg0, arg1) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return s.curBlock.NewValue2A(s.peekPos().WithNotStmt(), op, t, aux, arg0, arg1) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-27 15:45:20 +01:00
										 |  |  | // newValue2I adds a new value with two arguments and an auxint value to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue2I(op ssa.Op, t *types.Type, aux int64, arg0, arg1 *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue2I(s.peekPos(), op, t, aux, arg0, arg1) | 
					
						
							| 
									
										
										
										
											2015-06-27 15:45:20 +01:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | // newValue3 adds a new value with three arguments to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue3(op ssa.Op, t *types.Type, arg0, arg1, arg2 *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue3(s.peekPos(), op, t, arg0, arg1, arg2) | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-14 21:47:20 -07:00
										 |  |  | // newValue3I adds a new value with three arguments and an auxint value to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue3I(op ssa.Op, t *types.Type, aux int64, arg0, arg1, arg2 *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue3I(s.peekPos(), op, t, aux, arg0, arg1, arg2) | 
					
						
							| 
									
										
										
										
											2015-08-14 21:47:20 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-13 21:51:08 -04:00
										 |  |  | // newValue3A adds a new value with three arguments and an aux value to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue3A(op ssa.Op, t *types.Type, aux interface{}, arg0, arg1, arg2 *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-03-13 21:51:08 -04:00
										 |  |  | 	return s.curBlock.NewValue3A(s.peekPos(), op, t, aux, arg0, arg1, arg2) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-03-02 20:33:15 -05:00
										 |  |  | // newValue3Apos adds a new value with three arguments and an aux value to the current block. | 
					
						
							|  |  |  | // isStmt determines whether the created values may be a statement or not | 
					
						
							|  |  |  | // (i.e., false means never, yes means maybe). | 
					
						
							|  |  |  | func (s *state) newValue3Apos(op ssa.Op, t *types.Type, aux interface{}, arg0, arg1, arg2 *ssa.Value, isStmt bool) *ssa.Value { | 
					
						
							|  |  |  | 	if isStmt { | 
					
						
							|  |  |  | 		return s.curBlock.NewValue3A(s.peekPos(), op, t, aux, arg0, arg1, arg2) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return s.curBlock.NewValue3A(s.peekPos().WithNotStmt(), op, t, aux, arg0, arg1, arg2) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-08-25 16:02:57 -07:00
										 |  |  | // newValue4 adds a new value with four arguments to the current block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) newValue4(op ssa.Op, t *types.Type, arg0, arg1, arg2, arg3 *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	return s.curBlock.NewValue4(s.peekPos(), op, t, arg0, arg1, arg2, arg3) | 
					
						
							| 
									
										
										
										
											2016-08-25 16:02:57 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-03 18:24:22 -05:00
										 |  |  | // entryNewValue0 adds a new value with no arguments to the entry block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) entryNewValue0(op ssa.Op, t *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-05-15 13:49:30 -04:00
										 |  |  | 	return s.f.Entry.NewValue0(src.NoXPos, op, t) | 
					
						
							| 
									
										
										
										
											2015-06-11 21:29:25 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-03 18:24:22 -05:00
										 |  |  | // entryNewValue0A adds a new value with no arguments and an aux value to the entry block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) entryNewValue0A(op ssa.Op, t *types.Type, aux interface{}) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-11-30 15:11:34 -05:00
										 |  |  | 	return s.f.Entry.NewValue0A(src.NoXPos, op, t, aux) | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // entryNewValue1 adds a new value with one argument to the entry block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) entryNewValue1(op ssa.Op, t *types.Type, arg *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-11-30 15:11:34 -05:00
										 |  |  | 	return s.f.Entry.NewValue1(src.NoXPos, op, t, arg) | 
					
						
							| 
									
										
										
										
											2015-06-11 21:29:25 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // entryNewValue1 adds a new value with one argument and an auxint value to the entry block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) entryNewValue1I(op ssa.Op, t *types.Type, auxint int64, arg *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-11-30 15:11:34 -05:00
										 |  |  | 	return s.f.Entry.NewValue1I(src.NoXPos, op, t, auxint, arg) | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | // entryNewValue1A adds a new value with one argument and an aux value to the entry block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) entryNewValue1A(op ssa.Op, t *types.Type, aux interface{}, arg *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-11-30 15:11:34 -05:00
										 |  |  | 	return s.f.Entry.NewValue1A(src.NoXPos, op, t, aux, arg) | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | // entryNewValue2 adds a new value with two arguments to the entry block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) entryNewValue2(op ssa.Op, t *types.Type, arg0, arg1 *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-11-30 15:11:34 -05:00
										 |  |  | 	return s.f.Entry.NewValue2(src.NoXPos, op, t, arg0, arg1) | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-07-03 11:34:38 -04:00
										 |  |  | // entryNewValue2A adds a new value with two arguments and an aux value to the entry block. | 
					
						
							|  |  |  | func (s *state) entryNewValue2A(op ssa.Op, t *types.Type, aux interface{}, arg0, arg1 *ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 	return s.f.Entry.NewValue2A(src.NoXPos, op, t, aux, arg0, arg1) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-08 16:52:25 -07:00
										 |  |  | // const* routines add a new const value to the entry block. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constSlice(t *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstSlice(t) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | } | 
					
						
							|  |  |  | func (s *state) constInterface(t *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstInterface(t) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | func (s *state) constNil(t *types.Type) *ssa.Value { return s.f.ConstNil(t) } | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constEmptyString(t *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstEmptyString(t) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2015-09-08 16:52:25 -07:00
										 |  |  | func (s *state) constBool(c bool) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstBool(types.Types[TBOOL], c) | 
					
						
							| 
									
										
										
										
											2015-09-08 16:52:25 -07:00
										 |  |  | } | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constInt8(t *types.Type, c int8) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstInt8(t, c) | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | } | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constInt16(t *types.Type, c int16) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstInt16(t, c) | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | } | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constInt32(t *types.Type, c int32) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstInt32(t, c) | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | } | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constInt64(t *types.Type, c int64) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstInt64(t, c) | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | } | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constFloat32(t *types.Type, c float64) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstFloat32(t, c) | 
					
						
							| 
									
										
										
										
											2015-08-12 16:38:11 -04:00
										 |  |  | } | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constFloat64(t *types.Type, c float64) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstFloat64(t, c) | 
					
						
							| 
									
										
										
										
											2015-08-12 16:38:11 -04:00
										 |  |  | } | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constInt(t *types.Type, c int64) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 	if s.config.PtrSize == 8 { | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | 		return s.constInt64(t, c) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	if int64(int32(c)) != c { | 
					
						
							|  |  |  | 		s.Fatalf("integer constant too big %d", c) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return s.constInt32(t, int32(c)) | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | } | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) constOffPtrSP(t *types.Type, c int64) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-03 15:52:30 +01:00
										 |  |  | 	return s.f.ConstOffPtrSP(t, c, s.sp) | 
					
						
							| 
									
										
										
										
											2017-03-08 12:50:00 -08:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | // newValueOrSfCall* are wrappers around newValue*, which may create a call to a | 
					
						
							|  |  |  | // soft-float runtime function instead (when emitting soft-float code). | 
					
						
							|  |  |  | func (s *state) newValueOrSfCall1(op ssa.Op, t *types.Type, arg *ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 	if s.softFloat { | 
					
						
							|  |  |  | 		if c, ok := s.sfcall(op, arg); ok { | 
					
						
							|  |  |  | 			return c | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return s.newValue1(op, t, arg) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | func (s *state) newValueOrSfCall2(op ssa.Op, t *types.Type, arg0, arg1 *ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 	if s.softFloat { | 
					
						
							|  |  |  | 		if c, ok := s.sfcall(op, arg0, arg1); ok { | 
					
						
							|  |  |  | 			return c | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return s.newValue2(op, t, arg0, arg1) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-03-27 13:50:08 -07:00
										 |  |  | func (s *state) instrument(t *types.Type, addr *ssa.Value, wr bool) { | 
					
						
							|  |  |  | 	if !s.curfn.Func.InstrumentBody() { | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	w := t.Size() | 
					
						
							|  |  |  | 	if w == 0 { | 
					
						
							|  |  |  | 		return // can't race on zero-sized things | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	if ssa.IsSanitizerSafeAddr(addr) { | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	var fn *obj.LSym | 
					
						
							|  |  |  | 	needWidth := false | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	if flag_msan { | 
					
						
							|  |  |  | 		fn = msanread | 
					
						
							|  |  |  | 		if wr { | 
					
						
							|  |  |  | 			fn = msanwrite | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		needWidth = true | 
					
						
							|  |  |  | 	} else if flag_race && t.NumComponents(types.CountBlankFields) > 1 { | 
					
						
							|  |  |  | 		// for composite objects we have to write every address | 
					
						
							|  |  |  | 		// because a write might happen to any subobject. | 
					
						
							|  |  |  | 		// composites with only one element don't have subobjects, though. | 
					
						
							|  |  |  | 		fn = racereadrange | 
					
						
							|  |  |  | 		if wr { | 
					
						
							|  |  |  | 			fn = racewriterange | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		needWidth = true | 
					
						
							|  |  |  | 	} else if flag_race { | 
					
						
							|  |  |  | 		// for non-composite objects we can write just the start | 
					
						
							|  |  |  | 		// address, as any write must write the first byte. | 
					
						
							|  |  |  | 		fn = raceread | 
					
						
							|  |  |  | 		if wr { | 
					
						
							|  |  |  | 			fn = racewrite | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} else { | 
					
						
							|  |  |  | 		panic("unreachable") | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	args := []*ssa.Value{addr} | 
					
						
							|  |  |  | 	if needWidth { | 
					
						
							|  |  |  | 		args = append(args, s.constInt(types.Types[TUINTPTR], w)) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	s.rtcall(fn, true, nil, args...) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | func (s *state) load(t *types.Type, src *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:50:08 -07:00
										 |  |  | 	s.instrument(t, src, false) | 
					
						
							|  |  |  | 	return s.rawLoad(t, src) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | func (s *state) rawLoad(t *types.Type, src *ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 	return s.newValue2(ssa.OpLoad, t, src, s.mem()) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | func (s *state) store(t *types.Type, dst, val *ssa.Value) { | 
					
						
							|  |  |  | 	s.vars[&memVar] = s.newValue3A(ssa.OpStore, types.TypeMem, t, dst, val, s.mem()) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | func (s *state) zero(t *types.Type, dst *ssa.Value) { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:50:08 -07:00
										 |  |  | 	s.instrument(t, dst, true) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 	store := s.newValue2I(ssa.OpZero, types.TypeMem, t.Size(), dst, s.mem()) | 
					
						
							|  |  |  | 	store.Aux = t | 
					
						
							|  |  |  | 	s.vars[&memVar] = store | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | func (s *state) move(t *types.Type, dst, src *ssa.Value) { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:50:08 -07:00
										 |  |  | 	s.instrument(t, src, false) | 
					
						
							|  |  |  | 	s.instrument(t, dst, true) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 	store := s.newValue3I(ssa.OpMove, types.TypeMem, t.Size(), dst, src, s.mem()) | 
					
						
							|  |  |  | 	store.Aux = t | 
					
						
							|  |  |  | 	s.vars[&memVar] = store | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-04 09:49:33 -07:00
										 |  |  | // stmtList converts the statement list n to SSA and adds it to s. | 
					
						
							| 
									
										
										
										
											2016-03-08 10:26:20 -08:00
										 |  |  | func (s *state) stmtList(l Nodes) { | 
					
						
							| 
									
										
										
										
											2016-03-08 15:10:26 -08:00
										 |  |  | 	for _, n := range l.Slice() { | 
					
						
							|  |  |  | 		s.stmt(n) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-04 09:49:33 -07:00
										 |  |  | // stmt converts the statement n to SSA and adds it to s. | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | func (s *state) stmt(n *Node) { | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 	if !(n.Op == OVARKILL || n.Op == OVARLIVE) { | 
					
						
							|  |  |  | 		// OVARKILL and OVARLIVE are invisible to the programmer, so we don't use their line numbers to avoid confusion in debugging. | 
					
						
							|  |  |  | 		s.pushLine(n.Pos) | 
					
						
							|  |  |  | 		defer s.popLine() | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 11:05:03 -07:00
										 |  |  | 	// If s.curBlock is nil, and n isn't a label (which might have an associated goto somewhere), | 
					
						
							|  |  |  | 	// then this code is dead. Stop here. | 
					
						
							|  |  |  | 	if s.curBlock == nil && n.Op != OLABEL { | 
					
						
							|  |  |  | 		return | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	s.stmtList(n.Ninit) | 
					
						
							|  |  |  | 	switch n.Op { | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	case OBLOCK: | 
					
						
							|  |  |  | 		s.stmtList(n.List) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-16 13:25:36 -06:00
										 |  |  | 	// No-ops | 
					
						
							| 
									
										
										
										
											2015-08-28 21:19:40 -05:00
										 |  |  | 	case OEMPTY, ODCLCONST, ODCLTYPE, OFALL: | 
					
						
							| 
									
										
										
										
											2015-07-10 10:47:28 -06:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-16 13:25:36 -06:00
										 |  |  | 	// Expression statements | 
					
						
							| 
									
										
										
										
											2016-08-23 16:49:28 -07:00
										 |  |  | 	case OCALLFUNC: | 
					
						
							|  |  |  | 		if isIntrinsicCall(n) { | 
					
						
							|  |  |  | 			s.intrinsicCall(n) | 
					
						
							|  |  |  | 			return | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		fallthrough | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	case OCALLMETH, OCALLINTER: | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 		s.call(n, callNormal) | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 		if n.Op == OCALLFUNC && n.Left.Op == ONAME && n.Left.Class() == PFUNC { | 
					
						
							| 
									
										
										
										
											2016-09-11 08:29:04 -07:00
										 |  |  | 			if fn := n.Left.Sym.Name; compiling_runtime && fn == "throw" || | 
					
						
							| 
									
										
										
										
											2018-04-26 18:30:11 +02:00
										 |  |  | 				n.Left.Sym.Pkg == Runtimepkg && (fn == "throwinit" || fn == "gopanic" || fn == "panicwrap" || fn == "block" || fn == "panicmakeslicelen" || fn == "panicmakeslicecap") { | 
					
						
							| 
									
										
										
										
											2016-09-11 08:29:04 -07:00
										 |  |  | 				m := s.mem() | 
					
						
							|  |  |  | 				b := s.endBlock() | 
					
						
							|  |  |  | 				b.Kind = ssa.BlockExit | 
					
						
							|  |  |  | 				b.SetControl(m) | 
					
						
							|  |  |  | 				// TODO: never rewrite OPANIC to OCALLFUNC in the | 
					
						
							|  |  |  | 				// first place. Need to wait until all backends | 
					
						
							|  |  |  | 				// go through SSA. | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2016-02-06 22:35:34 -08:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	case ODEFER: | 
					
						
							|  |  |  | 		s.call(n.Left, callDefer) | 
					
						
							|  |  |  | 	case OPROC: | 
					
						
							|  |  |  | 		s.call(n.Left, callGo) | 
					
						
							| 
									
										
										
										
											2015-07-16 13:25:36 -06:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 	case OAS2DOTTYPE: | 
					
						
							| 
									
										
										
										
											2016-03-08 15:10:26 -08:00
										 |  |  | 		res, resok := s.dottype(n.Rlist.First(), true) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 		deref := false | 
					
						
							|  |  |  | 		if !canSSAType(n.Rlist.First().Type) { | 
					
						
							|  |  |  | 			if res.Op != ssa.OpLoad { | 
					
						
							|  |  |  | 				s.Fatalf("dottype of non-load") | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			mem := s.mem() | 
					
						
							|  |  |  | 			if mem.Op == ssa.OpVarKill { | 
					
						
							|  |  |  | 				mem = mem.Args[0] | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			if res.Args[1] != mem { | 
					
						
							|  |  |  | 				s.Fatalf("memory no longer live from 2-result dottype load") | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			deref = true | 
					
						
							|  |  |  | 			res = res.Args[0] | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-02-10 10:15:10 -05:00
										 |  |  | 		s.assign(n.List.First(), res, deref, 0) | 
					
						
							|  |  |  | 		s.assign(n.List.Second(), resok, false, 0) | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 		return | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-06 15:43:47 -04:00
										 |  |  | 	case OAS2FUNC: | 
					
						
							|  |  |  | 		// We come here only when it is an intrinsic call returning two values. | 
					
						
							|  |  |  | 		if !isIntrinsicCall(n.Rlist.First()) { | 
					
						
							|  |  |  | 			s.Fatalf("non-intrinsic AS2FUNC not expanded %v", n.Rlist.First()) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		v := s.intrinsicCall(n.Rlist.First()) | 
					
						
							|  |  |  | 		v1 := s.newValue1(ssa.OpSelect0, n.List.First().Type, v) | 
					
						
							|  |  |  | 		v2 := s.newValue1(ssa.OpSelect1, n.List.Second().Type, v) | 
					
						
							| 
									
										
										
										
											2017-02-10 10:15:10 -05:00
										 |  |  | 		s.assign(n.List.First(), v1, false, 0) | 
					
						
							|  |  |  | 		s.assign(n.List.Second(), v2, false, 0) | 
					
						
							| 
									
										
										
										
											2016-10-06 15:43:47 -04:00
										 |  |  | 		return | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	case ODCL: | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 		if n.Left.Class() == PAUTOHEAP { | 
					
						
							| 
									
										
											  
											
												cmd/compile: fix liveness computation for heap-escaped parameters
The liveness computation of parameters generally was never
correct, but forcing all parameters to be live throughout the
function covered up that problem. The new SSA back end is
too clever: even though it currently keeps the parameter values live
throughout the function, it may find optimizations that mean
the current values are not written back to the original parameter
stack slots immediately or ever (for example if a parameter is set
to nil, SSA constant propagation may replace all later uses of the
parameter with a constant nil, eliminating the need to write the nil
value back to the stack slot), so the liveness code must now
track the actual operations on the stack slots, exposing these
problems.
One small problem in the handling of arguments is that nodarg
can return ONAME PPARAM nodes with adjusted offsets, so that
there are actually multiple *Node pointers for the same parameter
in the instruction stream. This might be possible to correct, but
not in this CL. For now, we fix this by using n.Orig instead of n
when considering PPARAM and PPARAMOUT nodes.
The major problem in the handling of arguments is general
confusion in the liveness code about the meaning of PPARAM|PHEAP
and PPARAMOUT|PHEAP nodes, especially as contrasted with PAUTO|PHEAP.
The difference between these two is that when a local variable "moves"
to the heap, it's really just allocated there to start with; in contrast,
when an argument moves to the heap, the actual data has to be copied
there from the stack at the beginning of the function, and when a
result "moves" to the heap the value in the heap has to be copied
back to the stack when the function returns
This general confusion is also present in the SSA back end.
The PHEAP bit worked decently when I first introduced it 7 years ago (!)
in 391425ae. The back end did nothing sophisticated, and in particular
there was no analysis at all: no escape analysis, no liveness analysis,
and certainly no SSA back end. But the complications caused in the
various downstream consumers suggest that this should be a detail
kept mainly in the front end.
This CL therefore eliminates both the PHEAP bit and even the idea of
"heap variables" from the back ends.
First, it replaces the PPARAM|PHEAP, PPARAMOUT|PHEAP, and PAUTO|PHEAP
variable classes with the single PAUTOHEAP, a pseudo-class indicating
a variable maintained on the heap and available by indirecting a
local variable kept on the stack (a plain PAUTO).
Second, walkexpr replaces all references to PAUTOHEAP variables
with indirections of the corresponding PAUTO variable.
The back ends and the liveness code now just see plain indirected
variables. This may actually produce better code, but the real goal
here is to eliminate these little-used and somewhat suspect code
paths in the back end analyses.
The OPARAM node type goes away too.
A followup CL will do the same to PPARAMREF. I'm not sure that
the back ends (SSA in particular) are handling those right either,
and with the framework established in this CL that change is trivial
and the result clearly more correct.
Fixes #15747.
Change-Id: I2770b1ce3cbc93981bfc7166be66a9da12013d74
Reviewed-on: https://go-review.googlesource.com/23393
Reviewed-by: Keith Randall <khr@golang.org>
Run-TryBot: Russ Cox <rsc@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
											
										 
											2016-05-25 01:33:24 -04:00
										 |  |  | 			Fatalf("DCL %v", n) | 
					
						
							| 
									
										
										
										
											2015-06-12 14:23:29 +01:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 	case OLABEL: | 
					
						
							|  |  |  | 		sym := n.Left.Sym | 
					
						
							|  |  |  | 		lab := s.label(sym) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		// Associate label with its control flow node, if any | 
					
						
							| 
									
										
										
										
											2017-03-14 11:05:03 -07:00
										 |  |  | 		if ctl := n.labeledControl(); ctl != nil { | 
					
						
							|  |  |  | 			s.labeledNodes[ctl] = lab | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 		// The label might already have a target block via a goto. | 
					
						
							|  |  |  | 		if lab.target == nil { | 
					
						
							|  |  |  | 			lab.target = s.f.NewBlock(ssa.BlockPlain) | 
					
						
							| 
									
										
										
										
											2015-06-12 16:24:33 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-03 16:15:35 -08:00
										 |  |  | 		// Go to that label. | 
					
						
							|  |  |  | 		// (We pretend "label:" is preceded by "goto label", unless the predecessor is unreachable.) | 
					
						
							|  |  |  | 		if s.curBlock != nil { | 
					
						
							|  |  |  | 			b := s.endBlock() | 
					
						
							|  |  |  | 			b.AddEdgeTo(lab.target) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 		s.startBlock(lab.target) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	case OGOTO: | 
					
						
							|  |  |  | 		sym := n.Left.Sym | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		lab := s.label(sym) | 
					
						
							|  |  |  | 		if lab.target == nil { | 
					
						
							|  |  |  | 			lab.target = s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 		b := s.endBlock() | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 		b.Pos = s.lastPos.WithIsStmt() // Do this even if b is an empty block. | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 		b.AddEdgeTo(lab.target) | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-12-19 10:30:44 -08:00
										 |  |  | 	case OAS: | 
					
						
							| 
									
										
										
										
											2016-04-01 11:05:30 -07:00
										 |  |  | 		if n.Left == n.Right && n.Left.Op == ONAME { | 
					
						
							|  |  |  | 			// An x=x assignment. No point in doing anything | 
					
						
							|  |  |  | 			// here. In addition, skipping this assignment | 
					
						
							|  |  |  | 			// prevents generating: | 
					
						
							|  |  |  | 			//   VARDEF x | 
					
						
							|  |  |  | 			//   COPY x -> x | 
					
						
							|  |  |  | 			// which is bad because x is incorrectly considered | 
					
						
							|  |  |  | 			// dead before the vardef. See issue #14904. | 
					
						
							|  |  |  | 			return | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 		// Evaluate RHS. | 
					
						
							|  |  |  | 		rhs := n.Right | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 		if rhs != nil { | 
					
						
							|  |  |  | 			switch rhs.Op { | 
					
						
							| 
									
										
										
										
											2016-06-19 07:20:28 -07:00
										 |  |  | 			case OSTRUCTLIT, OARRAYLIT, OSLICELIT: | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 				// All literals with nonzero fields have already been | 
					
						
							|  |  |  | 				// rewritten during walk. Any that remain are just T{} | 
					
						
							|  |  |  | 				// or equivalents. Use the zero value. | 
					
						
							| 
									
										
										
										
											2018-04-08 13:39:10 +01:00
										 |  |  | 				if !isZero(rhs) { | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 					Fatalf("literal with nonzero value in SSA: %v", rhs) | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 				rhs = nil | 
					
						
							|  |  |  | 			case OAPPEND: | 
					
						
							| 
									
										
										
										
											2017-08-10 23:41:17 +09:00
										 |  |  | 				// Check whether we're writing the result of an append back to the same slice. | 
					
						
							|  |  |  | 				// If so, we handle it specially to avoid write barriers on the fast | 
					
						
							|  |  |  | 				// (non-growth) path. | 
					
						
							| 
									
										
										
										
											2017-12-04 13:30:13 -08:00
										 |  |  | 				if !samesafeexpr(n.Left, rhs.List.First()) || Debug['N'] != 0 { | 
					
						
							| 
									
										
										
										
											2017-08-10 23:41:17 +09:00
										 |  |  | 					break | 
					
						
							|  |  |  | 				} | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 				// If the slice can be SSA'd, it'll be on the stack, | 
					
						
							|  |  |  | 				// so there will be no write barriers, | 
					
						
							|  |  |  | 				// so there's no need to attempt to prevent them. | 
					
						
							| 
									
										
										
										
											2017-08-10 23:41:17 +09:00
										 |  |  | 				if s.canSSA(n.Left) { | 
					
						
							|  |  |  | 					if Debug_append > 0 { // replicating old diagnostic message | 
					
						
							|  |  |  | 						Warnl(n.Pos, "append: len-only update (in local slice)") | 
					
						
							| 
									
										
										
										
											2016-10-19 11:47:52 -04:00
										 |  |  | 					} | 
					
						
							| 
									
										
										
										
											2017-08-10 23:41:17 +09:00
										 |  |  | 					break | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 				if Debug_append > 0 { | 
					
						
							|  |  |  | 					Warnl(n.Pos, "append: len-only update") | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 				} | 
					
						
							| 
									
										
										
										
											2017-08-10 23:41:17 +09:00
										 |  |  | 				s.append(rhs, true) | 
					
						
							|  |  |  | 				return | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-04-27 09:27:52 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-04-08 13:39:10 +01:00
										 |  |  | 		if n.Left.isBlank() { | 
					
						
							| 
									
										
										
										
											2017-04-27 09:27:52 -07:00
										 |  |  | 			// _ = rhs | 
					
						
							|  |  |  | 			// Just evaluate rhs for side-effects. | 
					
						
							|  |  |  | 			if rhs != nil { | 
					
						
							|  |  |  | 				s.expr(rhs) | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			return | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		var t *types.Type | 
					
						
							|  |  |  | 		if n.Right != nil { | 
					
						
							|  |  |  | 			t = n.Right.Type | 
					
						
							|  |  |  | 		} else { | 
					
						
							|  |  |  | 			t = n.Left.Type | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 		var r *ssa.Value | 
					
						
							|  |  |  | 		deref := !canSSAType(t) | 
					
						
							|  |  |  | 		if deref { | 
					
						
							|  |  |  | 			if rhs == nil { | 
					
						
							|  |  |  | 				r = nil // Signal assign to use OpZero. | 
					
						
							|  |  |  | 			} else { | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 				r = s.addr(rhs, false) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 			} | 
					
						
							|  |  |  | 		} else { | 
					
						
							|  |  |  | 			if rhs == nil { | 
					
						
							|  |  |  | 				r = s.zeroVal(t) | 
					
						
							| 
									
										
										
										
											2015-09-18 22:12:38 -07:00
										 |  |  | 			} else { | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 				r = s.expr(rhs) | 
					
						
							| 
									
										
										
										
											2015-09-18 22:12:38 -07:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2015-08-29 14:54:45 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-21 10:22:03 -07:00
										 |  |  | 		var skip skipMask | 
					
						
							|  |  |  | 		if rhs != nil && (rhs.Op == OSLICE || rhs.Op == OSLICE3 || rhs.Op == OSLICESTR) && samesafeexpr(rhs.Left, n.Left) { | 
					
						
							|  |  |  | 			// We're assigning a slicing operation back to its source. | 
					
						
							|  |  |  | 			// Don't write back fields we aren't changing. See issue #14855. | 
					
						
							| 
									
										
										
										
											2016-04-21 11:55:33 -07:00
										 |  |  | 			i, j, k := rhs.SliceBounds() | 
					
						
							| 
									
										
										
										
											2016-04-01 14:51:02 -07:00
										 |  |  | 			if i != nil && (i.Op == OLITERAL && i.Val().Ctype() == CTINT && i.Int64() == 0) { | 
					
						
							| 
									
										
										
										
											2016-03-21 10:22:03 -07:00
										 |  |  | 				// [0:...] is the same as [:...] | 
					
						
							|  |  |  | 				i = nil | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			// TODO: detect defaults for len/cap also. | 
					
						
							|  |  |  | 			// Currently doesn't really work because (*p)[:len(*p)] appears here as: | 
					
						
							|  |  |  | 			//    tmp = len(*p) | 
					
						
							|  |  |  | 			//    (*p)[:tmp] | 
					
						
							|  |  |  | 			//if j != nil && (j.Op == OLEN && samesafeexpr(j.Left, n.Left)) { | 
					
						
							|  |  |  | 			//      j = nil | 
					
						
							|  |  |  | 			//} | 
					
						
							|  |  |  | 			//if k != nil && (k.Op == OCAP && samesafeexpr(k.Left, n.Left)) { | 
					
						
							|  |  |  | 			//      k = nil | 
					
						
							|  |  |  | 			//} | 
					
						
							|  |  |  | 			if i == nil { | 
					
						
							|  |  |  | 				skip |= skipPtr | 
					
						
							|  |  |  | 				if j == nil { | 
					
						
							|  |  |  | 					skip |= skipLen | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 				if k == nil { | 
					
						
							|  |  |  | 					skip |= skipCap | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-02-10 10:15:10 -05:00
										 |  |  | 		s.assign(n.Left, r, deref, skip) | 
					
						
							| 
									
										
										
										
											2015-06-12 14:23:29 +01:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	case OIF: | 
					
						
							|  |  |  | 		bThen := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		bEnd := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		var bElse *ssa.Block | 
					
						
							| 
									
										
										
										
											2017-04-25 08:46:00 -07:00
										 |  |  | 		var likely int8 | 
					
						
							|  |  |  | 		if n.Likely() { | 
					
						
							|  |  |  | 			likely = 1 | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-03-08 15:10:26 -08:00
										 |  |  | 		if n.Rlist.Len() != 0 { | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 			bElse = s.f.NewBlock(ssa.BlockPlain) | 
					
						
							| 
									
										
										
										
											2017-04-25 08:46:00 -07:00
										 |  |  | 			s.condBranch(n.Left, bThen, bElse, likely) | 
					
						
							| 
									
										
										
										
											2015-11-02 16:56:53 -08:00
										 |  |  | 		} else { | 
					
						
							| 
									
										
										
										
											2017-04-25 08:46:00 -07:00
										 |  |  | 			s.condBranch(n.Left, bThen, bEnd, likely) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		s.startBlock(bThen) | 
					
						
							| 
									
										
										
										
											2016-10-04 09:49:33 -07:00
										 |  |  | 		s.stmtList(n.Nbody) | 
					
						
							| 
									
										
										
										
											2015-07-20 18:42:45 -07:00
										 |  |  | 		if b := s.endBlock(); b != nil { | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 			b.AddEdgeTo(bEnd) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-08 15:10:26 -08:00
										 |  |  | 		if n.Rlist.Len() != 0 { | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 			s.startBlock(bElse) | 
					
						
							| 
									
										
										
										
											2015-06-11 10:20:39 -07:00
										 |  |  | 			s.stmtList(n.Rlist) | 
					
						
							| 
									
										
										
										
											2015-07-20 18:42:45 -07:00
										 |  |  | 			if b := s.endBlock(); b != nil { | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 				b.AddEdgeTo(bEnd) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		s.startBlock(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	case ORETURN: | 
					
						
							|  |  |  | 		s.stmtList(n.List) | 
					
						
							| 
									
										
										
										
											2017-10-29 11:53:18 -04:00
										 |  |  | 		b := s.exit() | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 		b.Pos = s.lastPos.WithIsStmt() | 
					
						
							| 
									
										
										
										
											2017-10-29 11:53:18 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-08 21:28:44 -07:00
										 |  |  | 	case ORETJMP: | 
					
						
							|  |  |  | 		s.stmtList(n.List) | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 		b := s.exit() | 
					
						
							|  |  |  | 		b.Kind = ssa.BlockRetJmp // override BlockRet | 
					
						
							| 
									
										
										
										
											2017-10-04 11:42:37 -07:00
										 |  |  | 		b.Aux = n.Sym.Linksym() | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 	case OCONTINUE, OBREAK: | 
					
						
							|  |  |  | 		var to *ssa.Block | 
					
						
							|  |  |  | 		if n.Left == nil { | 
					
						
							|  |  |  | 			// plain break/continue | 
					
						
							| 
									
										
										
										
											2017-03-14 11:05:03 -07:00
										 |  |  | 			switch n.Op { | 
					
						
							|  |  |  | 			case OCONTINUE: | 
					
						
							|  |  |  | 				to = s.continueTo | 
					
						
							|  |  |  | 			case OBREAK: | 
					
						
							|  |  |  | 				to = s.breakTo | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 			} | 
					
						
							|  |  |  | 		} else { | 
					
						
							|  |  |  | 			// labeled break/continue; look up the target | 
					
						
							|  |  |  | 			sym := n.Left.Sym | 
					
						
							|  |  |  | 			lab := s.label(sym) | 
					
						
							|  |  |  | 			switch n.Op { | 
					
						
							|  |  |  | 			case OCONTINUE: | 
					
						
							|  |  |  | 				to = lab.continueTarget | 
					
						
							|  |  |  | 			case OBREAK: | 
					
						
							|  |  |  | 				to = lab.breakTarget | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 11:05:03 -07:00
										 |  |  | 		b := s.endBlock() | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 		b.Pos = s.lastPos.WithIsStmt() // Do this even if b is an empty block. | 
					
						
							| 
									
										
										
										
											2017-03-14 11:05:03 -07:00
										 |  |  | 		b.AddEdgeTo(to) | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-02-02 11:53:41 -05:00
										 |  |  | 	case OFOR, OFORUNTIL: | 
					
						
							| 
									
										
										
										
											2015-07-06 15:29:39 -07:00
										 |  |  | 		// OFOR: for Ninit; Left; Right { Nbody } | 
					
						
							| 
									
										
											  
											
												cmd/compile: don't produce a past-the-end pointer in range loops
Currently, range loops over slices and arrays are compiled roughly
like:
for i, x := range s { b }
  ⇓
for i, _n, _p := 0, len(s), &s[0]; i < _n; i, _p = i+1, _p + unsafe.Sizeof(s[0]) { b }
  ⇓
i, _n, _p := 0, len(s), &s[0]
goto cond
body:
{ b }
i, _p = i+1, _p + unsafe.Sizeof(s[0])
cond:
if i < _n { goto body } else { goto end }
end:
The problem with this lowering is that _p may temporarily point past
the end of the allocation the moment before the loop terminates. Right
now this isn't a problem because there's never a safe-point during
this brief moment.
We're about to introduce safe-points everywhere, so this bad pointer
is going to be a problem. We could mark the increment as an unsafe
block, but this inhibits reordering opportunities and could result in
infrequent safe-points if the body is short.
Instead, this CL fixes this by changing how we compile range loops to
never produce this past-the-end pointer. It changes the lowering to
roughly:
i, _n, _p := 0, len(s), &s[0]
if i < _n { goto body } else { goto end }
top:
_p += unsafe.Sizeof(s[0])
body:
{ b }
i++
if i < _n { goto top } else { goto end }
end:
Notably, the increment is split into two parts: we increment the index
before checking the condition, but increment the pointer only *after*
the condition check has succeeded.
The implementation builds on the OFORUNTIL construct that was
introduced during the loop preemption experiments, since OFORUNTIL
places the increment and condition after the loop body. To support the
extra "late increment" step, we further define OFORUNTIL's "List"
field to contain the late increment statements. This makes all of this
a relatively small change.
This depends on the improvements to the prove pass in CL 102603. With
the current lowering, bounds-check elimination knows that i < _n in
the body because the body block is dominated by the cond block. In the
new lowering, deriving this fact requires detecting that i < _n on
*both* paths into body and hence is true in body. CL 102603 made prove
able to detect this.
The code size effect of this is minimal. The cmd/go binary on
linux/amd64 increases by 0.17%. Performance-wise, this actually
appears to be a net win, though it's mostly noise:
name                      old time/op    new time/op    delta
BinaryTree17-12              2.80s ± 0%     2.61s ± 1%  -6.88%  (p=0.000 n=20+18)
Fannkuch11-12                2.41s ± 0%     2.42s ± 0%  +0.05%  (p=0.005 n=20+20)
FmtFprintfEmpty-12          41.6ns ± 5%    41.4ns ± 6%    ~     (p=0.765 n=20+19)
FmtFprintfString-12         69.4ns ± 3%    69.3ns ± 1%    ~     (p=0.084 n=19+17)
FmtFprintfInt-12            76.1ns ± 1%    77.3ns ± 1%  +1.57%  (p=0.000 n=19+19)
FmtFprintfIntInt-12          122ns ± 2%     123ns ± 3%  +0.95%  (p=0.015 n=20+20)
FmtFprintfPrefixedInt-12     153ns ± 2%     151ns ± 3%  -1.27%  (p=0.013 n=20+20)
FmtFprintfFloat-12           215ns ± 0%     216ns ± 0%  +0.47%  (p=0.000 n=20+16)
FmtManyArgs-12               486ns ± 1%     498ns ± 0%  +2.40%  (p=0.000 n=20+17)
GobDecode-12                6.43ms ± 0%    6.50ms ± 0%  +1.08%  (p=0.000 n=18+19)
GobEncode-12                5.43ms ± 1%    5.47ms ± 0%  +0.76%  (p=0.000 n=20+20)
Gzip-12                      218ms ± 1%     218ms ± 1%    ~     (p=0.883 n=20+20)
Gunzip-12                   38.8ms ± 0%    38.9ms ± 0%    ~     (p=0.644 n=19+19)
HTTPClientServer-12         76.2µs ± 1%    76.4µs ± 2%    ~     (p=0.218 n=20+20)
JSONEncode-12               12.2ms ± 0%    12.3ms ± 1%  +0.45%  (p=0.000 n=19+19)
JSONDecode-12               54.2ms ± 1%    53.3ms ± 0%  -1.67%  (p=0.000 n=20+20)
Mandelbrot200-12            3.71ms ± 0%    3.71ms ± 0%    ~     (p=0.143 n=19+20)
GoParse-12                  3.22ms ± 0%    3.19ms ± 1%  -0.72%  (p=0.000 n=20+20)
RegexpMatchEasy0_32-12      76.7ns ± 1%    75.8ns ± 1%  -1.19%  (p=0.000 n=20+17)
RegexpMatchEasy0_1K-12       245ns ± 1%     243ns ± 0%  -0.72%  (p=0.000 n=18+17)
RegexpMatchEasy1_32-12      71.9ns ± 0%    71.7ns ± 1%  -0.39%  (p=0.006 n=12+18)
RegexpMatchEasy1_1K-12       358ns ± 1%     354ns ± 1%  -1.13%  (p=0.000 n=20+19)
RegexpMatchMedium_32-12      105ns ± 2%     105ns ± 1%  -0.63%  (p=0.007 n=19+20)
RegexpMatchMedium_1K-12     31.9µs ± 1%    31.9µs ± 1%    ~     (p=1.000 n=17+17)
RegexpMatchHard_32-12       1.51µs ± 1%    1.52µs ± 2%  +0.46%  (p=0.042 n=18+18)
RegexpMatchHard_1K-12       45.3µs ± 1%    45.5µs ± 2%  +0.44%  (p=0.029 n=18+19)
Revcomp-12                   388ms ± 1%     385ms ± 0%  -0.57%  (p=0.000 n=19+18)
Template-12                 63.0ms ± 1%    63.3ms ± 0%  +0.50%  (p=0.000 n=19+20)
TimeParse-12                 309ns ± 1%     307ns ± 0%  -0.62%  (p=0.000 n=20+20)
TimeFormat-12                328ns ± 0%     333ns ± 0%  +1.35%  (p=0.000 n=19+19)
[Geo mean]                  47.0µs         46.9µs       -0.20%
(https://perf.golang.org/search?q=upload:20180326.1)
For #10958.
For #24543.
Change-Id: Icbd52e711fdbe7938a1fea3e6baca1104b53ac3a
Reviewed-on: https://go-review.googlesource.com/102604
Run-TryBot: Austin Clements <austin@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2018-03-22 12:04:51 -04:00
										 |  |  | 		// cond (Left); body (Nbody); incr (Right) | 
					
						
							|  |  |  | 		// | 
					
						
							|  |  |  | 		// OFORUNTIL: for Ninit; Left; Right; List { Nbody } | 
					
						
							|  |  |  | 		// => body: { Nbody }; incr: Right; if Left { lateincr: List; goto body }; end: | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 		bCond := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		bBody := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							| 
									
										
										
										
											2015-07-06 15:29:39 -07:00
										 |  |  | 		bIncr := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 		bEnd := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-02-02 11:53:41 -05:00
										 |  |  | 		// first, jump to condition test (OFOR) or body (OFORUNTIL) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 		b := s.endBlock() | 
					
						
							| 
									
										
										
										
											2017-02-02 11:53:41 -05:00
										 |  |  | 		if n.Op == OFOR { | 
					
						
							|  |  |  | 			b.AddEdgeTo(bCond) | 
					
						
							|  |  |  | 			// generate code to test condition | 
					
						
							|  |  |  | 			s.startBlock(bCond) | 
					
						
							|  |  |  | 			if n.Left != nil { | 
					
						
							|  |  |  | 				s.condBranch(n.Left, bBody, bEnd, 1) | 
					
						
							|  |  |  | 			} else { | 
					
						
							|  |  |  | 				b := s.endBlock() | 
					
						
							|  |  |  | 				b.Kind = ssa.BlockPlain | 
					
						
							|  |  |  | 				b.AddEdgeTo(bBody) | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-06 15:29:39 -07:00
										 |  |  | 		} else { | 
					
						
							| 
									
										
										
										
											2015-11-02 16:56:53 -08:00
										 |  |  | 			b.AddEdgeTo(bBody) | 
					
						
							| 
									
										
										
										
											2015-07-06 15:29:39 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 		// set up for continue/break in body | 
					
						
							|  |  |  | 		prevContinue := s.continueTo | 
					
						
							|  |  |  | 		prevBreak := s.breakTo | 
					
						
							|  |  |  | 		s.continueTo = bIncr | 
					
						
							|  |  |  | 		s.breakTo = bEnd | 
					
						
							|  |  |  | 		lab := s.labeledNodes[n] | 
					
						
							|  |  |  | 		if lab != nil { | 
					
						
							|  |  |  | 			// labeled for loop | 
					
						
							|  |  |  | 			lab.continueTarget = bIncr | 
					
						
							|  |  |  | 			lab.breakTarget = bEnd | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 		// generate body | 
					
						
							|  |  |  | 		s.startBlock(bBody) | 
					
						
							| 
									
										
										
										
											2016-10-04 09:49:33 -07:00
										 |  |  | 		s.stmtList(n.Nbody) | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 		// tear down continue/break | 
					
						
							|  |  |  | 		s.continueTo = prevContinue | 
					
						
							|  |  |  | 		s.breakTo = prevBreak | 
					
						
							|  |  |  | 		if lab != nil { | 
					
						
							|  |  |  | 			lab.continueTarget = nil | 
					
						
							|  |  |  | 			lab.breakTarget = nil | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		// done with body, goto incr | 
					
						
							| 
									
										
										
										
											2015-07-06 15:29:39 -07:00
										 |  |  | 		if b := s.endBlock(); b != nil { | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 			b.AddEdgeTo(bIncr) | 
					
						
							| 
									
										
										
										
											2015-07-06 15:29:39 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: don't produce a past-the-end pointer in range loops
Currently, range loops over slices and arrays are compiled roughly
like:
for i, x := range s { b }
  ⇓
for i, _n, _p := 0, len(s), &s[0]; i < _n; i, _p = i+1, _p + unsafe.Sizeof(s[0]) { b }
  ⇓
i, _n, _p := 0, len(s), &s[0]
goto cond
body:
{ b }
i, _p = i+1, _p + unsafe.Sizeof(s[0])
cond:
if i < _n { goto body } else { goto end }
end:
The problem with this lowering is that _p may temporarily point past
the end of the allocation the moment before the loop terminates. Right
now this isn't a problem because there's never a safe-point during
this brief moment.
We're about to introduce safe-points everywhere, so this bad pointer
is going to be a problem. We could mark the increment as an unsafe
block, but this inhibits reordering opportunities and could result in
infrequent safe-points if the body is short.
Instead, this CL fixes this by changing how we compile range loops to
never produce this past-the-end pointer. It changes the lowering to
roughly:
i, _n, _p := 0, len(s), &s[0]
if i < _n { goto body } else { goto end }
top:
_p += unsafe.Sizeof(s[0])
body:
{ b }
i++
if i < _n { goto top } else { goto end }
end:
Notably, the increment is split into two parts: we increment the index
before checking the condition, but increment the pointer only *after*
the condition check has succeeded.
The implementation builds on the OFORUNTIL construct that was
introduced during the loop preemption experiments, since OFORUNTIL
places the increment and condition after the loop body. To support the
extra "late increment" step, we further define OFORUNTIL's "List"
field to contain the late increment statements. This makes all of this
a relatively small change.
This depends on the improvements to the prove pass in CL 102603. With
the current lowering, bounds-check elimination knows that i < _n in
the body because the body block is dominated by the cond block. In the
new lowering, deriving this fact requires detecting that i < _n on
*both* paths into body and hence is true in body. CL 102603 made prove
able to detect this.
The code size effect of this is minimal. The cmd/go binary on
linux/amd64 increases by 0.17%. Performance-wise, this actually
appears to be a net win, though it's mostly noise:
name                      old time/op    new time/op    delta
BinaryTree17-12              2.80s ± 0%     2.61s ± 1%  -6.88%  (p=0.000 n=20+18)
Fannkuch11-12                2.41s ± 0%     2.42s ± 0%  +0.05%  (p=0.005 n=20+20)
FmtFprintfEmpty-12          41.6ns ± 5%    41.4ns ± 6%    ~     (p=0.765 n=20+19)
FmtFprintfString-12         69.4ns ± 3%    69.3ns ± 1%    ~     (p=0.084 n=19+17)
FmtFprintfInt-12            76.1ns ± 1%    77.3ns ± 1%  +1.57%  (p=0.000 n=19+19)
FmtFprintfIntInt-12          122ns ± 2%     123ns ± 3%  +0.95%  (p=0.015 n=20+20)
FmtFprintfPrefixedInt-12     153ns ± 2%     151ns ± 3%  -1.27%  (p=0.013 n=20+20)
FmtFprintfFloat-12           215ns ± 0%     216ns ± 0%  +0.47%  (p=0.000 n=20+16)
FmtManyArgs-12               486ns ± 1%     498ns ± 0%  +2.40%  (p=0.000 n=20+17)
GobDecode-12                6.43ms ± 0%    6.50ms ± 0%  +1.08%  (p=0.000 n=18+19)
GobEncode-12                5.43ms ± 1%    5.47ms ± 0%  +0.76%  (p=0.000 n=20+20)
Gzip-12                      218ms ± 1%     218ms ± 1%    ~     (p=0.883 n=20+20)
Gunzip-12                   38.8ms ± 0%    38.9ms ± 0%    ~     (p=0.644 n=19+19)
HTTPClientServer-12         76.2µs ± 1%    76.4µs ± 2%    ~     (p=0.218 n=20+20)
JSONEncode-12               12.2ms ± 0%    12.3ms ± 1%  +0.45%  (p=0.000 n=19+19)
JSONDecode-12               54.2ms ± 1%    53.3ms ± 0%  -1.67%  (p=0.000 n=20+20)
Mandelbrot200-12            3.71ms ± 0%    3.71ms ± 0%    ~     (p=0.143 n=19+20)
GoParse-12                  3.22ms ± 0%    3.19ms ± 1%  -0.72%  (p=0.000 n=20+20)
RegexpMatchEasy0_32-12      76.7ns ± 1%    75.8ns ± 1%  -1.19%  (p=0.000 n=20+17)
RegexpMatchEasy0_1K-12       245ns ± 1%     243ns ± 0%  -0.72%  (p=0.000 n=18+17)
RegexpMatchEasy1_32-12      71.9ns ± 0%    71.7ns ± 1%  -0.39%  (p=0.006 n=12+18)
RegexpMatchEasy1_1K-12       358ns ± 1%     354ns ± 1%  -1.13%  (p=0.000 n=20+19)
RegexpMatchMedium_32-12      105ns ± 2%     105ns ± 1%  -0.63%  (p=0.007 n=19+20)
RegexpMatchMedium_1K-12     31.9µs ± 1%    31.9µs ± 1%    ~     (p=1.000 n=17+17)
RegexpMatchHard_32-12       1.51µs ± 1%    1.52µs ± 2%  +0.46%  (p=0.042 n=18+18)
RegexpMatchHard_1K-12       45.3µs ± 1%    45.5µs ± 2%  +0.44%  (p=0.029 n=18+19)
Revcomp-12                   388ms ± 1%     385ms ± 0%  -0.57%  (p=0.000 n=19+18)
Template-12                 63.0ms ± 1%    63.3ms ± 0%  +0.50%  (p=0.000 n=19+20)
TimeParse-12                 309ns ± 1%     307ns ± 0%  -0.62%  (p=0.000 n=20+20)
TimeFormat-12                328ns ± 0%     333ns ± 0%  +1.35%  (p=0.000 n=19+19)
[Geo mean]                  47.0µs         46.9µs       -0.20%
(https://perf.golang.org/search?q=upload:20180326.1)
For #10958.
For #24543.
Change-Id: Icbd52e711fdbe7938a1fea3e6baca1104b53ac3a
Reviewed-on: https://go-review.googlesource.com/102604
Run-TryBot: Austin Clements <austin@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2018-03-22 12:04:51 -04:00
										 |  |  | 		// generate incr (and, for OFORUNTIL, condition) | 
					
						
							| 
									
										
										
										
											2015-07-06 15:29:39 -07:00
										 |  |  | 		s.startBlock(bIncr) | 
					
						
							| 
									
										
										
										
											2015-06-24 17:48:22 -07:00
										 |  |  | 		if n.Right != nil { | 
					
						
							|  |  |  | 			s.stmt(n.Right) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
											  
											
												cmd/compile: don't produce a past-the-end pointer in range loops
Currently, range loops over slices and arrays are compiled roughly
like:
for i, x := range s { b }
  ⇓
for i, _n, _p := 0, len(s), &s[0]; i < _n; i, _p = i+1, _p + unsafe.Sizeof(s[0]) { b }
  ⇓
i, _n, _p := 0, len(s), &s[0]
goto cond
body:
{ b }
i, _p = i+1, _p + unsafe.Sizeof(s[0])
cond:
if i < _n { goto body } else { goto end }
end:
The problem with this lowering is that _p may temporarily point past
the end of the allocation the moment before the loop terminates. Right
now this isn't a problem because there's never a safe-point during
this brief moment.
We're about to introduce safe-points everywhere, so this bad pointer
is going to be a problem. We could mark the increment as an unsafe
block, but this inhibits reordering opportunities and could result in
infrequent safe-points if the body is short.
Instead, this CL fixes this by changing how we compile range loops to
never produce this past-the-end pointer. It changes the lowering to
roughly:
i, _n, _p := 0, len(s), &s[0]
if i < _n { goto body } else { goto end }
top:
_p += unsafe.Sizeof(s[0])
body:
{ b }
i++
if i < _n { goto top } else { goto end }
end:
Notably, the increment is split into two parts: we increment the index
before checking the condition, but increment the pointer only *after*
the condition check has succeeded.
The implementation builds on the OFORUNTIL construct that was
introduced during the loop preemption experiments, since OFORUNTIL
places the increment and condition after the loop body. To support the
extra "late increment" step, we further define OFORUNTIL's "List"
field to contain the late increment statements. This makes all of this
a relatively small change.
This depends on the improvements to the prove pass in CL 102603. With
the current lowering, bounds-check elimination knows that i < _n in
the body because the body block is dominated by the cond block. In the
new lowering, deriving this fact requires detecting that i < _n on
*both* paths into body and hence is true in body. CL 102603 made prove
able to detect this.
The code size effect of this is minimal. The cmd/go binary on
linux/amd64 increases by 0.17%. Performance-wise, this actually
appears to be a net win, though it's mostly noise:
name                      old time/op    new time/op    delta
BinaryTree17-12              2.80s ± 0%     2.61s ± 1%  -6.88%  (p=0.000 n=20+18)
Fannkuch11-12                2.41s ± 0%     2.42s ± 0%  +0.05%  (p=0.005 n=20+20)
FmtFprintfEmpty-12          41.6ns ± 5%    41.4ns ± 6%    ~     (p=0.765 n=20+19)
FmtFprintfString-12         69.4ns ± 3%    69.3ns ± 1%    ~     (p=0.084 n=19+17)
FmtFprintfInt-12            76.1ns ± 1%    77.3ns ± 1%  +1.57%  (p=0.000 n=19+19)
FmtFprintfIntInt-12          122ns ± 2%     123ns ± 3%  +0.95%  (p=0.015 n=20+20)
FmtFprintfPrefixedInt-12     153ns ± 2%     151ns ± 3%  -1.27%  (p=0.013 n=20+20)
FmtFprintfFloat-12           215ns ± 0%     216ns ± 0%  +0.47%  (p=0.000 n=20+16)
FmtManyArgs-12               486ns ± 1%     498ns ± 0%  +2.40%  (p=0.000 n=20+17)
GobDecode-12                6.43ms ± 0%    6.50ms ± 0%  +1.08%  (p=0.000 n=18+19)
GobEncode-12                5.43ms ± 1%    5.47ms ± 0%  +0.76%  (p=0.000 n=20+20)
Gzip-12                      218ms ± 1%     218ms ± 1%    ~     (p=0.883 n=20+20)
Gunzip-12                   38.8ms ± 0%    38.9ms ± 0%    ~     (p=0.644 n=19+19)
HTTPClientServer-12         76.2µs ± 1%    76.4µs ± 2%    ~     (p=0.218 n=20+20)
JSONEncode-12               12.2ms ± 0%    12.3ms ± 1%  +0.45%  (p=0.000 n=19+19)
JSONDecode-12               54.2ms ± 1%    53.3ms ± 0%  -1.67%  (p=0.000 n=20+20)
Mandelbrot200-12            3.71ms ± 0%    3.71ms ± 0%    ~     (p=0.143 n=19+20)
GoParse-12                  3.22ms ± 0%    3.19ms ± 1%  -0.72%  (p=0.000 n=20+20)
RegexpMatchEasy0_32-12      76.7ns ± 1%    75.8ns ± 1%  -1.19%  (p=0.000 n=20+17)
RegexpMatchEasy0_1K-12       245ns ± 1%     243ns ± 0%  -0.72%  (p=0.000 n=18+17)
RegexpMatchEasy1_32-12      71.9ns ± 0%    71.7ns ± 1%  -0.39%  (p=0.006 n=12+18)
RegexpMatchEasy1_1K-12       358ns ± 1%     354ns ± 1%  -1.13%  (p=0.000 n=20+19)
RegexpMatchMedium_32-12      105ns ± 2%     105ns ± 1%  -0.63%  (p=0.007 n=19+20)
RegexpMatchMedium_1K-12     31.9µs ± 1%    31.9µs ± 1%    ~     (p=1.000 n=17+17)
RegexpMatchHard_32-12       1.51µs ± 1%    1.52µs ± 2%  +0.46%  (p=0.042 n=18+18)
RegexpMatchHard_1K-12       45.3µs ± 1%    45.5µs ± 2%  +0.44%  (p=0.029 n=18+19)
Revcomp-12                   388ms ± 1%     385ms ± 0%  -0.57%  (p=0.000 n=19+18)
Template-12                 63.0ms ± 1%    63.3ms ± 0%  +0.50%  (p=0.000 n=19+20)
TimeParse-12                 309ns ± 1%     307ns ± 0%  -0.62%  (p=0.000 n=20+20)
TimeFormat-12                328ns ± 0%     333ns ± 0%  +1.35%  (p=0.000 n=19+19)
[Geo mean]                  47.0µs         46.9µs       -0.20%
(https://perf.golang.org/search?q=upload:20180326.1)
For #10958.
For #24543.
Change-Id: Icbd52e711fdbe7938a1fea3e6baca1104b53ac3a
Reviewed-on: https://go-review.googlesource.com/102604
Run-TryBot: Austin Clements <austin@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2018-03-22 12:04:51 -04:00
										 |  |  | 		if n.Op == OFOR { | 
					
						
							|  |  |  | 			if b := s.endBlock(); b != nil { | 
					
						
							|  |  |  | 				b.AddEdgeTo(bCond) | 
					
						
							|  |  |  | 				// It can happen that bIncr ends in a block containing only VARKILL, | 
					
						
							|  |  |  | 				// and that muddles the debugging experience. | 
					
						
							|  |  |  | 				if n.Op != OFORUNTIL && b.Pos == src.NoXPos { | 
					
						
							|  |  |  | 					b.Pos = bCond.Pos | 
					
						
							|  |  |  | 				} | 
					
						
							| 
									
										
										
										
											2017-02-02 11:53:41 -05:00
										 |  |  | 			} | 
					
						
							| 
									
										
											  
											
												cmd/compile: don't produce a past-the-end pointer in range loops
Currently, range loops over slices and arrays are compiled roughly
like:
for i, x := range s { b }
  ⇓
for i, _n, _p := 0, len(s), &s[0]; i < _n; i, _p = i+1, _p + unsafe.Sizeof(s[0]) { b }
  ⇓
i, _n, _p := 0, len(s), &s[0]
goto cond
body:
{ b }
i, _p = i+1, _p + unsafe.Sizeof(s[0])
cond:
if i < _n { goto body } else { goto end }
end:
The problem with this lowering is that _p may temporarily point past
the end of the allocation the moment before the loop terminates. Right
now this isn't a problem because there's never a safe-point during
this brief moment.
We're about to introduce safe-points everywhere, so this bad pointer
is going to be a problem. We could mark the increment as an unsafe
block, but this inhibits reordering opportunities and could result in
infrequent safe-points if the body is short.
Instead, this CL fixes this by changing how we compile range loops to
never produce this past-the-end pointer. It changes the lowering to
roughly:
i, _n, _p := 0, len(s), &s[0]
if i < _n { goto body } else { goto end }
top:
_p += unsafe.Sizeof(s[0])
body:
{ b }
i++
if i < _n { goto top } else { goto end }
end:
Notably, the increment is split into two parts: we increment the index
before checking the condition, but increment the pointer only *after*
the condition check has succeeded.
The implementation builds on the OFORUNTIL construct that was
introduced during the loop preemption experiments, since OFORUNTIL
places the increment and condition after the loop body. To support the
extra "late increment" step, we further define OFORUNTIL's "List"
field to contain the late increment statements. This makes all of this
a relatively small change.
This depends on the improvements to the prove pass in CL 102603. With
the current lowering, bounds-check elimination knows that i < _n in
the body because the body block is dominated by the cond block. In the
new lowering, deriving this fact requires detecting that i < _n on
*both* paths into body and hence is true in body. CL 102603 made prove
able to detect this.
The code size effect of this is minimal. The cmd/go binary on
linux/amd64 increases by 0.17%. Performance-wise, this actually
appears to be a net win, though it's mostly noise:
name                      old time/op    new time/op    delta
BinaryTree17-12              2.80s ± 0%     2.61s ± 1%  -6.88%  (p=0.000 n=20+18)
Fannkuch11-12                2.41s ± 0%     2.42s ± 0%  +0.05%  (p=0.005 n=20+20)
FmtFprintfEmpty-12          41.6ns ± 5%    41.4ns ± 6%    ~     (p=0.765 n=20+19)
FmtFprintfString-12         69.4ns ± 3%    69.3ns ± 1%    ~     (p=0.084 n=19+17)
FmtFprintfInt-12            76.1ns ± 1%    77.3ns ± 1%  +1.57%  (p=0.000 n=19+19)
FmtFprintfIntInt-12          122ns ± 2%     123ns ± 3%  +0.95%  (p=0.015 n=20+20)
FmtFprintfPrefixedInt-12     153ns ± 2%     151ns ± 3%  -1.27%  (p=0.013 n=20+20)
FmtFprintfFloat-12           215ns ± 0%     216ns ± 0%  +0.47%  (p=0.000 n=20+16)
FmtManyArgs-12               486ns ± 1%     498ns ± 0%  +2.40%  (p=0.000 n=20+17)
GobDecode-12                6.43ms ± 0%    6.50ms ± 0%  +1.08%  (p=0.000 n=18+19)
GobEncode-12                5.43ms ± 1%    5.47ms ± 0%  +0.76%  (p=0.000 n=20+20)
Gzip-12                      218ms ± 1%     218ms ± 1%    ~     (p=0.883 n=20+20)
Gunzip-12                   38.8ms ± 0%    38.9ms ± 0%    ~     (p=0.644 n=19+19)
HTTPClientServer-12         76.2µs ± 1%    76.4µs ± 2%    ~     (p=0.218 n=20+20)
JSONEncode-12               12.2ms ± 0%    12.3ms ± 1%  +0.45%  (p=0.000 n=19+19)
JSONDecode-12               54.2ms ± 1%    53.3ms ± 0%  -1.67%  (p=0.000 n=20+20)
Mandelbrot200-12            3.71ms ± 0%    3.71ms ± 0%    ~     (p=0.143 n=19+20)
GoParse-12                  3.22ms ± 0%    3.19ms ± 1%  -0.72%  (p=0.000 n=20+20)
RegexpMatchEasy0_32-12      76.7ns ± 1%    75.8ns ± 1%  -1.19%  (p=0.000 n=20+17)
RegexpMatchEasy0_1K-12       245ns ± 1%     243ns ± 0%  -0.72%  (p=0.000 n=18+17)
RegexpMatchEasy1_32-12      71.9ns ± 0%    71.7ns ± 1%  -0.39%  (p=0.006 n=12+18)
RegexpMatchEasy1_1K-12       358ns ± 1%     354ns ± 1%  -1.13%  (p=0.000 n=20+19)
RegexpMatchMedium_32-12      105ns ± 2%     105ns ± 1%  -0.63%  (p=0.007 n=19+20)
RegexpMatchMedium_1K-12     31.9µs ± 1%    31.9µs ± 1%    ~     (p=1.000 n=17+17)
RegexpMatchHard_32-12       1.51µs ± 1%    1.52µs ± 2%  +0.46%  (p=0.042 n=18+18)
RegexpMatchHard_1K-12       45.3µs ± 1%    45.5µs ± 2%  +0.44%  (p=0.029 n=18+19)
Revcomp-12                   388ms ± 1%     385ms ± 0%  -0.57%  (p=0.000 n=19+18)
Template-12                 63.0ms ± 1%    63.3ms ± 0%  +0.50%  (p=0.000 n=19+20)
TimeParse-12                 309ns ± 1%     307ns ± 0%  -0.62%  (p=0.000 n=20+20)
TimeFormat-12                328ns ± 0%     333ns ± 0%  +1.35%  (p=0.000 n=19+19)
[Geo mean]                  47.0µs         46.9µs       -0.20%
(https://perf.golang.org/search?q=upload:20180326.1)
For #10958.
For #24543.
Change-Id: Icbd52e711fdbe7938a1fea3e6baca1104b53ac3a
Reviewed-on: https://go-review.googlesource.com/102604
Run-TryBot: Austin Clements <austin@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2018-03-22 12:04:51 -04:00
										 |  |  | 		} else { | 
					
						
							|  |  |  | 			// bCond is unused in OFORUNTIL, so repurpose it. | 
					
						
							|  |  |  | 			bLateIncr := bCond | 
					
						
							|  |  |  | 			// test condition | 
					
						
							|  |  |  | 			s.condBranch(n.Left, bLateIncr, bEnd, 1) | 
					
						
							|  |  |  | 			// generate late increment | 
					
						
							|  |  |  | 			s.startBlock(bLateIncr) | 
					
						
							|  |  |  | 			s.stmtList(n.List) | 
					
						
							|  |  |  | 			s.endBlock().AddEdgeTo(bBody) | 
					
						
							| 
									
										
										
										
											2017-02-02 11:53:41 -05:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 		s.startBlock(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 	case OSWITCH, OSELECT: | 
					
						
							|  |  |  | 		// These have been mostly rewritten by the front end into their Nbody fields. | 
					
						
							|  |  |  | 		// Our main task is to correctly hook up any break statements. | 
					
						
							|  |  |  | 		bEnd := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		prevBreak := s.breakTo | 
					
						
							|  |  |  | 		s.breakTo = bEnd | 
					
						
							|  |  |  | 		lab := s.labeledNodes[n] | 
					
						
							|  |  |  | 		if lab != nil { | 
					
						
							|  |  |  | 			// labeled | 
					
						
							|  |  |  | 			lab.breakTarget = bEnd | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		// generate body code | 
					
						
							| 
									
										
										
										
											2016-10-04 09:49:33 -07:00
										 |  |  | 		s.stmtList(n.Nbody) | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 		s.breakTo = prevBreak | 
					
						
							|  |  |  | 		if lab != nil { | 
					
						
							|  |  |  | 			lab.breakTarget = nil | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-01 15:50:57 -08:00
										 |  |  | 		// walk adds explicit OBREAK nodes to the end of all reachable code paths. | 
					
						
							|  |  |  | 		// If we still have a current block here, then mark it unreachable. | 
					
						
							|  |  |  | 		if s.curBlock != nil { | 
					
						
							|  |  |  | 			m := s.mem() | 
					
						
							|  |  |  | 			b := s.endBlock() | 
					
						
							|  |  |  | 			b.Kind = ssa.BlockExit | 
					
						
							|  |  |  | 			b.SetControl(m) | 
					
						
							| 
									
										
										
										
											2015-07-20 15:39:14 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		s.startBlock(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	case OVARKILL: | 
					
						
							| 
									
										
										
										
											2015-08-24 02:16:19 -07:00
										 |  |  | 		// Insert a varkill op to record that a variable is no longer live. | 
					
						
							|  |  |  | 		// We only care about liveness info at call sites, so putting the | 
					
						
							|  |  |  | 		// varkill in the store chain is enough to keep it correctly ordered | 
					
						
							|  |  |  | 		// with respect to call ops. | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 		if !s.canSSA(n.Left) { | 
					
						
							| 
									
										
										
										
											2018-03-02 20:33:15 -05:00
										 |  |  | 			s.vars[&memVar] = s.newValue1Apos(ssa.OpVarKill, types.TypeMem, n.Left, s.mem(), false) | 
					
						
							| 
									
										
										
										
											2015-09-19 12:01:39 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-08-28 22:51:01 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-01-19 09:59:21 -08:00
										 |  |  | 	case OVARLIVE: | 
					
						
							|  |  |  | 		// Insert a varlive op to record that a variable is still live. | 
					
						
							| 
									
										
										
										
											2017-02-27 19:56:38 +02:00
										 |  |  | 		if !n.Left.Addrtaken() { | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 			s.Fatalf("VARLIVE variable %v must have Addrtaken set", n.Left) | 
					
						
							| 
									
										
										
										
											2016-01-19 09:59:21 -08:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-03-29 14:01:41 -04:00
										 |  |  | 		switch n.Left.Class() { | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 		case PAUTO, PPARAM, PPARAMOUT: | 
					
						
							| 
									
										
										
										
											2017-03-29 14:01:41 -04:00
										 |  |  | 		default: | 
					
						
							|  |  |  | 			s.Fatalf("VARLIVE variable %v must be Auto or Arg", n.Left) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 		s.vars[&memVar] = s.newValue1A(ssa.OpVarLive, types.TypeMem, n.Left, s.mem()) | 
					
						
							| 
									
										
										
										
											2016-01-19 09:59:21 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-12 14:06:44 -07:00
										 |  |  | 	case OCHECKNIL: | 
					
						
							|  |  |  | 		p := s.expr(n.Left) | 
					
						
							|  |  |  | 		s.nilCheck(p) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	default: | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 		s.Fatalf("unhandled stmt %v", n.Op) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | // exit processes any code that needs to be generated just before returning. | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | // It returns a BlockRet block that ends the control flow. Its control value | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | // will be set to the final memory state. | 
					
						
							|  |  |  | func (s *state) exit() *ssa.Block { | 
					
						
							| 
									
										
										
										
											2017-03-15 22:55:21 -07:00
										 |  |  | 	if s.hasdefer { | 
					
						
							| 
									
										
										
										
											2016-03-09 19:27:57 -08:00
										 |  |  | 		s.rtcall(Deferreturn, true, nil) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 	// Run exit code. Typically, this code copies heap-allocated PPARAMOUT | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 	// variables back to the stack. | 
					
						
							| 
									
										
										
										
											2018-03-08 02:20:26 -08:00
										 |  |  | 	s.stmtList(s.curfn.Func.Exit) | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// Store SSAable PPARAMOUT variables back to stack locations. | 
					
						
							|  |  |  | 	for _, n := range s.returns { | 
					
						
							| 
									
										
										
										
											2016-04-21 19:28:28 -07:00
										 |  |  | 		addr := s.decladdrs[n] | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 		val := s.variable(n, n.Type) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 		s.vars[&memVar] = s.newValue1A(ssa.OpVarDef, types.TypeMem, n, s.mem()) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(n.Type, addr, val) | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 		// TODO: if val is ever spilled, we'd like to use the | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 		// PPARAMOUT slot for spilling it. That won't happen | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 		// currently. | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Do actual return. | 
					
						
							|  |  |  | 	m := s.mem() | 
					
						
							|  |  |  | 	b := s.endBlock() | 
					
						
							|  |  |  | 	b.Kind = ssa.BlockRet | 
					
						
							| 
									
										
										
										
											2016-03-15 20:45:50 -07:00
										 |  |  | 	b.SetControl(m) | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 	return b | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | type opAndType struct { | 
					
						
							| 
									
										
										
										
											2015-11-16 13:20:16 -08:00
										 |  |  | 	op    Op | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	etype types.EType | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | var opToSSA = map[opAndType]ssa.Op{ | 
					
						
							| 
									
										
										
										
											2015-08-12 16:38:11 -04:00
										 |  |  | 	opAndType{OADD, TINT8}:    ssa.OpAdd8, | 
					
						
							|  |  |  | 	opAndType{OADD, TUINT8}:   ssa.OpAdd8, | 
					
						
							|  |  |  | 	opAndType{OADD, TINT16}:   ssa.OpAdd16, | 
					
						
							|  |  |  | 	opAndType{OADD, TUINT16}:  ssa.OpAdd16, | 
					
						
							|  |  |  | 	opAndType{OADD, TINT32}:   ssa.OpAdd32, | 
					
						
							|  |  |  | 	opAndType{OADD, TUINT32}:  ssa.OpAdd32, | 
					
						
							|  |  |  | 	opAndType{OADD, TPTR32}:   ssa.OpAdd32, | 
					
						
							|  |  |  | 	opAndType{OADD, TINT64}:   ssa.OpAdd64, | 
					
						
							|  |  |  | 	opAndType{OADD, TUINT64}:  ssa.OpAdd64, | 
					
						
							|  |  |  | 	opAndType{OADD, TPTR64}:   ssa.OpAdd64, | 
					
						
							|  |  |  | 	opAndType{OADD, TFLOAT32}: ssa.OpAdd32F, | 
					
						
							|  |  |  | 	opAndType{OADD, TFLOAT64}: ssa.OpAdd64F, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndType{OSUB, TINT8}:    ssa.OpSub8, | 
					
						
							|  |  |  | 	opAndType{OSUB, TUINT8}:   ssa.OpSub8, | 
					
						
							|  |  |  | 	opAndType{OSUB, TINT16}:   ssa.OpSub16, | 
					
						
							|  |  |  | 	opAndType{OSUB, TUINT16}:  ssa.OpSub16, | 
					
						
							|  |  |  | 	opAndType{OSUB, TINT32}:   ssa.OpSub32, | 
					
						
							|  |  |  | 	opAndType{OSUB, TUINT32}:  ssa.OpSub32, | 
					
						
							|  |  |  | 	opAndType{OSUB, TINT64}:   ssa.OpSub64, | 
					
						
							|  |  |  | 	opAndType{OSUB, TUINT64}:  ssa.OpSub64, | 
					
						
							|  |  |  | 	opAndType{OSUB, TFLOAT32}: ssa.OpSub32F, | 
					
						
							|  |  |  | 	opAndType{OSUB, TFLOAT64}: ssa.OpSub64F, | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-22 19:19:40 -07:00
										 |  |  | 	opAndType{ONOT, TBOOL}: ssa.OpNot, | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 	opAndType{OMINUS, TINT8}:    ssa.OpNeg8, | 
					
						
							|  |  |  | 	opAndType{OMINUS, TUINT8}:   ssa.OpNeg8, | 
					
						
							|  |  |  | 	opAndType{OMINUS, TINT16}:   ssa.OpNeg16, | 
					
						
							|  |  |  | 	opAndType{OMINUS, TUINT16}:  ssa.OpNeg16, | 
					
						
							|  |  |  | 	opAndType{OMINUS, TINT32}:   ssa.OpNeg32, | 
					
						
							|  |  |  | 	opAndType{OMINUS, TUINT32}:  ssa.OpNeg32, | 
					
						
							|  |  |  | 	opAndType{OMINUS, TINT64}:   ssa.OpNeg64, | 
					
						
							|  |  |  | 	opAndType{OMINUS, TUINT64}:  ssa.OpNeg64, | 
					
						
							|  |  |  | 	opAndType{OMINUS, TFLOAT32}: ssa.OpNeg32F, | 
					
						
							|  |  |  | 	opAndType{OMINUS, TFLOAT64}: ssa.OpNeg64F, | 
					
						
							| 
									
										
										
										
											2015-07-21 16:58:18 +02:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-29 17:07:09 -07:00
										 |  |  | 	opAndType{OCOM, TINT8}:   ssa.OpCom8, | 
					
						
							|  |  |  | 	opAndType{OCOM, TUINT8}:  ssa.OpCom8, | 
					
						
							|  |  |  | 	opAndType{OCOM, TINT16}:  ssa.OpCom16, | 
					
						
							|  |  |  | 	opAndType{OCOM, TUINT16}: ssa.OpCom16, | 
					
						
							|  |  |  | 	opAndType{OCOM, TINT32}:  ssa.OpCom32, | 
					
						
							|  |  |  | 	opAndType{OCOM, TUINT32}: ssa.OpCom32, | 
					
						
							|  |  |  | 	opAndType{OCOM, TINT64}:  ssa.OpCom64, | 
					
						
							|  |  |  | 	opAndType{OCOM, TUINT64}: ssa.OpCom64, | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-06 19:24:59 -07:00
										 |  |  | 	opAndType{OIMAG, TCOMPLEX64}:  ssa.OpComplexImag, | 
					
						
							|  |  |  | 	opAndType{OIMAG, TCOMPLEX128}: ssa.OpComplexImag, | 
					
						
							|  |  |  | 	opAndType{OREAL, TCOMPLEX64}:  ssa.OpComplexReal, | 
					
						
							|  |  |  | 	opAndType{OREAL, TCOMPLEX128}: ssa.OpComplexReal, | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-12 16:38:11 -04:00
										 |  |  | 	opAndType{OMUL, TINT8}:    ssa.OpMul8, | 
					
						
							|  |  |  | 	opAndType{OMUL, TUINT8}:   ssa.OpMul8, | 
					
						
							|  |  |  | 	opAndType{OMUL, TINT16}:   ssa.OpMul16, | 
					
						
							|  |  |  | 	opAndType{OMUL, TUINT16}:  ssa.OpMul16, | 
					
						
							|  |  |  | 	opAndType{OMUL, TINT32}:   ssa.OpMul32, | 
					
						
							|  |  |  | 	opAndType{OMUL, TUINT32}:  ssa.OpMul32, | 
					
						
							|  |  |  | 	opAndType{OMUL, TINT64}:   ssa.OpMul64, | 
					
						
							|  |  |  | 	opAndType{OMUL, TUINT64}:  ssa.OpMul64, | 
					
						
							|  |  |  | 	opAndType{OMUL, TFLOAT32}: ssa.OpMul32F, | 
					
						
							|  |  |  | 	opAndType{OMUL, TFLOAT64}: ssa.OpMul64F, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndType{ODIV, TFLOAT32}: ssa.OpDiv32F, | 
					
						
							|  |  |  | 	opAndType{ODIV, TFLOAT64}: ssa.OpDiv64F, | 
					
						
							| 
									
										
										
										
											2015-07-22 13:46:15 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-17 17:46:06 -05:00
										 |  |  | 	opAndType{ODIV, TINT8}:   ssa.OpDiv8, | 
					
						
							|  |  |  | 	opAndType{ODIV, TUINT8}:  ssa.OpDiv8u, | 
					
						
							|  |  |  | 	opAndType{ODIV, TINT16}:  ssa.OpDiv16, | 
					
						
							|  |  |  | 	opAndType{ODIV, TUINT16}: ssa.OpDiv16u, | 
					
						
							|  |  |  | 	opAndType{ODIV, TINT32}:  ssa.OpDiv32, | 
					
						
							|  |  |  | 	opAndType{ODIV, TUINT32}: ssa.OpDiv32u, | 
					
						
							|  |  |  | 	opAndType{ODIV, TINT64}:  ssa.OpDiv64, | 
					
						
							|  |  |  | 	opAndType{ODIV, TUINT64}: ssa.OpDiv64u, | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-18 19:51:44 -05:00
										 |  |  | 	opAndType{OMOD, TINT8}:   ssa.OpMod8, | 
					
						
							|  |  |  | 	opAndType{OMOD, TUINT8}:  ssa.OpMod8u, | 
					
						
							|  |  |  | 	opAndType{OMOD, TINT16}:  ssa.OpMod16, | 
					
						
							|  |  |  | 	opAndType{OMOD, TUINT16}: ssa.OpMod16u, | 
					
						
							|  |  |  | 	opAndType{OMOD, TINT32}:  ssa.OpMod32, | 
					
						
							|  |  |  | 	opAndType{OMOD, TUINT32}: ssa.OpMod32u, | 
					
						
							|  |  |  | 	opAndType{OMOD, TINT64}:  ssa.OpMod64, | 
					
						
							|  |  |  | 	opAndType{OMOD, TUINT64}: ssa.OpMod64u, | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-28 14:58:49 +02:00
										 |  |  | 	opAndType{OAND, TINT8}:   ssa.OpAnd8, | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 	opAndType{OAND, TUINT8}:  ssa.OpAnd8, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:58:49 +02:00
										 |  |  | 	opAndType{OAND, TINT16}:  ssa.OpAnd16, | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 	opAndType{OAND, TUINT16}: ssa.OpAnd16, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:58:49 +02:00
										 |  |  | 	opAndType{OAND, TINT32}:  ssa.OpAnd32, | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 	opAndType{OAND, TUINT32}: ssa.OpAnd32, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:58:49 +02:00
										 |  |  | 	opAndType{OAND, TINT64}:  ssa.OpAnd64, | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 	opAndType{OAND, TUINT64}: ssa.OpAnd64, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:58:49 +02:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-29 17:52:25 +02:00
										 |  |  | 	opAndType{OOR, TINT8}:   ssa.OpOr8, | 
					
						
							|  |  |  | 	opAndType{OOR, TUINT8}:  ssa.OpOr8, | 
					
						
							|  |  |  | 	opAndType{OOR, TINT16}:  ssa.OpOr16, | 
					
						
							|  |  |  | 	opAndType{OOR, TUINT16}: ssa.OpOr16, | 
					
						
							|  |  |  | 	opAndType{OOR, TINT32}:  ssa.OpOr32, | 
					
						
							|  |  |  | 	opAndType{OOR, TUINT32}: ssa.OpOr32, | 
					
						
							|  |  |  | 	opAndType{OOR, TINT64}:  ssa.OpOr64, | 
					
						
							|  |  |  | 	opAndType{OOR, TUINT64}: ssa.OpOr64, | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-30 12:33:36 +02:00
										 |  |  | 	opAndType{OXOR, TINT8}:   ssa.OpXor8, | 
					
						
							|  |  |  | 	opAndType{OXOR, TUINT8}:  ssa.OpXor8, | 
					
						
							|  |  |  | 	opAndType{OXOR, TINT16}:  ssa.OpXor16, | 
					
						
							|  |  |  | 	opAndType{OXOR, TUINT16}: ssa.OpXor16, | 
					
						
							|  |  |  | 	opAndType{OXOR, TINT32}:  ssa.OpXor32, | 
					
						
							|  |  |  | 	opAndType{OXOR, TUINT32}: ssa.OpXor32, | 
					
						
							|  |  |  | 	opAndType{OXOR, TINT64}:  ssa.OpXor64, | 
					
						
							|  |  |  | 	opAndType{OXOR, TUINT64}: ssa.OpXor64, | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-04-24 21:21:07 +02:00
										 |  |  | 	opAndType{OEQ, TBOOL}:      ssa.OpEqB, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:14:25 -07:00
										 |  |  | 	opAndType{OEQ, TINT8}:      ssa.OpEq8, | 
					
						
							|  |  |  | 	opAndType{OEQ, TUINT8}:     ssa.OpEq8, | 
					
						
							|  |  |  | 	opAndType{OEQ, TINT16}:     ssa.OpEq16, | 
					
						
							|  |  |  | 	opAndType{OEQ, TUINT16}:    ssa.OpEq16, | 
					
						
							|  |  |  | 	opAndType{OEQ, TINT32}:     ssa.OpEq32, | 
					
						
							|  |  |  | 	opAndType{OEQ, TUINT32}:    ssa.OpEq32, | 
					
						
							|  |  |  | 	opAndType{OEQ, TINT64}:     ssa.OpEq64, | 
					
						
							|  |  |  | 	opAndType{OEQ, TUINT64}:    ssa.OpEq64, | 
					
						
							| 
									
										
										
										
											2015-09-10 13:53:27 -07:00
										 |  |  | 	opAndType{OEQ, TINTER}:     ssa.OpEqInter, | 
					
						
							| 
									
										
										
										
											2016-04-18 14:02:08 -07:00
										 |  |  | 	opAndType{OEQ, TSLICE}:     ssa.OpEqSlice, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:14:25 -07:00
										 |  |  | 	opAndType{OEQ, TFUNC}:      ssa.OpEqPtr, | 
					
						
							|  |  |  | 	opAndType{OEQ, TMAP}:       ssa.OpEqPtr, | 
					
						
							|  |  |  | 	opAndType{OEQ, TCHAN}:      ssa.OpEqPtr, | 
					
						
							| 
									
										
										
										
											2016-05-13 11:25:07 -04:00
										 |  |  | 	opAndType{OEQ, TPTR32}:     ssa.OpEqPtr, | 
					
						
							| 
									
										
										
										
											2015-08-30 20:47:26 -05:00
										 |  |  | 	opAndType{OEQ, TPTR64}:     ssa.OpEqPtr, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:14:25 -07:00
										 |  |  | 	opAndType{OEQ, TUINTPTR}:   ssa.OpEqPtr, | 
					
						
							|  |  |  | 	opAndType{OEQ, TUNSAFEPTR}: ssa.OpEqPtr, | 
					
						
							| 
									
										
										
										
											2015-08-18 14:39:26 -04:00
										 |  |  | 	opAndType{OEQ, TFLOAT64}:   ssa.OpEq64F, | 
					
						
							|  |  |  | 	opAndType{OEQ, TFLOAT32}:   ssa.OpEq32F, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:14:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-04-24 21:21:07 +02:00
										 |  |  | 	opAndType{ONE, TBOOL}:      ssa.OpNeqB, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:14:25 -07:00
										 |  |  | 	opAndType{ONE, TINT8}:      ssa.OpNeq8, | 
					
						
							|  |  |  | 	opAndType{ONE, TUINT8}:     ssa.OpNeq8, | 
					
						
							|  |  |  | 	opAndType{ONE, TINT16}:     ssa.OpNeq16, | 
					
						
							|  |  |  | 	opAndType{ONE, TUINT16}:    ssa.OpNeq16, | 
					
						
							|  |  |  | 	opAndType{ONE, TINT32}:     ssa.OpNeq32, | 
					
						
							|  |  |  | 	opAndType{ONE, TUINT32}:    ssa.OpNeq32, | 
					
						
							|  |  |  | 	opAndType{ONE, TINT64}:     ssa.OpNeq64, | 
					
						
							|  |  |  | 	opAndType{ONE, TUINT64}:    ssa.OpNeq64, | 
					
						
							| 
									
										
										
										
											2015-09-10 13:53:27 -07:00
										 |  |  | 	opAndType{ONE, TINTER}:     ssa.OpNeqInter, | 
					
						
							| 
									
										
										
										
											2016-04-18 14:02:08 -07:00
										 |  |  | 	opAndType{ONE, TSLICE}:     ssa.OpNeqSlice, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:14:25 -07:00
										 |  |  | 	opAndType{ONE, TFUNC}:      ssa.OpNeqPtr, | 
					
						
							|  |  |  | 	opAndType{ONE, TMAP}:       ssa.OpNeqPtr, | 
					
						
							|  |  |  | 	opAndType{ONE, TCHAN}:      ssa.OpNeqPtr, | 
					
						
							| 
									
										
										
										
											2016-05-13 11:25:07 -04:00
										 |  |  | 	opAndType{ONE, TPTR32}:     ssa.OpNeqPtr, | 
					
						
							| 
									
										
										
										
											2015-08-30 20:47:26 -05:00
										 |  |  | 	opAndType{ONE, TPTR64}:     ssa.OpNeqPtr, | 
					
						
							| 
									
										
										
										
											2015-07-28 14:14:25 -07:00
										 |  |  | 	opAndType{ONE, TUINTPTR}:   ssa.OpNeqPtr, | 
					
						
							|  |  |  | 	opAndType{ONE, TUNSAFEPTR}: ssa.OpNeqPtr, | 
					
						
							| 
									
										
										
										
											2015-08-18 14:39:26 -04:00
										 |  |  | 	opAndType{ONE, TFLOAT64}:   ssa.OpNeq64F, | 
					
						
							|  |  |  | 	opAndType{ONE, TFLOAT32}:   ssa.OpNeq32F, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndType{OLT, TINT8}:    ssa.OpLess8, | 
					
						
							|  |  |  | 	opAndType{OLT, TUINT8}:   ssa.OpLess8U, | 
					
						
							|  |  |  | 	opAndType{OLT, TINT16}:   ssa.OpLess16, | 
					
						
							|  |  |  | 	opAndType{OLT, TUINT16}:  ssa.OpLess16U, | 
					
						
							|  |  |  | 	opAndType{OLT, TINT32}:   ssa.OpLess32, | 
					
						
							|  |  |  | 	opAndType{OLT, TUINT32}:  ssa.OpLess32U, | 
					
						
							|  |  |  | 	opAndType{OLT, TINT64}:   ssa.OpLess64, | 
					
						
							|  |  |  | 	opAndType{OLT, TUINT64}:  ssa.OpLess64U, | 
					
						
							|  |  |  | 	opAndType{OLT, TFLOAT64}: ssa.OpLess64F, | 
					
						
							|  |  |  | 	opAndType{OLT, TFLOAT32}: ssa.OpLess32F, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndType{OGT, TINT8}:    ssa.OpGreater8, | 
					
						
							|  |  |  | 	opAndType{OGT, TUINT8}:   ssa.OpGreater8U, | 
					
						
							|  |  |  | 	opAndType{OGT, TINT16}:   ssa.OpGreater16, | 
					
						
							|  |  |  | 	opAndType{OGT, TUINT16}:  ssa.OpGreater16U, | 
					
						
							|  |  |  | 	opAndType{OGT, TINT32}:   ssa.OpGreater32, | 
					
						
							|  |  |  | 	opAndType{OGT, TUINT32}:  ssa.OpGreater32U, | 
					
						
							|  |  |  | 	opAndType{OGT, TINT64}:   ssa.OpGreater64, | 
					
						
							|  |  |  | 	opAndType{OGT, TUINT64}:  ssa.OpGreater64U, | 
					
						
							|  |  |  | 	opAndType{OGT, TFLOAT64}: ssa.OpGreater64F, | 
					
						
							|  |  |  | 	opAndType{OGT, TFLOAT32}: ssa.OpGreater32F, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndType{OLE, TINT8}:    ssa.OpLeq8, | 
					
						
							|  |  |  | 	opAndType{OLE, TUINT8}:   ssa.OpLeq8U, | 
					
						
							|  |  |  | 	opAndType{OLE, TINT16}:   ssa.OpLeq16, | 
					
						
							|  |  |  | 	opAndType{OLE, TUINT16}:  ssa.OpLeq16U, | 
					
						
							|  |  |  | 	opAndType{OLE, TINT32}:   ssa.OpLeq32, | 
					
						
							|  |  |  | 	opAndType{OLE, TUINT32}:  ssa.OpLeq32U, | 
					
						
							|  |  |  | 	opAndType{OLE, TINT64}:   ssa.OpLeq64, | 
					
						
							|  |  |  | 	opAndType{OLE, TUINT64}:  ssa.OpLeq64U, | 
					
						
							|  |  |  | 	opAndType{OLE, TFLOAT64}: ssa.OpLeq64F, | 
					
						
							|  |  |  | 	opAndType{OLE, TFLOAT32}: ssa.OpLeq32F, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndType{OGE, TINT8}:    ssa.OpGeq8, | 
					
						
							|  |  |  | 	opAndType{OGE, TUINT8}:   ssa.OpGeq8U, | 
					
						
							|  |  |  | 	opAndType{OGE, TINT16}:   ssa.OpGeq16, | 
					
						
							|  |  |  | 	opAndType{OGE, TUINT16}:  ssa.OpGeq16U, | 
					
						
							|  |  |  | 	opAndType{OGE, TINT32}:   ssa.OpGeq32, | 
					
						
							|  |  |  | 	opAndType{OGE, TUINT32}:  ssa.OpGeq32U, | 
					
						
							|  |  |  | 	opAndType{OGE, TINT64}:   ssa.OpGeq64, | 
					
						
							|  |  |  | 	opAndType{OGE, TUINT64}:  ssa.OpGeq64U, | 
					
						
							|  |  |  | 	opAndType{OGE, TFLOAT64}: ssa.OpGeq64F, | 
					
						
							|  |  |  | 	opAndType{OGE, TFLOAT32}: ssa.OpGeq32F, | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) concreteEtype(t *types.Type) types.EType { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 	e := t.Etype | 
					
						
							|  |  |  | 	switch e { | 
					
						
							|  |  |  | 	default: | 
					
						
							|  |  |  | 		return e | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 	case TINT: | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 		if s.config.PtrSize == 8 { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 			return TINT64 | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 		return TINT32 | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 	case TUINT: | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 		if s.config.PtrSize == 8 { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 			return TUINT64 | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		return TUINT32 | 
					
						
							|  |  |  | 	case TUINTPTR: | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 		if s.config.PtrSize == 8 { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 			return TUINT64 | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 		return TUINT32 | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) ssaOp(op Op, t *types.Type) ssa.Op { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 	etype := s.concreteEtype(t) | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 	x, ok := opToSSA[opAndType{op, etype}] | 
					
						
							|  |  |  | 	if !ok { | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 		s.Fatalf("unhandled binary op %v %s", op, etype) | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	return x | 
					
						
							| 
									
										
										
										
											2015-06-24 17:48:22 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func floatForComplex(t *types.Type) *types.Type { | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 	if t.Size() == 8 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		return types.Types[TFLOAT32] | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 	} else { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		return types.Types[TFLOAT64] | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-29 17:07:09 -07:00
										 |  |  | type opAndTwoTypes struct { | 
					
						
							| 
									
										
										
										
											2015-11-16 13:20:16 -08:00
										 |  |  | 	op     Op | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	etype1 types.EType | 
					
						
							|  |  |  | 	etype2 types.EType | 
					
						
							| 
									
										
										
										
											2015-07-29 17:07:09 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | type twoTypes struct { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	etype1 types.EType | 
					
						
							|  |  |  | 	etype2 types.EType | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | type twoOpsAndType struct { | 
					
						
							|  |  |  | 	op1              ssa.Op | 
					
						
							|  |  |  | 	op2              ssa.Op | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	intermediateType types.EType | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | var fpConvOpToSSA = map[twoTypes]twoOpsAndType{ | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	twoTypes{TINT8, TFLOAT32}:  twoOpsAndType{ssa.OpSignExt8to32, ssa.OpCvt32to32F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TINT16, TFLOAT32}: twoOpsAndType{ssa.OpSignExt16to32, ssa.OpCvt32to32F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TINT32, TFLOAT32}: twoOpsAndType{ssa.OpCopy, ssa.OpCvt32to32F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TINT64, TFLOAT32}: twoOpsAndType{ssa.OpCopy, ssa.OpCvt64to32F, TINT64}, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	twoTypes{TINT8, TFLOAT64}:  twoOpsAndType{ssa.OpSignExt8to32, ssa.OpCvt32to64F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TINT16, TFLOAT64}: twoOpsAndType{ssa.OpSignExt16to32, ssa.OpCvt32to64F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TINT32, TFLOAT64}: twoOpsAndType{ssa.OpCopy, ssa.OpCvt32to64F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TINT64, TFLOAT64}: twoOpsAndType{ssa.OpCopy, ssa.OpCvt64to64F, TINT64}, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TINT8}:  twoOpsAndType{ssa.OpCvt32Fto32, ssa.OpTrunc32to8, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TINT16}: twoOpsAndType{ssa.OpCvt32Fto32, ssa.OpTrunc32to16, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TINT32}: twoOpsAndType{ssa.OpCvt32Fto32, ssa.OpCopy, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TINT64}: twoOpsAndType{ssa.OpCvt32Fto64, ssa.OpCopy, TINT64}, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TINT8}:  twoOpsAndType{ssa.OpCvt64Fto32, ssa.OpTrunc32to8, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TINT16}: twoOpsAndType{ssa.OpCvt64Fto32, ssa.OpTrunc32to16, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TINT32}: twoOpsAndType{ssa.OpCvt64Fto32, ssa.OpCopy, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TINT64}: twoOpsAndType{ssa.OpCvt64Fto64, ssa.OpCopy, TINT64}, | 
					
						
							|  |  |  | 	// unsigned | 
					
						
							|  |  |  | 	twoTypes{TUINT8, TFLOAT32}:  twoOpsAndType{ssa.OpZeroExt8to32, ssa.OpCvt32to32F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TUINT16, TFLOAT32}: twoOpsAndType{ssa.OpZeroExt16to32, ssa.OpCvt32to32F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TUINT32, TFLOAT32}: twoOpsAndType{ssa.OpZeroExt32to64, ssa.OpCvt64to32F, TINT64}, // go wide to dodge unsigned | 
					
						
							|  |  |  | 	twoTypes{TUINT64, TFLOAT32}: twoOpsAndType{ssa.OpCopy, ssa.OpInvalid, TUINT64},            // Cvt64Uto32F, branchy code expansion instead | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	twoTypes{TUINT8, TFLOAT64}:  twoOpsAndType{ssa.OpZeroExt8to32, ssa.OpCvt32to64F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TUINT16, TFLOAT64}: twoOpsAndType{ssa.OpZeroExt16to32, ssa.OpCvt32to64F, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TUINT32, TFLOAT64}: twoOpsAndType{ssa.OpZeroExt32to64, ssa.OpCvt64to64F, TINT64}, // go wide to dodge unsigned | 
					
						
							|  |  |  | 	twoTypes{TUINT64, TFLOAT64}: twoOpsAndType{ssa.OpCopy, ssa.OpInvalid, TUINT64},            // Cvt64Uto64F, branchy code expansion instead | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TUINT8}:  twoOpsAndType{ssa.OpCvt32Fto32, ssa.OpTrunc32to8, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TUINT16}: twoOpsAndType{ssa.OpCvt32Fto32, ssa.OpTrunc32to16, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TUINT32}: twoOpsAndType{ssa.OpCvt32Fto64, ssa.OpTrunc64to32, TINT64}, // go wide to dodge unsigned | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TUINT64}: twoOpsAndType{ssa.OpInvalid, ssa.OpCopy, TUINT64},          // Cvt32Fto64U, branchy code expansion instead | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TUINT8}:  twoOpsAndType{ssa.OpCvt64Fto32, ssa.OpTrunc32to8, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TUINT16}: twoOpsAndType{ssa.OpCvt64Fto32, ssa.OpTrunc32to16, TINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TUINT32}: twoOpsAndType{ssa.OpCvt64Fto64, ssa.OpTrunc64to32, TINT64}, // go wide to dodge unsigned | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TUINT64}: twoOpsAndType{ssa.OpInvalid, ssa.OpCopy, TUINT64},          // Cvt64Fto64U, branchy code expansion instead | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// float | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TFLOAT32}: twoOpsAndType{ssa.OpCvt64Fto32F, ssa.OpCopy, TFLOAT32}, | 
					
						
							| 
									
										
										
										
											2017-02-12 22:12:12 -05:00
										 |  |  | 	twoTypes{TFLOAT64, TFLOAT64}: twoOpsAndType{ssa.OpRound64F, ssa.OpCopy, TFLOAT64}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TFLOAT32}: twoOpsAndType{ssa.OpRound32F, ssa.OpCopy, TFLOAT32}, | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 	twoTypes{TFLOAT32, TFLOAT64}: twoOpsAndType{ssa.OpCvt32Fto64F, ssa.OpCopy, TFLOAT64}, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-05-31 11:27:16 -04:00
										 |  |  | // this map is used only for 32-bit arch, and only includes the difference | 
					
						
							|  |  |  | // on 32-bit arch, don't use int64<->float conversion for uint32 | 
					
						
							|  |  |  | var fpConvOpToSSA32 = map[twoTypes]twoOpsAndType{ | 
					
						
							|  |  |  | 	twoTypes{TUINT32, TFLOAT32}: twoOpsAndType{ssa.OpCopy, ssa.OpCvt32Uto32F, TUINT32}, | 
					
						
							|  |  |  | 	twoTypes{TUINT32, TFLOAT64}: twoOpsAndType{ssa.OpCopy, ssa.OpCvt32Uto64F, TUINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TUINT32}: twoOpsAndType{ssa.OpCvt32Fto32U, ssa.OpCopy, TUINT32}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TUINT32}: twoOpsAndType{ssa.OpCvt64Fto32U, ssa.OpCopy, TUINT32}, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-08-16 14:17:33 -04:00
										 |  |  | // uint64<->float conversions, only on machines that have intructions for that | 
					
						
							|  |  |  | var uint64fpConvOpToSSA = map[twoTypes]twoOpsAndType{ | 
					
						
							|  |  |  | 	twoTypes{TUINT64, TFLOAT32}: twoOpsAndType{ssa.OpCopy, ssa.OpCvt64Uto32F, TUINT64}, | 
					
						
							|  |  |  | 	twoTypes{TUINT64, TFLOAT64}: twoOpsAndType{ssa.OpCopy, ssa.OpCvt64Uto64F, TUINT64}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT32, TUINT64}: twoOpsAndType{ssa.OpCvt32Fto64U, ssa.OpCopy, TUINT64}, | 
					
						
							|  |  |  | 	twoTypes{TFLOAT64, TUINT64}: twoOpsAndType{ssa.OpCvt64Fto64U, ssa.OpCopy, TUINT64}, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-29 17:07:09 -07:00
										 |  |  | var shiftOpToSSA = map[opAndTwoTypes]ssa.Op{ | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT8, TUINT8}:   ssa.OpLsh8x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT8, TUINT8}:  ssa.OpLsh8x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT8, TUINT16}:  ssa.OpLsh8x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT8, TUINT16}: ssa.OpLsh8x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT8, TUINT32}:  ssa.OpLsh8x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT8, TUINT32}: ssa.OpLsh8x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT8, TUINT64}:  ssa.OpLsh8x64, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT8, TUINT64}: ssa.OpLsh8x64, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT16, TUINT8}:   ssa.OpLsh16x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT16, TUINT8}:  ssa.OpLsh16x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT16, TUINT16}:  ssa.OpLsh16x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT16, TUINT16}: ssa.OpLsh16x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT16, TUINT32}:  ssa.OpLsh16x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT16, TUINT32}: ssa.OpLsh16x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT16, TUINT64}:  ssa.OpLsh16x64, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT16, TUINT64}: ssa.OpLsh16x64, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT32, TUINT8}:   ssa.OpLsh32x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT32, TUINT8}:  ssa.OpLsh32x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT32, TUINT16}:  ssa.OpLsh32x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT32, TUINT16}: ssa.OpLsh32x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT32, TUINT32}:  ssa.OpLsh32x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT32, TUINT32}: ssa.OpLsh32x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT32, TUINT64}:  ssa.OpLsh32x64, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT32, TUINT64}: ssa.OpLsh32x64, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT64, TUINT8}:   ssa.OpLsh64x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT64, TUINT8}:  ssa.OpLsh64x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT64, TUINT16}:  ssa.OpLsh64x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT64, TUINT16}: ssa.OpLsh64x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT64, TUINT32}:  ssa.OpLsh64x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT64, TUINT32}: ssa.OpLsh64x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TINT64, TUINT64}:  ssa.OpLsh64x64, | 
					
						
							|  |  |  | 	opAndTwoTypes{OLSH, TUINT64, TUINT64}: ssa.OpLsh64x64, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT8, TUINT8}:   ssa.OpRsh8x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT8, TUINT8}:  ssa.OpRsh8Ux8, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT8, TUINT16}:  ssa.OpRsh8x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT8, TUINT16}: ssa.OpRsh8Ux16, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT8, TUINT32}:  ssa.OpRsh8x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT8, TUINT32}: ssa.OpRsh8Ux32, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT8, TUINT64}:  ssa.OpRsh8x64, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT8, TUINT64}: ssa.OpRsh8Ux64, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT16, TUINT8}:   ssa.OpRsh16x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT16, TUINT8}:  ssa.OpRsh16Ux8, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT16, TUINT16}:  ssa.OpRsh16x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT16, TUINT16}: ssa.OpRsh16Ux16, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT16, TUINT32}:  ssa.OpRsh16x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT16, TUINT32}: ssa.OpRsh16Ux32, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT16, TUINT64}:  ssa.OpRsh16x64, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT16, TUINT64}: ssa.OpRsh16Ux64, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT32, TUINT8}:   ssa.OpRsh32x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT32, TUINT8}:  ssa.OpRsh32Ux8, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT32, TUINT16}:  ssa.OpRsh32x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT32, TUINT16}: ssa.OpRsh32Ux16, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT32, TUINT32}:  ssa.OpRsh32x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT32, TUINT32}: ssa.OpRsh32Ux32, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT32, TUINT64}:  ssa.OpRsh32x64, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT32, TUINT64}: ssa.OpRsh32Ux64, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT64, TUINT8}:   ssa.OpRsh64x8, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT64, TUINT8}:  ssa.OpRsh64Ux8, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT64, TUINT16}:  ssa.OpRsh64x16, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT64, TUINT16}: ssa.OpRsh64Ux16, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT64, TUINT32}:  ssa.OpRsh64x32, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT64, TUINT32}: ssa.OpRsh64Ux32, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TINT64, TUINT64}:  ssa.OpRsh64x64, | 
					
						
							|  |  |  | 	opAndTwoTypes{ORSH, TUINT64, TUINT64}: ssa.OpRsh64Ux64, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) ssaShiftOp(op Op, t *types.Type, u *types.Type) ssa.Op { | 
					
						
							| 
									
										
										
										
											2015-07-29 17:07:09 -07:00
										 |  |  | 	etype1 := s.concreteEtype(t) | 
					
						
							|  |  |  | 	etype2 := s.concreteEtype(u) | 
					
						
							|  |  |  | 	x, ok := shiftOpToSSA[opAndTwoTypes{op, etype1, etype2}] | 
					
						
							|  |  |  | 	if !ok { | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 		s.Fatalf("unhandled shift op %v etype=%s/%s", op, etype1, etype2) | 
					
						
							| 
									
										
										
										
											2015-07-29 17:07:09 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	return x | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | // expr converts the expression n to ssa, adds it to s and returns the ssa result. | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | func (s *state) expr(n *Node) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-04-26 15:22:33 -07:00
										 |  |  | 	if !(n.Op == ONAME || n.Op == OLITERAL && n.Sym != nil) { | 
					
						
							|  |  |  | 		// ONAMEs and named OLITERALs have the line number | 
					
						
							|  |  |  | 		// of the decl, not the use. See issue 14742. | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 		s.pushLine(n.Pos) | 
					
						
							| 
									
										
										
										
											2016-04-26 15:22:33 -07:00
										 |  |  | 		defer s.popLine() | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-05-30 01:03:06 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-11 11:39:12 -07:00
										 |  |  | 	s.stmtList(n.Ninit) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	switch n.Op { | 
					
						
							| 
									
										
										
										
											2016-09-10 22:44:00 +02:00
										 |  |  | 	case OARRAYBYTESTRTMP: | 
					
						
							|  |  |  | 		slice := s.expr(n.Left) | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		ptr := s.newValue1(ssa.OpSlicePtr, s.f.Config.Types.BytePtr, slice) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		len := s.newValue1(ssa.OpSliceLen, types.Types[TINT], slice) | 
					
						
							| 
									
										
										
										
											2016-09-10 22:44:00 +02:00
										 |  |  | 		return s.newValue2(ssa.OpStringMake, n.Type, ptr, len) | 
					
						
							| 
									
										
										
										
											2016-10-27 23:31:38 +02:00
										 |  |  | 	case OSTRARRAYBYTETMP: | 
					
						
							|  |  |  | 		str := s.expr(n.Left) | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		ptr := s.newValue1(ssa.OpStringPtr, s.f.Config.Types.BytePtr, str) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		len := s.newValue1(ssa.OpStringLen, types.Types[TINT], str) | 
					
						
							| 
									
										
										
										
											2016-10-27 23:31:38 +02:00
										 |  |  | 		return s.newValue3(ssa.OpSliceMake, n.Type, ptr, len, len) | 
					
						
							| 
									
										
										
										
											2015-09-07 19:07:02 -05:00
										 |  |  | 	case OCFUNC: | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 		aux := n.Left.Sym.Linksym() | 
					
						
							| 
									
										
										
										
											2015-09-07 19:07:02 -05:00
										 |  |  | 		return s.entryNewValue1A(ssa.OpAddr, n.Type, aux, s.sb) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	case ONAME: | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 		if n.Class() == PFUNC { | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 			// "value" of a function is the address of the function's closure | 
					
						
							| 
									
										
										
										
											2017-04-21 07:51:41 -07:00
										 |  |  | 			sym := funcsym(n.Sym).Linksym() | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 			return s.entryNewValue1A(ssa.OpAddr, types.NewPtr(n.Type), sym, s.sb) | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 		if s.canSSA(n) { | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 			return s.variable(n, n.Type) | 
					
						
							| 
									
										
										
										
											2015-05-12 15:16:52 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		addr := s.addr(n, false) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		return s.load(n.Type, addr) | 
					
						
							| 
									
										
										
										
											2015-09-11 16:40:05 -04:00
										 |  |  | 	case OCLOSUREVAR: | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		addr := s.addr(n, false) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		return s.load(n.Type, addr) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	case OLITERAL: | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 		switch u := n.Val().U.(type) { | 
					
						
							|  |  |  | 		case *Mpint: | 
					
						
							|  |  |  | 			i := u.Int64() | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			switch n.Type.Size() { | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | 			case 1: | 
					
						
							|  |  |  | 				return s.constInt8(n.Type, int8(i)) | 
					
						
							|  |  |  | 			case 2: | 
					
						
							|  |  |  | 				return s.constInt16(n.Type, int16(i)) | 
					
						
							|  |  |  | 			case 4: | 
					
						
							|  |  |  | 				return s.constInt32(n.Type, int32(i)) | 
					
						
							|  |  |  | 			case 8: | 
					
						
							|  |  |  | 				return s.constInt64(n.Type, i) | 
					
						
							|  |  |  | 			default: | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				s.Fatalf("bad integer size %d", n.Type.Size()) | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | 				return nil | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 		case string: | 
					
						
							|  |  |  | 			if u == "" { | 
					
						
							| 
									
										
										
										
											2016-03-06 18:06:09 -08:00
										 |  |  | 				return s.constEmptyString(n.Type) | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 			return s.entryNewValue0A(ssa.OpConstString, n.Type, u) | 
					
						
							|  |  |  | 		case bool: | 
					
						
							| 
									
										
										
										
											2016-04-26 15:22:33 -07:00
										 |  |  | 			return s.constBool(u) | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 		case *NilVal: | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 			t := n.Type | 
					
						
							|  |  |  | 			switch { | 
					
						
							|  |  |  | 			case t.IsSlice(): | 
					
						
							| 
									
										
										
										
											2016-03-06 18:06:09 -08:00
										 |  |  | 				return s.constSlice(t) | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 			case t.IsInterface(): | 
					
						
							| 
									
										
										
										
											2016-03-06 18:06:09 -08:00
										 |  |  | 				return s.constInterface(t) | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 			default: | 
					
						
							| 
									
										
										
										
											2016-03-06 18:06:09 -08:00
										 |  |  | 				return s.constNil(t) | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 		case *Mpflt: | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			switch n.Type.Size() { | 
					
						
							| 
									
										
										
										
											2015-08-12 16:38:11 -04:00
										 |  |  | 			case 4: | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 				return s.constFloat32(n.Type, u.Float32()) | 
					
						
							| 
									
										
										
										
											2015-08-12 16:38:11 -04:00
										 |  |  | 			case 8: | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 				return s.constFloat64(n.Type, u.Float64()) | 
					
						
							| 
									
										
										
										
											2015-08-12 16:38:11 -04:00
										 |  |  | 			default: | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				s.Fatalf("bad float size %d", n.Type.Size()) | 
					
						
							| 
									
										
										
										
											2015-08-12 16:38:11 -04:00
										 |  |  | 				return nil | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 		case *Mpcplx: | 
					
						
							|  |  |  | 			r := &u.Real | 
					
						
							|  |  |  | 			i := &u.Imag | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			switch n.Type.Size() { | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 			case 8: | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				pt := types.Types[TFLOAT32] | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 				return s.newValue2(ssa.OpComplexMake, n.Type, | 
					
						
							|  |  |  | 					s.constFloat32(pt, r.Float32()), | 
					
						
							|  |  |  | 					s.constFloat32(pt, i.Float32())) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 			case 16: | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				pt := types.Types[TFLOAT64] | 
					
						
							| 
									
										
										
										
											2016-04-22 12:27:29 -07:00
										 |  |  | 				return s.newValue2(ssa.OpComplexMake, n.Type, | 
					
						
							|  |  |  | 					s.constFloat64(pt, r.Float64()), | 
					
						
							|  |  |  | 					s.constFloat64(pt, i.Float64())) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 			default: | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				s.Fatalf("bad float size %d", n.Type.Size()) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 				return nil | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2015-08-12 16:38:11 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 		default: | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 			s.Fatalf("unhandled OLITERAL %v", n.Val().Ctype()) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 			return nil | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-06-12 16:24:33 -07:00
										 |  |  | 	case OCONVNOP: | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 		to := n.Type | 
					
						
							|  |  |  | 		from := n.Left.Type | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		// Assume everything will work out, so set up our return value. | 
					
						
							|  |  |  | 		// Anything interesting that happens from here is a fatal. | 
					
						
							| 
									
										
										
										
											2015-06-12 16:24:33 -07:00
										 |  |  | 		x := s.expr(n.Left) | 
					
						
							| 
									
										
										
										
											2015-10-19 11:36:07 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 		// Special case for not confusing GC and liveness. | 
					
						
							|  |  |  | 		// We don't want pointers accidentally classified | 
					
						
							|  |  |  | 		// as not-pointers or vice-versa because of copy | 
					
						
							|  |  |  | 		// elision. | 
					
						
							| 
									
										
										
										
											2016-03-28 10:55:44 -07:00
										 |  |  | 		if to.IsPtrShaped() != from.IsPtrShaped() { | 
					
						
							| 
									
										
										
										
											2015-11-10 15:35:36 -08:00
										 |  |  | 			return s.newValue2(ssa.OpConvert, to, x, s.mem()) | 
					
						
							| 
									
										
										
										
											2015-10-19 11:36:07 -04:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 		v := s.newValue1(ssa.OpCopy, to, x) // ensure that v has the right type | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-07 19:07:02 -05:00
										 |  |  | 		// CONVNOP closure | 
					
						
							| 
									
										
										
										
											2016-03-28 10:55:44 -07:00
										 |  |  | 		if to.Etype == TFUNC && from.IsPtrShaped() { | 
					
						
							| 
									
										
										
										
											2015-09-07 19:07:02 -05:00
										 |  |  | 			return v | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 		// named <--> unnamed type or typed <--> untyped const | 
					
						
							|  |  |  | 		if from.Etype == to.Etype { | 
					
						
							|  |  |  | 			return v | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-10-19 11:36:07 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 		// unsafe.Pointer <--> *T | 
					
						
							| 
									
										
										
										
											2017-03-07 14:26:27 -08:00
										 |  |  | 		if to.Etype == TUNSAFEPTR && from.IsPtrShaped() || from.Etype == TUNSAFEPTR && to.IsPtrShaped() { | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 			return v | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-09-02 18:46:59 +02:00
										 |  |  | 		// map <--> *hmap | 
					
						
							|  |  |  | 		if to.Etype == TMAP && from.IsPtr() && | 
					
						
							|  |  |  | 			to.MapType().Hmap == from.Elem() { | 
					
						
							|  |  |  | 			return v | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-04-27 16:37:25 -07:00
										 |  |  | 		dowidth(from) | 
					
						
							|  |  |  | 		dowidth(to) | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 		if from.Width != to.Width { | 
					
						
							|  |  |  | 			s.Fatalf("CONVNOP width mismatch %v (%d) -> %v (%d)\n", from, from.Width, to, to.Width) | 
					
						
							|  |  |  | 			return nil | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		if etypesign(from.Etype) != etypesign(to.Etype) { | 
					
						
							| 
									
										
										
										
											2016-04-22 08:39:56 -07:00
										 |  |  | 			s.Fatalf("CONVNOP sign mismatch %v (%s) -> %v (%s)\n", from, from.Etype, to, to.Etype) | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 			return nil | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-01 15:17:34 -08:00
										 |  |  | 		if instrumenting { | 
					
						
							| 
									
										
										
										
											2015-10-09 16:48:30 -04:00
										 |  |  | 			// These appear to be fine, but they fail the | 
					
						
							|  |  |  | 			// integer constraint below, so okay them here. | 
					
						
							|  |  |  | 			// Sample non-integer conversion: map[string]string -> *uint8 | 
					
						
							|  |  |  | 			return v | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		if etypesign(from.Etype) == 0 { | 
					
						
							|  |  |  | 			s.Fatalf("CONVNOP unrecognized non-integer %v -> %v\n", from, to) | 
					
						
							|  |  |  | 			return nil | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		// integer, same width, same sign | 
					
						
							|  |  |  | 		return v | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-14 11:38:46 -07:00
										 |  |  | 	case OCONV: | 
					
						
							|  |  |  | 		x := s.expr(n.Left) | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 		ft := n.Left.Type // from type | 
					
						
							|  |  |  | 		tt := n.Type      // to type | 
					
						
							| 
									
										
										
										
											2017-01-21 19:52:09 -08:00
										 |  |  | 		if ft.IsBoolean() && tt.IsKind(TUINT8) { | 
					
						
							|  |  |  | 			// Bool -> uint8 is generated internally when indexing into runtime.staticbyte. | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpCopy, n.Type, x) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 		if ft.IsInteger() && tt.IsInteger() { | 
					
						
							|  |  |  | 			var op ssa.Op | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			if tt.Size() == ft.Size() { | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 				op = ssa.OpCopy | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			} else if tt.Size() < ft.Size() { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 				// truncation | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				switch 10*ft.Size() + tt.Size() { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 				case 21: | 
					
						
							|  |  |  | 					op = ssa.OpTrunc16to8 | 
					
						
							|  |  |  | 				case 41: | 
					
						
							|  |  |  | 					op = ssa.OpTrunc32to8 | 
					
						
							|  |  |  | 				case 42: | 
					
						
							|  |  |  | 					op = ssa.OpTrunc32to16 | 
					
						
							|  |  |  | 				case 81: | 
					
						
							|  |  |  | 					op = ssa.OpTrunc64to8 | 
					
						
							|  |  |  | 				case 82: | 
					
						
							|  |  |  | 					op = ssa.OpTrunc64to16 | 
					
						
							|  |  |  | 				case 84: | 
					
						
							|  |  |  | 					op = ssa.OpTrunc64to32 | 
					
						
							|  |  |  | 				default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 					s.Fatalf("weird integer truncation %v -> %v", ft, tt) | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 				} | 
					
						
							|  |  |  | 			} else if ft.IsSigned() { | 
					
						
							|  |  |  | 				// sign extension | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				switch 10*ft.Size() + tt.Size() { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 				case 12: | 
					
						
							|  |  |  | 					op = ssa.OpSignExt8to16 | 
					
						
							|  |  |  | 				case 14: | 
					
						
							|  |  |  | 					op = ssa.OpSignExt8to32 | 
					
						
							|  |  |  | 				case 18: | 
					
						
							|  |  |  | 					op = ssa.OpSignExt8to64 | 
					
						
							|  |  |  | 				case 24: | 
					
						
							|  |  |  | 					op = ssa.OpSignExt16to32 | 
					
						
							|  |  |  | 				case 28: | 
					
						
							|  |  |  | 					op = ssa.OpSignExt16to64 | 
					
						
							|  |  |  | 				case 48: | 
					
						
							|  |  |  | 					op = ssa.OpSignExt32to64 | 
					
						
							|  |  |  | 				default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 					s.Fatalf("bad integer sign extension %v -> %v", ft, tt) | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 				} | 
					
						
							|  |  |  | 			} else { | 
					
						
							|  |  |  | 				// zero extension | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				switch 10*ft.Size() + tt.Size() { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 				case 12: | 
					
						
							|  |  |  | 					op = ssa.OpZeroExt8to16 | 
					
						
							|  |  |  | 				case 14: | 
					
						
							|  |  |  | 					op = ssa.OpZeroExt8to32 | 
					
						
							|  |  |  | 				case 18: | 
					
						
							|  |  |  | 					op = ssa.OpZeroExt8to64 | 
					
						
							|  |  |  | 				case 24: | 
					
						
							|  |  |  | 					op = ssa.OpZeroExt16to32 | 
					
						
							|  |  |  | 				case 28: | 
					
						
							|  |  |  | 					op = ssa.OpZeroExt16to64 | 
					
						
							|  |  |  | 				case 48: | 
					
						
							|  |  |  | 					op = ssa.OpZeroExt32to64 | 
					
						
							|  |  |  | 				default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 					s.Fatalf("weird integer sign extension %v -> %v", ft, tt) | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 				} | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			return s.newValue1(op, n.Type, x) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 		if ft.IsFloat() || tt.IsFloat() { | 
					
						
							|  |  |  | 			conv, ok := fpConvOpToSSA[twoTypes{s.concreteEtype(ft), s.concreteEtype(tt)}] | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 			if s.config.RegSize == 4 && thearch.LinkArch.Family != sys.MIPS && !s.softFloat { | 
					
						
							| 
									
										
										
										
											2016-05-31 11:27:16 -04:00
										 |  |  | 				if conv1, ok1 := fpConvOpToSSA32[twoTypes{s.concreteEtype(ft), s.concreteEtype(tt)}]; ok1 { | 
					
						
							|  |  |  | 					conv = conv1 | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2018-03-29 00:55:53 +02:00
										 |  |  | 			if thearch.LinkArch.Family == sys.ARM64 || thearch.LinkArch.Family == sys.Wasm || s.softFloat { | 
					
						
							| 
									
										
										
										
											2016-08-16 14:17:33 -04:00
										 |  |  | 				if conv1, ok1 := uint64fpConvOpToSSA[twoTypes{s.concreteEtype(ft), s.concreteEtype(tt)}]; ok1 { | 
					
						
							|  |  |  | 					conv = conv1 | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 			if thearch.LinkArch.Family == sys.MIPS && !s.softFloat { | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				if ft.Size() == 4 && ft.IsInteger() && !ft.IsSigned() { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 					// tt is float32 or float64, and ft is also unsigned | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 					if tt.Size() == 4 { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 						return s.uint32Tofloat32(n, x, ft, tt) | 
					
						
							|  |  |  | 					} | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 					if tt.Size() == 8 { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 						return s.uint32Tofloat64(n, x, ft, tt) | 
					
						
							|  |  |  | 					} | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				} else if tt.Size() == 4 && tt.IsInteger() && !tt.IsSigned() { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 					// ft is float32 or float64, and tt is unsigned integer | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 					if ft.Size() == 4 { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 						return s.float32ToUint32(n, x, ft, tt) | 
					
						
							|  |  |  | 					} | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 					if ft.Size() == 8 { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 						return s.float64ToUint32(n, x, ft, tt) | 
					
						
							|  |  |  | 					} | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 			if !ok { | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 				s.Fatalf("weird float conversion %v -> %v", ft, tt) | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 			op1, op2, it := conv.op1, conv.op2, conv.intermediateType | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			if op1 != ssa.OpInvalid && op2 != ssa.OpInvalid { | 
					
						
							|  |  |  | 				// normal case, not tripping over unsigned 64 | 
					
						
							|  |  |  | 				if op1 == ssa.OpCopy { | 
					
						
							|  |  |  | 					if op2 == ssa.OpCopy { | 
					
						
							|  |  |  | 						return x | 
					
						
							|  |  |  | 					} | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 					return s.newValueOrSfCall1(op2, n.Type, x) | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 				} | 
					
						
							|  |  |  | 				if op2 == ssa.OpCopy { | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 					return s.newValueOrSfCall1(op1, n.Type, x) | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 				} | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 				return s.newValueOrSfCall1(op2, n.Type, s.newValueOrSfCall1(op1, types.Types[it], x)) | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 			} | 
					
						
							|  |  |  | 			// Tricky 64-bit unsigned cases. | 
					
						
							|  |  |  | 			if ft.IsInteger() { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 				// tt is float32 or float64, and ft is also unsigned | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				if tt.Size() == 4 { | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 					return s.uint64Tofloat32(n, x, ft, tt) | 
					
						
							|  |  |  | 				} | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 				if tt.Size() == 8 { | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 					return s.uint64Tofloat64(n, x, ft, tt) | 
					
						
							|  |  |  | 				} | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 				s.Fatalf("weird unsigned integer to float conversion %v -> %v", ft, tt) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 			// ft is float32 or float64, and tt is unsigned integer | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			if ft.Size() == 4 { | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 				return s.float32ToUint64(n, x, ft, tt) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			if ft.Size() == 8 { | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 				return s.float64ToUint64(n, x, ft, tt) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 			s.Fatalf("weird float to unsigned integer conversion %v -> %v", ft, tt) | 
					
						
							| 
									
										
										
										
											2015-09-01 17:09:00 -04:00
										 |  |  | 			return nil | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 		if ft.IsComplex() && tt.IsComplex() { | 
					
						
							|  |  |  | 			var op ssa.Op | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			if ft.Size() == tt.Size() { | 
					
						
							|  |  |  | 				switch ft.Size() { | 
					
						
							| 
									
										
										
										
											2017-02-12 22:12:12 -05:00
										 |  |  | 				case 8: | 
					
						
							|  |  |  | 					op = ssa.OpRound32F | 
					
						
							|  |  |  | 				case 16: | 
					
						
							|  |  |  | 					op = ssa.OpRound64F | 
					
						
							|  |  |  | 				default: | 
					
						
							|  |  |  | 					s.Fatalf("weird complex conversion %v -> %v", ft, tt) | 
					
						
							|  |  |  | 				} | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			} else if ft.Size() == 8 && tt.Size() == 16 { | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 				op = ssa.OpCvt32Fto64F | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 			} else if ft.Size() == 16 && tt.Size() == 8 { | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 				op = ssa.OpCvt64Fto32F | 
					
						
							|  |  |  | 			} else { | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 				s.Fatalf("weird complex conversion %v -> %v", ft, tt) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 			} | 
					
						
							|  |  |  | 			ftp := floatForComplex(ft) | 
					
						
							|  |  |  | 			ttp := floatForComplex(tt) | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpComplexMake, tt, | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 				s.newValueOrSfCall1(op, ttp, s.newValue1(ssa.OpComplexReal, ftp, x)), | 
					
						
							|  |  |  | 				s.newValueOrSfCall1(op, ttp, s.newValue1(ssa.OpComplexImag, ftp, x))) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 		s.Fatalf("unhandled OCONV %s -> %s", n.Left.Type.Etype, n.Type.Etype) | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 		return nil | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 	case ODOTTYPE: | 
					
						
							|  |  |  | 		res, _ := s.dottype(n, false) | 
					
						
							|  |  |  | 		return res | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-24 17:48:22 -07:00
										 |  |  | 	// binary ops | 
					
						
							|  |  |  | 	case OLT, OEQ, ONE, OLE, OGE, OGT: | 
					
						
							| 
									
										
										
										
											2015-05-28 10:47:24 -07:00
										 |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		b := s.expr(n.Right) | 
					
						
							| 
									
										
										
										
											2015-09-10 11:05:42 -07:00
										 |  |  | 		if n.Left.Type.IsComplex() { | 
					
						
							| 
									
										
										
										
											2015-09-10 14:59:00 -07:00
										 |  |  | 			pt := floatForComplex(n.Left.Type) | 
					
						
							| 
									
										
										
										
											2015-09-10 11:05:42 -07:00
										 |  |  | 			op := s.ssaOp(OEQ, pt) | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 			r := s.newValueOrSfCall2(op, types.Types[TBOOL], s.newValue1(ssa.OpComplexReal, pt, a), s.newValue1(ssa.OpComplexReal, pt, b)) | 
					
						
							|  |  |  | 			i := s.newValueOrSfCall2(op, types.Types[TBOOL], s.newValue1(ssa.OpComplexImag, pt, a), s.newValue1(ssa.OpComplexImag, pt, b)) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			c := s.newValue2(ssa.OpAndB, types.Types[TBOOL], r, i) | 
					
						
							| 
									
										
										
										
											2015-09-10 11:05:42 -07:00
										 |  |  | 			switch n.Op { | 
					
						
							|  |  |  | 			case OEQ: | 
					
						
							|  |  |  | 				return c | 
					
						
							|  |  |  | 			case ONE: | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				return s.newValue1(ssa.OpNot, types.Types[TBOOL], c) | 
					
						
							| 
									
										
										
										
											2015-09-10 11:05:42 -07:00
										 |  |  | 			default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 				s.Fatalf("ordered complex compare %v", n.Op) | 
					
						
							| 
									
										
										
										
											2015-09-10 11:05:42 -07:00
										 |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 		if n.Left.Type.IsFloat() { | 
					
						
							|  |  |  | 			return s.newValueOrSfCall2(s.ssaOp(n.Op, n.Left.Type), types.Types[TBOOL], a, b) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		return s.newValue2(s.ssaOp(n.Op, n.Left.Type), types.Types[TBOOL], a, b) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 	case OMUL: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		b := s.expr(n.Right) | 
					
						
							|  |  |  | 		if n.Type.IsComplex() { | 
					
						
							|  |  |  | 			mulop := ssa.OpMul64F | 
					
						
							|  |  |  | 			addop := ssa.OpAdd64F | 
					
						
							|  |  |  | 			subop := ssa.OpSub64F | 
					
						
							|  |  |  | 			pt := floatForComplex(n.Type) // Could be Float32 or Float64 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			wt := types.Types[TFLOAT64]   // Compute in Float64 to minimize cancelation error | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 			areal := s.newValue1(ssa.OpComplexReal, pt, a) | 
					
						
							|  |  |  | 			breal := s.newValue1(ssa.OpComplexReal, pt, b) | 
					
						
							|  |  |  | 			aimag := s.newValue1(ssa.OpComplexImag, pt, a) | 
					
						
							|  |  |  | 			bimag := s.newValue1(ssa.OpComplexImag, pt, b) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			if pt != wt { // Widen for calculation | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 				areal = s.newValueOrSfCall1(ssa.OpCvt32Fto64F, wt, areal) | 
					
						
							|  |  |  | 				breal = s.newValueOrSfCall1(ssa.OpCvt32Fto64F, wt, breal) | 
					
						
							|  |  |  | 				aimag = s.newValueOrSfCall1(ssa.OpCvt32Fto64F, wt, aimag) | 
					
						
							|  |  |  | 				bimag = s.newValueOrSfCall1(ssa.OpCvt32Fto64F, wt, bimag) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 			xreal := s.newValueOrSfCall2(subop, wt, s.newValueOrSfCall2(mulop, wt, areal, breal), s.newValueOrSfCall2(mulop, wt, aimag, bimag)) | 
					
						
							|  |  |  | 			ximag := s.newValueOrSfCall2(addop, wt, s.newValueOrSfCall2(mulop, wt, areal, bimag), s.newValueOrSfCall2(mulop, wt, aimag, breal)) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 			if pt != wt { // Narrow to store back | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 				xreal = s.newValueOrSfCall1(ssa.OpCvt64Fto32F, pt, xreal) | 
					
						
							|  |  |  | 				ximag = s.newValueOrSfCall1(ssa.OpCvt64Fto32F, pt, ximag) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpComplexMake, n.Type, xreal, ximag) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 
 | 
					
						
							|  |  |  | 		if n.Type.IsFloat() { | 
					
						
							|  |  |  | 			return s.newValueOrSfCall2(s.ssaOp(n.Op, n.Type), a.Type, a, b) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 		return s.newValue2(s.ssaOp(n.Op, n.Type), a.Type, a, b) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	case ODIV: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		b := s.expr(n.Right) | 
					
						
							|  |  |  | 		if n.Type.IsComplex() { | 
					
						
							|  |  |  | 			// TODO this is not executed because the front-end substitutes a runtime call. | 
					
						
							|  |  |  | 			// That probably ought to change; with modest optimization the widen/narrow | 
					
						
							|  |  |  | 			// conversions could all be elided in larger expression trees. | 
					
						
							|  |  |  | 			mulop := ssa.OpMul64F | 
					
						
							|  |  |  | 			addop := ssa.OpAdd64F | 
					
						
							|  |  |  | 			subop := ssa.OpSub64F | 
					
						
							|  |  |  | 			divop := ssa.OpDiv64F | 
					
						
							|  |  |  | 			pt := floatForComplex(n.Type) // Could be Float32 or Float64 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			wt := types.Types[TFLOAT64]   // Compute in Float64 to minimize cancelation error | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 			areal := s.newValue1(ssa.OpComplexReal, pt, a) | 
					
						
							|  |  |  | 			breal := s.newValue1(ssa.OpComplexReal, pt, b) | 
					
						
							|  |  |  | 			aimag := s.newValue1(ssa.OpComplexImag, pt, a) | 
					
						
							|  |  |  | 			bimag := s.newValue1(ssa.OpComplexImag, pt, b) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			if pt != wt { // Widen for calculation | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 				areal = s.newValueOrSfCall1(ssa.OpCvt32Fto64F, wt, areal) | 
					
						
							|  |  |  | 				breal = s.newValueOrSfCall1(ssa.OpCvt32Fto64F, wt, breal) | 
					
						
							|  |  |  | 				aimag = s.newValueOrSfCall1(ssa.OpCvt32Fto64F, wt, aimag) | 
					
						
							|  |  |  | 				bimag = s.newValueOrSfCall1(ssa.OpCvt32Fto64F, wt, bimag) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 			denom := s.newValueOrSfCall2(addop, wt, s.newValueOrSfCall2(mulop, wt, breal, breal), s.newValueOrSfCall2(mulop, wt, bimag, bimag)) | 
					
						
							|  |  |  | 			xreal := s.newValueOrSfCall2(addop, wt, s.newValueOrSfCall2(mulop, wt, areal, breal), s.newValueOrSfCall2(mulop, wt, aimag, bimag)) | 
					
						
							|  |  |  | 			ximag := s.newValueOrSfCall2(subop, wt, s.newValueOrSfCall2(mulop, wt, aimag, breal), s.newValueOrSfCall2(mulop, wt, areal, bimag)) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 			// TODO not sure if this is best done in wide precision or narrow | 
					
						
							|  |  |  | 			// Double-rounding might be an issue. | 
					
						
							|  |  |  | 			// Note that the pre-SSA implementation does the entire calculation | 
					
						
							|  |  |  | 			// in wide format, so wide is compatible. | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 			xreal = s.newValueOrSfCall2(divop, wt, xreal, denom) | 
					
						
							|  |  |  | 			ximag = s.newValueOrSfCall2(divop, wt, ximag, denom) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 			if pt != wt { // Narrow to store back | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 				xreal = s.newValueOrSfCall1(ssa.OpCvt64Fto32F, pt, xreal) | 
					
						
							|  |  |  | 				ximag = s.newValueOrSfCall1(ssa.OpCvt64Fto32F, pt, ximag) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 			} | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpComplexMake, n.Type, xreal, ximag) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-10-28 13:55:46 -04:00
										 |  |  | 		if n.Type.IsFloat() { | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 			return s.newValueOrSfCall2(s.ssaOp(n.Op, n.Type), a.Type, a, b) | 
					
						
							| 
									
										
										
										
											2015-10-28 13:55:46 -04:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-09-04 16:59:46 -07:00
										 |  |  | 		return s.intDivide(n, a, b) | 
					
						
							| 
									
										
										
										
											2015-10-28 13:55:46 -04:00
										 |  |  | 	case OMOD: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		b := s.expr(n.Right) | 
					
						
							| 
									
										
										
										
											2016-09-04 16:59:46 -07:00
										 |  |  | 		return s.intDivide(n, a, b) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 	case OADD, OSUB: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		b := s.expr(n.Right) | 
					
						
							|  |  |  | 		if n.Type.IsComplex() { | 
					
						
							|  |  |  | 			pt := floatForComplex(n.Type) | 
					
						
							|  |  |  | 			op := s.ssaOp(n.Op, pt) | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpComplexMake, n.Type, | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 				s.newValueOrSfCall2(op, pt, s.newValue1(ssa.OpComplexReal, pt, a), s.newValue1(ssa.OpComplexReal, pt, b)), | 
					
						
							|  |  |  | 				s.newValueOrSfCall2(op, pt, s.newValue1(ssa.OpComplexImag, pt, a), s.newValue1(ssa.OpComplexImag, pt, b))) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		if n.Type.IsFloat() { | 
					
						
							|  |  |  | 			return s.newValueOrSfCall2(s.ssaOp(n.Op, n.Type), a.Type, a, b) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		return s.newValue2(s.ssaOp(n.Op, n.Type), a.Type, a, b) | 
					
						
							| 
									
										
										
										
											2017-03-03 02:43:44 -08:00
										 |  |  | 	case OAND, OOR, OXOR: | 
					
						
							| 
									
										
										
										
											2015-05-28 10:47:24 -07:00
										 |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		b := s.expr(n.Right) | 
					
						
							| 
									
										
										
										
											2015-07-19 15:48:20 -07:00
										 |  |  | 		return s.newValue2(s.ssaOp(n.Op, n.Type), a.Type, a, b) | 
					
						
							| 
									
										
										
										
											2015-07-29 17:07:09 -07:00
										 |  |  | 	case OLSH, ORSH: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		b := s.expr(n.Right) | 
					
						
							|  |  |  | 		return s.newValue2(s.ssaShiftOp(n.Op, n.Type, n.Right.Type), a.Type, a, b) | 
					
						
							| 
									
										
										
										
											2015-07-10 12:58:53 -06:00
										 |  |  | 	case OANDAND, OOROR: | 
					
						
							|  |  |  | 		// To implement OANDAND (and OOROR), we introduce a | 
					
						
							|  |  |  | 		// new temporary variable to hold the result. The | 
					
						
							|  |  |  | 		// variable is associated with the OANDAND node in the | 
					
						
							|  |  |  | 		// s.vars table (normally variables are only | 
					
						
							|  |  |  | 		// associated with ONAME nodes). We convert | 
					
						
							|  |  |  | 		//     A && B | 
					
						
							|  |  |  | 		// to | 
					
						
							|  |  |  | 		//     var = A | 
					
						
							|  |  |  | 		//     if var { | 
					
						
							|  |  |  | 		//         var = B | 
					
						
							|  |  |  | 		//     } | 
					
						
							|  |  |  | 		// Using var in the subsequent block introduces the | 
					
						
							|  |  |  | 		// necessary phi variable. | 
					
						
							|  |  |  | 		el := s.expr(n.Left) | 
					
						
							|  |  |  | 		s.vars[n] = el | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		b := s.endBlock() | 
					
						
							|  |  |  | 		b.Kind = ssa.BlockIf | 
					
						
							| 
									
										
										
										
											2016-03-15 20:45:50 -07:00
										 |  |  | 		b.SetControl(el) | 
					
						
							| 
									
										
										
										
											2015-08-11 17:28:56 -07:00
										 |  |  | 		// In theory, we should set b.Likely here based on context. | 
					
						
							|  |  |  | 		// However, gc only gives us likeliness hints | 
					
						
							|  |  |  | 		// in a single place, for plain OIF statements, | 
					
						
							|  |  |  | 		// and passing around context is finnicky, so don't bother for now. | 
					
						
							| 
									
										
										
										
											2015-07-10 12:58:53 -06:00
										 |  |  | 
 | 
					
						
							|  |  |  | 		bRight := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		bResult := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		if n.Op == OANDAND { | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 			b.AddEdgeTo(bRight) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bResult) | 
					
						
							| 
									
										
										
										
											2015-07-10 12:58:53 -06:00
										 |  |  | 		} else if n.Op == OOROR { | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 			b.AddEdgeTo(bResult) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bRight) | 
					
						
							| 
									
										
										
										
											2015-07-10 12:58:53 -06:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		s.startBlock(bRight) | 
					
						
							|  |  |  | 		er := s.expr(n.Right) | 
					
						
							|  |  |  | 		s.vars[n] = er | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		b = s.endBlock() | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 		b.AddEdgeTo(bResult) | 
					
						
							| 
									
										
										
										
											2015-07-10 12:58:53 -06:00
										 |  |  | 
 | 
					
						
							|  |  |  | 		s.startBlock(bResult) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		return s.variable(n, types.Types[TBOOL]) | 
					
						
							| 
									
										
										
										
											2015-09-12 14:14:02 -07:00
										 |  |  | 	case OCOMPLEX: | 
					
						
							|  |  |  | 		r := s.expr(n.Left) | 
					
						
							|  |  |  | 		i := s.expr(n.Right) | 
					
						
							|  |  |  | 		return s.newValue2(ssa.OpComplexMake, n.Type, r, i) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-05 19:28:00 -07:00
										 |  |  | 	// unary ops | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 	case OMINUS: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		if n.Type.IsComplex() { | 
					
						
							|  |  |  | 			tp := floatForComplex(n.Type) | 
					
						
							|  |  |  | 			negop := s.ssaOp(n.Op, tp) | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpComplexMake, n.Type, | 
					
						
							|  |  |  | 				s.newValue1(negop, tp, s.newValue1(ssa.OpComplexReal, tp, a)), | 
					
						
							|  |  |  | 				s.newValue1(negop, tp, s.newValue1(ssa.OpComplexImag, tp, a))) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		return s.newValue1(s.ssaOp(n.Op, n.Type), a.Type, a) | 
					
						
							| 
									
										
										
										
											2016-11-02 17:20:22 +01:00
										 |  |  | 	case ONOT, OCOM: | 
					
						
							| 
									
										
										
										
											2015-07-10 11:25:48 -06:00
										 |  |  | 		a := s.expr(n.Left) | 
					
						
							| 
									
										
										
										
											2015-07-21 16:58:18 +02:00
										 |  |  | 		return s.newValue1(s.ssaOp(n.Op, n.Type), a.Type, a) | 
					
						
							| 
									
										
										
										
											2015-09-10 11:37:09 -07:00
										 |  |  | 	case OIMAG, OREAL: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		return s.newValue1(s.ssaOp(n.Op, n.Left.Type), n.Type, a) | 
					
						
							| 
									
										
										
										
											2015-09-05 19:28:00 -07:00
										 |  |  | 	case OPLUS: | 
					
						
							|  |  |  | 		return s.expr(n.Left) | 
					
						
							| 
									
										
										
										
											2015-07-10 11:25:48 -06:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	case OADDR: | 
					
						
							| 
									
										
										
										
											2017-02-27 19:56:38 +02:00
										 |  |  | 		return s.addr(n.Left, n.Bounded()) | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-24 14:33:22 -07:00
										 |  |  | 	case OINDREGSP: | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		addr := s.constOffPtrSP(types.NewPtr(n.Type), n.Xoffset) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		return s.load(n.Type, addr) | 
					
						
							| 
									
										
										
										
											2015-07-28 12:37:46 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	case OIND: | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 		p := s.exprPtr(n.Left, false, n.Pos) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		return s.load(n.Type, p) | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-15 21:33:49 -07:00
										 |  |  | 	case ODOT: | 
					
						
							| 
									
										
										
										
											2017-02-08 15:31:24 -05:00
										 |  |  | 		if n.Left.Op == OSTRUCTLIT { | 
					
						
							|  |  |  | 			// All literals with nonzero fields have already been | 
					
						
							|  |  |  | 			// rewritten during walk. Any that remain are just T{} | 
					
						
							|  |  |  | 			// or equivalents. Use the zero value. | 
					
						
							| 
									
										
										
										
											2018-04-08 13:39:10 +01:00
										 |  |  | 			if !isZero(n.Left) { | 
					
						
							| 
									
										
										
										
											2017-02-08 15:31:24 -05:00
										 |  |  | 				Fatalf("literal with nonzero value in SSA: %v", n.Left) | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			return s.zeroVal(n.Type) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2018-04-25 12:36:36 -07:00
										 |  |  | 		// If n is addressable and can't be represented in | 
					
						
							|  |  |  | 		// SSA, then load just the selected field. This | 
					
						
							|  |  |  | 		// prevents false memory dependencies in race/msan | 
					
						
							|  |  |  | 		// instrumentation. | 
					
						
							|  |  |  | 		if islvalue(n) && !s.canSSA(n) { | 
					
						
							|  |  |  | 			p := s.addr(n, false) | 
					
						
							|  |  |  | 			return s.load(n.Type, p) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		v := s.expr(n.Left) | 
					
						
							|  |  |  | 		return s.newValue1I(ssa.OpStructSelect, n.Type, int64(fieldIdx(n)), v) | 
					
						
							| 
									
										
										
										
											2015-07-15 21:33:49 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	case ODOTPTR: | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 		p := s.exprPtr(n.Left, false, n.Pos) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		p = s.newValue1I(ssa.OpOffPtr, types.NewPtr(n.Type), n.Xoffset, p) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		return s.load(n.Type, p) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	case OINDEX: | 
					
						
							| 
									
										
										
										
											2015-10-09 09:33:29 -07:00
										 |  |  | 		switch { | 
					
						
							|  |  |  | 		case n.Left.Type.IsString(): | 
					
						
							| 
									
										
										
										
											2017-02-27 19:56:38 +02:00
										 |  |  | 			if n.Bounded() && Isconst(n.Left, CTSTR) && Isconst(n.Right, CTINT) { | 
					
						
							| 
									
										
										
										
											2015-10-10 15:24:34 -04:00
										 |  |  | 				// Replace "abc"[1] with 'b'. | 
					
						
							|  |  |  | 				// Delayed until now because "abc"[1] is not an ideal constant. | 
					
						
							|  |  |  | 				// See test/fixedbugs/issue11370.go. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				return s.newValue0I(ssa.OpConst8, types.Types[TUINT8], int64(int8(n.Left.Val().U.(string)[n.Right.Int64()]))) | 
					
						
							| 
									
										
										
										
											2015-10-10 15:24:34 -04:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 			a := s.expr(n.Left) | 
					
						
							|  |  |  | 			i := s.expr(n.Right) | 
					
						
							| 
									
										
										
										
											2016-09-16 00:33:29 +10:00
										 |  |  | 			i = s.extendIndex(i, panicindex) | 
					
						
							| 
									
										
										
										
											2017-02-27 19:56:38 +02:00
										 |  |  | 			if !n.Bounded() { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				len := s.newValue1(ssa.OpStringLen, types.Types[TINT], a) | 
					
						
							| 
									
										
										
										
											2015-10-09 09:33:29 -07:00
										 |  |  | 				s.boundsCheck(i, len) | 
					
						
							| 
									
										
										
										
											2015-08-18 14:17:30 -07:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 			ptrtyp := s.f.Config.Types.BytePtr | 
					
						
							| 
									
										
										
										
											2015-10-09 09:33:29 -07:00
										 |  |  | 			ptr := s.newValue1(ssa.OpStringPtr, ptrtyp, a) | 
					
						
							| 
									
										
										
										
											2016-03-04 12:34:43 -08:00
										 |  |  | 			if Isconst(n.Right, CTINT) { | 
					
						
							| 
									
										
										
										
											2016-04-01 14:51:02 -07:00
										 |  |  | 				ptr = s.newValue1I(ssa.OpOffPtr, ptrtyp, n.Right.Int64(), ptr) | 
					
						
							| 
									
										
										
										
											2016-03-04 12:34:43 -08:00
										 |  |  | 			} else { | 
					
						
							|  |  |  | 				ptr = s.newValue2(ssa.OpAddPtr, ptrtyp, ptr, i) | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			return s.load(types.Types[TUINT8], ptr) | 
					
						
							| 
									
										
										
										
											2015-10-09 09:33:29 -07:00
										 |  |  | 		case n.Left.Type.IsSlice(): | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 			p := s.addr(n, false) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			return s.load(n.Left.Type.Elem(), p) | 
					
						
							| 
									
										
										
										
											2015-10-09 09:33:29 -07:00
										 |  |  | 		case n.Left.Type.IsArray(): | 
					
						
							| 
									
										
										
										
											2018-06-28 20:34:05 -04:00
										 |  |  | 			if canSSAType(n.Left.Type) { | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 				// SSA can handle arrays of length at most 1. | 
					
						
							| 
									
										
										
										
											2018-06-28 20:34:05 -04:00
										 |  |  | 				bound := n.Left.Type.NumElem() | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 				a := s.expr(n.Left) | 
					
						
							|  |  |  | 				i := s.expr(n.Right) | 
					
						
							|  |  |  | 				if bound == 0 { | 
					
						
							|  |  |  | 					// Bounds check will never succeed.  Might as well | 
					
						
							|  |  |  | 					// use constants for the bounds check. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 					z := s.constInt(types.Types[TINT], 0) | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 					s.boundsCheck(z, z) | 
					
						
							|  |  |  | 					// The return value won't be live, return junk. | 
					
						
							|  |  |  | 					return s.newValue0(ssa.OpUnknown, n.Type) | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 				i = s.extendIndex(i, panicindex) | 
					
						
							| 
									
										
										
										
											2017-04-03 10:00:02 -07:00
										 |  |  | 				if !n.Bounded() { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 					s.boundsCheck(i, s.constInt(types.Types[TINT], bound)) | 
					
						
							| 
									
										
										
										
											2017-04-03 10:00:02 -07:00
										 |  |  | 				} | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 				return s.newValue1I(ssa.OpArraySelect, n.Type, 0, a) | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 			p := s.addr(n, false) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			return s.load(n.Left.Type.Elem(), p) | 
					
						
							| 
									
										
										
										
											2015-10-09 09:33:29 -07:00
										 |  |  | 		default: | 
					
						
							|  |  |  | 			s.Fatalf("bad type for index %v", n.Left.Type) | 
					
						
							|  |  |  | 			return nil | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-10 10:47:28 -06:00
										 |  |  | 	case OLEN, OCAP: | 
					
						
							| 
									
										
										
										
											2015-07-03 18:41:28 -07:00
										 |  |  | 		switch { | 
					
						
							| 
									
										
										
										
											2015-07-10 10:47:28 -06:00
										 |  |  | 		case n.Left.Type.IsSlice(): | 
					
						
							|  |  |  | 			op := ssa.OpSliceLen | 
					
						
							|  |  |  | 			if n.Op == OCAP { | 
					
						
							|  |  |  | 				op = ssa.OpSliceCap | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(op, types.Types[TINT], s.expr(n.Left)) | 
					
						
							| 
									
										
										
										
											2015-07-10 10:47:28 -06:00
										 |  |  | 		case n.Left.Type.IsString(): // string; not reachable for OCAP | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpStringLen, types.Types[TINT], s.expr(n.Left)) | 
					
						
							| 
									
										
										
										
											2015-08-28 15:56:43 -05:00
										 |  |  | 		case n.Left.Type.IsMap(), n.Left.Type.IsChan(): | 
					
						
							|  |  |  | 			return s.referenceTypeBuiltin(n, s.expr(n.Left)) | 
					
						
							| 
									
										
										
										
											2015-07-03 18:41:28 -07:00
										 |  |  | 		default: // array | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.constInt(types.Types[TINT], n.Left.Type.NumElem()) | 
					
						
							| 
									
										
										
										
											2015-07-03 18:41:28 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-12 10:12:14 -07:00
										 |  |  | 	case OSPTR: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		if n.Left.Type.IsSlice() { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpSlicePtr, n.Type, a) | 
					
						
							|  |  |  | 		} else { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpStringPtr, n.Type, a) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-04 15:47:22 -07:00
										 |  |  | 	case OITAB: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		return s.newValue1(ssa.OpITab, n.Type, a) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-06-06 12:38:19 -07:00
										 |  |  | 	case OIDATA: | 
					
						
							|  |  |  | 		a := s.expr(n.Left) | 
					
						
							|  |  |  | 		return s.newValue1(ssa.OpIData, n.Type, a) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-05 19:28:27 -07:00
										 |  |  | 	case OEFACE: | 
					
						
							|  |  |  | 		tab := s.expr(n.Left) | 
					
						
							|  |  |  | 		data := s.expr(n.Right) | 
					
						
							|  |  |  | 		return s.newValue2(ssa.OpIMake, n.Type, tab, data) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-04-21 11:55:33 -07:00
										 |  |  | 	case OSLICE, OSLICEARR, OSLICE3, OSLICE3ARR: | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		v := s.expr(n.Left) | 
					
						
							| 
									
										
										
										
											2016-04-21 11:55:33 -07:00
										 |  |  | 		var i, j, k *ssa.Value | 
					
						
							|  |  |  | 		low, high, max := n.SliceBounds() | 
					
						
							|  |  |  | 		if low != nil { | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | 			i = s.extendIndex(s.expr(low), panicslice) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-04-21 11:55:33 -07:00
										 |  |  | 		if high != nil { | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | 			j = s.extendIndex(s.expr(high), panicslice) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-04-21 11:55:33 -07:00
										 |  |  | 		if max != nil { | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | 			k = s.extendIndex(s.expr(max), panicslice) | 
					
						
							| 
									
										
										
										
											2016-04-21 11:55:33 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		p, l, c := s.slice(n.Left.Type, v, i, j, k) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		return s.newValue3(ssa.OpSliceMake, n.Type, p, l, c) | 
					
						
							| 
									
										
										
										
											2016-04-21 11:55:33 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-24 23:52:03 -07:00
										 |  |  | 	case OSLICESTR: | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		v := s.expr(n.Left) | 
					
						
							|  |  |  | 		var i, j *ssa.Value | 
					
						
							| 
									
										
										
										
											2016-04-21 11:55:33 -07:00
										 |  |  | 		low, high, _ := n.SliceBounds() | 
					
						
							|  |  |  | 		if low != nil { | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | 			i = s.extendIndex(s.expr(low), panicslice) | 
					
						
							| 
									
										
										
										
											2015-08-24 23:52:03 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-04-21 11:55:33 -07:00
										 |  |  | 		if high != nil { | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | 			j = s.extendIndex(s.expr(high), panicslice) | 
					
						
							| 
									
										
										
										
											2015-08-24 23:52:03 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		p, l, _ := s.slice(n.Left.Type, v, i, j, nil) | 
					
						
							|  |  |  | 		return s.newValue2(ssa.OpStringMake, n.Type, p, l) | 
					
						
							| 
									
										
										
										
											2015-08-24 23:52:03 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-11 00:10:52 -05:00
										 |  |  | 	case OCALLFUNC: | 
					
						
							| 
									
										
										
										
											2016-08-23 16:49:28 -07:00
										 |  |  | 		if isIntrinsicCall(n) { | 
					
						
							|  |  |  | 			return s.intrinsicCall(n) | 
					
						
							| 
									
										
										
										
											2016-03-11 00:10:52 -05:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		fallthrough | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	case OCALLINTER, OCALLMETH: | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 		a := s.call(n, callNormal) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		return s.load(n.Type, a) | 
					
						
							| 
									
										
										
										
											2015-08-12 11:22:16 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	case OGETG: | 
					
						
							| 
									
										
										
										
											2015-10-19 18:54:40 -07:00
										 |  |  | 		return s.newValue1(ssa.OpGetG, n.Type, s.mem()) | 
					
						
							| 
									
										
										
										
											2015-08-12 11:22:16 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-11 11:02:57 -07:00
										 |  |  | 	case OAPPEND: | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 		return s.append(n, false) | 
					
						
							| 
									
										
										
										
											2015-09-11 11:02:57 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-01-16 12:37:11 -05:00
										 |  |  | 	case OSTRUCTLIT, OARRAYLIT: | 
					
						
							|  |  |  | 		// All literals with nonzero fields have already been | 
					
						
							|  |  |  | 		// rewritten during walk. Any that remain are just T{} | 
					
						
							|  |  |  | 		// or equivalents. Use the zero value. | 
					
						
							| 
									
										
										
										
											2018-04-08 13:39:10 +01:00
										 |  |  | 		if !isZero(n) { | 
					
						
							| 
									
										
										
										
											2017-01-16 12:37:11 -05:00
										 |  |  | 			Fatalf("literal with nonzero value in SSA: %v", n) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		return s.zeroVal(n.Type) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	default: | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 		s.Fatalf("unhandled expr %v", n.Op) | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 		return nil | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							| 
									
										
										
										
											2015-09-11 11:02:57 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | // append converts an OAPPEND node to SSA. | 
					
						
							|  |  |  | // If inplace is false, it converts the OAPPEND expression n to an ssa.Value, | 
					
						
							|  |  |  | // adds it to s, and returns the Value. | 
					
						
							|  |  |  | // If inplace is true, it writes the result of the OAPPEND expression n | 
					
						
							|  |  |  | // back to the slice being appended to, and returns nil. | 
					
						
							|  |  |  | // inplace MUST be set to false if the slice can be SSA'd. | 
					
						
							|  |  |  | func (s *state) append(n *Node, inplace bool) *ssa.Value { | 
					
						
							|  |  |  | 	// If inplace is false, process as expression "append(s, e1, e2, e3)": | 
					
						
							|  |  |  | 	// | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid a spill in append fast path
Instead of spilling newlen, recalculate it.
This removes a spill from the fast path,
at the cost of a cheap recalculation
on the (rare) growth path.
This uses 8 bytes less of stack space.
It generates two more bytes of code,
but that is due to suboptimal register allocation;
see far below.
Runtime append microbenchmarks are all over the map,
presumably due to incidental code movement.
Sample code:
func s(b []byte) []byte {
	b = append(b, 1, 2, 3)
	return b
}
Before:
"".s t=1 size=160 args=0x30 locals=0x48
	0x0000 00000 (append.go:8)	TEXT	"".s(SB), $72-48
	0x0000 00000 (append.go:8)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:8)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:8)	JLS	149
	0x0013 00019 (append.go:8)	SUBQ	$72, SP
	0x0017 00023 (append.go:8)	FUNCDATA	$0, gclocals·6432f8c6a0d23fa7bee6c5d96f21a92a(SB)
	0x0017 00023 (append.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:9)	MOVQ	"".b+88(FP), CX
	0x001c 00028 (append.go:9)	LEAQ	3(CX), DX
	0x0020 00032 (append.go:9)	MOVQ	DX, "".autotmp_0+64(SP)
	0x0025 00037 (append.go:9)	MOVQ	"".b+96(FP), BX
	0x002a 00042 (append.go:9)	CMPQ	DX, BX
	0x002d 00045 (append.go:9)	JGT	$0, 86
	0x002f 00047 (append.go:8)	MOVQ	"".b+80(FP), AX
	0x0034 00052 (append.go:9)	MOVB	$1, (AX)(CX*1)
	0x0038 00056 (append.go:9)	MOVB	$2, 1(AX)(CX*1)
	0x003d 00061 (append.go:9)	MOVB	$3, 2(AX)(CX*1)
	0x0042 00066 (append.go:10)	MOVQ	AX, "".~r1+104(FP)
	0x0047 00071 (append.go:10)	MOVQ	DX, "".~r1+112(FP)
	0x004c 00076 (append.go:10)	MOVQ	BX, "".~r1+120(FP)
	0x0051 00081 (append.go:10)	ADDQ	$72, SP
	0x0055 00085 (append.go:10)	RET
	0x0056 00086 (append.go:9)	LEAQ	type.[]uint8(SB), AX
	0x005d 00093 (append.go:9)	MOVQ	AX, (SP)
	0x0061 00097 (append.go:9)	MOVQ	"".b+80(FP), BP
	0x0066 00102 (append.go:9)	MOVQ	BP, 8(SP)
	0x006b 00107 (append.go:9)	MOVQ	CX, 16(SP)
	0x0070 00112 (append.go:9)	MOVQ	BX, 24(SP)
	0x0075 00117 (append.go:9)	MOVQ	DX, 32(SP)
	0x007a 00122 (append.go:9)	PCDATA	$0, $0
	0x007a 00122 (append.go:9)	CALL	runtime.growslice(SB)
	0x007f 00127 (append.go:9)	MOVQ	40(SP), AX
	0x0084 00132 (append.go:9)	MOVQ	56(SP), BX
	0x0089 00137 (append.go:8)	MOVQ	"".b+88(FP), CX
	0x008e 00142 (append.go:9)	MOVQ	"".autotmp_0+64(SP), DX
	0x0093 00147 (append.go:9)	JMP	52
	0x0095 00149 (append.go:9)	NOP
	0x0095 00149 (append.go:8)	CALL	runtime.morestack_noctxt(SB)
	0x009a 00154 (append.go:8)	JMP	0
After:
"".s t=1 size=176 args=0x30 locals=0x40
	0x0000 00000 (append.go:8)	TEXT	"".s(SB), $64-48
	0x0000 00000 (append.go:8)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:8)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:8)	JLS	151
	0x0013 00019 (append.go:8)	SUBQ	$64, SP
	0x0017 00023 (append.go:8)	FUNCDATA	$0, gclocals·6432f8c6a0d23fa7bee6c5d96f21a92a(SB)
	0x0017 00023 (append.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:9)	MOVQ	"".b+80(FP), CX
	0x001c 00028 (append.go:9)	LEAQ	3(CX), DX
	0x0020 00032 (append.go:9)	MOVQ	"".b+88(FP), BX
	0x0025 00037 (append.go:9)	CMPQ	DX, BX
	0x0028 00040 (append.go:9)	JGT	$0, 81
	0x002a 00042 (append.go:8)	MOVQ	"".b+72(FP), AX
	0x002f 00047 (append.go:9)	MOVB	$1, (AX)(CX*1)
	0x0033 00051 (append.go:9)	MOVB	$2, 1(AX)(CX*1)
	0x0038 00056 (append.go:9)	MOVB	$3, 2(AX)(CX*1)
	0x003d 00061 (append.go:10)	MOVQ	AX, "".~r1+96(FP)
	0x0042 00066 (append.go:10)	MOVQ	DX, "".~r1+104(FP)
	0x0047 00071 (append.go:10)	MOVQ	BX, "".~r1+112(FP)
	0x004c 00076 (append.go:10)	ADDQ	$64, SP
	0x0050 00080 (append.go:10)	RET
	0x0051 00081 (append.go:9)	LEAQ	type.[]uint8(SB), AX
	0x0058 00088 (append.go:9)	MOVQ	AX, (SP)
	0x005c 00092 (append.go:9)	MOVQ	"".b+72(FP), BP
	0x0061 00097 (append.go:9)	MOVQ	BP, 8(SP)
	0x0066 00102 (append.go:9)	MOVQ	CX, 16(SP)
	0x006b 00107 (append.go:9)	MOVQ	BX, 24(SP)
	0x0070 00112 (append.go:9)	MOVQ	DX, 32(SP)
	0x0075 00117 (append.go:9)	PCDATA	$0, $0
	0x0075 00117 (append.go:9)	CALL	runtime.growslice(SB)
	0x007a 00122 (append.go:9)	MOVQ	40(SP), AX
	0x007f 00127 (append.go:9)	MOVQ	48(SP), CX
	0x0084 00132 (append.go:9)	MOVQ	56(SP), BX
	0x0089 00137 (append.go:9)	ADDQ	$3, CX
	0x008d 00141 (append.go:9)	MOVQ	CX, DX
	0x0090 00144 (append.go:8)	MOVQ	"".b+80(FP), CX
	0x0095 00149 (append.go:9)	JMP	47
	0x0097 00151 (append.go:9)	NOP
	0x0097 00151 (append.go:8)	CALL	runtime.morestack_noctxt(SB)
	0x009c 00156 (append.go:8)	JMP	0
Observe that in the following sequence,
we should use DX directly instead of using
CX as a temporary register, which would make
the new code a strict improvement on the old:
	0x007f 00127 (append.go:9)	MOVQ	48(SP), CX
	0x0084 00132 (append.go:9)	MOVQ	56(SP), BX
	0x0089 00137 (append.go:9)	ADDQ	$3, CX
	0x008d 00141 (append.go:9)	MOVQ	CX, DX
	0x0090 00144 (append.go:8)	MOVQ	"".b+80(FP), CX
Change-Id: I4ee50b18fa53865901d2d7f86c2cbb54c6fa6924
Reviewed-on: https://go-review.googlesource.com/21812
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:08:00 -07:00
										 |  |  | 	// ptr, len, cap := s | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	// newlen := len + 3 | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	// if newlen > cap { | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid a spill in append fast path
Instead of spilling newlen, recalculate it.
This removes a spill from the fast path,
at the cost of a cheap recalculation
on the (rare) growth path.
This uses 8 bytes less of stack space.
It generates two more bytes of code,
but that is due to suboptimal register allocation;
see far below.
Runtime append microbenchmarks are all over the map,
presumably due to incidental code movement.
Sample code:
func s(b []byte) []byte {
	b = append(b, 1, 2, 3)
	return b
}
Before:
"".s t=1 size=160 args=0x30 locals=0x48
	0x0000 00000 (append.go:8)	TEXT	"".s(SB), $72-48
	0x0000 00000 (append.go:8)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:8)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:8)	JLS	149
	0x0013 00019 (append.go:8)	SUBQ	$72, SP
	0x0017 00023 (append.go:8)	FUNCDATA	$0, gclocals·6432f8c6a0d23fa7bee6c5d96f21a92a(SB)
	0x0017 00023 (append.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:9)	MOVQ	"".b+88(FP), CX
	0x001c 00028 (append.go:9)	LEAQ	3(CX), DX
	0x0020 00032 (append.go:9)	MOVQ	DX, "".autotmp_0+64(SP)
	0x0025 00037 (append.go:9)	MOVQ	"".b+96(FP), BX
	0x002a 00042 (append.go:9)	CMPQ	DX, BX
	0x002d 00045 (append.go:9)	JGT	$0, 86
	0x002f 00047 (append.go:8)	MOVQ	"".b+80(FP), AX
	0x0034 00052 (append.go:9)	MOVB	$1, (AX)(CX*1)
	0x0038 00056 (append.go:9)	MOVB	$2, 1(AX)(CX*1)
	0x003d 00061 (append.go:9)	MOVB	$3, 2(AX)(CX*1)
	0x0042 00066 (append.go:10)	MOVQ	AX, "".~r1+104(FP)
	0x0047 00071 (append.go:10)	MOVQ	DX, "".~r1+112(FP)
	0x004c 00076 (append.go:10)	MOVQ	BX, "".~r1+120(FP)
	0x0051 00081 (append.go:10)	ADDQ	$72, SP
	0x0055 00085 (append.go:10)	RET
	0x0056 00086 (append.go:9)	LEAQ	type.[]uint8(SB), AX
	0x005d 00093 (append.go:9)	MOVQ	AX, (SP)
	0x0061 00097 (append.go:9)	MOVQ	"".b+80(FP), BP
	0x0066 00102 (append.go:9)	MOVQ	BP, 8(SP)
	0x006b 00107 (append.go:9)	MOVQ	CX, 16(SP)
	0x0070 00112 (append.go:9)	MOVQ	BX, 24(SP)
	0x0075 00117 (append.go:9)	MOVQ	DX, 32(SP)
	0x007a 00122 (append.go:9)	PCDATA	$0, $0
	0x007a 00122 (append.go:9)	CALL	runtime.growslice(SB)
	0x007f 00127 (append.go:9)	MOVQ	40(SP), AX
	0x0084 00132 (append.go:9)	MOVQ	56(SP), BX
	0x0089 00137 (append.go:8)	MOVQ	"".b+88(FP), CX
	0x008e 00142 (append.go:9)	MOVQ	"".autotmp_0+64(SP), DX
	0x0093 00147 (append.go:9)	JMP	52
	0x0095 00149 (append.go:9)	NOP
	0x0095 00149 (append.go:8)	CALL	runtime.morestack_noctxt(SB)
	0x009a 00154 (append.go:8)	JMP	0
After:
"".s t=1 size=176 args=0x30 locals=0x40
	0x0000 00000 (append.go:8)	TEXT	"".s(SB), $64-48
	0x0000 00000 (append.go:8)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:8)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:8)	JLS	151
	0x0013 00019 (append.go:8)	SUBQ	$64, SP
	0x0017 00023 (append.go:8)	FUNCDATA	$0, gclocals·6432f8c6a0d23fa7bee6c5d96f21a92a(SB)
	0x0017 00023 (append.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:9)	MOVQ	"".b+80(FP), CX
	0x001c 00028 (append.go:9)	LEAQ	3(CX), DX
	0x0020 00032 (append.go:9)	MOVQ	"".b+88(FP), BX
	0x0025 00037 (append.go:9)	CMPQ	DX, BX
	0x0028 00040 (append.go:9)	JGT	$0, 81
	0x002a 00042 (append.go:8)	MOVQ	"".b+72(FP), AX
	0x002f 00047 (append.go:9)	MOVB	$1, (AX)(CX*1)
	0x0033 00051 (append.go:9)	MOVB	$2, 1(AX)(CX*1)
	0x0038 00056 (append.go:9)	MOVB	$3, 2(AX)(CX*1)
	0x003d 00061 (append.go:10)	MOVQ	AX, "".~r1+96(FP)
	0x0042 00066 (append.go:10)	MOVQ	DX, "".~r1+104(FP)
	0x0047 00071 (append.go:10)	MOVQ	BX, "".~r1+112(FP)
	0x004c 00076 (append.go:10)	ADDQ	$64, SP
	0x0050 00080 (append.go:10)	RET
	0x0051 00081 (append.go:9)	LEAQ	type.[]uint8(SB), AX
	0x0058 00088 (append.go:9)	MOVQ	AX, (SP)
	0x005c 00092 (append.go:9)	MOVQ	"".b+72(FP), BP
	0x0061 00097 (append.go:9)	MOVQ	BP, 8(SP)
	0x0066 00102 (append.go:9)	MOVQ	CX, 16(SP)
	0x006b 00107 (append.go:9)	MOVQ	BX, 24(SP)
	0x0070 00112 (append.go:9)	MOVQ	DX, 32(SP)
	0x0075 00117 (append.go:9)	PCDATA	$0, $0
	0x0075 00117 (append.go:9)	CALL	runtime.growslice(SB)
	0x007a 00122 (append.go:9)	MOVQ	40(SP), AX
	0x007f 00127 (append.go:9)	MOVQ	48(SP), CX
	0x0084 00132 (append.go:9)	MOVQ	56(SP), BX
	0x0089 00137 (append.go:9)	ADDQ	$3, CX
	0x008d 00141 (append.go:9)	MOVQ	CX, DX
	0x0090 00144 (append.go:8)	MOVQ	"".b+80(FP), CX
	0x0095 00149 (append.go:9)	JMP	47
	0x0097 00151 (append.go:9)	NOP
	0x0097 00151 (append.go:8)	CALL	runtime.morestack_noctxt(SB)
	0x009c 00156 (append.go:8)	JMP	0
Observe that in the following sequence,
we should use DX directly instead of using
CX as a temporary register, which would make
the new code a strict improvement on the old:
	0x007f 00127 (append.go:9)	MOVQ	48(SP), CX
	0x0084 00132 (append.go:9)	MOVQ	56(SP), BX
	0x0089 00137 (append.go:9)	ADDQ	$3, CX
	0x008d 00141 (append.go:9)	MOVQ	CX, DX
	0x0090 00144 (append.go:8)	MOVQ	"".b+80(FP), CX
Change-Id: I4ee50b18fa53865901d2d7f86c2cbb54c6fa6924
Reviewed-on: https://go-review.googlesource.com/21812
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:08:00 -07:00
										 |  |  | 	//     ptr, len, cap = growslice(s, newlen) | 
					
						
							|  |  |  | 	//     newlen = len + 3 // recalculate to avoid a spill | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	// } | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	// // with write barriers, if needed: | 
					
						
							|  |  |  | 	// *(ptr+len) = e1 | 
					
						
							|  |  |  | 	// *(ptr+len+1) = e2 | 
					
						
							|  |  |  | 	// *(ptr+len+2) = e3 | 
					
						
							|  |  |  | 	// return makeslice(ptr, newlen, cap) | 
					
						
							|  |  |  | 	// | 
					
						
							|  |  |  | 	// | 
					
						
							|  |  |  | 	// If inplace is true, process as statement "s = append(s, e1, e2, e3)": | 
					
						
							|  |  |  | 	// | 
					
						
							|  |  |  | 	// a := &s | 
					
						
							|  |  |  | 	// ptr, len, cap := s | 
					
						
							|  |  |  | 	// newlen := len + 3 | 
					
						
							|  |  |  | 	// if newlen > cap { | 
					
						
							| 
									
										
											  
											
												cmd/compile: re-enable in-place append optimization
CL 21891 was too clever in its attempts to avoid spills.
Storing newlen too early caused uses of append in the runtime
itself to receive an inconsistent view of a slice,
leading to corruption.
This CL makes the generate code much more similar to
the old backend. It spills more than before,
but those spills have been contained to the grow path.
It recalculates newlen unnecessarily on the fast path,
but that's measurably cheaper than spilling it.
CL 21891 caused runtime failures in 6 of 2000 runs
of net/http and crypto/x509 in my test setup.
This CL has gone 6000 runs without a failure.
Benchmarks going from master to this CL:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   439ns ± 2%   436ns ± 2%  -0.72%  (p=0.001 n=28+27)
AppendInPlace/NoGrow/1Ptr-8   901ns ± 0%   856ns ± 0%  -4.95%  (p=0.000 n=26+29)
AppendInPlace/NoGrow/2Ptr-8  2.15µs ± 1%  1.95µs ± 0%  -9.07%  (p=0.000 n=28+30)
AppendInPlace/NoGrow/3Ptr-8  2.66µs ± 0%  2.45µs ± 0%  -7.93%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/4Ptr-8  3.24µs ± 1%  3.02µs ± 1%  -6.75%  (p=0.000 n=28+30)
AppendInPlace/Grow/Byte-8     269ns ± 1%   271ns ± 1%  +0.84%  (p=0.000 n=30+29)
AppendInPlace/Grow/1Ptr-8     275ns ± 1%   280ns ± 1%  +1.75%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     384ns ± 0%   391ns ± 0%  +1.94%  (p=0.000 n=27+30)
AppendInPlace/Grow/3Ptr-8     455ns ± 0%   462ns ± 0%  +1.43%  (p=0.000 n=29+29)
AppendInPlace/Grow/4Ptr-8     478ns ± 0%   479ns ± 0%  +0.23%  (p=0.000 n=30+27)
However, for the large no-grow cases, there is still more work to be done.
Going from this CL to the non-SSA backend:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   436ns ± 2%   436ns ± 2%     ~     (p=0.967 n=27+29)
AppendInPlace/NoGrow/1Ptr-8   856ns ± 0%   884ns ± 0%   +3.28%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/2Ptr-8  1.95µs ± 0%  1.56µs ± 0%  -20.28%  (p=0.000 n=30+29)
AppendInPlace/NoGrow/3Ptr-8  2.45µs ± 0%  1.89µs ± 0%  -22.88%  (p=0.000 n=26+28)
AppendInPlace/NoGrow/4Ptr-8  3.02µs ± 1%  2.56µs ± 1%  -15.35%  (p=0.000 n=30+28)
AppendInPlace/Grow/Byte-8     271ns ± 1%   283ns ± 1%   +4.56%  (p=0.000 n=29+29)
AppendInPlace/Grow/1Ptr-8     280ns ± 1%   288ns ± 1%   +2.99%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     391ns ± 0%   409ns ± 0%   +4.66%  (p=0.000 n=30+29)
AppendInPlace/Grow/3Ptr-8     462ns ± 0%   481ns ± 0%   +4.13%  (p=0.000 n=29+30)
AppendInPlace/Grow/4Ptr-8     479ns ± 0%   502ns ± 0%   +4.81%  (p=0.000 n=27+26)
New generated code:
var x []byte
func a() {
	x = append(x, 1)
}
"".a t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (a.go:5)	TEXT	"".a(SB), $72-0
	0x0000 00000 (a.go:5)	MOVQ	(TLS), CX
	0x0009 00009 (a.go:5)	CMPQ	SP, 16(CX)
	0x000d 00013 (a.go:5)	JLS	190
	0x0013 00019 (a.go:5)	SUBQ	$72, SP
	0x0017 00023 (a.go:5)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:5)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:6)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (a.go:6)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (a.go:6)	MOVQ	"".x(SB), BX
	0x002c 00044 (a.go:6)	LEAQ	1(DX), BP
	0x0030 00048 (a.go:6)	CMPQ	BP, CX
	0x0033 00051 (a.go:6)	JGT	$0, 73
	0x0035 00053 (a.go:6)	LEAQ	1(DX), AX
	0x0039 00057 (a.go:6)	MOVQ	AX, "".x+8(SB)
	0x0040 00064 (a.go:6)	MOVB	$1, (BX)(DX*1)
	0x0044 00068 (a.go:7)	ADDQ	$72, SP
	0x0048 00072 (a.go:7)	RET
	0x0049 00073 (a.go:6)	LEAQ	type.[]uint8(SB), AX
	0x0050 00080 (a.go:6)	MOVQ	AX, (SP)
	0x0054 00084 (a.go:6)	MOVQ	BX, 8(SP)
	0x0059 00089 (a.go:6)	MOVQ	DX, 16(SP)
	0x005e 00094 (a.go:6)	MOVQ	CX, 24(SP)
	0x0063 00099 (a.go:6)	MOVQ	BP, 32(SP)
	0x0068 00104 (a.go:6)	PCDATA	$0, $0
	0x0068 00104 (a.go:6)	CALL	runtime.growslice(SB)
	0x006d 00109 (a.go:6)	MOVQ	40(SP), CX
	0x0072 00114 (a.go:6)	MOVQ	48(SP), DX
	0x0077 00119 (a.go:6)	MOVQ	DX, "".autotmp_0+64(SP)
	0x007c 00124 (a.go:6)	MOVQ	56(SP), BX
	0x0081 00129 (a.go:6)	MOVQ	BX, "".x+16(SB)
	0x0088 00136 (a.go:6)	MOVL	runtime.writeBarrier(SB), AX
	0x008e 00142 (a.go:6)	TESTB	AL, AL
	0x0090 00144 (a.go:6)	JNE	$0, 162
	0x0092 00146 (a.go:6)	MOVQ	CX, "".x(SB)
	0x0099 00153 (a.go:6)	MOVQ	"".x(SB), BX
	0x00a0 00160 (a.go:6)	JMP	53
	0x00a2 00162 (a.go:6)	LEAQ	"".x(SB), BX
	0x00a9 00169 (a.go:6)	MOVQ	BX, (SP)
	0x00ad 00173 (a.go:6)	MOVQ	CX, 8(SP)
	0x00b2 00178 (a.go:6)	PCDATA	$0, $0
	0x00b2 00178 (a.go:6)	CALL	runtime.writebarrierptr(SB)
	0x00b7 00183 (a.go:6)	MOVQ	"".autotmp_0+64(SP), DX
	0x00bc 00188 (a.go:6)	JMP	153
	0x00be 00190 (a.go:6)	NOP
	0x00be 00190 (a.go:5)	CALL	runtime.morestack_noctxt(SB)
	0x00c3 00195 (a.go:5)	JMP	0
Fixes #14969 again
Change-Id: Ia50463b1f506011aad0718a4fef1d4738e43c32d
Reviewed-on: https://go-review.googlesource.com/22197
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-18 09:40:30 -07:00
										 |  |  | 	//    newptr, len, newcap = growslice(ptr, len, cap, newlen) | 
					
						
							|  |  |  | 	//    vardef(a)       // if necessary, advise liveness we are writing a new a | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	//    *a.cap = newcap // write before ptr to avoid a spill | 
					
						
							|  |  |  | 	//    *a.ptr = newptr // with write barrier | 
					
						
							|  |  |  | 	// } | 
					
						
							| 
									
										
											  
											
												cmd/compile: re-enable in-place append optimization
CL 21891 was too clever in its attempts to avoid spills.
Storing newlen too early caused uses of append in the runtime
itself to receive an inconsistent view of a slice,
leading to corruption.
This CL makes the generate code much more similar to
the old backend. It spills more than before,
but those spills have been contained to the grow path.
It recalculates newlen unnecessarily on the fast path,
but that's measurably cheaper than spilling it.
CL 21891 caused runtime failures in 6 of 2000 runs
of net/http and crypto/x509 in my test setup.
This CL has gone 6000 runs without a failure.
Benchmarks going from master to this CL:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   439ns ± 2%   436ns ± 2%  -0.72%  (p=0.001 n=28+27)
AppendInPlace/NoGrow/1Ptr-8   901ns ± 0%   856ns ± 0%  -4.95%  (p=0.000 n=26+29)
AppendInPlace/NoGrow/2Ptr-8  2.15µs ± 1%  1.95µs ± 0%  -9.07%  (p=0.000 n=28+30)
AppendInPlace/NoGrow/3Ptr-8  2.66µs ± 0%  2.45µs ± 0%  -7.93%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/4Ptr-8  3.24µs ± 1%  3.02µs ± 1%  -6.75%  (p=0.000 n=28+30)
AppendInPlace/Grow/Byte-8     269ns ± 1%   271ns ± 1%  +0.84%  (p=0.000 n=30+29)
AppendInPlace/Grow/1Ptr-8     275ns ± 1%   280ns ± 1%  +1.75%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     384ns ± 0%   391ns ± 0%  +1.94%  (p=0.000 n=27+30)
AppendInPlace/Grow/3Ptr-8     455ns ± 0%   462ns ± 0%  +1.43%  (p=0.000 n=29+29)
AppendInPlace/Grow/4Ptr-8     478ns ± 0%   479ns ± 0%  +0.23%  (p=0.000 n=30+27)
However, for the large no-grow cases, there is still more work to be done.
Going from this CL to the non-SSA backend:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   436ns ± 2%   436ns ± 2%     ~     (p=0.967 n=27+29)
AppendInPlace/NoGrow/1Ptr-8   856ns ± 0%   884ns ± 0%   +3.28%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/2Ptr-8  1.95µs ± 0%  1.56µs ± 0%  -20.28%  (p=0.000 n=30+29)
AppendInPlace/NoGrow/3Ptr-8  2.45µs ± 0%  1.89µs ± 0%  -22.88%  (p=0.000 n=26+28)
AppendInPlace/NoGrow/4Ptr-8  3.02µs ± 1%  2.56µs ± 1%  -15.35%  (p=0.000 n=30+28)
AppendInPlace/Grow/Byte-8     271ns ± 1%   283ns ± 1%   +4.56%  (p=0.000 n=29+29)
AppendInPlace/Grow/1Ptr-8     280ns ± 1%   288ns ± 1%   +2.99%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     391ns ± 0%   409ns ± 0%   +4.66%  (p=0.000 n=30+29)
AppendInPlace/Grow/3Ptr-8     462ns ± 0%   481ns ± 0%   +4.13%  (p=0.000 n=29+30)
AppendInPlace/Grow/4Ptr-8     479ns ± 0%   502ns ± 0%   +4.81%  (p=0.000 n=27+26)
New generated code:
var x []byte
func a() {
	x = append(x, 1)
}
"".a t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (a.go:5)	TEXT	"".a(SB), $72-0
	0x0000 00000 (a.go:5)	MOVQ	(TLS), CX
	0x0009 00009 (a.go:5)	CMPQ	SP, 16(CX)
	0x000d 00013 (a.go:5)	JLS	190
	0x0013 00019 (a.go:5)	SUBQ	$72, SP
	0x0017 00023 (a.go:5)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:5)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:6)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (a.go:6)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (a.go:6)	MOVQ	"".x(SB), BX
	0x002c 00044 (a.go:6)	LEAQ	1(DX), BP
	0x0030 00048 (a.go:6)	CMPQ	BP, CX
	0x0033 00051 (a.go:6)	JGT	$0, 73
	0x0035 00053 (a.go:6)	LEAQ	1(DX), AX
	0x0039 00057 (a.go:6)	MOVQ	AX, "".x+8(SB)
	0x0040 00064 (a.go:6)	MOVB	$1, (BX)(DX*1)
	0x0044 00068 (a.go:7)	ADDQ	$72, SP
	0x0048 00072 (a.go:7)	RET
	0x0049 00073 (a.go:6)	LEAQ	type.[]uint8(SB), AX
	0x0050 00080 (a.go:6)	MOVQ	AX, (SP)
	0x0054 00084 (a.go:6)	MOVQ	BX, 8(SP)
	0x0059 00089 (a.go:6)	MOVQ	DX, 16(SP)
	0x005e 00094 (a.go:6)	MOVQ	CX, 24(SP)
	0x0063 00099 (a.go:6)	MOVQ	BP, 32(SP)
	0x0068 00104 (a.go:6)	PCDATA	$0, $0
	0x0068 00104 (a.go:6)	CALL	runtime.growslice(SB)
	0x006d 00109 (a.go:6)	MOVQ	40(SP), CX
	0x0072 00114 (a.go:6)	MOVQ	48(SP), DX
	0x0077 00119 (a.go:6)	MOVQ	DX, "".autotmp_0+64(SP)
	0x007c 00124 (a.go:6)	MOVQ	56(SP), BX
	0x0081 00129 (a.go:6)	MOVQ	BX, "".x+16(SB)
	0x0088 00136 (a.go:6)	MOVL	runtime.writeBarrier(SB), AX
	0x008e 00142 (a.go:6)	TESTB	AL, AL
	0x0090 00144 (a.go:6)	JNE	$0, 162
	0x0092 00146 (a.go:6)	MOVQ	CX, "".x(SB)
	0x0099 00153 (a.go:6)	MOVQ	"".x(SB), BX
	0x00a0 00160 (a.go:6)	JMP	53
	0x00a2 00162 (a.go:6)	LEAQ	"".x(SB), BX
	0x00a9 00169 (a.go:6)	MOVQ	BX, (SP)
	0x00ad 00173 (a.go:6)	MOVQ	CX, 8(SP)
	0x00b2 00178 (a.go:6)	PCDATA	$0, $0
	0x00b2 00178 (a.go:6)	CALL	runtime.writebarrierptr(SB)
	0x00b7 00183 (a.go:6)	MOVQ	"".autotmp_0+64(SP), DX
	0x00bc 00188 (a.go:6)	JMP	153
	0x00be 00190 (a.go:6)	NOP
	0x00be 00190 (a.go:5)	CALL	runtime.morestack_noctxt(SB)
	0x00c3 00195 (a.go:5)	JMP	0
Fixes #14969 again
Change-Id: Ia50463b1f506011aad0718a4fef1d4738e43c32d
Reviewed-on: https://go-review.googlesource.com/22197
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-18 09:40:30 -07:00
										 |  |  | 	// newlen = len + 3 // recalculate to avoid a spill | 
					
						
							|  |  |  | 	// *a.len = newlen | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	// // with write barriers, if needed: | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	// *(ptr+len) = e1 | 
					
						
							|  |  |  | 	// *(ptr+len+1) = e2 | 
					
						
							|  |  |  | 	// *(ptr+len+2) = e3 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	et := n.Type.Elem() | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	pt := types.NewPtr(et) | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// Evaluate slice | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	sn := n.List.First() // the slice node is the first in the list | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	var slice, addr *ssa.Value | 
					
						
							|  |  |  | 	if inplace { | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		addr = s.addr(sn, false) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		slice = s.load(n.Type, addr) | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	} else { | 
					
						
							|  |  |  | 		slice = s.expr(sn) | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// Allocate new blocks | 
					
						
							|  |  |  | 	grow := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	assign := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Decide if we need to grow | 
					
						
							|  |  |  | 	nargs := int64(n.List.Len() - 1) | 
					
						
							|  |  |  | 	p := s.newValue1(ssa.OpSlicePtr, pt, slice) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	l := s.newValue1(ssa.OpSliceLen, types.Types[TINT], slice) | 
					
						
							|  |  |  | 	c := s.newValue1(ssa.OpSliceCap, types.Types[TINT], slice) | 
					
						
							|  |  |  | 	nl := s.newValue2(s.ssaOp(OADD, types.Types[TINT]), types.Types[TINT], l, s.constInt(types.Types[TINT], nargs)) | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	cmp := s.newValue2(s.ssaOp(OGT, types.Types[TINT]), types.Types[TBOOL], nl, c) | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	s.vars[&ptrVar] = p | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	if !inplace { | 
					
						
							|  |  |  | 		s.vars[&newlenVar] = nl | 
					
						
							|  |  |  | 		s.vars[&capVar] = c | 
					
						
							| 
									
										
											  
											
												cmd/compile: re-enable in-place append optimization
CL 21891 was too clever in its attempts to avoid spills.
Storing newlen too early caused uses of append in the runtime
itself to receive an inconsistent view of a slice,
leading to corruption.
This CL makes the generate code much more similar to
the old backend. It spills more than before,
but those spills have been contained to the grow path.
It recalculates newlen unnecessarily on the fast path,
but that's measurably cheaper than spilling it.
CL 21891 caused runtime failures in 6 of 2000 runs
of net/http and crypto/x509 in my test setup.
This CL has gone 6000 runs without a failure.
Benchmarks going from master to this CL:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   439ns ± 2%   436ns ± 2%  -0.72%  (p=0.001 n=28+27)
AppendInPlace/NoGrow/1Ptr-8   901ns ± 0%   856ns ± 0%  -4.95%  (p=0.000 n=26+29)
AppendInPlace/NoGrow/2Ptr-8  2.15µs ± 1%  1.95µs ± 0%  -9.07%  (p=0.000 n=28+30)
AppendInPlace/NoGrow/3Ptr-8  2.66µs ± 0%  2.45µs ± 0%  -7.93%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/4Ptr-8  3.24µs ± 1%  3.02µs ± 1%  -6.75%  (p=0.000 n=28+30)
AppendInPlace/Grow/Byte-8     269ns ± 1%   271ns ± 1%  +0.84%  (p=0.000 n=30+29)
AppendInPlace/Grow/1Ptr-8     275ns ± 1%   280ns ± 1%  +1.75%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     384ns ± 0%   391ns ± 0%  +1.94%  (p=0.000 n=27+30)
AppendInPlace/Grow/3Ptr-8     455ns ± 0%   462ns ± 0%  +1.43%  (p=0.000 n=29+29)
AppendInPlace/Grow/4Ptr-8     478ns ± 0%   479ns ± 0%  +0.23%  (p=0.000 n=30+27)
However, for the large no-grow cases, there is still more work to be done.
Going from this CL to the non-SSA backend:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   436ns ± 2%   436ns ± 2%     ~     (p=0.967 n=27+29)
AppendInPlace/NoGrow/1Ptr-8   856ns ± 0%   884ns ± 0%   +3.28%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/2Ptr-8  1.95µs ± 0%  1.56µs ± 0%  -20.28%  (p=0.000 n=30+29)
AppendInPlace/NoGrow/3Ptr-8  2.45µs ± 0%  1.89µs ± 0%  -22.88%  (p=0.000 n=26+28)
AppendInPlace/NoGrow/4Ptr-8  3.02µs ± 1%  2.56µs ± 1%  -15.35%  (p=0.000 n=30+28)
AppendInPlace/Grow/Byte-8     271ns ± 1%   283ns ± 1%   +4.56%  (p=0.000 n=29+29)
AppendInPlace/Grow/1Ptr-8     280ns ± 1%   288ns ± 1%   +2.99%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     391ns ± 0%   409ns ± 0%   +4.66%  (p=0.000 n=30+29)
AppendInPlace/Grow/3Ptr-8     462ns ± 0%   481ns ± 0%   +4.13%  (p=0.000 n=29+30)
AppendInPlace/Grow/4Ptr-8     479ns ± 0%   502ns ± 0%   +4.81%  (p=0.000 n=27+26)
New generated code:
var x []byte
func a() {
	x = append(x, 1)
}
"".a t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (a.go:5)	TEXT	"".a(SB), $72-0
	0x0000 00000 (a.go:5)	MOVQ	(TLS), CX
	0x0009 00009 (a.go:5)	CMPQ	SP, 16(CX)
	0x000d 00013 (a.go:5)	JLS	190
	0x0013 00019 (a.go:5)	SUBQ	$72, SP
	0x0017 00023 (a.go:5)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:5)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:6)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (a.go:6)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (a.go:6)	MOVQ	"".x(SB), BX
	0x002c 00044 (a.go:6)	LEAQ	1(DX), BP
	0x0030 00048 (a.go:6)	CMPQ	BP, CX
	0x0033 00051 (a.go:6)	JGT	$0, 73
	0x0035 00053 (a.go:6)	LEAQ	1(DX), AX
	0x0039 00057 (a.go:6)	MOVQ	AX, "".x+8(SB)
	0x0040 00064 (a.go:6)	MOVB	$1, (BX)(DX*1)
	0x0044 00068 (a.go:7)	ADDQ	$72, SP
	0x0048 00072 (a.go:7)	RET
	0x0049 00073 (a.go:6)	LEAQ	type.[]uint8(SB), AX
	0x0050 00080 (a.go:6)	MOVQ	AX, (SP)
	0x0054 00084 (a.go:6)	MOVQ	BX, 8(SP)
	0x0059 00089 (a.go:6)	MOVQ	DX, 16(SP)
	0x005e 00094 (a.go:6)	MOVQ	CX, 24(SP)
	0x0063 00099 (a.go:6)	MOVQ	BP, 32(SP)
	0x0068 00104 (a.go:6)	PCDATA	$0, $0
	0x0068 00104 (a.go:6)	CALL	runtime.growslice(SB)
	0x006d 00109 (a.go:6)	MOVQ	40(SP), CX
	0x0072 00114 (a.go:6)	MOVQ	48(SP), DX
	0x0077 00119 (a.go:6)	MOVQ	DX, "".autotmp_0+64(SP)
	0x007c 00124 (a.go:6)	MOVQ	56(SP), BX
	0x0081 00129 (a.go:6)	MOVQ	BX, "".x+16(SB)
	0x0088 00136 (a.go:6)	MOVL	runtime.writeBarrier(SB), AX
	0x008e 00142 (a.go:6)	TESTB	AL, AL
	0x0090 00144 (a.go:6)	JNE	$0, 162
	0x0092 00146 (a.go:6)	MOVQ	CX, "".x(SB)
	0x0099 00153 (a.go:6)	MOVQ	"".x(SB), BX
	0x00a0 00160 (a.go:6)	JMP	53
	0x00a2 00162 (a.go:6)	LEAQ	"".x(SB), BX
	0x00a9 00169 (a.go:6)	MOVQ	BX, (SP)
	0x00ad 00173 (a.go:6)	MOVQ	CX, 8(SP)
	0x00b2 00178 (a.go:6)	PCDATA	$0, $0
	0x00b2 00178 (a.go:6)	CALL	runtime.writebarrierptr(SB)
	0x00b7 00183 (a.go:6)	MOVQ	"".autotmp_0+64(SP), DX
	0x00bc 00188 (a.go:6)	JMP	153
	0x00be 00190 (a.go:6)	NOP
	0x00be 00190 (a.go:5)	CALL	runtime.morestack_noctxt(SB)
	0x00c3 00195 (a.go:5)	JMP	0
Fixes #14969 again
Change-Id: Ia50463b1f506011aad0718a4fef1d4738e43c32d
Reviewed-on: https://go-review.googlesource.com/22197
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-18 09:40:30 -07:00
										 |  |  | 	} else { | 
					
						
							|  |  |  | 		s.vars[&lenVar] = l | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	b := s.endBlock() | 
					
						
							|  |  |  | 	b.Kind = ssa.BlockIf | 
					
						
							|  |  |  | 	b.Likely = ssa.BranchUnlikely | 
					
						
							|  |  |  | 	b.SetControl(cmp) | 
					
						
							|  |  |  | 	b.AddEdgeTo(grow) | 
					
						
							|  |  |  | 	b.AddEdgeTo(assign) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Call growslice | 
					
						
							|  |  |  | 	s.startBlock(grow) | 
					
						
							| 
									
										
										
										
											2017-03-27 14:48:24 -07:00
										 |  |  | 	taddr := s.expr(n.Left) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	r := s.rtcall(growslice, true, []*types.Type{pt, types.Types[TINT], types.Types[TINT]}, taddr, p, l, c, nl) | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	if inplace { | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 		if sn.Op == ONAME && sn.Class() != PEXTERN { | 
					
						
							| 
									
										
											  
											
												cmd/compile: re-enable in-place append optimization
CL 21891 was too clever in its attempts to avoid spills.
Storing newlen too early caused uses of append in the runtime
itself to receive an inconsistent view of a slice,
leading to corruption.
This CL makes the generate code much more similar to
the old backend. It spills more than before,
but those spills have been contained to the grow path.
It recalculates newlen unnecessarily on the fast path,
but that's measurably cheaper than spilling it.
CL 21891 caused runtime failures in 6 of 2000 runs
of net/http and crypto/x509 in my test setup.
This CL has gone 6000 runs without a failure.
Benchmarks going from master to this CL:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   439ns ± 2%   436ns ± 2%  -0.72%  (p=0.001 n=28+27)
AppendInPlace/NoGrow/1Ptr-8   901ns ± 0%   856ns ± 0%  -4.95%  (p=0.000 n=26+29)
AppendInPlace/NoGrow/2Ptr-8  2.15µs ± 1%  1.95µs ± 0%  -9.07%  (p=0.000 n=28+30)
AppendInPlace/NoGrow/3Ptr-8  2.66µs ± 0%  2.45µs ± 0%  -7.93%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/4Ptr-8  3.24µs ± 1%  3.02µs ± 1%  -6.75%  (p=0.000 n=28+30)
AppendInPlace/Grow/Byte-8     269ns ± 1%   271ns ± 1%  +0.84%  (p=0.000 n=30+29)
AppendInPlace/Grow/1Ptr-8     275ns ± 1%   280ns ± 1%  +1.75%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     384ns ± 0%   391ns ± 0%  +1.94%  (p=0.000 n=27+30)
AppendInPlace/Grow/3Ptr-8     455ns ± 0%   462ns ± 0%  +1.43%  (p=0.000 n=29+29)
AppendInPlace/Grow/4Ptr-8     478ns ± 0%   479ns ± 0%  +0.23%  (p=0.000 n=30+27)
However, for the large no-grow cases, there is still more work to be done.
Going from this CL to the non-SSA backend:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   436ns ± 2%   436ns ± 2%     ~     (p=0.967 n=27+29)
AppendInPlace/NoGrow/1Ptr-8   856ns ± 0%   884ns ± 0%   +3.28%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/2Ptr-8  1.95µs ± 0%  1.56µs ± 0%  -20.28%  (p=0.000 n=30+29)
AppendInPlace/NoGrow/3Ptr-8  2.45µs ± 0%  1.89µs ± 0%  -22.88%  (p=0.000 n=26+28)
AppendInPlace/NoGrow/4Ptr-8  3.02µs ± 1%  2.56µs ± 1%  -15.35%  (p=0.000 n=30+28)
AppendInPlace/Grow/Byte-8     271ns ± 1%   283ns ± 1%   +4.56%  (p=0.000 n=29+29)
AppendInPlace/Grow/1Ptr-8     280ns ± 1%   288ns ± 1%   +2.99%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     391ns ± 0%   409ns ± 0%   +4.66%  (p=0.000 n=30+29)
AppendInPlace/Grow/3Ptr-8     462ns ± 0%   481ns ± 0%   +4.13%  (p=0.000 n=29+30)
AppendInPlace/Grow/4Ptr-8     479ns ± 0%   502ns ± 0%   +4.81%  (p=0.000 n=27+26)
New generated code:
var x []byte
func a() {
	x = append(x, 1)
}
"".a t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (a.go:5)	TEXT	"".a(SB), $72-0
	0x0000 00000 (a.go:5)	MOVQ	(TLS), CX
	0x0009 00009 (a.go:5)	CMPQ	SP, 16(CX)
	0x000d 00013 (a.go:5)	JLS	190
	0x0013 00019 (a.go:5)	SUBQ	$72, SP
	0x0017 00023 (a.go:5)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:5)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:6)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (a.go:6)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (a.go:6)	MOVQ	"".x(SB), BX
	0x002c 00044 (a.go:6)	LEAQ	1(DX), BP
	0x0030 00048 (a.go:6)	CMPQ	BP, CX
	0x0033 00051 (a.go:6)	JGT	$0, 73
	0x0035 00053 (a.go:6)	LEAQ	1(DX), AX
	0x0039 00057 (a.go:6)	MOVQ	AX, "".x+8(SB)
	0x0040 00064 (a.go:6)	MOVB	$1, (BX)(DX*1)
	0x0044 00068 (a.go:7)	ADDQ	$72, SP
	0x0048 00072 (a.go:7)	RET
	0x0049 00073 (a.go:6)	LEAQ	type.[]uint8(SB), AX
	0x0050 00080 (a.go:6)	MOVQ	AX, (SP)
	0x0054 00084 (a.go:6)	MOVQ	BX, 8(SP)
	0x0059 00089 (a.go:6)	MOVQ	DX, 16(SP)
	0x005e 00094 (a.go:6)	MOVQ	CX, 24(SP)
	0x0063 00099 (a.go:6)	MOVQ	BP, 32(SP)
	0x0068 00104 (a.go:6)	PCDATA	$0, $0
	0x0068 00104 (a.go:6)	CALL	runtime.growslice(SB)
	0x006d 00109 (a.go:6)	MOVQ	40(SP), CX
	0x0072 00114 (a.go:6)	MOVQ	48(SP), DX
	0x0077 00119 (a.go:6)	MOVQ	DX, "".autotmp_0+64(SP)
	0x007c 00124 (a.go:6)	MOVQ	56(SP), BX
	0x0081 00129 (a.go:6)	MOVQ	BX, "".x+16(SB)
	0x0088 00136 (a.go:6)	MOVL	runtime.writeBarrier(SB), AX
	0x008e 00142 (a.go:6)	TESTB	AL, AL
	0x0090 00144 (a.go:6)	JNE	$0, 162
	0x0092 00146 (a.go:6)	MOVQ	CX, "".x(SB)
	0x0099 00153 (a.go:6)	MOVQ	"".x(SB), BX
	0x00a0 00160 (a.go:6)	JMP	53
	0x00a2 00162 (a.go:6)	LEAQ	"".x(SB), BX
	0x00a9 00169 (a.go:6)	MOVQ	BX, (SP)
	0x00ad 00173 (a.go:6)	MOVQ	CX, 8(SP)
	0x00b2 00178 (a.go:6)	PCDATA	$0, $0
	0x00b2 00178 (a.go:6)	CALL	runtime.writebarrierptr(SB)
	0x00b7 00183 (a.go:6)	MOVQ	"".autotmp_0+64(SP), DX
	0x00bc 00188 (a.go:6)	JMP	153
	0x00be 00190 (a.go:6)	NOP
	0x00be 00190 (a.go:5)	CALL	runtime.morestack_noctxt(SB)
	0x00c3 00195 (a.go:5)	JMP	0
Fixes #14969 again
Change-Id: Ia50463b1f506011aad0718a4fef1d4738e43c32d
Reviewed-on: https://go-review.googlesource.com/22197
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-18 09:40:30 -07:00
										 |  |  | 			// Tell liveness we're about to build a new slice | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			s.vars[&memVar] = s.newValue1A(ssa.OpVarDef, types.TypeMem, sn, s.mem()) | 
					
						
							| 
									
										
											  
											
												cmd/compile: re-enable in-place append optimization
CL 21891 was too clever in its attempts to avoid spills.
Storing newlen too early caused uses of append in the runtime
itself to receive an inconsistent view of a slice,
leading to corruption.
This CL makes the generate code much more similar to
the old backend. It spills more than before,
but those spills have been contained to the grow path.
It recalculates newlen unnecessarily on the fast path,
but that's measurably cheaper than spilling it.
CL 21891 caused runtime failures in 6 of 2000 runs
of net/http and crypto/x509 in my test setup.
This CL has gone 6000 runs without a failure.
Benchmarks going from master to this CL:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   439ns ± 2%   436ns ± 2%  -0.72%  (p=0.001 n=28+27)
AppendInPlace/NoGrow/1Ptr-8   901ns ± 0%   856ns ± 0%  -4.95%  (p=0.000 n=26+29)
AppendInPlace/NoGrow/2Ptr-8  2.15µs ± 1%  1.95µs ± 0%  -9.07%  (p=0.000 n=28+30)
AppendInPlace/NoGrow/3Ptr-8  2.66µs ± 0%  2.45µs ± 0%  -7.93%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/4Ptr-8  3.24µs ± 1%  3.02µs ± 1%  -6.75%  (p=0.000 n=28+30)
AppendInPlace/Grow/Byte-8     269ns ± 1%   271ns ± 1%  +0.84%  (p=0.000 n=30+29)
AppendInPlace/Grow/1Ptr-8     275ns ± 1%   280ns ± 1%  +1.75%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     384ns ± 0%   391ns ± 0%  +1.94%  (p=0.000 n=27+30)
AppendInPlace/Grow/3Ptr-8     455ns ± 0%   462ns ± 0%  +1.43%  (p=0.000 n=29+29)
AppendInPlace/Grow/4Ptr-8     478ns ± 0%   479ns ± 0%  +0.23%  (p=0.000 n=30+27)
However, for the large no-grow cases, there is still more work to be done.
Going from this CL to the non-SSA backend:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   436ns ± 2%   436ns ± 2%     ~     (p=0.967 n=27+29)
AppendInPlace/NoGrow/1Ptr-8   856ns ± 0%   884ns ± 0%   +3.28%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/2Ptr-8  1.95µs ± 0%  1.56µs ± 0%  -20.28%  (p=0.000 n=30+29)
AppendInPlace/NoGrow/3Ptr-8  2.45µs ± 0%  1.89µs ± 0%  -22.88%  (p=0.000 n=26+28)
AppendInPlace/NoGrow/4Ptr-8  3.02µs ± 1%  2.56µs ± 1%  -15.35%  (p=0.000 n=30+28)
AppendInPlace/Grow/Byte-8     271ns ± 1%   283ns ± 1%   +4.56%  (p=0.000 n=29+29)
AppendInPlace/Grow/1Ptr-8     280ns ± 1%   288ns ± 1%   +2.99%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     391ns ± 0%   409ns ± 0%   +4.66%  (p=0.000 n=30+29)
AppendInPlace/Grow/3Ptr-8     462ns ± 0%   481ns ± 0%   +4.13%  (p=0.000 n=29+30)
AppendInPlace/Grow/4Ptr-8     479ns ± 0%   502ns ± 0%   +4.81%  (p=0.000 n=27+26)
New generated code:
var x []byte
func a() {
	x = append(x, 1)
}
"".a t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (a.go:5)	TEXT	"".a(SB), $72-0
	0x0000 00000 (a.go:5)	MOVQ	(TLS), CX
	0x0009 00009 (a.go:5)	CMPQ	SP, 16(CX)
	0x000d 00013 (a.go:5)	JLS	190
	0x0013 00019 (a.go:5)	SUBQ	$72, SP
	0x0017 00023 (a.go:5)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:5)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:6)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (a.go:6)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (a.go:6)	MOVQ	"".x(SB), BX
	0x002c 00044 (a.go:6)	LEAQ	1(DX), BP
	0x0030 00048 (a.go:6)	CMPQ	BP, CX
	0x0033 00051 (a.go:6)	JGT	$0, 73
	0x0035 00053 (a.go:6)	LEAQ	1(DX), AX
	0x0039 00057 (a.go:6)	MOVQ	AX, "".x+8(SB)
	0x0040 00064 (a.go:6)	MOVB	$1, (BX)(DX*1)
	0x0044 00068 (a.go:7)	ADDQ	$72, SP
	0x0048 00072 (a.go:7)	RET
	0x0049 00073 (a.go:6)	LEAQ	type.[]uint8(SB), AX
	0x0050 00080 (a.go:6)	MOVQ	AX, (SP)
	0x0054 00084 (a.go:6)	MOVQ	BX, 8(SP)
	0x0059 00089 (a.go:6)	MOVQ	DX, 16(SP)
	0x005e 00094 (a.go:6)	MOVQ	CX, 24(SP)
	0x0063 00099 (a.go:6)	MOVQ	BP, 32(SP)
	0x0068 00104 (a.go:6)	PCDATA	$0, $0
	0x0068 00104 (a.go:6)	CALL	runtime.growslice(SB)
	0x006d 00109 (a.go:6)	MOVQ	40(SP), CX
	0x0072 00114 (a.go:6)	MOVQ	48(SP), DX
	0x0077 00119 (a.go:6)	MOVQ	DX, "".autotmp_0+64(SP)
	0x007c 00124 (a.go:6)	MOVQ	56(SP), BX
	0x0081 00129 (a.go:6)	MOVQ	BX, "".x+16(SB)
	0x0088 00136 (a.go:6)	MOVL	runtime.writeBarrier(SB), AX
	0x008e 00142 (a.go:6)	TESTB	AL, AL
	0x0090 00144 (a.go:6)	JNE	$0, 162
	0x0092 00146 (a.go:6)	MOVQ	CX, "".x(SB)
	0x0099 00153 (a.go:6)	MOVQ	"".x(SB), BX
	0x00a0 00160 (a.go:6)	JMP	53
	0x00a2 00162 (a.go:6)	LEAQ	"".x(SB), BX
	0x00a9 00169 (a.go:6)	MOVQ	BX, (SP)
	0x00ad 00173 (a.go:6)	MOVQ	CX, 8(SP)
	0x00b2 00178 (a.go:6)	PCDATA	$0, $0
	0x00b2 00178 (a.go:6)	CALL	runtime.writebarrierptr(SB)
	0x00b7 00183 (a.go:6)	MOVQ	"".autotmp_0+64(SP), DX
	0x00bc 00188 (a.go:6)	JMP	153
	0x00be 00190 (a.go:6)	NOP
	0x00be 00190 (a.go:5)	CALL	runtime.morestack_noctxt(SB)
	0x00c3 00195 (a.go:5)	JMP	0
Fixes #14969 again
Change-Id: Ia50463b1f506011aad0718a4fef1d4738e43c32d
Reviewed-on: https://go-review.googlesource.com/22197
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-18 09:40:30 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		capaddr := s.newValue1I(ssa.OpOffPtr, s.f.Config.Types.IntPtr, int64(array_cap), addr) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(types.Types[TINT], capaddr, r[2]) | 
					
						
							|  |  |  | 		s.store(pt, addr, r[0]) | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 		// load the value we just stored to avoid having to spill it | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.vars[&ptrVar] = s.load(pt, addr) | 
					
						
							| 
									
										
											  
											
												cmd/compile: re-enable in-place append optimization
CL 21891 was too clever in its attempts to avoid spills.
Storing newlen too early caused uses of append in the runtime
itself to receive an inconsistent view of a slice,
leading to corruption.
This CL makes the generate code much more similar to
the old backend. It spills more than before,
but those spills have been contained to the grow path.
It recalculates newlen unnecessarily on the fast path,
but that's measurably cheaper than spilling it.
CL 21891 caused runtime failures in 6 of 2000 runs
of net/http and crypto/x509 in my test setup.
This CL has gone 6000 runs without a failure.
Benchmarks going from master to this CL:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   439ns ± 2%   436ns ± 2%  -0.72%  (p=0.001 n=28+27)
AppendInPlace/NoGrow/1Ptr-8   901ns ± 0%   856ns ± 0%  -4.95%  (p=0.000 n=26+29)
AppendInPlace/NoGrow/2Ptr-8  2.15µs ± 1%  1.95µs ± 0%  -9.07%  (p=0.000 n=28+30)
AppendInPlace/NoGrow/3Ptr-8  2.66µs ± 0%  2.45µs ± 0%  -7.93%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/4Ptr-8  3.24µs ± 1%  3.02µs ± 1%  -6.75%  (p=0.000 n=28+30)
AppendInPlace/Grow/Byte-8     269ns ± 1%   271ns ± 1%  +0.84%  (p=0.000 n=30+29)
AppendInPlace/Grow/1Ptr-8     275ns ± 1%   280ns ± 1%  +1.75%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     384ns ± 0%   391ns ± 0%  +1.94%  (p=0.000 n=27+30)
AppendInPlace/Grow/3Ptr-8     455ns ± 0%   462ns ± 0%  +1.43%  (p=0.000 n=29+29)
AppendInPlace/Grow/4Ptr-8     478ns ± 0%   479ns ± 0%  +0.23%  (p=0.000 n=30+27)
However, for the large no-grow cases, there is still more work to be done.
Going from this CL to the non-SSA backend:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   436ns ± 2%   436ns ± 2%     ~     (p=0.967 n=27+29)
AppendInPlace/NoGrow/1Ptr-8   856ns ± 0%   884ns ± 0%   +3.28%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/2Ptr-8  1.95µs ± 0%  1.56µs ± 0%  -20.28%  (p=0.000 n=30+29)
AppendInPlace/NoGrow/3Ptr-8  2.45µs ± 0%  1.89µs ± 0%  -22.88%  (p=0.000 n=26+28)
AppendInPlace/NoGrow/4Ptr-8  3.02µs ± 1%  2.56µs ± 1%  -15.35%  (p=0.000 n=30+28)
AppendInPlace/Grow/Byte-8     271ns ± 1%   283ns ± 1%   +4.56%  (p=0.000 n=29+29)
AppendInPlace/Grow/1Ptr-8     280ns ± 1%   288ns ± 1%   +2.99%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     391ns ± 0%   409ns ± 0%   +4.66%  (p=0.000 n=30+29)
AppendInPlace/Grow/3Ptr-8     462ns ± 0%   481ns ± 0%   +4.13%  (p=0.000 n=29+30)
AppendInPlace/Grow/4Ptr-8     479ns ± 0%   502ns ± 0%   +4.81%  (p=0.000 n=27+26)
New generated code:
var x []byte
func a() {
	x = append(x, 1)
}
"".a t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (a.go:5)	TEXT	"".a(SB), $72-0
	0x0000 00000 (a.go:5)	MOVQ	(TLS), CX
	0x0009 00009 (a.go:5)	CMPQ	SP, 16(CX)
	0x000d 00013 (a.go:5)	JLS	190
	0x0013 00019 (a.go:5)	SUBQ	$72, SP
	0x0017 00023 (a.go:5)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:5)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:6)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (a.go:6)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (a.go:6)	MOVQ	"".x(SB), BX
	0x002c 00044 (a.go:6)	LEAQ	1(DX), BP
	0x0030 00048 (a.go:6)	CMPQ	BP, CX
	0x0033 00051 (a.go:6)	JGT	$0, 73
	0x0035 00053 (a.go:6)	LEAQ	1(DX), AX
	0x0039 00057 (a.go:6)	MOVQ	AX, "".x+8(SB)
	0x0040 00064 (a.go:6)	MOVB	$1, (BX)(DX*1)
	0x0044 00068 (a.go:7)	ADDQ	$72, SP
	0x0048 00072 (a.go:7)	RET
	0x0049 00073 (a.go:6)	LEAQ	type.[]uint8(SB), AX
	0x0050 00080 (a.go:6)	MOVQ	AX, (SP)
	0x0054 00084 (a.go:6)	MOVQ	BX, 8(SP)
	0x0059 00089 (a.go:6)	MOVQ	DX, 16(SP)
	0x005e 00094 (a.go:6)	MOVQ	CX, 24(SP)
	0x0063 00099 (a.go:6)	MOVQ	BP, 32(SP)
	0x0068 00104 (a.go:6)	PCDATA	$0, $0
	0x0068 00104 (a.go:6)	CALL	runtime.growslice(SB)
	0x006d 00109 (a.go:6)	MOVQ	40(SP), CX
	0x0072 00114 (a.go:6)	MOVQ	48(SP), DX
	0x0077 00119 (a.go:6)	MOVQ	DX, "".autotmp_0+64(SP)
	0x007c 00124 (a.go:6)	MOVQ	56(SP), BX
	0x0081 00129 (a.go:6)	MOVQ	BX, "".x+16(SB)
	0x0088 00136 (a.go:6)	MOVL	runtime.writeBarrier(SB), AX
	0x008e 00142 (a.go:6)	TESTB	AL, AL
	0x0090 00144 (a.go:6)	JNE	$0, 162
	0x0092 00146 (a.go:6)	MOVQ	CX, "".x(SB)
	0x0099 00153 (a.go:6)	MOVQ	"".x(SB), BX
	0x00a0 00160 (a.go:6)	JMP	53
	0x00a2 00162 (a.go:6)	LEAQ	"".x(SB), BX
	0x00a9 00169 (a.go:6)	MOVQ	BX, (SP)
	0x00ad 00173 (a.go:6)	MOVQ	CX, 8(SP)
	0x00b2 00178 (a.go:6)	PCDATA	$0, $0
	0x00b2 00178 (a.go:6)	CALL	runtime.writebarrierptr(SB)
	0x00b7 00183 (a.go:6)	MOVQ	"".autotmp_0+64(SP), DX
	0x00bc 00188 (a.go:6)	JMP	153
	0x00be 00190 (a.go:6)	NOP
	0x00be 00190 (a.go:5)	CALL	runtime.morestack_noctxt(SB)
	0x00c3 00195 (a.go:5)	JMP	0
Fixes #14969 again
Change-Id: Ia50463b1f506011aad0718a4fef1d4738e43c32d
Reviewed-on: https://go-review.googlesource.com/22197
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-18 09:40:30 -07:00
										 |  |  | 		s.vars[&lenVar] = r[1] // avoid a spill in the fast path | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	} else { | 
					
						
							|  |  |  | 		s.vars[&ptrVar] = r[0] | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		s.vars[&newlenVar] = s.newValue2(s.ssaOp(OADD, types.Types[TINT]), types.Types[TINT], r[1], s.constInt(types.Types[TINT], nargs)) | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 		s.vars[&capVar] = r[2] | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	b = s.endBlock() | 
					
						
							|  |  |  | 	b.AddEdgeTo(assign) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// assign new elements to slots | 
					
						
							|  |  |  | 	s.startBlock(assign) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: re-enable in-place append optimization
CL 21891 was too clever in its attempts to avoid spills.
Storing newlen too early caused uses of append in the runtime
itself to receive an inconsistent view of a slice,
leading to corruption.
This CL makes the generate code much more similar to
the old backend. It spills more than before,
but those spills have been contained to the grow path.
It recalculates newlen unnecessarily on the fast path,
but that's measurably cheaper than spilling it.
CL 21891 caused runtime failures in 6 of 2000 runs
of net/http and crypto/x509 in my test setup.
This CL has gone 6000 runs without a failure.
Benchmarks going from master to this CL:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   439ns ± 2%   436ns ± 2%  -0.72%  (p=0.001 n=28+27)
AppendInPlace/NoGrow/1Ptr-8   901ns ± 0%   856ns ± 0%  -4.95%  (p=0.000 n=26+29)
AppendInPlace/NoGrow/2Ptr-8  2.15µs ± 1%  1.95µs ± 0%  -9.07%  (p=0.000 n=28+30)
AppendInPlace/NoGrow/3Ptr-8  2.66µs ± 0%  2.45µs ± 0%  -7.93%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/4Ptr-8  3.24µs ± 1%  3.02µs ± 1%  -6.75%  (p=0.000 n=28+30)
AppendInPlace/Grow/Byte-8     269ns ± 1%   271ns ± 1%  +0.84%  (p=0.000 n=30+29)
AppendInPlace/Grow/1Ptr-8     275ns ± 1%   280ns ± 1%  +1.75%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     384ns ± 0%   391ns ± 0%  +1.94%  (p=0.000 n=27+30)
AppendInPlace/Grow/3Ptr-8     455ns ± 0%   462ns ± 0%  +1.43%  (p=0.000 n=29+29)
AppendInPlace/Grow/4Ptr-8     478ns ± 0%   479ns ± 0%  +0.23%  (p=0.000 n=30+27)
However, for the large no-grow cases, there is still more work to be done.
Going from this CL to the non-SSA backend:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   436ns ± 2%   436ns ± 2%     ~     (p=0.967 n=27+29)
AppendInPlace/NoGrow/1Ptr-8   856ns ± 0%   884ns ± 0%   +3.28%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/2Ptr-8  1.95µs ± 0%  1.56µs ± 0%  -20.28%  (p=0.000 n=30+29)
AppendInPlace/NoGrow/3Ptr-8  2.45µs ± 0%  1.89µs ± 0%  -22.88%  (p=0.000 n=26+28)
AppendInPlace/NoGrow/4Ptr-8  3.02µs ± 1%  2.56µs ± 1%  -15.35%  (p=0.000 n=30+28)
AppendInPlace/Grow/Byte-8     271ns ± 1%   283ns ± 1%   +4.56%  (p=0.000 n=29+29)
AppendInPlace/Grow/1Ptr-8     280ns ± 1%   288ns ± 1%   +2.99%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     391ns ± 0%   409ns ± 0%   +4.66%  (p=0.000 n=30+29)
AppendInPlace/Grow/3Ptr-8     462ns ± 0%   481ns ± 0%   +4.13%  (p=0.000 n=29+30)
AppendInPlace/Grow/4Ptr-8     479ns ± 0%   502ns ± 0%   +4.81%  (p=0.000 n=27+26)
New generated code:
var x []byte
func a() {
	x = append(x, 1)
}
"".a t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (a.go:5)	TEXT	"".a(SB), $72-0
	0x0000 00000 (a.go:5)	MOVQ	(TLS), CX
	0x0009 00009 (a.go:5)	CMPQ	SP, 16(CX)
	0x000d 00013 (a.go:5)	JLS	190
	0x0013 00019 (a.go:5)	SUBQ	$72, SP
	0x0017 00023 (a.go:5)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:5)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:6)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (a.go:6)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (a.go:6)	MOVQ	"".x(SB), BX
	0x002c 00044 (a.go:6)	LEAQ	1(DX), BP
	0x0030 00048 (a.go:6)	CMPQ	BP, CX
	0x0033 00051 (a.go:6)	JGT	$0, 73
	0x0035 00053 (a.go:6)	LEAQ	1(DX), AX
	0x0039 00057 (a.go:6)	MOVQ	AX, "".x+8(SB)
	0x0040 00064 (a.go:6)	MOVB	$1, (BX)(DX*1)
	0x0044 00068 (a.go:7)	ADDQ	$72, SP
	0x0048 00072 (a.go:7)	RET
	0x0049 00073 (a.go:6)	LEAQ	type.[]uint8(SB), AX
	0x0050 00080 (a.go:6)	MOVQ	AX, (SP)
	0x0054 00084 (a.go:6)	MOVQ	BX, 8(SP)
	0x0059 00089 (a.go:6)	MOVQ	DX, 16(SP)
	0x005e 00094 (a.go:6)	MOVQ	CX, 24(SP)
	0x0063 00099 (a.go:6)	MOVQ	BP, 32(SP)
	0x0068 00104 (a.go:6)	PCDATA	$0, $0
	0x0068 00104 (a.go:6)	CALL	runtime.growslice(SB)
	0x006d 00109 (a.go:6)	MOVQ	40(SP), CX
	0x0072 00114 (a.go:6)	MOVQ	48(SP), DX
	0x0077 00119 (a.go:6)	MOVQ	DX, "".autotmp_0+64(SP)
	0x007c 00124 (a.go:6)	MOVQ	56(SP), BX
	0x0081 00129 (a.go:6)	MOVQ	BX, "".x+16(SB)
	0x0088 00136 (a.go:6)	MOVL	runtime.writeBarrier(SB), AX
	0x008e 00142 (a.go:6)	TESTB	AL, AL
	0x0090 00144 (a.go:6)	JNE	$0, 162
	0x0092 00146 (a.go:6)	MOVQ	CX, "".x(SB)
	0x0099 00153 (a.go:6)	MOVQ	"".x(SB), BX
	0x00a0 00160 (a.go:6)	JMP	53
	0x00a2 00162 (a.go:6)	LEAQ	"".x(SB), BX
	0x00a9 00169 (a.go:6)	MOVQ	BX, (SP)
	0x00ad 00173 (a.go:6)	MOVQ	CX, 8(SP)
	0x00b2 00178 (a.go:6)	PCDATA	$0, $0
	0x00b2 00178 (a.go:6)	CALL	runtime.writebarrierptr(SB)
	0x00b7 00183 (a.go:6)	MOVQ	"".autotmp_0+64(SP), DX
	0x00bc 00188 (a.go:6)	JMP	153
	0x00be 00190 (a.go:6)	NOP
	0x00be 00190 (a.go:5)	CALL	runtime.morestack_noctxt(SB)
	0x00c3 00195 (a.go:5)	JMP	0
Fixes #14969 again
Change-Id: Ia50463b1f506011aad0718a4fef1d4738e43c32d
Reviewed-on: https://go-review.googlesource.com/22197
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-18 09:40:30 -07:00
										 |  |  | 	if inplace { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		l = s.variable(&lenVar, types.Types[TINT]) // generates phi for len | 
					
						
							|  |  |  | 		nl = s.newValue2(s.ssaOp(OADD, types.Types[TINT]), types.Types[TINT], l, s.constInt(types.Types[TINT], nargs)) | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		lenaddr := s.newValue1I(ssa.OpOffPtr, s.f.Config.Types.IntPtr, int64(array_nel), addr) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(types.Types[TINT], lenaddr, nl) | 
					
						
							| 
									
										
											  
											
												cmd/compile: re-enable in-place append optimization
CL 21891 was too clever in its attempts to avoid spills.
Storing newlen too early caused uses of append in the runtime
itself to receive an inconsistent view of a slice,
leading to corruption.
This CL makes the generate code much more similar to
the old backend. It spills more than before,
but those spills have been contained to the grow path.
It recalculates newlen unnecessarily on the fast path,
but that's measurably cheaper than spilling it.
CL 21891 caused runtime failures in 6 of 2000 runs
of net/http and crypto/x509 in my test setup.
This CL has gone 6000 runs without a failure.
Benchmarks going from master to this CL:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   439ns ± 2%   436ns ± 2%  -0.72%  (p=0.001 n=28+27)
AppendInPlace/NoGrow/1Ptr-8   901ns ± 0%   856ns ± 0%  -4.95%  (p=0.000 n=26+29)
AppendInPlace/NoGrow/2Ptr-8  2.15µs ± 1%  1.95µs ± 0%  -9.07%  (p=0.000 n=28+30)
AppendInPlace/NoGrow/3Ptr-8  2.66µs ± 0%  2.45µs ± 0%  -7.93%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/4Ptr-8  3.24µs ± 1%  3.02µs ± 1%  -6.75%  (p=0.000 n=28+30)
AppendInPlace/Grow/Byte-8     269ns ± 1%   271ns ± 1%  +0.84%  (p=0.000 n=30+29)
AppendInPlace/Grow/1Ptr-8     275ns ± 1%   280ns ± 1%  +1.75%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     384ns ± 0%   391ns ± 0%  +1.94%  (p=0.000 n=27+30)
AppendInPlace/Grow/3Ptr-8     455ns ± 0%   462ns ± 0%  +1.43%  (p=0.000 n=29+29)
AppendInPlace/Grow/4Ptr-8     478ns ± 0%   479ns ± 0%  +0.23%  (p=0.000 n=30+27)
However, for the large no-grow cases, there is still more work to be done.
Going from this CL to the non-SSA backend:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   436ns ± 2%   436ns ± 2%     ~     (p=0.967 n=27+29)
AppendInPlace/NoGrow/1Ptr-8   856ns ± 0%   884ns ± 0%   +3.28%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/2Ptr-8  1.95µs ± 0%  1.56µs ± 0%  -20.28%  (p=0.000 n=30+29)
AppendInPlace/NoGrow/3Ptr-8  2.45µs ± 0%  1.89µs ± 0%  -22.88%  (p=0.000 n=26+28)
AppendInPlace/NoGrow/4Ptr-8  3.02µs ± 1%  2.56µs ± 1%  -15.35%  (p=0.000 n=30+28)
AppendInPlace/Grow/Byte-8     271ns ± 1%   283ns ± 1%   +4.56%  (p=0.000 n=29+29)
AppendInPlace/Grow/1Ptr-8     280ns ± 1%   288ns ± 1%   +2.99%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     391ns ± 0%   409ns ± 0%   +4.66%  (p=0.000 n=30+29)
AppendInPlace/Grow/3Ptr-8     462ns ± 0%   481ns ± 0%   +4.13%  (p=0.000 n=29+30)
AppendInPlace/Grow/4Ptr-8     479ns ± 0%   502ns ± 0%   +4.81%  (p=0.000 n=27+26)
New generated code:
var x []byte
func a() {
	x = append(x, 1)
}
"".a t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (a.go:5)	TEXT	"".a(SB), $72-0
	0x0000 00000 (a.go:5)	MOVQ	(TLS), CX
	0x0009 00009 (a.go:5)	CMPQ	SP, 16(CX)
	0x000d 00013 (a.go:5)	JLS	190
	0x0013 00019 (a.go:5)	SUBQ	$72, SP
	0x0017 00023 (a.go:5)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:5)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:6)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (a.go:6)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (a.go:6)	MOVQ	"".x(SB), BX
	0x002c 00044 (a.go:6)	LEAQ	1(DX), BP
	0x0030 00048 (a.go:6)	CMPQ	BP, CX
	0x0033 00051 (a.go:6)	JGT	$0, 73
	0x0035 00053 (a.go:6)	LEAQ	1(DX), AX
	0x0039 00057 (a.go:6)	MOVQ	AX, "".x+8(SB)
	0x0040 00064 (a.go:6)	MOVB	$1, (BX)(DX*1)
	0x0044 00068 (a.go:7)	ADDQ	$72, SP
	0x0048 00072 (a.go:7)	RET
	0x0049 00073 (a.go:6)	LEAQ	type.[]uint8(SB), AX
	0x0050 00080 (a.go:6)	MOVQ	AX, (SP)
	0x0054 00084 (a.go:6)	MOVQ	BX, 8(SP)
	0x0059 00089 (a.go:6)	MOVQ	DX, 16(SP)
	0x005e 00094 (a.go:6)	MOVQ	CX, 24(SP)
	0x0063 00099 (a.go:6)	MOVQ	BP, 32(SP)
	0x0068 00104 (a.go:6)	PCDATA	$0, $0
	0x0068 00104 (a.go:6)	CALL	runtime.growslice(SB)
	0x006d 00109 (a.go:6)	MOVQ	40(SP), CX
	0x0072 00114 (a.go:6)	MOVQ	48(SP), DX
	0x0077 00119 (a.go:6)	MOVQ	DX, "".autotmp_0+64(SP)
	0x007c 00124 (a.go:6)	MOVQ	56(SP), BX
	0x0081 00129 (a.go:6)	MOVQ	BX, "".x+16(SB)
	0x0088 00136 (a.go:6)	MOVL	runtime.writeBarrier(SB), AX
	0x008e 00142 (a.go:6)	TESTB	AL, AL
	0x0090 00144 (a.go:6)	JNE	$0, 162
	0x0092 00146 (a.go:6)	MOVQ	CX, "".x(SB)
	0x0099 00153 (a.go:6)	MOVQ	"".x(SB), BX
	0x00a0 00160 (a.go:6)	JMP	53
	0x00a2 00162 (a.go:6)	LEAQ	"".x(SB), BX
	0x00a9 00169 (a.go:6)	MOVQ	BX, (SP)
	0x00ad 00173 (a.go:6)	MOVQ	CX, 8(SP)
	0x00b2 00178 (a.go:6)	PCDATA	$0, $0
	0x00b2 00178 (a.go:6)	CALL	runtime.writebarrierptr(SB)
	0x00b7 00183 (a.go:6)	MOVQ	"".autotmp_0+64(SP), DX
	0x00bc 00188 (a.go:6)	JMP	153
	0x00be 00190 (a.go:6)	NOP
	0x00be 00190 (a.go:5)	CALL	runtime.morestack_noctxt(SB)
	0x00c3 00195 (a.go:5)	JMP	0
Fixes #14969 again
Change-Id: Ia50463b1f506011aad0718a4fef1d4738e43c32d
Reviewed-on: https://go-review.googlesource.com/22197
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-18 09:40:30 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	// Evaluate args | 
					
						
							| 
									
										
										
										
											2016-05-27 14:07:37 -07:00
										 |  |  | 	type argRec struct { | 
					
						
							|  |  |  | 		// if store is true, we're appending the value v.  If false, we're appending the | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		// value at *v. | 
					
						
							|  |  |  | 		v     *ssa.Value | 
					
						
							|  |  |  | 		store bool | 
					
						
							| 
									
										
										
										
											2016-05-27 14:07:37 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	args := make([]argRec, 0, nargs) | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	for _, n := range n.List.Slice()[1:] { | 
					
						
							|  |  |  | 		if canSSAType(n.Type) { | 
					
						
							| 
									
										
										
										
											2016-05-27 14:07:37 -07:00
										 |  |  | 			args = append(args, argRec{v: s.expr(n), store: true}) | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 		} else { | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 			v := s.addr(n, false) | 
					
						
							|  |  |  | 			args = append(args, argRec{v: v}) | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	p = s.variable(&ptrVar, pt) // generates phi for ptr | 
					
						
							|  |  |  | 	if !inplace { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		nl = s.variable(&newlenVar, types.Types[TINT]) // generates phi for nl | 
					
						
							|  |  |  | 		c = s.variable(&capVar, types.Types[TINT])     // generates phi for cap | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	p2 := s.newValue2(ssa.OpPtrIndex, pt, p, l) | 
					
						
							|  |  |  | 	for i, arg := range args { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		addr := s.newValue2(ssa.OpPtrIndex, pt, p2, s.constInt(types.Types[TINT], int64(i))) | 
					
						
							| 
									
										
										
										
											2016-05-27 14:07:37 -07:00
										 |  |  | 		if arg.store { | 
					
						
							| 
									
										
										
										
											2018-03-02 20:33:15 -05:00
										 |  |  | 			s.storeType(et, addr, arg.v, 0, true) | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 		} else { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			s.move(et, addr, arg.v) | 
					
						
							| 
									
										
										
										
											2015-09-11 11:02:57 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	delete(s.vars, &ptrVar) | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	if inplace { | 
					
						
							| 
									
										
											  
											
												cmd/compile: re-enable in-place append optimization
CL 21891 was too clever in its attempts to avoid spills.
Storing newlen too early caused uses of append in the runtime
itself to receive an inconsistent view of a slice,
leading to corruption.
This CL makes the generate code much more similar to
the old backend. It spills more than before,
but those spills have been contained to the grow path.
It recalculates newlen unnecessarily on the fast path,
but that's measurably cheaper than spilling it.
CL 21891 caused runtime failures in 6 of 2000 runs
of net/http and crypto/x509 in my test setup.
This CL has gone 6000 runs without a failure.
Benchmarks going from master to this CL:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   439ns ± 2%   436ns ± 2%  -0.72%  (p=0.001 n=28+27)
AppendInPlace/NoGrow/1Ptr-8   901ns ± 0%   856ns ± 0%  -4.95%  (p=0.000 n=26+29)
AppendInPlace/NoGrow/2Ptr-8  2.15µs ± 1%  1.95µs ± 0%  -9.07%  (p=0.000 n=28+30)
AppendInPlace/NoGrow/3Ptr-8  2.66µs ± 0%  2.45µs ± 0%  -7.93%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/4Ptr-8  3.24µs ± 1%  3.02µs ± 1%  -6.75%  (p=0.000 n=28+30)
AppendInPlace/Grow/Byte-8     269ns ± 1%   271ns ± 1%  +0.84%  (p=0.000 n=30+29)
AppendInPlace/Grow/1Ptr-8     275ns ± 1%   280ns ± 1%  +1.75%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     384ns ± 0%   391ns ± 0%  +1.94%  (p=0.000 n=27+30)
AppendInPlace/Grow/3Ptr-8     455ns ± 0%   462ns ± 0%  +1.43%  (p=0.000 n=29+29)
AppendInPlace/Grow/4Ptr-8     478ns ± 0%   479ns ± 0%  +0.23%  (p=0.000 n=30+27)
However, for the large no-grow cases, there is still more work to be done.
Going from this CL to the non-SSA backend:
name                         old time/op  new time/op  delta
AppendInPlace/NoGrow/Byte-8   436ns ± 2%   436ns ± 2%     ~     (p=0.967 n=27+29)
AppendInPlace/NoGrow/1Ptr-8   856ns ± 0%   884ns ± 0%   +3.28%  (p=0.000 n=29+26)
AppendInPlace/NoGrow/2Ptr-8  1.95µs ± 0%  1.56µs ± 0%  -20.28%  (p=0.000 n=30+29)
AppendInPlace/NoGrow/3Ptr-8  2.45µs ± 0%  1.89µs ± 0%  -22.88%  (p=0.000 n=26+28)
AppendInPlace/NoGrow/4Ptr-8  3.02µs ± 1%  2.56µs ± 1%  -15.35%  (p=0.000 n=30+28)
AppendInPlace/Grow/Byte-8     271ns ± 1%   283ns ± 1%   +4.56%  (p=0.000 n=29+29)
AppendInPlace/Grow/1Ptr-8     280ns ± 1%   288ns ± 1%   +2.99%  (p=0.000 n=30+30)
AppendInPlace/Grow/2Ptr-8     391ns ± 0%   409ns ± 0%   +4.66%  (p=0.000 n=30+29)
AppendInPlace/Grow/3Ptr-8     462ns ± 0%   481ns ± 0%   +4.13%  (p=0.000 n=29+30)
AppendInPlace/Grow/4Ptr-8     479ns ± 0%   502ns ± 0%   +4.81%  (p=0.000 n=27+26)
New generated code:
var x []byte
func a() {
	x = append(x, 1)
}
"".a t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (a.go:5)	TEXT	"".a(SB), $72-0
	0x0000 00000 (a.go:5)	MOVQ	(TLS), CX
	0x0009 00009 (a.go:5)	CMPQ	SP, 16(CX)
	0x000d 00013 (a.go:5)	JLS	190
	0x0013 00019 (a.go:5)	SUBQ	$72, SP
	0x0017 00023 (a.go:5)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:5)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (a.go:6)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (a.go:6)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (a.go:6)	MOVQ	"".x(SB), BX
	0x002c 00044 (a.go:6)	LEAQ	1(DX), BP
	0x0030 00048 (a.go:6)	CMPQ	BP, CX
	0x0033 00051 (a.go:6)	JGT	$0, 73
	0x0035 00053 (a.go:6)	LEAQ	1(DX), AX
	0x0039 00057 (a.go:6)	MOVQ	AX, "".x+8(SB)
	0x0040 00064 (a.go:6)	MOVB	$1, (BX)(DX*1)
	0x0044 00068 (a.go:7)	ADDQ	$72, SP
	0x0048 00072 (a.go:7)	RET
	0x0049 00073 (a.go:6)	LEAQ	type.[]uint8(SB), AX
	0x0050 00080 (a.go:6)	MOVQ	AX, (SP)
	0x0054 00084 (a.go:6)	MOVQ	BX, 8(SP)
	0x0059 00089 (a.go:6)	MOVQ	DX, 16(SP)
	0x005e 00094 (a.go:6)	MOVQ	CX, 24(SP)
	0x0063 00099 (a.go:6)	MOVQ	BP, 32(SP)
	0x0068 00104 (a.go:6)	PCDATA	$0, $0
	0x0068 00104 (a.go:6)	CALL	runtime.growslice(SB)
	0x006d 00109 (a.go:6)	MOVQ	40(SP), CX
	0x0072 00114 (a.go:6)	MOVQ	48(SP), DX
	0x0077 00119 (a.go:6)	MOVQ	DX, "".autotmp_0+64(SP)
	0x007c 00124 (a.go:6)	MOVQ	56(SP), BX
	0x0081 00129 (a.go:6)	MOVQ	BX, "".x+16(SB)
	0x0088 00136 (a.go:6)	MOVL	runtime.writeBarrier(SB), AX
	0x008e 00142 (a.go:6)	TESTB	AL, AL
	0x0090 00144 (a.go:6)	JNE	$0, 162
	0x0092 00146 (a.go:6)	MOVQ	CX, "".x(SB)
	0x0099 00153 (a.go:6)	MOVQ	"".x(SB), BX
	0x00a0 00160 (a.go:6)	JMP	53
	0x00a2 00162 (a.go:6)	LEAQ	"".x(SB), BX
	0x00a9 00169 (a.go:6)	MOVQ	BX, (SP)
	0x00ad 00173 (a.go:6)	MOVQ	CX, 8(SP)
	0x00b2 00178 (a.go:6)	PCDATA	$0, $0
	0x00b2 00178 (a.go:6)	CALL	runtime.writebarrierptr(SB)
	0x00b7 00183 (a.go:6)	MOVQ	"".autotmp_0+64(SP), DX
	0x00bc 00188 (a.go:6)	JMP	153
	0x00be 00190 (a.go:6)	NOP
	0x00be 00190 (a.go:5)	CALL	runtime.morestack_noctxt(SB)
	0x00c3 00195 (a.go:5)	JMP	0
Fixes #14969 again
Change-Id: Ia50463b1f506011aad0718a4fef1d4738e43c32d
Reviewed-on: https://go-review.googlesource.com/22197
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-18 09:40:30 -07:00
										 |  |  | 		delete(s.vars, &lenVar) | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 		return nil | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid a spill in append fast path
Instead of spilling newlen, recalculate it.
This removes a spill from the fast path,
at the cost of a cheap recalculation
on the (rare) growth path.
This uses 8 bytes less of stack space.
It generates two more bytes of code,
but that is due to suboptimal register allocation;
see far below.
Runtime append microbenchmarks are all over the map,
presumably due to incidental code movement.
Sample code:
func s(b []byte) []byte {
	b = append(b, 1, 2, 3)
	return b
}
Before:
"".s t=1 size=160 args=0x30 locals=0x48
	0x0000 00000 (append.go:8)	TEXT	"".s(SB), $72-48
	0x0000 00000 (append.go:8)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:8)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:8)	JLS	149
	0x0013 00019 (append.go:8)	SUBQ	$72, SP
	0x0017 00023 (append.go:8)	FUNCDATA	$0, gclocals·6432f8c6a0d23fa7bee6c5d96f21a92a(SB)
	0x0017 00023 (append.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:9)	MOVQ	"".b+88(FP), CX
	0x001c 00028 (append.go:9)	LEAQ	3(CX), DX
	0x0020 00032 (append.go:9)	MOVQ	DX, "".autotmp_0+64(SP)
	0x0025 00037 (append.go:9)	MOVQ	"".b+96(FP), BX
	0x002a 00042 (append.go:9)	CMPQ	DX, BX
	0x002d 00045 (append.go:9)	JGT	$0, 86
	0x002f 00047 (append.go:8)	MOVQ	"".b+80(FP), AX
	0x0034 00052 (append.go:9)	MOVB	$1, (AX)(CX*1)
	0x0038 00056 (append.go:9)	MOVB	$2, 1(AX)(CX*1)
	0x003d 00061 (append.go:9)	MOVB	$3, 2(AX)(CX*1)
	0x0042 00066 (append.go:10)	MOVQ	AX, "".~r1+104(FP)
	0x0047 00071 (append.go:10)	MOVQ	DX, "".~r1+112(FP)
	0x004c 00076 (append.go:10)	MOVQ	BX, "".~r1+120(FP)
	0x0051 00081 (append.go:10)	ADDQ	$72, SP
	0x0055 00085 (append.go:10)	RET
	0x0056 00086 (append.go:9)	LEAQ	type.[]uint8(SB), AX
	0x005d 00093 (append.go:9)	MOVQ	AX, (SP)
	0x0061 00097 (append.go:9)	MOVQ	"".b+80(FP), BP
	0x0066 00102 (append.go:9)	MOVQ	BP, 8(SP)
	0x006b 00107 (append.go:9)	MOVQ	CX, 16(SP)
	0x0070 00112 (append.go:9)	MOVQ	BX, 24(SP)
	0x0075 00117 (append.go:9)	MOVQ	DX, 32(SP)
	0x007a 00122 (append.go:9)	PCDATA	$0, $0
	0x007a 00122 (append.go:9)	CALL	runtime.growslice(SB)
	0x007f 00127 (append.go:9)	MOVQ	40(SP), AX
	0x0084 00132 (append.go:9)	MOVQ	56(SP), BX
	0x0089 00137 (append.go:8)	MOVQ	"".b+88(FP), CX
	0x008e 00142 (append.go:9)	MOVQ	"".autotmp_0+64(SP), DX
	0x0093 00147 (append.go:9)	JMP	52
	0x0095 00149 (append.go:9)	NOP
	0x0095 00149 (append.go:8)	CALL	runtime.morestack_noctxt(SB)
	0x009a 00154 (append.go:8)	JMP	0
After:
"".s t=1 size=176 args=0x30 locals=0x40
	0x0000 00000 (append.go:8)	TEXT	"".s(SB), $64-48
	0x0000 00000 (append.go:8)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:8)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:8)	JLS	151
	0x0013 00019 (append.go:8)	SUBQ	$64, SP
	0x0017 00023 (append.go:8)	FUNCDATA	$0, gclocals·6432f8c6a0d23fa7bee6c5d96f21a92a(SB)
	0x0017 00023 (append.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:9)	MOVQ	"".b+80(FP), CX
	0x001c 00028 (append.go:9)	LEAQ	3(CX), DX
	0x0020 00032 (append.go:9)	MOVQ	"".b+88(FP), BX
	0x0025 00037 (append.go:9)	CMPQ	DX, BX
	0x0028 00040 (append.go:9)	JGT	$0, 81
	0x002a 00042 (append.go:8)	MOVQ	"".b+72(FP), AX
	0x002f 00047 (append.go:9)	MOVB	$1, (AX)(CX*1)
	0x0033 00051 (append.go:9)	MOVB	$2, 1(AX)(CX*1)
	0x0038 00056 (append.go:9)	MOVB	$3, 2(AX)(CX*1)
	0x003d 00061 (append.go:10)	MOVQ	AX, "".~r1+96(FP)
	0x0042 00066 (append.go:10)	MOVQ	DX, "".~r1+104(FP)
	0x0047 00071 (append.go:10)	MOVQ	BX, "".~r1+112(FP)
	0x004c 00076 (append.go:10)	ADDQ	$64, SP
	0x0050 00080 (append.go:10)	RET
	0x0051 00081 (append.go:9)	LEAQ	type.[]uint8(SB), AX
	0x0058 00088 (append.go:9)	MOVQ	AX, (SP)
	0x005c 00092 (append.go:9)	MOVQ	"".b+72(FP), BP
	0x0061 00097 (append.go:9)	MOVQ	BP, 8(SP)
	0x0066 00102 (append.go:9)	MOVQ	CX, 16(SP)
	0x006b 00107 (append.go:9)	MOVQ	BX, 24(SP)
	0x0070 00112 (append.go:9)	MOVQ	DX, 32(SP)
	0x0075 00117 (append.go:9)	PCDATA	$0, $0
	0x0075 00117 (append.go:9)	CALL	runtime.growslice(SB)
	0x007a 00122 (append.go:9)	MOVQ	40(SP), AX
	0x007f 00127 (append.go:9)	MOVQ	48(SP), CX
	0x0084 00132 (append.go:9)	MOVQ	56(SP), BX
	0x0089 00137 (append.go:9)	ADDQ	$3, CX
	0x008d 00141 (append.go:9)	MOVQ	CX, DX
	0x0090 00144 (append.go:8)	MOVQ	"".b+80(FP), CX
	0x0095 00149 (append.go:9)	JMP	47
	0x0097 00151 (append.go:9)	NOP
	0x0097 00151 (append.go:8)	CALL	runtime.morestack_noctxt(SB)
	0x009c 00156 (append.go:8)	JMP	0
Observe that in the following sequence,
we should use DX directly instead of using
CX as a temporary register, which would make
the new code a strict improvement on the old:
	0x007f 00127 (append.go:9)	MOVQ	48(SP), CX
	0x0084 00132 (append.go:9)	MOVQ	56(SP), BX
	0x0089 00137 (append.go:9)	ADDQ	$3, CX
	0x008d 00141 (append.go:9)	MOVQ	CX, DX
	0x0090 00144 (append.go:8)	MOVQ	"".b+80(FP), CX
Change-Id: I4ee50b18fa53865901d2d7f86c2cbb54c6fa6924
Reviewed-on: https://go-review.googlesource.com/21812
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:08:00 -07:00
										 |  |  | 	delete(s.vars, &newlenVar) | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	delete(s.vars, &capVar) | 
					
						
							| 
									
										
											  
											
												cmd/compile: avoid write barrier in append fast path
When we are writing the result of an append back
to the same slice, we don’t need a write barrier
on the fast path.
This re-implements an optimization that was present
in the old backend.
Updates #14921
Fixes #14969
Sample code:
var x []byte
func p() {
	x = append(x, 1, 2, 3)
}
Before:
"".p t=1 size=224 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	199
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	"".x+8(SB), BX
	0x002c 00044 (append.go:19)	MOVQ	BX, "".autotmp_0+64(SP)
	0x0031 00049 (append.go:22)	LEAQ	3(BX), BP
	0x0035 00053 (append.go:22)	CMPQ	BP, CX
	0x0038 00056 (append.go:22)	JGT	$0, 131
	0x003a 00058 (append.go:22)	MOVB	$1, (DX)(BX*1)
	0x003e 00062 (append.go:22)	MOVB	$2, 1(DX)(BX*1)
	0x0043 00067 (append.go:22)	MOVB	$3, 2(DX)(BX*1)
	0x0048 00072 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x004f 00079 (append.go:22)	MOVQ	CX, "".x+16(SB)
	0x0056 00086 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x005c 00092 (append.go:22)	TESTB	AL, AL
	0x005e 00094 (append.go:22)	JNE	$0, 108
	0x0060 00096 (append.go:22)	MOVQ	DX, "".x(SB)
	0x0067 00103 (append.go:23)	ADDQ	$72, SP
	0x006b 00107 (append.go:23)	RET
	0x006c 00108 (append.go:22)	LEAQ	"".x(SB), CX
	0x0073 00115 (append.go:22)	MOVQ	CX, (SP)
	0x0077 00119 (append.go:22)	MOVQ	DX, 8(SP)
	0x007c 00124 (append.go:22)	PCDATA	$0, $0
	0x007c 00124 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x0081 00129 (append.go:23)	JMP	103
	0x0083 00131 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x008a 00138 (append.go:22)	MOVQ	AX, (SP)
	0x008e 00142 (append.go:22)	MOVQ	DX, 8(SP)
	0x0093 00147 (append.go:22)	MOVQ	BX, 16(SP)
	0x0098 00152 (append.go:22)	MOVQ	CX, 24(SP)
	0x009d 00157 (append.go:22)	MOVQ	BP, 32(SP)
	0x00a2 00162 (append.go:22)	PCDATA	$0, $0
	0x00a2 00162 (append.go:22)	CALL	runtime.growslice(SB)
	0x00a7 00167 (append.go:22)	MOVQ	40(SP), DX
	0x00ac 00172 (append.go:22)	MOVQ	48(SP), AX
	0x00b1 00177 (append.go:22)	MOVQ	56(SP), CX
	0x00b6 00182 (append.go:22)	ADDQ	$3, AX
	0x00ba 00186 (append.go:19)	MOVQ	"".autotmp_0+64(SP), BX
	0x00bf 00191 (append.go:22)	MOVQ	AX, BP
	0x00c2 00194 (append.go:22)	JMP	58
	0x00c7 00199 (append.go:22)	NOP
	0x00c7 00199 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00cc 00204 (append.go:21)	JMP	0
After:
"".p t=1 size=208 args=0x0 locals=0x48
	0x0000 00000 (append.go:21)	TEXT	"".p(SB), $72-0
	0x0000 00000 (append.go:21)	MOVQ	(TLS), CX
	0x0009 00009 (append.go:21)	CMPQ	SP, 16(CX)
	0x000d 00013 (append.go:21)	JLS	191
	0x0013 00019 (append.go:21)	SUBQ	$72, SP
	0x0017 00023 (append.go:21)	FUNCDATA	$0, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:21)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0017 00023 (append.go:19)	MOVQ	"".x+16(SB), CX
	0x001e 00030 (append.go:19)	MOVQ	"".x+8(SB), DX
	0x0025 00037 (append.go:19)	MOVQ	DX, "".autotmp_0+64(SP)
	0x002a 00042 (append.go:19)	MOVQ	"".x(SB), BX
	0x0031 00049 (append.go:22)	LEAQ	3(DX), BP
	0x0035 00053 (append.go:22)	MOVQ	BP, "".x+8(SB)
	0x003c 00060 (append.go:22)	CMPQ	BP, CX
	0x003f 00063 (append.go:22)	JGT	$0, 84
	0x0041 00065 (append.go:22)	MOVB	$1, (BX)(DX*1)
	0x0045 00069 (append.go:22)	MOVB	$2, 1(BX)(DX*1)
	0x004a 00074 (append.go:22)	MOVB	$3, 2(BX)(DX*1)
	0x004f 00079 (append.go:23)	ADDQ	$72, SP
	0x0053 00083 (append.go:23)	RET
	0x0054 00084 (append.go:22)	LEAQ	type.[]uint8(SB), AX
	0x005b 00091 (append.go:22)	MOVQ	AX, (SP)
	0x005f 00095 (append.go:22)	MOVQ	BX, 8(SP)
	0x0064 00100 (append.go:22)	MOVQ	DX, 16(SP)
	0x0069 00105 (append.go:22)	MOVQ	CX, 24(SP)
	0x006e 00110 (append.go:22)	MOVQ	BP, 32(SP)
	0x0073 00115 (append.go:22)	PCDATA	$0, $0
	0x0073 00115 (append.go:22)	CALL	runtime.growslice(SB)
	0x0078 00120 (append.go:22)	MOVQ	40(SP), CX
	0x007d 00125 (append.go:22)	MOVQ	56(SP), AX
	0x0082 00130 (append.go:22)	MOVQ	AX, "".x+16(SB)
	0x0089 00137 (append.go:22)	MOVL	runtime.writeBarrier(SB), AX
	0x008f 00143 (append.go:22)	TESTB	AL, AL
	0x0091 00145 (append.go:22)	JNE	$0, 168
	0x0093 00147 (append.go:22)	MOVQ	CX, "".x(SB)
	0x009a 00154 (append.go:22)	MOVQ	"".x(SB), BX
	0x00a1 00161 (append.go:19)	MOVQ	"".autotmp_0+64(SP), DX
	0x00a6 00166 (append.go:22)	JMP	65
	0x00a8 00168 (append.go:22)	LEAQ	"".x(SB), DX
	0x00af 00175 (append.go:22)	MOVQ	DX, (SP)
	0x00b3 00179 (append.go:22)	MOVQ	CX, 8(SP)
	0x00b8 00184 (append.go:22)	PCDATA	$0, $0
	0x00b8 00184 (append.go:22)	CALL	runtime.writebarrierptr(SB)
	0x00bd 00189 (append.go:22)	JMP	154
	0x00bf 00191 (append.go:22)	NOP
	0x00bf 00191 (append.go:21)	CALL	runtime.morestack_noctxt(SB)
	0x00c4 00196 (append.go:21)	JMP	0
Change-Id: I77a41ad3a22557a4bb4654de7d6d24a029efe34a
Reviewed-on: https://go-review.googlesource.com/21813
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-04-10 09:44:17 -07:00
										 |  |  | 	// make result | 
					
						
							| 
									
										
										
										
											2016-04-04 10:58:21 -07:00
										 |  |  | 	return s.newValue3(ssa.OpSliceMake, n.Type, p, nl, c) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-11-02 16:56:53 -08:00
										 |  |  | // condBranch evaluates the boolean expression cond and branches to yes | 
					
						
							|  |  |  | // if cond is true and no if cond is false. | 
					
						
							|  |  |  | // This function is intended to handle && and || better than just calling | 
					
						
							|  |  |  | // s.expr(cond) and branching on the result. | 
					
						
							|  |  |  | func (s *state) condBranch(cond *Node, yes, no *ssa.Block, likely int8) { | 
					
						
							| 
									
										
										
										
											2017-10-15 22:49:52 +01:00
										 |  |  | 	switch cond.Op { | 
					
						
							|  |  |  | 	case OANDAND: | 
					
						
							| 
									
										
										
										
											2015-11-02 16:56:53 -08:00
										 |  |  | 		mid := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		s.stmtList(cond.Ninit) | 
					
						
							|  |  |  | 		s.condBranch(cond.Left, mid, no, max8(likely, 0)) | 
					
						
							|  |  |  | 		s.startBlock(mid) | 
					
						
							|  |  |  | 		s.condBranch(cond.Right, yes, no, likely) | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 		// Note: if likely==1, then both recursive calls pass 1. | 
					
						
							|  |  |  | 		// If likely==-1, then we don't have enough information to decide | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 		// whether the first branch is likely or not. So we pass 0 for | 
					
						
							| 
									
										
										
										
											2015-11-02 16:56:53 -08:00
										 |  |  | 		// the likeliness of the first branch. | 
					
						
							|  |  |  | 		// TODO: have the frontend give us branch prediction hints for | 
					
						
							|  |  |  | 		// OANDAND and OOROR nodes (if it ever has such info). | 
					
						
							| 
									
										
										
										
											2017-10-15 22:49:52 +01:00
										 |  |  | 	case OOROR: | 
					
						
							| 
									
										
										
										
											2015-11-02 16:56:53 -08:00
										 |  |  | 		mid := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		s.stmtList(cond.Ninit) | 
					
						
							|  |  |  | 		s.condBranch(cond.Left, yes, mid, min8(likely, 0)) | 
					
						
							|  |  |  | 		s.startBlock(mid) | 
					
						
							|  |  |  | 		s.condBranch(cond.Right, yes, no, likely) | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 		// Note: if likely==-1, then both recursive calls pass -1. | 
					
						
							|  |  |  | 		// If likely==1, then we don't have enough info to decide | 
					
						
							|  |  |  | 		// the likelihood of the first branch. | 
					
						
							| 
									
										
										
										
											2017-10-15 22:49:52 +01:00
										 |  |  | 	case ONOT: | 
					
						
							| 
									
										
										
										
											2015-11-03 09:30:17 -08:00
										 |  |  | 		s.stmtList(cond.Ninit) | 
					
						
							|  |  |  | 		s.condBranch(cond.Left, no, yes, -likely) | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-11-02 16:56:53 -08:00
										 |  |  | 	c := s.expr(cond) | 
					
						
							|  |  |  | 	b := s.endBlock() | 
					
						
							|  |  |  | 	b.Kind = ssa.BlockIf | 
					
						
							| 
									
										
										
										
											2016-03-15 20:45:50 -07:00
										 |  |  | 	b.SetControl(c) | 
					
						
							| 
									
										
										
										
											2015-11-02 16:56:53 -08:00
										 |  |  | 	b.Likely = ssa.BranchPrediction(likely) // gc and ssa both use -1/0/+1 for likeliness | 
					
						
							|  |  |  | 	b.AddEdgeTo(yes) | 
					
						
							|  |  |  | 	b.AddEdgeTo(no) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-21 10:22:03 -07:00
										 |  |  | type skipMask uint8 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | const ( | 
					
						
							|  |  |  | 	skipPtr skipMask = 1 << iota | 
					
						
							|  |  |  | 	skipLen | 
					
						
							|  |  |  | 	skipCap | 
					
						
							|  |  |  | ) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | // assign does left = right. | 
					
						
							|  |  |  | // Right has already been evaluated to ssa, left has not. | 
					
						
							|  |  |  | // If deref is true, then we do left = *right instead (and right has already been nil-checked). | 
					
						
							|  |  |  | // If deref is true and right == nil, just do left = 0. | 
					
						
							| 
									
										
										
										
											2016-03-21 10:22:03 -07:00
										 |  |  | // skip indicates assignments (at the top level) that can be avoided. | 
					
						
							| 
									
										
										
										
											2017-02-10 10:15:10 -05:00
										 |  |  | func (s *state) assign(left *Node, right *ssa.Value, deref bool, skip skipMask) { | 
					
						
							| 
									
										
										
										
											2018-04-08 13:39:10 +01:00
										 |  |  | 	if left.Op == ONAME && left.isBlank() { | 
					
						
							| 
									
										
										
										
											2015-08-14 21:47:20 -07:00
										 |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	t := left.Type | 
					
						
							| 
									
										
										
										
											2017-04-27 16:37:25 -07:00
										 |  |  | 	dowidth(t) | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 	if s.canSSA(left) { | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 		if deref { | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 			s.Fatalf("can SSA LHS %v but not RHS %s", left, right) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 		if left.Op == ODOT { | 
					
						
							|  |  |  | 			// We're assigning to a field of an ssa-able value. | 
					
						
							|  |  |  | 			// We need to build a new structure with the new value for the | 
					
						
							|  |  |  | 			// field we're assigning and the old values for the other fields. | 
					
						
							|  |  |  | 			// For instance: | 
					
						
							|  |  |  | 			//   type T struct {a, b, c int} | 
					
						
							|  |  |  | 			//   var T x | 
					
						
							|  |  |  | 			//   x.b = 5 | 
					
						
							|  |  |  | 			// For the x.b = 5 assignment we want to generate x = T{x.a, 5, x.c} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Grab information about the structure type. | 
					
						
							|  |  |  | 			t := left.Left.Type | 
					
						
							|  |  |  | 			nf := t.NumFields() | 
					
						
							|  |  |  | 			idx := fieldIdx(left) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Grab old value of structure. | 
					
						
							|  |  |  | 			old := s.expr(left.Left) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Make new structure. | 
					
						
							|  |  |  | 			new := s.newValue0(ssa.StructMakeOp(t.NumFields()), t) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Add fields as args. | 
					
						
							| 
									
										
										
										
											2016-03-14 12:45:18 -07:00
										 |  |  | 			for i := 0; i < nf; i++ { | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 				if i == idx { | 
					
						
							|  |  |  | 					new.AddArg(right) | 
					
						
							|  |  |  | 				} else { | 
					
						
							| 
									
										
										
										
											2016-03-14 12:45:18 -07:00
										 |  |  | 					new.AddArg(s.newValue1I(ssa.OpStructSelect, t.FieldType(i), int64(i), old)) | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 				} | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Recursively assign the new value we've made to the base of the dot op. | 
					
						
							| 
									
										
										
										
											2017-02-10 10:15:10 -05:00
										 |  |  | 			s.assign(left.Left, new, false, 0) | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 			// TODO: do we need to update named values here? | 
					
						
							|  |  |  | 			return | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 		if left.Op == OINDEX && left.Left.Type.IsArray() { | 
					
						
							|  |  |  | 			// We're assigning to an element of an ssa-able array. | 
					
						
							|  |  |  | 			// a[i] = v | 
					
						
							|  |  |  | 			t := left.Left.Type | 
					
						
							|  |  |  | 			n := t.NumElem() | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			i := s.expr(left.Right) // index | 
					
						
							|  |  |  | 			if n == 0 { | 
					
						
							|  |  |  | 				// The bounds check must fail.  Might as well | 
					
						
							|  |  |  | 				// ignore the actual index and just use zeros. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				z := s.constInt(types.Types[TINT], 0) | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 				s.boundsCheck(z, z) | 
					
						
							|  |  |  | 				return | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			if n != 1 { | 
					
						
							|  |  |  | 				s.Fatalf("assigning to non-1-length array") | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			// Rewrite to a = [1]{v} | 
					
						
							|  |  |  | 			i = s.extendIndex(i, panicindex) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			s.boundsCheck(i, s.constInt(types.Types[TINT], 1)) | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 			v := s.newValue1(ssa.OpArrayMake1, t, right) | 
					
						
							| 
									
										
										
										
											2017-02-10 10:15:10 -05:00
										 |  |  | 			s.assign(left.Left, v, false, 0) | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 			return | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-06-12 14:23:29 +01:00
										 |  |  | 		// Update variable assignment. | 
					
						
							| 
									
										
										
										
											2015-08-29 14:54:45 -07:00
										 |  |  | 		s.vars[left] = right | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | 		s.addNamedValue(left, right) | 
					
						
							| 
									
										
										
										
											2015-06-12 14:23:29 +01:00
										 |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 	// Left is not ssa-able. Compute its address. | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 	if left.Op == ONAME && left.Class() != PEXTERN && skip == 0 { | 
					
						
							| 
									
										
										
										
											2018-03-02 20:33:15 -05:00
										 |  |  | 		s.vars[&memVar] = s.newValue1Apos(ssa.OpVarDef, types.TypeMem, left, s.mem(), !left.IsAutoTmp()) | 
					
						
							| 
									
										
										
										
											2015-08-24 02:16:19 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2018-07-03 11:34:38 -04:00
										 |  |  | 	addr := s.addr(left, false) | 
					
						
							| 
									
										
										
										
											2017-02-05 23:43:31 -05:00
										 |  |  | 	if isReflectHeaderDataField(left) { | 
					
						
							|  |  |  | 		// Package unsafe's documentation says storing pointers into | 
					
						
							|  |  |  | 		// reflect.SliceHeader and reflect.StringHeader's Data fields | 
					
						
							|  |  |  | 		// is valid, even though they have type uintptr (#19168). | 
					
						
							|  |  |  | 		// Mark it pointer type to signal the writebarrier pass to | 
					
						
							|  |  |  | 		// insert a write barrier. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		t = types.Types[TUNSAFEPTR] | 
					
						
							| 
									
										
										
										
											2017-02-05 23:43:31 -05:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	if deref { | 
					
						
							|  |  |  | 		// Treat as a mem->mem move. | 
					
						
							| 
									
										
										
										
											2016-10-18 11:06:28 -04:00
										 |  |  | 		if right == nil { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			s.zero(t, addr) | 
					
						
							| 
									
										
										
										
											2017-02-10 10:15:10 -05:00
										 |  |  | 		} else { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			s.move(t, addr, right) | 
					
						
							| 
									
										
										
										
											2016-10-18 11:06:28 -04:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	// Treat as a store. | 
					
						
							| 
									
										
										
										
											2018-03-02 20:33:15 -05:00
										 |  |  | 	s.storeType(t, addr, right, skip, !left.IsAutoTmp()) | 
					
						
							| 
									
										
										
										
											2015-06-12 14:23:29 +01:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-20 15:30:52 -07:00
										 |  |  | // zeroVal returns the zero value for type t. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) zeroVal(t *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2015-07-20 15:30:52 -07:00
										 |  |  | 	switch { | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | 	case t.IsInteger(): | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		switch t.Size() { | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | 		case 1: | 
					
						
							|  |  |  | 			return s.constInt8(t, 0) | 
					
						
							|  |  |  | 		case 2: | 
					
						
							|  |  |  | 			return s.constInt16(t, 0) | 
					
						
							|  |  |  | 		case 4: | 
					
						
							|  |  |  | 			return s.constInt32(t, 0) | 
					
						
							|  |  |  | 		case 8: | 
					
						
							|  |  |  | 			return s.constInt64(t, 0) | 
					
						
							|  |  |  | 		default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 			s.Fatalf("bad sized integer type %v", t) | 
					
						
							| 
									
										
										
										
											2015-07-28 14:19:20 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-08-25 19:21:45 -05:00
										 |  |  | 	case t.IsFloat(): | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		switch t.Size() { | 
					
						
							| 
									
										
										
										
											2015-08-25 19:21:45 -05:00
										 |  |  | 		case 4: | 
					
						
							|  |  |  | 			return s.constFloat32(t, 0) | 
					
						
							|  |  |  | 		case 8: | 
					
						
							|  |  |  | 			return s.constFloat64(t, 0) | 
					
						
							|  |  |  | 		default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 			s.Fatalf("bad sized float type %v", t) | 
					
						
							| 
									
										
										
										
											2015-08-25 19:21:45 -05:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 	case t.IsComplex(): | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		switch t.Size() { | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 		case 8: | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			z := s.constFloat32(types.Types[TFLOAT32], 0) | 
					
						
							| 
									
										
										
										
											2015-08-28 13:52:26 -07:00
										 |  |  | 			return s.entryNewValue2(ssa.OpComplexMake, t, z, z) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 		case 16: | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			z := s.constFloat64(types.Types[TFLOAT64], 0) | 
					
						
							| 
									
										
										
										
											2015-08-28 13:52:26 -07:00
										 |  |  | 			return s.entryNewValue2(ssa.OpComplexMake, t, z, z) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 		default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 			s.Fatalf("bad sized complex type %v", t) | 
					
						
							| 
									
										
										
										
											2015-08-28 14:24:10 -04:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-20 15:30:52 -07:00
										 |  |  | 	case t.IsString(): | 
					
						
							| 
									
										
										
										
											2016-03-06 18:06:09 -08:00
										 |  |  | 		return s.constEmptyString(t) | 
					
						
							| 
									
										
										
										
											2016-03-28 10:55:44 -07:00
										 |  |  | 	case t.IsPtrShaped(): | 
					
						
							| 
									
										
										
										
											2016-03-06 18:06:09 -08:00
										 |  |  | 		return s.constNil(t) | 
					
						
							| 
									
										
										
										
											2015-07-20 15:30:52 -07:00
										 |  |  | 	case t.IsBoolean(): | 
					
						
							| 
									
										
										
										
											2015-09-08 16:52:25 -07:00
										 |  |  | 		return s.constBool(false) | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 	case t.IsInterface(): | 
					
						
							| 
									
										
										
										
											2016-03-06 18:06:09 -08:00
										 |  |  | 		return s.constInterface(t) | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 	case t.IsSlice(): | 
					
						
							| 
									
										
										
										
											2016-03-06 18:06:09 -08:00
										 |  |  | 		return s.constSlice(t) | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 	case t.IsStruct(): | 
					
						
							|  |  |  | 		n := t.NumFields() | 
					
						
							|  |  |  | 		v := s.entryNewValue0(ssa.StructMakeOp(t.NumFields()), t) | 
					
						
							| 
									
										
										
										
											2016-03-14 12:45:18 -07:00
										 |  |  | 		for i := 0; i < n; i++ { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v.AddArg(s.zeroVal(t.FieldType(i))) | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		return v | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 	case t.IsArray(): | 
					
						
							|  |  |  | 		switch t.NumElem() { | 
					
						
							|  |  |  | 		case 0: | 
					
						
							|  |  |  | 			return s.entryNewValue0(ssa.OpArrayMake0, t) | 
					
						
							|  |  |  | 		case 1: | 
					
						
							|  |  |  | 			return s.entryNewValue1(ssa.OpArrayMake1, t, s.zeroVal(t.Elem())) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-07-20 15:30:52 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 	s.Fatalf("zero for type %v not implemented", t) | 
					
						
							| 
									
										
										
										
											2015-07-20 15:30:52 -07:00
										 |  |  | 	return nil | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | type callKind int8 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | const ( | 
					
						
							|  |  |  | 	callNormal callKind = iota | 
					
						
							|  |  |  | 	callDefer | 
					
						
							|  |  |  | 	callGo | 
					
						
							|  |  |  | ) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | type sfRtCallDef struct { | 
					
						
							|  |  |  | 	rtfn  *obj.LSym | 
					
						
							|  |  |  | 	rtype types.EType | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | var softFloatOps map[ssa.Op]sfRtCallDef | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | func softfloatInit() { | 
					
						
							|  |  |  | 	// Some of these operations get transformed by sfcall. | 
					
						
							|  |  |  | 	softFloatOps = map[ssa.Op]sfRtCallDef{ | 
					
						
							|  |  |  | 		ssa.OpAdd32F: sfRtCallDef{sysfunc("fadd32"), TFLOAT32}, | 
					
						
							|  |  |  | 		ssa.OpAdd64F: sfRtCallDef{sysfunc("fadd64"), TFLOAT64}, | 
					
						
							|  |  |  | 		ssa.OpSub32F: sfRtCallDef{sysfunc("fadd32"), TFLOAT32}, | 
					
						
							|  |  |  | 		ssa.OpSub64F: sfRtCallDef{sysfunc("fadd64"), TFLOAT64}, | 
					
						
							|  |  |  | 		ssa.OpMul32F: sfRtCallDef{sysfunc("fmul32"), TFLOAT32}, | 
					
						
							|  |  |  | 		ssa.OpMul64F: sfRtCallDef{sysfunc("fmul64"), TFLOAT64}, | 
					
						
							|  |  |  | 		ssa.OpDiv32F: sfRtCallDef{sysfunc("fdiv32"), TFLOAT32}, | 
					
						
							|  |  |  | 		ssa.OpDiv64F: sfRtCallDef{sysfunc("fdiv64"), TFLOAT64}, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		ssa.OpEq64F:      sfRtCallDef{sysfunc("feq64"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpEq32F:      sfRtCallDef{sysfunc("feq32"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpNeq64F:     sfRtCallDef{sysfunc("feq64"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpNeq32F:     sfRtCallDef{sysfunc("feq32"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpLess64F:    sfRtCallDef{sysfunc("fgt64"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpLess32F:    sfRtCallDef{sysfunc("fgt32"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpGreater64F: sfRtCallDef{sysfunc("fgt64"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpGreater32F: sfRtCallDef{sysfunc("fgt32"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpLeq64F:     sfRtCallDef{sysfunc("fge64"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpLeq32F:     sfRtCallDef{sysfunc("fge32"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpGeq64F:     sfRtCallDef{sysfunc("fge64"), TBOOL}, | 
					
						
							|  |  |  | 		ssa.OpGeq32F:     sfRtCallDef{sysfunc("fge32"), TBOOL}, | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		ssa.OpCvt32to32F:  sfRtCallDef{sysfunc("fint32to32"), TFLOAT32}, | 
					
						
							|  |  |  | 		ssa.OpCvt32Fto32:  sfRtCallDef{sysfunc("f32toint32"), TINT32}, | 
					
						
							|  |  |  | 		ssa.OpCvt64to32F:  sfRtCallDef{sysfunc("fint64to32"), TFLOAT32}, | 
					
						
							|  |  |  | 		ssa.OpCvt32Fto64:  sfRtCallDef{sysfunc("f32toint64"), TINT64}, | 
					
						
							|  |  |  | 		ssa.OpCvt64Uto32F: sfRtCallDef{sysfunc("fuint64to32"), TFLOAT32}, | 
					
						
							|  |  |  | 		ssa.OpCvt32Fto64U: sfRtCallDef{sysfunc("f32touint64"), TUINT64}, | 
					
						
							|  |  |  | 		ssa.OpCvt32to64F:  sfRtCallDef{sysfunc("fint32to64"), TFLOAT64}, | 
					
						
							|  |  |  | 		ssa.OpCvt64Fto32:  sfRtCallDef{sysfunc("f64toint32"), TINT32}, | 
					
						
							|  |  |  | 		ssa.OpCvt64to64F:  sfRtCallDef{sysfunc("fint64to64"), TFLOAT64}, | 
					
						
							|  |  |  | 		ssa.OpCvt64Fto64:  sfRtCallDef{sysfunc("f64toint64"), TINT64}, | 
					
						
							|  |  |  | 		ssa.OpCvt64Uto64F: sfRtCallDef{sysfunc("fuint64to64"), TFLOAT64}, | 
					
						
							|  |  |  | 		ssa.OpCvt64Fto64U: sfRtCallDef{sysfunc("f64touint64"), TUINT64}, | 
					
						
							|  |  |  | 		ssa.OpCvt32Fto64F: sfRtCallDef{sysfunc("f32to64"), TFLOAT64}, | 
					
						
							|  |  |  | 		ssa.OpCvt64Fto32F: sfRtCallDef{sysfunc("f64to32"), TFLOAT32}, | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // TODO: do not emit sfcall if operation can be optimized to constant in later | 
					
						
							|  |  |  | // opt phase | 
					
						
							|  |  |  | func (s *state) sfcall(op ssa.Op, args ...*ssa.Value) (*ssa.Value, bool) { | 
					
						
							|  |  |  | 	if callDef, ok := softFloatOps[op]; ok { | 
					
						
							|  |  |  | 		switch op { | 
					
						
							|  |  |  | 		case ssa.OpLess32F, | 
					
						
							|  |  |  | 			ssa.OpLess64F, | 
					
						
							|  |  |  | 			ssa.OpLeq32F, | 
					
						
							|  |  |  | 			ssa.OpLeq64F: | 
					
						
							|  |  |  | 			args[0], args[1] = args[1], args[0] | 
					
						
							|  |  |  | 		case ssa.OpSub32F, | 
					
						
							|  |  |  | 			ssa.OpSub64F: | 
					
						
							|  |  |  | 			args[1] = s.newValue1(s.ssaOp(OMINUS, types.Types[callDef.rtype]), args[1].Type, args[1]) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		result := s.rtcall(callDef.rtfn, true, []*types.Type{types.Types[callDef.rtype]}, args...)[0] | 
					
						
							|  |  |  | 		if op == ssa.OpNeq32F || op == ssa.OpNeq64F { | 
					
						
							|  |  |  | 			result = s.newValue1(ssa.OpNot, result.Type, result) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		return result, true | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return nil, false | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | var intrinsics map[intrinsicKey]intrinsicBuilder | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | // An intrinsicBuilder converts a call node n into an ssa value that | 
					
						
							| 
									
										
										
										
											2016-11-01 15:28:10 -07:00
										 |  |  | // implements that call as an intrinsic. args is a list of arguments to the func. | 
					
						
							|  |  |  | type intrinsicBuilder func(s *state, n *Node, args []*ssa.Value) *ssa.Value | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | type intrinsicKey struct { | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	arch *sys.Arch | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 	pkg  string | 
					
						
							|  |  |  | 	fn   string | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | func init() { | 
					
						
							|  |  |  | 	intrinsics = map[intrinsicKey]intrinsicBuilder{} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	var all []*sys.Arch | 
					
						
							|  |  |  | 	var p4 []*sys.Arch | 
					
						
							|  |  |  | 	var p8 []*sys.Arch | 
					
						
							|  |  |  | 	for _, a := range sys.Archs { | 
					
						
							|  |  |  | 		all = append(all, a) | 
					
						
							|  |  |  | 		if a.PtrSize == 4 { | 
					
						
							|  |  |  | 			p4 = append(p4, a) | 
					
						
							|  |  |  | 		} else { | 
					
						
							|  |  |  | 			p8 = append(p8, a) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-09-10 22:44:00 +02:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	// add adds the intrinsic b for pkg.fn for the given list of architectures. | 
					
						
							|  |  |  | 	add := func(pkg, fn string, b intrinsicBuilder, archs ...*sys.Arch) { | 
					
						
							|  |  |  | 		for _, a := range archs { | 
					
						
							|  |  |  | 			intrinsics[intrinsicKey{a, pkg, fn}] = b | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	// addF does the same as add but operates on architecture families. | 
					
						
							|  |  |  | 	addF := func(pkg, fn string, b intrinsicBuilder, archFamilies ...sys.ArchFamily) { | 
					
						
							|  |  |  | 		m := 0 | 
					
						
							|  |  |  | 		for _, f := range archFamilies { | 
					
						
							|  |  |  | 			if f >= 32 { | 
					
						
							|  |  |  | 				panic("too many architecture families") | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			m |= 1 << uint(f) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		for _, a := range all { | 
					
						
							|  |  |  | 			if m>>uint(a.Family)&1 != 0 { | 
					
						
							|  |  |  | 				intrinsics[intrinsicKey{a, pkg, fn}] = b | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	// alias defines pkg.fn = pkg2.fn2 for all architectures in archs for which pkg2.fn2 exists. | 
					
						
							|  |  |  | 	alias := func(pkg, fn, pkg2, fn2 string, archs ...*sys.Arch) { | 
					
						
							|  |  |  | 		for _, a := range archs { | 
					
						
							|  |  |  | 			if b, ok := intrinsics[intrinsicKey{a, pkg2, fn2}]; ok { | 
					
						
							|  |  |  | 				intrinsics[intrinsicKey{a, pkg, fn}] = b | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-08-29 16:26:57 -04:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	/******** runtime ********/ | 
					
						
							|  |  |  | 	if !instrumenting { | 
					
						
							|  |  |  | 		add("runtime", "slicebytetostringtmp", | 
					
						
							|  |  |  | 			func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 				// Compiler frontend optimizations emit OARRAYBYTESTRTMP nodes | 
					
						
							|  |  |  | 				// for the backend instead of slicebytetostringtmp calls | 
					
						
							|  |  |  | 				// when not instrumenting. | 
					
						
							|  |  |  | 				slice := args[0] | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 				ptr := s.newValue1(ssa.OpSlicePtr, s.f.Config.Types.BytePtr, slice) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				len := s.newValue1(ssa.OpSliceLen, types.Types[TINT], slice) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 				return s.newValue2(ssa.OpStringMake, n.Type, ptr, len) | 
					
						
							|  |  |  | 			}, | 
					
						
							|  |  |  | 			all...) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	add("runtime", "KeepAlive", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 			data := s.newValue1(ssa.OpIData, s.f.Config.Types.BytePtr, args[0]) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			s.vars[&memVar] = s.newValue2(ssa.OpKeepAlive, types.TypeMem, data, s.mem()) | 
					
						
							| 
									
										
										
										
											2016-08-31 15:17:02 -07:00
										 |  |  | 			return nil | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		all...) | 
					
						
							| 
									
										
										
										
											2017-08-05 14:44:00 +08:00
										 |  |  | 	add("runtime", "getclosureptr", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue0(ssa.OpGetClosurePtr, s.f.Config.Types.Uintptr) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		all...) | 
					
						
							| 
									
										
										
										
											2016-09-10 22:44:00 +02:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-05-02 14:25:00 +08:00
										 |  |  | 	add("runtime", "getcallerpc", | 
					
						
							| 
									
										
										
										
											2016-10-24 10:25:05 -04:00
										 |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue0(ssa.OpGetCallerPC, s.f.Config.Types.Uintptr) | 
					
						
							| 
									
										
										
										
											2018-05-02 14:25:00 +08:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		all...) | 
					
						
							| 
									
										
										
										
											2016-10-24 10:25:05 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-10-09 15:33:29 -04:00
										 |  |  | 	add("runtime", "getcallersp", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue0(ssa.OpGetCallerSP, s.f.Config.Types.Uintptr) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		all...) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	/******** runtime/internal/sys ********/ | 
					
						
							|  |  |  | 	addF("runtime/internal/sys", "Ctz32", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpCtz32, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-12 11:45:35 -07:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.ARM, sys.S390X, sys.MIPS, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/sys", "Ctz64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpCtz64, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-12 11:45:35 -07:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.ARM, sys.S390X, sys.MIPS, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/sys", "Bswap32", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpBswap32, types.Types[TUINT32], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64, sys.ARM64, sys.ARM, sys.S390X) | 
					
						
							|  |  |  | 	addF("runtime/internal/sys", "Bswap64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpBswap64, types.Types[TUINT64], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64, sys.ARM64, sys.ARM, sys.S390X) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	/******** runtime/internal/atomic ********/ | 
					
						
							|  |  |  | 	addF("runtime/internal/atomic", "Load", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v := s.newValue2(ssa.OpAtomicLoad32, types.NewTuple(types.Types[TUINT32], types.TypeMem), args[0], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSelect0, types.Types[TUINT32], v) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Load64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v := s.newValue2(ssa.OpAtomicLoad64, types.NewTuple(types.Types[TUINT64], types.TypeMem), args[0], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSelect0, types.Types[TUINT64], v) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Loadp", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v := s.newValue2(ssa.OpAtomicLoadPtr, types.NewTuple(s.f.Config.Types.BytePtr, types.TypeMem), args[0], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v) | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSelect0, s.f.Config.Types.BytePtr, v) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Store", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			s.vars[&memVar] = s.newValue3(ssa.OpAtomicStore32, types.TypeMem, args[0], args[1], s.mem()) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 			return nil | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Store64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			s.vars[&memVar] = s.newValue3(ssa.OpAtomicStore64, types.TypeMem, args[0], args[1], s.mem()) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 			return nil | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "StorepNoWB", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			s.vars[&memVar] = s.newValue3(ssa.OpAtomicStorePtrNoWB, types.TypeMem, args[0], args[1], s.mem()) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 			return nil | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS, sys.MIPS64) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Xchg", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v := s.newValue3(ssa.OpAtomicExchange32, types.NewTuple(types.Types[TUINT32], types.TypeMem), args[0], args[1], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSelect0, types.Types[TUINT32], v) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Xchg64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v := s.newValue3(ssa.OpAtomicExchange64, types.NewTuple(types.Types[TUINT64], types.TypeMem), args[0], args[1], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSelect0, types.Types[TUINT64], v) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Xadd", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v := s.newValue3(ssa.OpAtomicAdd32, types.NewTuple(types.Types[TUINT32], types.TypeMem), args[0], args[1], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSelect0, types.Types[TUINT32], v) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-11-03 02:05:28 +00:00
										 |  |  | 		sys.AMD64, sys.S390X, sys.MIPS, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Xadd64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v := s.newValue3(ssa.OpAtomicAdd64, types.NewTuple(types.Types[TUINT64], types.TypeMem), args[0], args[1], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSelect0, types.Types[TUINT64], v) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-11-03 02:05:28 +00:00
										 |  |  | 		sys.AMD64, sys.S390X, sys.MIPS64, sys.PPC64) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	makeXaddARM64 := func(op0 ssa.Op, op1 ssa.Op, ty types.EType) func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 		return func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			// Target Atomic feature is identified by dynamic detection | 
					
						
							|  |  |  | 			addr := s.entryNewValue1A(ssa.OpAddr, types.Types[TBOOL].PtrTo(), arm64SupportAtomics, s.sb) | 
					
						
							|  |  |  | 			v := s.load(types.Types[TBOOL], addr) | 
					
						
							|  |  |  | 			b := s.endBlock() | 
					
						
							|  |  |  | 			b.Kind = ssa.BlockIf | 
					
						
							|  |  |  | 			b.SetControl(v) | 
					
						
							|  |  |  | 			bTrue := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			bFalse := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			bEnd := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bTrue) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bFalse) | 
					
						
							|  |  |  | 			b.Likely = ssa.BranchUnlikely // most machines don't have Atomics nowadays | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// We have atomic instructions - use it directly. | 
					
						
							|  |  |  | 			s.startBlock(bTrue) | 
					
						
							|  |  |  | 			v0 := s.newValue3(op1, types.NewTuple(types.Types[ty], types.TypeMem), args[0], args[1], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v0) | 
					
						
							|  |  |  | 			s.vars[n] = s.newValue1(ssa.OpSelect0, types.Types[ty], v0) | 
					
						
							|  |  |  | 			s.endBlock().AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Use original instruction sequence. | 
					
						
							|  |  |  | 			s.startBlock(bFalse) | 
					
						
							|  |  |  | 			v1 := s.newValue3(op0, types.NewTuple(types.Types[ty], types.TypeMem), args[0], args[1], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v1) | 
					
						
							|  |  |  | 			s.vars[n] = s.newValue1(ssa.OpSelect0, types.Types[ty], v1) | 
					
						
							|  |  |  | 			s.endBlock().AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Merge results. | 
					
						
							|  |  |  | 			s.startBlock(bEnd) | 
					
						
							|  |  |  | 			return s.variable(n, types.Types[ty]) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	addF("runtime/internal/atomic", "Xadd", | 
					
						
							|  |  |  | 		makeXaddARM64(ssa.OpAtomicAdd32, ssa.OpAtomicAdd32Variant, TUINT32), | 
					
						
							|  |  |  | 		sys.ARM64) | 
					
						
							|  |  |  | 	addF("runtime/internal/atomic", "Xadd64", | 
					
						
							|  |  |  | 		makeXaddARM64(ssa.OpAtomicAdd64, ssa.OpAtomicAdd64Variant, TUINT64), | 
					
						
							|  |  |  | 		sys.ARM64) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Cas", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v := s.newValue4(ssa.OpAtomicCompareAndSwap32, types.NewTuple(types.Types[TBOOL], types.TypeMem), args[0], args[1], args[2], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSelect0, types.Types[TBOOL], v) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "Cas64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			v := s.newValue4(ssa.OpAtomicCompareAndSwap64, types.NewTuple(types.Types[TBOOL], types.TypeMem), args[0], args[1], args[2], s.mem()) | 
					
						
							|  |  |  | 			s.vars[&memVar] = s.newValue1(ssa.OpSelect1, types.TypeMem, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSelect0, types.Types[TBOOL], v) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-10-05 10:11:38 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X, sys.MIPS64, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("runtime/internal/atomic", "And8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			s.vars[&memVar] = s.newValue3(ssa.OpAtomicAnd8, types.TypeMem, args[0], args[1], s.mem()) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 			return nil | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64, sys.ARM64, sys.MIPS, sys.PPC64) | 
					
						
							|  |  |  | 	addF("runtime/internal/atomic", "Or8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			s.vars[&memVar] = s.newValue3(ssa.OpAtomicOr8, types.TypeMem, args[0], args[1], s.mem()) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 			return nil | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64, sys.ARM64, sys.MIPS, sys.PPC64) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Loadint64", "runtime/internal/atomic", "Load64", all...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Xaddint64", "runtime/internal/atomic", "Xadd64", all...) | 
					
						
							| 
									
										
										
										
											2017-04-21 19:16:15 -07:00
										 |  |  | 	alias("runtime/internal/atomic", "Loaduint", "runtime/internal/atomic", "Load", p4...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Loaduint", "runtime/internal/atomic", "Load64", p8...) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	alias("runtime/internal/atomic", "Loaduintptr", "runtime/internal/atomic", "Load", p4...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Loaduintptr", "runtime/internal/atomic", "Load64", p8...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Storeuintptr", "runtime/internal/atomic", "Store", p4...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Storeuintptr", "runtime/internal/atomic", "Store64", p8...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Xchguintptr", "runtime/internal/atomic", "Xchg", p4...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Xchguintptr", "runtime/internal/atomic", "Xchg64", p8...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Xadduintptr", "runtime/internal/atomic", "Xadd", p4...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Xadduintptr", "runtime/internal/atomic", "Xadd64", p8...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Casuintptr", "runtime/internal/atomic", "Cas", p4...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Casuintptr", "runtime/internal/atomic", "Cas64", p8...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Casp1", "runtime/internal/atomic", "Cas", p4...) | 
					
						
							|  |  |  | 	alias("runtime/internal/atomic", "Casp1", "runtime/internal/atomic", "Cas64", p8...) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-04-01 11:01:36 -07:00
										 |  |  | 	alias("runtime/internal/sys", "Ctz8", "math/bits", "TrailingZeros8", all...) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	/******** math ********/ | 
					
						
							|  |  |  | 	addF("math", "Sqrt", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpSqrt, types.Types[TFLOAT64], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2018-02-23 10:40:12 +01:00
										 |  |  | 		sys.I386, sys.AMD64, sys.ARM, sys.ARM64, sys.MIPS, sys.MIPS64, sys.PPC64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-08-09 13:53:34 -04:00
										 |  |  | 	addF("math", "Trunc", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpTrunc, types.Types[TFLOAT64], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2018-02-16 09:22:32 -05:00
										 |  |  | 		sys.ARM64, sys.PPC64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-08-09 13:53:34 -04:00
										 |  |  | 	addF("math", "Ceil", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpCeil, types.Types[TFLOAT64], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2018-02-16 09:22:32 -05:00
										 |  |  | 		sys.ARM64, sys.PPC64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-08-09 13:53:34 -04:00
										 |  |  | 	addF("math", "Floor", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpFloor, types.Types[TFLOAT64], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2018-02-16 09:22:32 -05:00
										 |  |  | 		sys.ARM64, sys.PPC64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-09-14 20:00:02 +01:00
										 |  |  | 	addF("math", "Round", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpRound, types.Types[TFLOAT64], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2018-03-02 16:47:54 -03:00
										 |  |  | 		sys.ARM64, sys.PPC64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-10-30 09:02:44 -04:00
										 |  |  | 	addF("math", "RoundToEven", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpRoundToEven, types.Types[TFLOAT64], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-09-28 17:11:31 -04:00
										 |  |  | 	addF("math", "Abs", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpAbs, types.Types[TFLOAT64], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.PPC64) | 
					
						
							|  |  |  | 	addF("math", "Copysign", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpCopysign, types.Types[TFLOAT64], args[0], args[1]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.PPC64) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-10-05 15:45:46 -05:00
										 |  |  | 	makeRoundAMD64 := func(op ssa.Op) func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 		return func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-18 08:48:04 -07:00
										 |  |  | 			addr := s.entryNewValue1A(ssa.OpAddr, types.Types[TBOOL].PtrTo(), supportSSE41, s.sb) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			v := s.load(types.Types[TBOOL], addr) | 
					
						
							| 
									
										
										
										
											2017-10-05 15:45:46 -05:00
										 |  |  | 			b := s.endBlock() | 
					
						
							|  |  |  | 			b.Kind = ssa.BlockIf | 
					
						
							|  |  |  | 			b.SetControl(v) | 
					
						
							|  |  |  | 			bTrue := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			bFalse := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			bEnd := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bTrue) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bFalse) | 
					
						
							|  |  |  | 			b.Likely = ssa.BranchLikely // most machines have sse4.1 nowadays | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// We have the intrinsic - use it directly. | 
					
						
							|  |  |  | 			s.startBlock(bTrue) | 
					
						
							|  |  |  | 			s.vars[n] = s.newValue1(op, types.Types[TFLOAT64], args[0]) | 
					
						
							|  |  |  | 			s.endBlock().AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Call the pure Go version. | 
					
						
							|  |  |  | 			s.startBlock(bFalse) | 
					
						
							|  |  |  | 			a := s.call(n, callNormal) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			s.vars[n] = s.load(types.Types[TFLOAT64], a) | 
					
						
							| 
									
										
										
										
											2017-10-05 15:45:46 -05:00
										 |  |  | 			s.endBlock().AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Merge results. | 
					
						
							|  |  |  | 			s.startBlock(bEnd) | 
					
						
							|  |  |  | 			return s.variable(n, types.Types[TFLOAT64]) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-10-31 16:49:27 -05:00
										 |  |  | 	addF("math", "RoundToEven", | 
					
						
							|  |  |  | 		makeRoundAMD64(ssa.OpRoundToEven), | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2017-10-05 15:45:46 -05:00
										 |  |  | 	addF("math", "Floor", | 
					
						
							|  |  |  | 		makeRoundAMD64(ssa.OpFloor), | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							|  |  |  | 	addF("math", "Ceil", | 
					
						
							|  |  |  | 		makeRoundAMD64(ssa.OpCeil), | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							|  |  |  | 	addF("math", "Trunc", | 
					
						
							|  |  |  | 		makeRoundAMD64(ssa.OpTrunc), | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	/******** math/bits ********/ | 
					
						
							|  |  |  | 	addF("math/bits", "TrailingZeros64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpCtz64, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.ARM, sys.S390X, sys.MIPS, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("math/bits", "TrailingZeros32", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpCtz32, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.ARM, sys.S390X, sys.MIPS, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("math/bits", "TrailingZeros16", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			x := s.newValue1(ssa.OpZeroExt16to32, types.Types[TUINT32], args[0]) | 
					
						
							|  |  |  | 			c := s.constInt32(types.Types[TUINT32], 1<<16) | 
					
						
							|  |  |  | 			y := s.newValue2(ssa.OpOr32, types.Types[TUINT32], x, c) | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpCtz32, types.Types[TINT], y) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.ARM, sys.MIPS) | 
					
						
							| 
									
										
											  
											
												cmd/compile: optimize TrailingZeros(8|16) on amd64
Introduce Ctz8 and Ctz16 ops and provide optimized lowerings for them.
amd64 only for this CL, although it wouldn't surprise me
if other architectures also admit of optimized lowerings.
name               old time/op  new time/op  delta
TrailingZeros8-8   1.33ns ± 6%  0.84ns ± 3%  -36.90%  (p=0.000 n=20+20)
TrailingZeros16-8  1.26ns ± 5%  0.84ns ± 5%  -33.50%  (p=0.000 n=20+18)
Code:
func f8(x uint8)   { z = bits.TrailingZeros8(x) }
func f16(x uint16) { z = bits.TrailingZeros16(x) }
Before:
"".f8 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	MOVBLZX	AL, AX
	0x0008 00008 (x.go:7)	BTSQ	$8, AX
	0x000d 00013 (x.go:7)	BSFQ	AX, AX
	0x0011 00017 (x.go:7)	MOVL	$64, CX
	0x0016 00022 (x.go:7)	CMOVQEQ	CX, AX
	0x001a 00026 (x.go:7)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:7)	RET
"".f16 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	BTSQ	$16, AX
	0x000d 00013 (x.go:8)	BSFQ	AX, AX
	0x0011 00017 (x.go:8)	MOVL	$64, CX
	0x0016 00022 (x.go:8)	CMOVQEQ	CX, AX
	0x001a 00026 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:8)	RET
After:
"".f8 STEXT nosplit size=20 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	BTSL	$8, AX
	0x0009 00009 (x.go:7)	BSFL	AX, AX
	0x000c 00012 (x.go:7)	MOVQ	AX, "".z(SB)
	0x0013 00019 (x.go:7)	RET
"".f16 STEXT nosplit size=20 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	BTSL	$16, AX
	0x0009 00009 (x.go:8)	BSFL	AX, AX
	0x000c 00012 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0013 00019 (x.go:8)	RET
Change-Id: I0551e357348de2b724737d569afd6ac9f5c3aa11
Reviewed-on: https://go-review.googlesource.com/108940
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Giovanni Bajo <rasky@develer.com>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2018-04-23 14:46:41 -07:00
										 |  |  | 	addF("math/bits", "TrailingZeros16", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpCtz16, types.Types[TINT], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("math/bits", "TrailingZeros16", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			x := s.newValue1(ssa.OpZeroExt16to64, types.Types[TUINT64], args[0]) | 
					
						
							|  |  |  | 			c := s.constInt64(types.Types[TUINT64], 1<<16) | 
					
						
							|  |  |  | 			y := s.newValue2(ssa.OpOr64, types.Types[TUINT64], x, c) | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpCtz64, types.Types[TINT], y) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
											  
											
												cmd/compile: optimize TrailingZeros(8|16) on amd64
Introduce Ctz8 and Ctz16 ops and provide optimized lowerings for them.
amd64 only for this CL, although it wouldn't surprise me
if other architectures also admit of optimized lowerings.
name               old time/op  new time/op  delta
TrailingZeros8-8   1.33ns ± 6%  0.84ns ± 3%  -36.90%  (p=0.000 n=20+20)
TrailingZeros16-8  1.26ns ± 5%  0.84ns ± 5%  -33.50%  (p=0.000 n=20+18)
Code:
func f8(x uint8)   { z = bits.TrailingZeros8(x) }
func f16(x uint16) { z = bits.TrailingZeros16(x) }
Before:
"".f8 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	MOVBLZX	AL, AX
	0x0008 00008 (x.go:7)	BTSQ	$8, AX
	0x000d 00013 (x.go:7)	BSFQ	AX, AX
	0x0011 00017 (x.go:7)	MOVL	$64, CX
	0x0016 00022 (x.go:7)	CMOVQEQ	CX, AX
	0x001a 00026 (x.go:7)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:7)	RET
"".f16 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	BTSQ	$16, AX
	0x000d 00013 (x.go:8)	BSFQ	AX, AX
	0x0011 00017 (x.go:8)	MOVL	$64, CX
	0x0016 00022 (x.go:8)	CMOVQEQ	CX, AX
	0x001a 00026 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:8)	RET
After:
"".f8 STEXT nosplit size=20 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	BTSL	$8, AX
	0x0009 00009 (x.go:7)	BSFL	AX, AX
	0x000c 00012 (x.go:7)	MOVQ	AX, "".z(SB)
	0x0013 00019 (x.go:7)	RET
"".f16 STEXT nosplit size=20 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	BTSL	$16, AX
	0x0009 00009 (x.go:8)	BSFL	AX, AX
	0x000c 00012 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0013 00019 (x.go:8)	RET
Change-Id: I0551e357348de2b724737d569afd6ac9f5c3aa11
Reviewed-on: https://go-review.googlesource.com/108940
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Giovanni Bajo <rasky@develer.com>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2018-04-23 14:46:41 -07:00
										 |  |  | 		sys.ARM64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("math/bits", "TrailingZeros8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			x := s.newValue1(ssa.OpZeroExt8to32, types.Types[TUINT32], args[0]) | 
					
						
							|  |  |  | 			c := s.constInt32(types.Types[TUINT32], 1<<8) | 
					
						
							|  |  |  | 			y := s.newValue2(ssa.OpOr32, types.Types[TUINT32], x, c) | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpCtz32, types.Types[TINT], y) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.ARM, sys.MIPS) | 
					
						
							| 
									
										
											  
											
												cmd/compile: optimize TrailingZeros(8|16) on amd64
Introduce Ctz8 and Ctz16 ops and provide optimized lowerings for them.
amd64 only for this CL, although it wouldn't surprise me
if other architectures also admit of optimized lowerings.
name               old time/op  new time/op  delta
TrailingZeros8-8   1.33ns ± 6%  0.84ns ± 3%  -36.90%  (p=0.000 n=20+20)
TrailingZeros16-8  1.26ns ± 5%  0.84ns ± 5%  -33.50%  (p=0.000 n=20+18)
Code:
func f8(x uint8)   { z = bits.TrailingZeros8(x) }
func f16(x uint16) { z = bits.TrailingZeros16(x) }
Before:
"".f8 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	MOVBLZX	AL, AX
	0x0008 00008 (x.go:7)	BTSQ	$8, AX
	0x000d 00013 (x.go:7)	BSFQ	AX, AX
	0x0011 00017 (x.go:7)	MOVL	$64, CX
	0x0016 00022 (x.go:7)	CMOVQEQ	CX, AX
	0x001a 00026 (x.go:7)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:7)	RET
"".f16 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	BTSQ	$16, AX
	0x000d 00013 (x.go:8)	BSFQ	AX, AX
	0x0011 00017 (x.go:8)	MOVL	$64, CX
	0x0016 00022 (x.go:8)	CMOVQEQ	CX, AX
	0x001a 00026 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:8)	RET
After:
"".f8 STEXT nosplit size=20 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	BTSL	$8, AX
	0x0009 00009 (x.go:7)	BSFL	AX, AX
	0x000c 00012 (x.go:7)	MOVQ	AX, "".z(SB)
	0x0013 00019 (x.go:7)	RET
"".f16 STEXT nosplit size=20 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	BTSL	$16, AX
	0x0009 00009 (x.go:8)	BSFL	AX, AX
	0x000c 00012 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0013 00019 (x.go:8)	RET
Change-Id: I0551e357348de2b724737d569afd6ac9f5c3aa11
Reviewed-on: https://go-review.googlesource.com/108940
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Giovanni Bajo <rasky@develer.com>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2018-04-23 14:46:41 -07:00
										 |  |  | 	addF("math/bits", "TrailingZeros8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpCtz8, types.Types[TINT], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	addF("math/bits", "TrailingZeros8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			x := s.newValue1(ssa.OpZeroExt8to64, types.Types[TUINT64], args[0]) | 
					
						
							|  |  |  | 			c := s.constInt64(types.Types[TUINT64], 1<<8) | 
					
						
							|  |  |  | 			y := s.newValue2(ssa.OpOr64, types.Types[TUINT64], x, c) | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpCtz64, types.Types[TINT], y) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
											  
											
												cmd/compile: optimize TrailingZeros(8|16) on amd64
Introduce Ctz8 and Ctz16 ops and provide optimized lowerings for them.
amd64 only for this CL, although it wouldn't surprise me
if other architectures also admit of optimized lowerings.
name               old time/op  new time/op  delta
TrailingZeros8-8   1.33ns ± 6%  0.84ns ± 3%  -36.90%  (p=0.000 n=20+20)
TrailingZeros16-8  1.26ns ± 5%  0.84ns ± 5%  -33.50%  (p=0.000 n=20+18)
Code:
func f8(x uint8)   { z = bits.TrailingZeros8(x) }
func f16(x uint16) { z = bits.TrailingZeros16(x) }
Before:
"".f8 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	MOVBLZX	AL, AX
	0x0008 00008 (x.go:7)	BTSQ	$8, AX
	0x000d 00013 (x.go:7)	BSFQ	AX, AX
	0x0011 00017 (x.go:7)	MOVL	$64, CX
	0x0016 00022 (x.go:7)	CMOVQEQ	CX, AX
	0x001a 00026 (x.go:7)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:7)	RET
"".f16 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	BTSQ	$16, AX
	0x000d 00013 (x.go:8)	BSFQ	AX, AX
	0x0011 00017 (x.go:8)	MOVL	$64, CX
	0x0016 00022 (x.go:8)	CMOVQEQ	CX, AX
	0x001a 00026 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:8)	RET
After:
"".f8 STEXT nosplit size=20 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	BTSL	$8, AX
	0x0009 00009 (x.go:7)	BSFL	AX, AX
	0x000c 00012 (x.go:7)	MOVQ	AX, "".z(SB)
	0x0013 00019 (x.go:7)	RET
"".f16 STEXT nosplit size=20 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	BTSL	$16, AX
	0x0009 00009 (x.go:8)	BSFL	AX, AX
	0x000c 00012 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0013 00019 (x.go:8)	RET
Change-Id: I0551e357348de2b724737d569afd6ac9f5c3aa11
Reviewed-on: https://go-review.googlesource.com/108940
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Giovanni Bajo <rasky@develer.com>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2018-04-23 14:46:41 -07:00
										 |  |  | 		sys.ARM64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-03-15 21:28:29 -07:00
										 |  |  | 	alias("math/bits", "ReverseBytes64", "runtime/internal/sys", "Bswap64", all...) | 
					
						
							|  |  |  | 	alias("math/bits", "ReverseBytes32", "runtime/internal/sys", "Bswap32", all...) | 
					
						
							|  |  |  | 	// ReverseBytes inlines correctly, no need to intrinsify it. | 
					
						
							|  |  |  | 	// ReverseBytes16 lowers to a rotate, no need for anything special here. | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 	addF("math/bits", "Len64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpBitLen64, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.ARM, sys.S390X, sys.MIPS, sys.PPC64) | 
					
						
							| 
									
										
											  
											
												cmd/compile: optimize LeadingZeros(16|32) on amd64
Introduce Len8 and Len16 ops and provide optimized lowerings for them.
amd64 only for this CL, although it wouldn't surprise me
if other architectures also admit of optimized lowerings.
Also use and optimize the Len32 lowering, along the same lines.
Leave Len8 unused for the moment; a subsequent CL will enable it.
For 16 and 32 bits, this leads to a speed-up.
name              old time/op  new time/op  delta
LeadingZeros16-8  1.42ns ± 5%  1.23ns ± 5%  -13.42%  (p=0.000 n=20+20)
LeadingZeros32-8  1.25ns ± 5%  1.03ns ± 5%  -17.63%  (p=0.000 n=20+16)
Code:
func f16(x uint16) { z = bits.LeadingZeros16(x) }
func f32(x uint32) { z = bits.LeadingZeros32(x) }
Before:
"".f16 STEXT nosplit size=38 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	BSRQ	AX, AX
	0x000c 00012 (x.go:8)	MOVQ	$-1, CX
	0x0013 00019 (x.go:8)	CMOVQEQ	CX, AX
	0x0017 00023 (x.go:8)	ADDQ	$-15, AX
	0x001b 00027 (x.go:8)	NEGQ	AX
	0x001e 00030 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0025 00037 (x.go:8)	RET
"".f32 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:9)	TEXT	"".f32(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:9)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:9)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:9)	MOVL	"".x+8(SP), AX
	0x0004 00004 (x.go:9)	BSRQ	AX, AX
	0x0008 00008 (x.go:9)	MOVQ	$-1, CX
	0x000f 00015 (x.go:9)	CMOVQEQ	CX, AX
	0x0013 00019 (x.go:9)	ADDQ	$-31, AX
	0x0017 00023 (x.go:9)	NEGQ	AX
	0x001a 00026 (x.go:9)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:9)	RET
After:
"".f16 STEXT nosplit size=30 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	LEAL	1(AX)(AX*1), AX
	0x000c 00012 (x.go:8)	BSRL	AX, AX
	0x000f 00015 (x.go:8)	ADDQ	$-16, AX
	0x0013 00019 (x.go:8)	NEGQ	AX
	0x0016 00022 (x.go:8)	MOVQ	AX, "".z(SB)
	0x001d 00029 (x.go:8)	RET
"".f32 STEXT nosplit size=28 args=0x8 locals=0x0
	0x0000 00000 (x.go:9)	TEXT	"".f32(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:9)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:9)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:9)	MOVL	"".x+8(SP), AX
	0x0004 00004 (x.go:9)	LEAQ	1(AX)(AX*1), AX
	0x0009 00009 (x.go:9)	BSRQ	AX, AX
	0x000d 00013 (x.go:9)	ADDQ	$-32, AX
	0x0011 00017 (x.go:9)	NEGQ	AX
	0x0014 00020 (x.go:9)	MOVQ	AX, "".z(SB)
	0x001b 00027 (x.go:9)	RET
Change-Id: I6c93c173752a7bfdeab8be30777ae05a736e1f4b
Reviewed-on: https://go-review.googlesource.com/108941
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Giovanni Bajo <rasky@develer.com>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2018-04-23 14:54:45 -07:00
										 |  |  | 	addF("math/bits", "Len32", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpBitLen32, types.Types[TINT], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 	addF("math/bits", "Len32", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 			if s.config.PtrSize == 4 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				return s.newValue1(ssa.OpBitLen32, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			x := s.newValue1(ssa.OpZeroExt32to64, types.Types[TUINT64], args[0]) | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpBitLen64, types.Types[TINT], x) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
											  
											
												cmd/compile: optimize LeadingZeros(16|32) on amd64
Introduce Len8 and Len16 ops and provide optimized lowerings for them.
amd64 only for this CL, although it wouldn't surprise me
if other architectures also admit of optimized lowerings.
Also use and optimize the Len32 lowering, along the same lines.
Leave Len8 unused for the moment; a subsequent CL will enable it.
For 16 and 32 bits, this leads to a speed-up.
name              old time/op  new time/op  delta
LeadingZeros16-8  1.42ns ± 5%  1.23ns ± 5%  -13.42%  (p=0.000 n=20+20)
LeadingZeros32-8  1.25ns ± 5%  1.03ns ± 5%  -17.63%  (p=0.000 n=20+16)
Code:
func f16(x uint16) { z = bits.LeadingZeros16(x) }
func f32(x uint32) { z = bits.LeadingZeros32(x) }
Before:
"".f16 STEXT nosplit size=38 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	BSRQ	AX, AX
	0x000c 00012 (x.go:8)	MOVQ	$-1, CX
	0x0013 00019 (x.go:8)	CMOVQEQ	CX, AX
	0x0017 00023 (x.go:8)	ADDQ	$-15, AX
	0x001b 00027 (x.go:8)	NEGQ	AX
	0x001e 00030 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0025 00037 (x.go:8)	RET
"".f32 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:9)	TEXT	"".f32(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:9)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:9)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:9)	MOVL	"".x+8(SP), AX
	0x0004 00004 (x.go:9)	BSRQ	AX, AX
	0x0008 00008 (x.go:9)	MOVQ	$-1, CX
	0x000f 00015 (x.go:9)	CMOVQEQ	CX, AX
	0x0013 00019 (x.go:9)	ADDQ	$-31, AX
	0x0017 00023 (x.go:9)	NEGQ	AX
	0x001a 00026 (x.go:9)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:9)	RET
After:
"".f16 STEXT nosplit size=30 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	LEAL	1(AX)(AX*1), AX
	0x000c 00012 (x.go:8)	BSRL	AX, AX
	0x000f 00015 (x.go:8)	ADDQ	$-16, AX
	0x0013 00019 (x.go:8)	NEGQ	AX
	0x0016 00022 (x.go:8)	MOVQ	AX, "".z(SB)
	0x001d 00029 (x.go:8)	RET
"".f32 STEXT nosplit size=28 args=0x8 locals=0x0
	0x0000 00000 (x.go:9)	TEXT	"".f32(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:9)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:9)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:9)	MOVL	"".x+8(SP), AX
	0x0004 00004 (x.go:9)	LEAQ	1(AX)(AX*1), AX
	0x0009 00009 (x.go:9)	BSRQ	AX, AX
	0x000d 00013 (x.go:9)	ADDQ	$-32, AX
	0x0011 00017 (x.go:9)	NEGQ	AX
	0x0014 00020 (x.go:9)	MOVQ	AX, "".z(SB)
	0x001b 00027 (x.go:9)	RET
Change-Id: I6c93c173752a7bfdeab8be30777ae05a736e1f4b
Reviewed-on: https://go-review.googlesource.com/108941
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Giovanni Bajo <rasky@develer.com>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2018-04-23 14:54:45 -07:00
										 |  |  | 		sys.ARM64, sys.ARM, sys.S390X, sys.MIPS, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 	addF("math/bits", "Len16", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 			if s.config.PtrSize == 4 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				x := s.newValue1(ssa.OpZeroExt16to32, types.Types[TUINT32], args[0]) | 
					
						
							|  |  |  | 				return s.newValue1(ssa.OpBitLen32, types.Types[TINT], x) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			x := s.newValue1(ssa.OpZeroExt16to64, types.Types[TUINT64], args[0]) | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpBitLen64, types.Types[TINT], x) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
											  
											
												cmd/compile: optimize LeadingZeros(16|32) on amd64
Introduce Len8 and Len16 ops and provide optimized lowerings for them.
amd64 only for this CL, although it wouldn't surprise me
if other architectures also admit of optimized lowerings.
Also use and optimize the Len32 lowering, along the same lines.
Leave Len8 unused for the moment; a subsequent CL will enable it.
For 16 and 32 bits, this leads to a speed-up.
name              old time/op  new time/op  delta
LeadingZeros16-8  1.42ns ± 5%  1.23ns ± 5%  -13.42%  (p=0.000 n=20+20)
LeadingZeros32-8  1.25ns ± 5%  1.03ns ± 5%  -17.63%  (p=0.000 n=20+16)
Code:
func f16(x uint16) { z = bits.LeadingZeros16(x) }
func f32(x uint32) { z = bits.LeadingZeros32(x) }
Before:
"".f16 STEXT nosplit size=38 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	BSRQ	AX, AX
	0x000c 00012 (x.go:8)	MOVQ	$-1, CX
	0x0013 00019 (x.go:8)	CMOVQEQ	CX, AX
	0x0017 00023 (x.go:8)	ADDQ	$-15, AX
	0x001b 00027 (x.go:8)	NEGQ	AX
	0x001e 00030 (x.go:8)	MOVQ	AX, "".z(SB)
	0x0025 00037 (x.go:8)	RET
"".f32 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:9)	TEXT	"".f32(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:9)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:9)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:9)	MOVL	"".x+8(SP), AX
	0x0004 00004 (x.go:9)	BSRQ	AX, AX
	0x0008 00008 (x.go:9)	MOVQ	$-1, CX
	0x000f 00015 (x.go:9)	CMOVQEQ	CX, AX
	0x0013 00019 (x.go:9)	ADDQ	$-31, AX
	0x0017 00023 (x.go:9)	NEGQ	AX
	0x001a 00026 (x.go:9)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:9)	RET
After:
"".f16 STEXT nosplit size=30 args=0x8 locals=0x0
	0x0000 00000 (x.go:8)	TEXT	"".f16(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:8)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:8)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:8)	MOVWLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:8)	MOVWLZX	AX, AX
	0x0008 00008 (x.go:8)	LEAL	1(AX)(AX*1), AX
	0x000c 00012 (x.go:8)	BSRL	AX, AX
	0x000f 00015 (x.go:8)	ADDQ	$-16, AX
	0x0013 00019 (x.go:8)	NEGQ	AX
	0x0016 00022 (x.go:8)	MOVQ	AX, "".z(SB)
	0x001d 00029 (x.go:8)	RET
"".f32 STEXT nosplit size=28 args=0x8 locals=0x0
	0x0000 00000 (x.go:9)	TEXT	"".f32(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:9)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:9)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:9)	MOVL	"".x+8(SP), AX
	0x0004 00004 (x.go:9)	LEAQ	1(AX)(AX*1), AX
	0x0009 00009 (x.go:9)	BSRQ	AX, AX
	0x000d 00013 (x.go:9)	ADDQ	$-32, AX
	0x0011 00017 (x.go:9)	NEGQ	AX
	0x0014 00020 (x.go:9)	MOVQ	AX, "".z(SB)
	0x001b 00027 (x.go:9)	RET
Change-Id: I6c93c173752a7bfdeab8be30777ae05a736e1f4b
Reviewed-on: https://go-review.googlesource.com/108941
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Giovanni Bajo <rasky@develer.com>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2018-04-23 14:54:45 -07:00
										 |  |  | 		sys.ARM64, sys.ARM, sys.S390X, sys.MIPS, sys.PPC64) | 
					
						
							|  |  |  | 	addF("math/bits", "Len16", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpBitLen16, types.Types[TINT], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 	addF("math/bits", "Len8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 			if s.config.PtrSize == 4 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				x := s.newValue1(ssa.OpZeroExt8to32, types.Types[TUINT32], args[0]) | 
					
						
							|  |  |  | 				return s.newValue1(ssa.OpBitLen32, types.Types[TINT], x) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			x := s.newValue1(ssa.OpZeroExt8to64, types.Types[TUINT64], args[0]) | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpBitLen64, types.Types[TINT], x) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 		sys.ARM64, sys.ARM, sys.S390X, sys.MIPS, sys.PPC64) | 
					
						
							| 
									
										
											  
											
												cmd/compile: use intrinsic for LeadingZeros8 on amd64
The previous change sped up the pure computation form of LeadingZeros8.
This places it somewhat close to the table lookup form.
Depending on something that varies from toolchain to toolchain
(alignment, perhaps?), the slowdown from ditching the table lookup
is either 20% or 5%.
This benchmark is the best case scenario for the table lookup:
It is in the L1 cache already.
I think we're close enough that we can switch to the computational version,
and trust that the memory effects and binary size savings will be worth it.
Code:
func f8(x uint8)   { z = bits.LeadingZeros8(x) }
Before:
"".f8 STEXT nosplit size=34 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	MOVBLZX	AL, AX
	0x0008 00008 (x.go:7)	LEAQ	math/bits.len8tab(SB), CX
	0x000f 00015 (x.go:7)	MOVBLZX	(CX)(AX*1), AX
	0x0013 00019 (x.go:7)	ADDQ	$-8, AX
	0x0017 00023 (x.go:7)	NEGQ	AX
	0x001a 00026 (x.go:7)	MOVQ	AX, "".z(SB)
	0x0021 00033 (x.go:7)	RET
After:
"".f8 STEXT nosplit size=30 args=0x8 locals=0x0
	0x0000 00000 (x.go:7)	TEXT	"".f8(SB), NOSPLIT, $0-8
	0x0000 00000 (x.go:7)	FUNCDATA	$0, gclocals·2a5305abe05176240e61b8620e19a815(SB)
	0x0000 00000 (x.go:7)	FUNCDATA	$1, gclocals·33cdeccccebe80329f1fdbee7f5874cb(SB)
	0x0000 00000 (x.go:7)	MOVBLZX	"".x+8(SP), AX
	0x0005 00005 (x.go:7)	MOVBLZX	AL, AX
	0x0008 00008 (x.go:7)	LEAL	1(AX)(AX*1), AX
	0x000c 00012 (x.go:7)	BSRL	AX, AX
	0x000f 00015 (x.go:7)	ADDQ	$-8, AX
	0x0013 00019 (x.go:7)	NEGQ	AX
	0x0016 00022 (x.go:7)	MOVQ	AX, "".z(SB)
	0x001d 00029 (x.go:7)	RET
Change-Id: Icc7db50a7820fb9a3da8a816d6b6940d7f8e193e
Reviewed-on: https://go-review.googlesource.com/108942
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2018-04-23 15:38:50 -07:00
										 |  |  | 	addF("math/bits", "Len8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpBitLen8, types.Types[TINT], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 	addF("math/bits", "Len", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 			if s.config.PtrSize == 4 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				return s.newValue1(ssa.OpBitLen32, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpBitLen64, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.ARM, sys.S390X, sys.MIPS, sys.PPC64) | 
					
						
							| 
									
										
										
										
											2017-03-16 14:08:31 -07:00
										 |  |  | 	// LeadingZeros is handled because it trivially calls Len. | 
					
						
							| 
									
										
										
										
											2017-03-16 22:34:38 -07:00
										 |  |  | 	addF("math/bits", "Reverse64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpBitRev64, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 22:34:38 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.ARM64) | 
					
						
							|  |  |  | 	addF("math/bits", "Reverse32", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpBitRev32, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 22:34:38 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.ARM64) | 
					
						
							|  |  |  | 	addF("math/bits", "Reverse16", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpBitRev16, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 22:34:38 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.ARM64) | 
					
						
							|  |  |  | 	addF("math/bits", "Reverse8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpBitRev8, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 22:34:38 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.ARM64) | 
					
						
							|  |  |  | 	addF("math/bits", "Reverse", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 			if s.config.PtrSize == 4 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				return s.newValue1(ssa.OpBitRev32, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 22:34:38 -07:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue1(ssa.OpBitRev64, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 22:34:38 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.ARM64) | 
					
						
							| 
									
										
										
										
											2018-08-30 15:47:04 -06:00
										 |  |  | 	addF("math/bits", "RotateLeft8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpRotateLeft8, types.Types[TUINT8], args[0], args[1]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							|  |  |  | 	addF("math/bits", "RotateLeft16", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpRotateLeft16, types.Types[TUINT16], args[0], args[1]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.AMD64) | 
					
						
							|  |  |  | 	addF("math/bits", "RotateLeft32", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpRotateLeft32, types.Types[TUINT32], args[0], args[1]) | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2018-06-30 06:48:51 +00:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2018-08-30 15:47:04 -06:00
										 |  |  | 	addF("math/bits", "RotateLeft64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue2(ssa.OpRotateLeft64, types.Types[TUINT64], args[0], args[1]) | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2018-06-30 06:48:51 +00:00
										 |  |  | 		sys.AMD64, sys.ARM64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2018-08-30 15:47:04 -06:00
										 |  |  | 	alias("math/bits", "RotateLeft", "math/bits", "RotateLeft64", p8...) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 	makeOnesCountAMD64 := func(op64 ssa.Op, op32 ssa.Op) func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 		return func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2018-04-18 08:48:04 -07:00
										 |  |  | 			addr := s.entryNewValue1A(ssa.OpAddr, types.Types[TBOOL].PtrTo(), supportPopcnt, s.sb) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			v := s.load(types.Types[TBOOL], addr) | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 			b := s.endBlock() | 
					
						
							|  |  |  | 			b.Kind = ssa.BlockIf | 
					
						
							|  |  |  | 			b.SetControl(v) | 
					
						
							|  |  |  | 			bTrue := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			bFalse := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			bEnd := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bTrue) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bFalse) | 
					
						
							|  |  |  | 			b.Likely = ssa.BranchLikely // most machines have popcnt nowadays | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// We have the intrinsic - use it directly. | 
					
						
							|  |  |  | 			s.startBlock(bTrue) | 
					
						
							|  |  |  | 			op := op64 | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 			if s.config.PtrSize == 4 { | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 				op = op32 | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			s.vars[n] = s.newValue1(op, types.Types[TINT], args[0]) | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 			s.endBlock().AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Call the pure Go version. | 
					
						
							|  |  |  | 			s.startBlock(bFalse) | 
					
						
							|  |  |  | 			a := s.call(n, callNormal) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			s.vars[n] = s.load(types.Types[TINT], a) | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 			s.endBlock().AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Merge results. | 
					
						
							|  |  |  | 			s.startBlock(bEnd) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.variable(n, types.Types[TINT]) | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	addF("math/bits", "OnesCount64", | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 		makeOnesCountAMD64(ssa.OpPopCount64, ssa.OpPopCount64), | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 	addF("math/bits", "OnesCount64", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpPopCount64, types.Types[TINT], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
											  
											
												cmd/compile: implement OnesCount{8,16,32,64} intrinsics on s390x
This CL implements the math/bits.OnesCount{8,16,32,64} functions
as intrinsics on s390x using the 'population count' (popcnt)
instruction. This instruction was released as the 'population-count'
facility which uses the same facility bit (45) as the
'distinct-operands' facility which is a pre-requisite for Go on
s390x. We can therefore use it without a feature check.
The s390x popcnt instruction treats a 64 bit register as a vector
of 8 bytes, summing the number of ones in each byte individually.
It then writes the results to the corresponding bytes in the
output register. Therefore to implement OnesCount{16,32,64} we
need to sum the individual byte counts using some extra
instructions. To do this efficiently I've added some additional
pseudo operations to the s390x SSA backend.
Unlike other architectures the new instruction sequence is faster
for OnesCount8, so that is implemented using the intrinsic.
name         old time/op  new time/op  delta
OnesCount    3.21ns ± 1%  1.35ns ± 0%  -58.00%  (p=0.000 n=20+20)
OnesCount8   0.91ns ± 1%  0.81ns ± 0%  -11.43%  (p=0.000 n=20+20)
OnesCount16  1.51ns ± 3%  1.21ns ± 0%  -19.71%  (p=0.000 n=20+17)
OnesCount32  1.91ns ± 0%  1.12ns ± 1%  -41.60%  (p=0.000 n=19+20)
OnesCount64  3.18ns ± 4%  1.35ns ± 0%  -57.52%  (p=0.000 n=20+20)
Change-Id: Id54f0bd28b6db9a887ad12c0d72fcc168ef9c4e0
Reviewed-on: https://go-review.googlesource.com/114675
Run-TryBot: Michael Munday <mike.munday@ibm.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Cherry Zhang <cherryyz@google.com>
											
										 
											2018-05-25 17:54:58 +01:00
										 |  |  | 		sys.PPC64, sys.ARM64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 	addF("math/bits", "OnesCount32", | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 		makeOnesCountAMD64(ssa.OpPopCount32, ssa.OpPopCount32), | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 	addF("math/bits", "OnesCount32", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpPopCount32, types.Types[TINT], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
											  
											
												cmd/compile: implement OnesCount{8,16,32,64} intrinsics on s390x
This CL implements the math/bits.OnesCount{8,16,32,64} functions
as intrinsics on s390x using the 'population count' (popcnt)
instruction. This instruction was released as the 'population-count'
facility which uses the same facility bit (45) as the
'distinct-operands' facility which is a pre-requisite for Go on
s390x. We can therefore use it without a feature check.
The s390x popcnt instruction treats a 64 bit register as a vector
of 8 bytes, summing the number of ones in each byte individually.
It then writes the results to the corresponding bytes in the
output register. Therefore to implement OnesCount{16,32,64} we
need to sum the individual byte counts using some extra
instructions. To do this efficiently I've added some additional
pseudo operations to the s390x SSA backend.
Unlike other architectures the new instruction sequence is faster
for OnesCount8, so that is implemented using the intrinsic.
name         old time/op  new time/op  delta
OnesCount    3.21ns ± 1%  1.35ns ± 0%  -58.00%  (p=0.000 n=20+20)
OnesCount8   0.91ns ± 1%  0.81ns ± 0%  -11.43%  (p=0.000 n=20+20)
OnesCount16  1.51ns ± 3%  1.21ns ± 0%  -19.71%  (p=0.000 n=20+17)
OnesCount32  1.91ns ± 0%  1.12ns ± 1%  -41.60%  (p=0.000 n=19+20)
OnesCount64  3.18ns ± 4%  1.35ns ± 0%  -57.52%  (p=0.000 n=20+20)
Change-Id: Id54f0bd28b6db9a887ad12c0d72fcc168ef9c4e0
Reviewed-on: https://go-review.googlesource.com/114675
Run-TryBot: Michael Munday <mike.munday@ibm.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Cherry Zhang <cherryyz@google.com>
											
										 
											2018-05-25 17:54:58 +01:00
										 |  |  | 		sys.PPC64, sys.ARM64, sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 	addF("math/bits", "OnesCount16", | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 		makeOnesCountAMD64(ssa.OpPopCount16, ssa.OpPopCount16), | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2018-01-30 12:16:52 -05:00
										 |  |  | 	addF("math/bits", "OnesCount16", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpPopCount16, types.Types[TINT], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							| 
									
										
											  
											
												cmd/compile: implement OnesCount{8,16,32,64} intrinsics on s390x
This CL implements the math/bits.OnesCount{8,16,32,64} functions
as intrinsics on s390x using the 'population count' (popcnt)
instruction. This instruction was released as the 'population-count'
facility which uses the same facility bit (45) as the
'distinct-operands' facility which is a pre-requisite for Go on
s390x. We can therefore use it without a feature check.
The s390x popcnt instruction treats a 64 bit register as a vector
of 8 bytes, summing the number of ones in each byte individually.
It then writes the results to the corresponding bytes in the
output register. Therefore to implement OnesCount{16,32,64} we
need to sum the individual byte counts using some extra
instructions. To do this efficiently I've added some additional
pseudo operations to the s390x SSA backend.
Unlike other architectures the new instruction sequence is faster
for OnesCount8, so that is implemented using the intrinsic.
name         old time/op  new time/op  delta
OnesCount    3.21ns ± 1%  1.35ns ± 0%  -58.00%  (p=0.000 n=20+20)
OnesCount8   0.91ns ± 1%  0.81ns ± 0%  -11.43%  (p=0.000 n=20+20)
OnesCount16  1.51ns ± 3%  1.21ns ± 0%  -19.71%  (p=0.000 n=20+17)
OnesCount32  1.91ns ± 0%  1.12ns ± 1%  -41.60%  (p=0.000 n=19+20)
OnesCount64  3.18ns ± 4%  1.35ns ± 0%  -57.52%  (p=0.000 n=20+20)
Change-Id: Id54f0bd28b6db9a887ad12c0d72fcc168ef9c4e0
Reviewed-on: https://go-review.googlesource.com/114675
Run-TryBot: Michael Munday <mike.munday@ibm.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Cherry Zhang <cherryyz@google.com>
											
										 
											2018-05-25 17:54:58 +01:00
										 |  |  | 		sys.ARM64, sys.S390X) | 
					
						
							|  |  |  | 	addF("math/bits", "OnesCount8", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpPopCount8, types.Types[TINT], args[0]) | 
					
						
							|  |  |  | 		}, | 
					
						
							|  |  |  | 		sys.S390X) | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 	addF("math/bits", "OnesCount", | 
					
						
							| 
									
										
										
										
											2017-04-24 15:11:39 -04:00
										 |  |  | 		makeOnesCountAMD64(ssa.OpPopCount64, ssa.OpPopCount32), | 
					
						
							| 
									
										
										
										
											2017-03-16 21:33:03 -07:00
										 |  |  | 		sys.AMD64) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	/******** sync/atomic ********/ | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// Note: these are disabled by flag_race in findIntrinsic below. | 
					
						
							|  |  |  | 	alias("sync/atomic", "LoadInt32", "runtime/internal/atomic", "Load", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "LoadInt64", "runtime/internal/atomic", "Load64", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "LoadPointer", "runtime/internal/atomic", "Loadp", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "LoadUint32", "runtime/internal/atomic", "Load", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "LoadUint64", "runtime/internal/atomic", "Load64", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "LoadUintptr", "runtime/internal/atomic", "Load", p4...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "LoadUintptr", "runtime/internal/atomic", "Load64", p8...) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	alias("sync/atomic", "StoreInt32", "runtime/internal/atomic", "Store", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "StoreInt64", "runtime/internal/atomic", "Store64", all...) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 	// Note: not StorePointer, that needs a write barrier.  Same below for {CompareAnd}Swap. | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	alias("sync/atomic", "StoreUint32", "runtime/internal/atomic", "Store", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "StoreUint64", "runtime/internal/atomic", "Store64", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "StoreUintptr", "runtime/internal/atomic", "Store", p4...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "StoreUintptr", "runtime/internal/atomic", "Store64", p8...) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	alias("sync/atomic", "SwapInt32", "runtime/internal/atomic", "Xchg", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "SwapInt64", "runtime/internal/atomic", "Xchg64", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "SwapUint32", "runtime/internal/atomic", "Xchg", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "SwapUint64", "runtime/internal/atomic", "Xchg64", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "SwapUintptr", "runtime/internal/atomic", "Xchg", p4...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "SwapUintptr", "runtime/internal/atomic", "Xchg64", p8...) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	alias("sync/atomic", "CompareAndSwapInt32", "runtime/internal/atomic", "Cas", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "CompareAndSwapInt64", "runtime/internal/atomic", "Cas64", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "CompareAndSwapUint32", "runtime/internal/atomic", "Cas", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "CompareAndSwapUint64", "runtime/internal/atomic", "Cas64", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "CompareAndSwapUintptr", "runtime/internal/atomic", "Cas", p4...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "CompareAndSwapUintptr", "runtime/internal/atomic", "Cas64", p8...) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	alias("sync/atomic", "AddInt32", "runtime/internal/atomic", "Xadd", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "AddInt64", "runtime/internal/atomic", "Xadd64", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "AddUint32", "runtime/internal/atomic", "Xadd", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "AddUint64", "runtime/internal/atomic", "Xadd64", all...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "AddUintptr", "runtime/internal/atomic", "Xadd", p4...) | 
					
						
							|  |  |  | 	alias("sync/atomic", "AddUintptr", "runtime/internal/atomic", "Xadd64", p8...) | 
					
						
							| 
									
										
										
										
											2016-10-06 15:43:47 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	/******** math/big ********/ | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	add("math/big", "mulWW", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			return s.newValue2(ssa.OpMul64uhilo, types.NewTuple(types.Types[TUINT64], types.Types[TUINT64]), args[0], args[1]) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							| 
									
										
										
										
											2018-04-02 16:22:08 -04:00
										 |  |  | 		sys.ArchAMD64, sys.ArchARM64) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	add("math/big", "divWW", | 
					
						
							|  |  |  | 		func(s *state, n *Node, args []*ssa.Value) *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			return s.newValue3(ssa.OpDiv128u, types.NewTuple(types.Types[TUINT64], types.Types[TUINT64]), args[0], args[1], args[2]) | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 		}, | 
					
						
							|  |  |  | 		sys.ArchAMD64) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // findIntrinsic returns a function which builds the SSA equivalent of the | 
					
						
							|  |  |  | // function identified by the symbol sym.  If sym is not an intrinsic call, returns nil. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func findIntrinsic(sym *types.Sym) intrinsicBuilder { | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 	if ssa.IntrinsicsDisable { | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 		return nil | 
					
						
							| 
									
										
										
										
											2016-03-11 00:10:52 -05:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 	if sym == nil || sym.Pkg == nil { | 
					
						
							|  |  |  | 		return nil | 
					
						
							| 
									
										
										
										
											2016-03-11 00:10:52 -05:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 	pkg := sym.Pkg.Path | 
					
						
							| 
									
										
										
										
											2016-10-06 15:43:47 -04:00
										 |  |  | 	if sym.Pkg == localpkg { | 
					
						
							|  |  |  | 		pkg = myimportpath | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	if flag_race && pkg == "sync/atomic" { | 
					
						
							|  |  |  | 		// The race detector needs to be able to intercept these calls. | 
					
						
							|  |  |  | 		// We can't intrinsify them. | 
					
						
							|  |  |  | 		return nil | 
					
						
							| 
									
										
										
										
											2016-08-23 16:49:28 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-11-10 18:08:48 +01:00
										 |  |  | 	// Skip intrinsifying math functions (which may contain hard-float | 
					
						
							|  |  |  | 	// instructions) when soft-float | 
					
						
							|  |  |  | 	if thearch.SoftFloat && pkg == "math" { | 
					
						
							|  |  |  | 		return nil | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-14 13:25:12 -07:00
										 |  |  | 	fn := sym.Name | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:36 -07:00
										 |  |  | 	return intrinsics[intrinsicKey{thearch.LinkArch.Arch, pkg, fn}] | 
					
						
							| 
									
										
										
										
											2016-03-11 00:10:52 -05:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-08-23 16:49:28 -07:00
										 |  |  | func isIntrinsicCall(n *Node) bool { | 
					
						
							| 
									
										
										
										
											2016-03-11 00:10:52 -05:00
										 |  |  | 	if n == nil || n.Left == nil { | 
					
						
							|  |  |  | 		return false | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 	return findIntrinsic(n.Left.Sym) != nil | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // intrinsicCall converts a call to a recognized intrinsic function into the intrinsic SSA operation. | 
					
						
							|  |  |  | func (s *state) intrinsicCall(n *Node) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-11-01 15:28:10 -07:00
										 |  |  | 	v := findIntrinsic(n.Left.Sym)(s, n, s.intrinsicArgs(n)) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 	if ssa.IntrinsicsDebug > 0 { | 
					
						
							|  |  |  | 		x := v | 
					
						
							|  |  |  | 		if x == nil { | 
					
						
							|  |  |  | 			x = s.mem() | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		if x.Op == ssa.OpSelect0 || x.Op == ssa.OpSelect1 { | 
					
						
							|  |  |  | 			x = x.Args[0] | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 		Warnl(n.Pos, "intrinsic substitution for %v with %s", n.Left.Sym.Name, x.LongString()) | 
					
						
							| 
									
										
										
										
											2016-08-29 20:28:20 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	return v | 
					
						
							| 
									
										
										
										
											2016-03-11 00:10:52 -05:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-11-01 15:28:10 -07:00
										 |  |  | type callArg struct { | 
					
						
							|  |  |  | 	offset int64 | 
					
						
							|  |  |  | 	v      *ssa.Value | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | type byOffset []callArg | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | func (x byOffset) Len() int      { return len(x) } | 
					
						
							|  |  |  | func (x byOffset) Swap(i, j int) { x[i], x[j] = x[j], x[i] } | 
					
						
							|  |  |  | func (x byOffset) Less(i, j int) bool { | 
					
						
							|  |  |  | 	return x[i].offset < x[j].offset | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // intrinsicArgs extracts args from n, evaluates them to SSA values, and returns them. | 
					
						
							|  |  |  | func (s *state) intrinsicArgs(n *Node) []*ssa.Value { | 
					
						
							|  |  |  | 	// This code is complicated because of how walk transforms calls. For a call node, | 
					
						
							|  |  |  | 	// each entry in n.List is either an assignment to OINDREGSP which actually | 
					
						
							|  |  |  | 	// stores an arg, or an assignment to a temporary which computes an arg | 
					
						
							|  |  |  | 	// which is later assigned. | 
					
						
							|  |  |  | 	// The args can also be out of order. | 
					
						
							|  |  |  | 	// TODO: when walk goes away someday, this code can go away also. | 
					
						
							|  |  |  | 	var args []callArg | 
					
						
							|  |  |  | 	temps := map[*Node]*ssa.Value{} | 
					
						
							|  |  |  | 	for _, a := range n.List.Slice() { | 
					
						
							|  |  |  | 		if a.Op != OAS { | 
					
						
							| 
									
										
										
										
											2017-11-09 23:10:43 +00:00
										 |  |  | 			s.Fatalf("non-assignment as a function argument %v", a.Op) | 
					
						
							| 
									
										
										
										
											2016-11-01 15:28:10 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		l, r := a.Left, a.Right | 
					
						
							|  |  |  | 		switch l.Op { | 
					
						
							|  |  |  | 		case ONAME: | 
					
						
							|  |  |  | 			// Evaluate and store to "temporary". | 
					
						
							|  |  |  | 			// Walk ensures these temporaries are dead outside of n. | 
					
						
							|  |  |  | 			temps[l] = s.expr(r) | 
					
						
							|  |  |  | 		case OINDREGSP: | 
					
						
							|  |  |  | 			// Store a value to an argument slot. | 
					
						
							|  |  |  | 			var v *ssa.Value | 
					
						
							|  |  |  | 			if x, ok := temps[r]; ok { | 
					
						
							|  |  |  | 				// This is a previously computed temporary. | 
					
						
							|  |  |  | 				v = x | 
					
						
							|  |  |  | 			} else { | 
					
						
							|  |  |  | 				// This is an explicit value; evaluate it. | 
					
						
							|  |  |  | 				v = s.expr(r) | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			args = append(args, callArg{l.Xoffset, v}) | 
					
						
							|  |  |  | 		default: | 
					
						
							| 
									
										
										
										
											2017-11-09 23:10:43 +00:00
										 |  |  | 			s.Fatalf("function argument assignment target not allowed: %v", l.Op) | 
					
						
							| 
									
										
										
										
											2016-11-01 15:28:10 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-03-11 00:10:52 -05:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-11-01 15:28:10 -07:00
										 |  |  | 	sort.Sort(byOffset(args)) | 
					
						
							|  |  |  | 	res := make([]*ssa.Value, len(args)) | 
					
						
							|  |  |  | 	for i, a := range args { | 
					
						
							|  |  |  | 		res[i] = a.v | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return res | 
					
						
							| 
									
										
										
										
											2016-08-23 16:49:28 -07:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2016-03-11 00:10:52 -05:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | // Calls the function n using the specified call type. | 
					
						
							|  |  |  | // Returns the address of the return value (or nil if none). | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | func (s *state) call(n *Node, k callKind) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	var sym *types.Sym     // target symbol (if static) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	var closure *ssa.Value // ptr to closure to run (if dynamic) | 
					
						
							|  |  |  | 	var codeptr *ssa.Value // ptr to target code (if dynamic) | 
					
						
							|  |  |  | 	var rcvr *ssa.Value    // receiver to set | 
					
						
							|  |  |  | 	fn := n.Left | 
					
						
							|  |  |  | 	switch n.Op { | 
					
						
							|  |  |  | 	case OCALLFUNC: | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 		if k == callNormal && fn.Op == ONAME && fn.Class() == PFUNC { | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 			sym = fn.Sym | 
					
						
							|  |  |  | 			break | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		closure = s.expr(fn) | 
					
						
							| 
									
										
										
										
											2018-08-05 18:52:15 +02:00
										 |  |  | 		if thearch.LinkArch.Family == sys.Wasm { | 
					
						
							|  |  |  | 			// TODO(neelance): On other architectures this should be eliminated by the optimization steps | 
					
						
							|  |  |  | 			s.nilCheck(closure) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	case OCALLMETH: | 
					
						
							|  |  |  | 		if fn.Op != ODOTMETH { | 
					
						
							|  |  |  | 			Fatalf("OCALLMETH: n.Left not an ODOTMETH: %v", fn) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		if k == callNormal { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ODOT and friends to use Sym, not Right
The Node type ODOT and its variants all represent a selector, with a
simple name to the right of the dot.  Before this change this was
represented by using an ONAME Node in the Right field.  This ONAME node
served no useful purpose.  This CL changes these Node types to store the
symbol in the Sym field instead, thus not requiring allocating a Node
for each selector.
When compiling x/tools/go/types this CL eliminates nearly 5000 calls to
newname and reduces the total number of Nodes allocated by about 6.6%.
It seems to cut compilation time by 1 to 2 percent.
Getting this right was somewhat subtle, and I added two dubious changes
to produce the exact same output as before.  One is to ishairy in
inl.go: the ONAME node increased the cost of ODOT and friends by 1, and
I retained that, although really ODOT is not more expensive than any
other node.  The other is to varexpr in walk.go: because the ONAME in
the Right field of an ODOT has no class, varexpr would always return
false for an ODOT, although in fact for some ODOT's it seemingly ought
to return true; I added an && false for now.  I will send separate CLs,
that will break toolstash -cmp, to clean these up.
This CL passes toolstash -cmp.
Change-Id: I4af8a10cc59078c436130ce472f25abc3a9b2f80
Reviewed-on: https://go-review.googlesource.com/20890
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Brad Fitzpatrick <bradfitz@golang.org>
											
										 
											2016-03-18 16:52:30 -07:00
										 |  |  | 			sym = fn.Sym | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 			break | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-09-20 16:34:30 -07:00
										 |  |  | 		// Make a name n2 for the function. | 
					
						
							|  |  |  | 		// fn.Sym might be sync.(*Mutex).Unlock. | 
					
						
							|  |  |  | 		// Make a PFUNC node out of that, then evaluate it. | 
					
						
							|  |  |  | 		// We get back an SSA value representing &sync.(*Mutex).Unlock·f. | 
					
						
							|  |  |  | 		// We can then pass that to defer or go. | 
					
						
							| 
									
										
										
										
											2017-03-28 15:51:05 -07:00
										 |  |  | 		n2 := newnamel(fn.Pos, fn.Sym) | 
					
						
							|  |  |  | 		n2.Name.Curfn = s.curfn | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 		n2.SetClass(PFUNC) | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 		n2.Pos = fn.Pos | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		n2.Type = types.Types[TUINT8] // dummy type for a static closure. Could use runtime.funcval if we had it. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ODOT and friends to use Sym, not Right
The Node type ODOT and its variants all represent a selector, with a
simple name to the right of the dot.  Before this change this was
represented by using an ONAME Node in the Right field.  This ONAME node
served no useful purpose.  This CL changes these Node types to store the
symbol in the Sym field instead, thus not requiring allocating a Node
for each selector.
When compiling x/tools/go/types this CL eliminates nearly 5000 calls to
newname and reduces the total number of Nodes allocated by about 6.6%.
It seems to cut compilation time by 1 to 2 percent.
Getting this right was somewhat subtle, and I added two dubious changes
to produce the exact same output as before.  One is to ishairy in
inl.go: the ONAME node increased the cost of ODOT and friends by 1, and
I retained that, although really ODOT is not more expensive than any
other node.  The other is to varexpr in walk.go: because the ONAME in
the Right field of an ODOT has no class, varexpr would always return
false for an ODOT, although in fact for some ODOT's it seemingly ought
to return true; I added an && false for now.  I will send separate CLs,
that will break toolstash -cmp, to clean these up.
This CL passes toolstash -cmp.
Change-Id: I4af8a10cc59078c436130ce472f25abc3a9b2f80
Reviewed-on: https://go-review.googlesource.com/20890
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Brad Fitzpatrick <bradfitz@golang.org>
											
										 
											2016-03-18 16:52:30 -07:00
										 |  |  | 		closure = s.expr(n2) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 		// Note: receiver is already assigned in n.List, so we don't | 
					
						
							|  |  |  | 		// want to set it here. | 
					
						
							|  |  |  | 	case OCALLINTER: | 
					
						
							|  |  |  | 		if fn.Op != ODOTINTER { | 
					
						
							| 
									
										
										
										
											2016-04-27 15:10:10 +10:00
										 |  |  | 			Fatalf("OCALLINTER: n.Left not an ODOTINTER: %v", fn.Op) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		i := s.expr(fn.Left) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		itab := s.newValue1(ssa.OpITab, types.Types[TUINTPTR], i) | 
					
						
							| 
									
										
										
										
											2017-11-13 19:03:31 -08:00
										 |  |  | 		s.nilCheck(itab) | 
					
						
							| 
									
										
										
										
											2017-05-31 10:08:51 -07:00
										 |  |  | 		itabidx := fn.Xoffset + 2*int64(Widthptr) + 8 // offset of fun field in runtime.itab | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		itab = s.newValue1I(ssa.OpOffPtr, s.f.Config.Types.UintptrPtr, itabidx, itab) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 		if k == callNormal { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			codeptr = s.load(types.Types[TUINTPTR], itab) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 		} else { | 
					
						
							|  |  |  | 			closure = itab | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		rcvr = s.newValue1(ssa.OpIData, types.Types[TUINTPTR], i) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-27 16:37:25 -07:00
										 |  |  | 	dowidth(fn.Type) | 
					
						
							| 
									
										
										
										
											2016-03-28 14:31:57 -07:00
										 |  |  | 	stksize := fn.Type.ArgWidth() // includes receiver | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 	// Run all argument assignments. The arg slots have already | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	// been offset by the appropriate amount (+2*widthptr for go/defer, | 
					
						
							|  |  |  | 	// +widthptr for interface calls). | 
					
						
							|  |  |  | 	// For OCALLMETH, the receiver is set in these statements. | 
					
						
							|  |  |  | 	s.stmtList(n.List) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Set receiver (for interface calls) | 
					
						
							|  |  |  | 	if rcvr != nil { | 
					
						
							| 
									
										
										
										
											2015-10-19 13:56:55 -07:00
										 |  |  | 		argStart := Ctxt.FixedFrameSize() | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 		if k != callNormal { | 
					
						
							|  |  |  | 			argStart += int64(2 * Widthptr) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		addr := s.constOffPtrSP(s.f.Config.Types.UintptrPtr, argStart) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(types.Types[TUINTPTR], addr, rcvr) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Defer/go args | 
					
						
							|  |  |  | 	if k != callNormal { | 
					
						
							| 
									
										
										
										
											2018-08-20 10:55:26 +02:00
										 |  |  | 		// Write argsize and closure (args to newproc/deferproc). | 
					
						
							| 
									
										
										
										
											2016-05-15 00:12:56 -04:00
										 |  |  | 		argStart := Ctxt.FixedFrameSize() | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		argsize := s.constInt32(types.Types[TUINT32], int32(stksize)) | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		addr := s.constOffPtrSP(s.f.Config.Types.UInt32Ptr, argStart) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(types.Types[TUINT32], addr, argsize) | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		addr = s.constOffPtrSP(s.f.Config.Types.UintptrPtr, argStart+int64(Widthptr)) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(types.Types[TUINTPTR], addr, closure) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 		stksize += 2 * int64(Widthptr) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// call target | 
					
						
							|  |  |  | 	var call *ssa.Value | 
					
						
							|  |  |  | 	switch { | 
					
						
							|  |  |  | 	case k == callDefer: | 
					
						
							| 
									
										
										
										
											2018-08-20 10:55:26 +02:00
										 |  |  | 		call = s.newValue1A(ssa.OpStaticCall, types.TypeMem, deferproc, s.mem()) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	case k == callGo: | 
					
						
							| 
									
										
										
										
											2018-08-20 10:55:26 +02:00
										 |  |  | 		call = s.newValue1A(ssa.OpStaticCall, types.TypeMem, newproc, s.mem()) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	case closure != nil: | 
					
						
							| 
									
										
										
										
											2018-03-27 13:50:08 -07:00
										 |  |  | 		// rawLoad because loading the code pointer from a | 
					
						
							|  |  |  | 		// closure is always safe, but IsSanitizerSafeAddr | 
					
						
							|  |  |  | 		// can't always figure that out currently, and it's | 
					
						
							|  |  |  | 		// critical that we not clobber any arguments already | 
					
						
							|  |  |  | 		// stored onto the stack. | 
					
						
							|  |  |  | 		codeptr = s.rawLoad(types.Types[TUINTPTR], closure) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 		call = s.newValue3(ssa.OpClosureCall, types.TypeMem, codeptr, closure, s.mem()) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	case codeptr != nil: | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 		call = s.newValue2(ssa.OpInterCall, types.TypeMem, codeptr, s.mem()) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	case sym != nil: | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 		call = s.newValue1A(ssa.OpStaticCall, types.TypeMem, sym.Linksym(), s.mem()) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 		Fatalf("bad call type %v %v", n.Op, n) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	call.AuxInt = stksize // Call operations carry the argsize of the callee along with them | 
					
						
							| 
									
										
										
										
											2015-09-17 16:45:10 -07:00
										 |  |  | 	s.vars[&memVar] = call | 
					
						
							| 
									
										
										
										
											2016-09-09 13:11:07 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// Finish block for defers | 
					
						
							| 
									
										
										
										
											2016-03-09 19:27:57 -08:00
										 |  |  | 	if k == callDefer { | 
					
						
							| 
									
										
										
										
											2016-09-09 13:11:07 -07:00
										 |  |  | 		b := s.endBlock() | 
					
						
							| 
									
										
										
										
											2016-03-09 19:27:57 -08:00
										 |  |  | 		b.Kind = ssa.BlockDefer | 
					
						
							| 
									
										
										
										
											2016-09-09 13:11:07 -07:00
										 |  |  | 		b.SetControl(call) | 
					
						
							|  |  |  | 		bNext := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		b.AddEdgeTo(bNext) | 
					
						
							|  |  |  | 		// Add recover edge to exit code. | 
					
						
							| 
									
										
										
										
											2016-03-09 19:27:57 -08:00
										 |  |  | 		r := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 		s.startBlock(r) | 
					
						
							|  |  |  | 		s.exit() | 
					
						
							|  |  |  | 		b.AddEdgeTo(r) | 
					
						
							|  |  |  | 		b.Likely = ssa.BranchLikely | 
					
						
							| 
									
										
										
										
											2016-09-09 13:11:07 -07:00
										 |  |  | 		s.startBlock(bNext) | 
					
						
							| 
									
										
										
										
											2016-03-09 19:27:57 -08:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-15 11:06:03 -07:00
										 |  |  | 	res := n.Left.Type.Results() | 
					
						
							|  |  |  | 	if res.NumFields() == 0 || k != callNormal { | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | 		// call has no return value. Continue with the next statement. | 
					
						
							|  |  |  | 		return nil | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-03-15 11:06:03 -07:00
										 |  |  | 	fp := res.Field(0) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	return s.constOffPtrSP(types.NewPtr(fp.Type), fp.Offset+Ctxt.FixedFrameSize()) | 
					
						
							| 
									
										
										
										
											2015-09-09 23:56:59 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | // etypesign returns the signed-ness of e, for integer/pointer etypes. | 
					
						
							|  |  |  | // -1 means signed, +1 means unsigned, 0 means non-integer/non-pointer. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func etypesign(e types.EType) int8 { | 
					
						
							| 
									
										
										
										
											2015-07-28 14:31:25 -07:00
										 |  |  | 	switch e { | 
					
						
							|  |  |  | 	case TINT8, TINT16, TINT32, TINT64, TINT: | 
					
						
							|  |  |  | 		return -1 | 
					
						
							|  |  |  | 	case TUINT8, TUINT16, TUINT32, TUINT64, TUINT, TUINTPTR, TUNSAFEPTR: | 
					
						
							|  |  |  | 		return +1 | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return 0 | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-02 09:16:22 -07:00
										 |  |  | // addr converts the address of the expression n to SSA, adds it to s and returns the SSA result. | 
					
						
							| 
									
										
										
										
											2015-07-13 15:55:37 -07:00
										 |  |  | // The value that the returned Value represents is guaranteed to be non-nil. | 
					
						
							| 
									
										
										
										
											2015-10-09 16:48:30 -04:00
										 |  |  | // If bounded is true then this address does not require a nil check for its operand | 
					
						
							|  |  |  | // even if that would otherwise be implied. | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | func (s *state) addr(n *Node, bounded bool) *ssa.Value { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	t := types.NewPtr(n.Type) | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	switch n.Op { | 
					
						
							|  |  |  | 	case ONAME: | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 		switch n.Class() { | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 		case PEXTERN: | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 			// global variable | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 			v := s.entryNewValue1A(ssa.OpAddr, t, n.Sym.Linksym(), s.sb) | 
					
						
							| 
									
										
										
										
											2015-07-28 11:08:44 -07:00
										 |  |  | 			// TODO: Make OpAddr use AuxInt as well as Aux. | 
					
						
							|  |  |  | 			if n.Xoffset != 0 { | 
					
						
							|  |  |  | 				v = s.entryNewValue1I(ssa.OpOffPtr, v.Type, n.Xoffset, v) | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 			return v | 
					
						
							| 
									
										
										
										
											2015-09-11 16:40:05 -04:00
										 |  |  | 		case PPARAM: | 
					
						
							|  |  |  | 			// parameter slot | 
					
						
							| 
									
										
										
										
											2015-06-29 11:56:28 -07:00
										 |  |  | 			v := s.decladdrs[n] | 
					
						
							| 
									
										
										
										
											2015-11-16 13:20:16 -08:00
										 |  |  | 			if v != nil { | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 				return v | 
					
						
							| 
									
										
										
										
											2015-06-29 11:56:28 -07:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2018-03-08 21:00:36 +00:00
										 |  |  | 			if n == nodfp { | 
					
						
							|  |  |  | 				// Special arg that points to the frame pointer (Used by ORECOVER). | 
					
						
							| 
									
										
										
										
											2018-07-03 11:34:38 -04:00
										 |  |  | 				return s.entryNewValue2A(ssa.OpLocalAddr, t, n, s.sp, s.startmem) | 
					
						
							| 
									
										
										
										
											2018-03-08 21:00:36 +00:00
										 |  |  | 			} | 
					
						
							| 
									
										
										
										
											2015-11-16 13:20:16 -08:00
										 |  |  | 			s.Fatalf("addr of undeclared ONAME %v. declared: %v", n, s.decladdrs) | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 			return nil | 
					
						
							| 
									
										
										
										
											2015-08-24 02:16:19 -07:00
										 |  |  | 		case PAUTO: | 
					
						
							| 
									
										
										
										
											2018-07-03 11:34:38 -04:00
										 |  |  | 			return s.newValue2Apos(ssa.OpLocalAddr, t, n, s.sp, s.mem(), !n.IsAutoTmp()) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-11 16:40:05 -04:00
										 |  |  | 		case PPARAMOUT: // Same as PAUTO -- cannot generate LEA early. | 
					
						
							| 
									
										
										
										
											2015-10-15 20:25:32 -05:00
										 |  |  | 			// ensure that we reuse symbols for out parameters so | 
					
						
							|  |  |  | 			// that cse works on their addresses | 
					
						
							| 
									
										
										
										
											2018-07-03 11:34:38 -04:00
										 |  |  | 			return s.newValue2Apos(ssa.OpLocalAddr, t, n, s.sp, s.mem(), true) | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 		default: | 
					
						
							| 
									
										
										
										
											2017-11-10 18:35:30 +00:00
										 |  |  | 			s.Fatalf("variable address class %v not implemented", n.Class()) | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 			return nil | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-10-24 14:33:22 -07:00
										 |  |  | 	case OINDREGSP: | 
					
						
							|  |  |  | 		// indirect off REGSP | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 		// used for storing/loading arguments/returns to/from callees | 
					
						
							| 
									
										
										
										
											2017-03-08 12:50:00 -08:00
										 |  |  | 		return s.constOffPtrSP(t, n.Xoffset) | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	case OINDEX: | 
					
						
							| 
									
										
										
										
											2015-07-10 10:47:28 -06:00
										 |  |  | 		if n.Left.Type.IsSlice() { | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 			a := s.expr(n.Left) | 
					
						
							|  |  |  | 			i := s.expr(n.Right) | 
					
						
							| 
									
										
										
										
											2016-09-16 00:33:29 +10:00
										 |  |  | 			i = s.extendIndex(i, panicindex) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			len := s.newValue1(ssa.OpSliceLen, types.Types[TINT], a) | 
					
						
							| 
									
										
										
										
											2017-02-27 19:56:38 +02:00
										 |  |  | 			if !n.Bounded() { | 
					
						
							| 
									
										
										
										
											2015-08-18 14:17:30 -07:00
										 |  |  | 				s.boundsCheck(i, len) | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | 			p := s.newValue1(ssa.OpSlicePtr, t, a) | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 			return s.newValue2(ssa.OpPtrIndex, t, p, i) | 
					
						
							| 
									
										
										
										
											2015-07-10 10:47:28 -06:00
										 |  |  | 		} else { // array | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 			a := s.addr(n.Left, bounded) | 
					
						
							| 
									
										
										
										
											2015-07-10 10:47:28 -06:00
										 |  |  | 			i := s.expr(n.Right) | 
					
						
							| 
									
										
										
										
											2016-09-16 00:33:29 +10:00
										 |  |  | 			i = s.extendIndex(i, panicindex) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			len := s.constInt(types.Types[TINT], n.Left.Type.NumElem()) | 
					
						
							| 
									
										
										
										
											2017-02-27 19:56:38 +02:00
										 |  |  | 			if !n.Bounded() { | 
					
						
							| 
									
										
										
										
											2015-08-18 14:17:30 -07:00
										 |  |  | 				s.boundsCheck(i, len) | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.newValue2(ssa.OpPtrIndex, types.NewPtr(n.Left.Type.Elem()), a, i) | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-07-13 21:22:16 -05:00
										 |  |  | 	case OIND: | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		return s.exprPtr(n.Left, bounded, n.Pos) | 
					
						
							| 
									
										
										
										
											2015-07-13 15:55:37 -07:00
										 |  |  | 	case ODOT: | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		p := s.addr(n.Left, bounded) | 
					
						
							|  |  |  | 		return s.newValue1I(ssa.OpOffPtr, t, n.Xoffset, p) | 
					
						
							| 
									
										
										
										
											2015-07-13 15:55:37 -07:00
										 |  |  | 	case ODOTPTR: | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 		p := s.exprPtr(n.Left, bounded, n.Pos) | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		return s.newValue1I(ssa.OpOffPtr, t, n.Xoffset, p) | 
					
						
							| 
									
										
										
										
											2015-09-11 16:40:05 -04:00
										 |  |  | 	case OCLOSUREVAR: | 
					
						
							| 
									
										
										
										
											2016-03-04 12:34:43 -08:00
										 |  |  | 		return s.newValue1I(ssa.OpOffPtr, t, n.Xoffset, | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 			s.entryNewValue0(ssa.OpGetClosurePtr, s.f.Config.Types.BytePtr)) | 
					
						
							| 
									
										
										
										
											2015-10-09 16:48:30 -04:00
										 |  |  | 	case OCONVNOP: | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		addr := s.addr(n.Left, bounded) | 
					
						
							|  |  |  | 		return s.newValue1(ssa.OpCopy, t, addr) // ensure that addr has the right type | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	case OCALLFUNC, OCALLINTER, OCALLMETH: | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		return s.call(n, callNormal) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	case ODOTTYPE: | 
					
						
							|  |  |  | 		v, _ := s.dottype(n, false) | 
					
						
							|  |  |  | 		if v.Op != ssa.OpLoad { | 
					
						
							|  |  |  | 			s.Fatalf("dottype of non-load") | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		if v.Args[1] != s.mem() { | 
					
						
							|  |  |  | 			s.Fatalf("memory no longer live from dottype load") | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		return v.Args[0] | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	default: | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 		s.Fatalf("unhandled addr %v", n.Op) | 
					
						
							| 
									
										
										
										
											2017-02-02 19:47:59 -05:00
										 |  |  | 		return nil | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | // canSSA reports whether n is SSA-able. | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | // n must be an ONAME (or an ODOT sequence with an ONAME base). | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | func (s *state) canSSA(n *Node) bool { | 
					
						
							| 
									
										
										
										
											2016-04-29 12:09:32 -07:00
										 |  |  | 	if Debug['N'] != 0 { | 
					
						
							|  |  |  | 		return false | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 	for n.Op == ODOT || (n.Op == OINDEX && n.Left.Type.IsArray()) { | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 		n = n.Left | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 	if n.Op != ONAME { | 
					
						
							| 
									
										
										
										
											2015-06-27 15:45:20 +01:00
										 |  |  | 		return false | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-02-27 19:56:38 +02:00
										 |  |  | 	if n.Addrtaken() { | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 		return false | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
											  
											
												cmd/compile: fix liveness computation for heap-escaped parameters
The liveness computation of parameters generally was never
correct, but forcing all parameters to be live throughout the
function covered up that problem. The new SSA back end is
too clever: even though it currently keeps the parameter values live
throughout the function, it may find optimizations that mean
the current values are not written back to the original parameter
stack slots immediately or ever (for example if a parameter is set
to nil, SSA constant propagation may replace all later uses of the
parameter with a constant nil, eliminating the need to write the nil
value back to the stack slot), so the liveness code must now
track the actual operations on the stack slots, exposing these
problems.
One small problem in the handling of arguments is that nodarg
can return ONAME PPARAM nodes with adjusted offsets, so that
there are actually multiple *Node pointers for the same parameter
in the instruction stream. This might be possible to correct, but
not in this CL. For now, we fix this by using n.Orig instead of n
when considering PPARAM and PPARAMOUT nodes.
The major problem in the handling of arguments is general
confusion in the liveness code about the meaning of PPARAM|PHEAP
and PPARAMOUT|PHEAP nodes, especially as contrasted with PAUTO|PHEAP.
The difference between these two is that when a local variable "moves"
to the heap, it's really just allocated there to start with; in contrast,
when an argument moves to the heap, the actual data has to be copied
there from the stack at the beginning of the function, and when a
result "moves" to the heap the value in the heap has to be copied
back to the stack when the function returns
This general confusion is also present in the SSA back end.
The PHEAP bit worked decently when I first introduced it 7 years ago (!)
in 391425ae. The back end did nothing sophisticated, and in particular
there was no analysis at all: no escape analysis, no liveness analysis,
and certainly no SSA back end. But the complications caused in the
various downstream consumers suggest that this should be a detail
kept mainly in the front end.
This CL therefore eliminates both the PHEAP bit and even the idea of
"heap variables" from the back ends.
First, it replaces the PPARAM|PHEAP, PPARAMOUT|PHEAP, and PAUTO|PHEAP
variable classes with the single PAUTOHEAP, a pseudo-class indicating
a variable maintained on the heap and available by indirecting a
local variable kept on the stack (a plain PAUTO).
Second, walkexpr replaces all references to PAUTOHEAP variables
with indirections of the corresponding PAUTO variable.
The back ends and the liveness code now just see plain indirected
variables. This may actually produce better code, but the real goal
here is to eliminate these little-used and somewhat suspect code
paths in the back end analyses.
The OPARAM node type goes away too.
A followup CL will do the same to PPARAMREF. I'm not sure that
the back ends (SSA in particular) are handling those right either,
and with the framework established in this CL that change is trivial
and the result clearly more correct.
Fixes #15747.
Change-Id: I2770b1ce3cbc93981bfc7166be66a9da12013d74
Reviewed-on: https://go-review.googlesource.com/23393
Reviewed-by: Keith Randall <khr@golang.org>
Run-TryBot: Russ Cox <rsc@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
											
										 
											2016-05-25 01:33:24 -04:00
										 |  |  | 	if n.isParamHeapCopy() { | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 		return false | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PAUTOHEAP { | 
					
						
							| 
									
										
											  
											
												cmd/compile: fix liveness computation for heap-escaped parameters
The liveness computation of parameters generally was never
correct, but forcing all parameters to be live throughout the
function covered up that problem. The new SSA back end is
too clever: even though it currently keeps the parameter values live
throughout the function, it may find optimizations that mean
the current values are not written back to the original parameter
stack slots immediately or ever (for example if a parameter is set
to nil, SSA constant propagation may replace all later uses of the
parameter with a constant nil, eliminating the need to write the nil
value back to the stack slot), so the liveness code must now
track the actual operations on the stack slots, exposing these
problems.
One small problem in the handling of arguments is that nodarg
can return ONAME PPARAM nodes with adjusted offsets, so that
there are actually multiple *Node pointers for the same parameter
in the instruction stream. This might be possible to correct, but
not in this CL. For now, we fix this by using n.Orig instead of n
when considering PPARAM and PPARAMOUT nodes.
The major problem in the handling of arguments is general
confusion in the liveness code about the meaning of PPARAM|PHEAP
and PPARAMOUT|PHEAP nodes, especially as contrasted with PAUTO|PHEAP.
The difference between these two is that when a local variable "moves"
to the heap, it's really just allocated there to start with; in contrast,
when an argument moves to the heap, the actual data has to be copied
there from the stack at the beginning of the function, and when a
result "moves" to the heap the value in the heap has to be copied
back to the stack when the function returns
This general confusion is also present in the SSA back end.
The PHEAP bit worked decently when I first introduced it 7 years ago (!)
in 391425ae. The back end did nothing sophisticated, and in particular
there was no analysis at all: no escape analysis, no liveness analysis,
and certainly no SSA back end. But the complications caused in the
various downstream consumers suggest that this should be a detail
kept mainly in the front end.
This CL therefore eliminates both the PHEAP bit and even the idea of
"heap variables" from the back ends.
First, it replaces the PPARAM|PHEAP, PPARAMOUT|PHEAP, and PAUTO|PHEAP
variable classes with the single PAUTOHEAP, a pseudo-class indicating
a variable maintained on the heap and available by indirecting a
local variable kept on the stack (a plain PAUTO).
Second, walkexpr replaces all references to PAUTOHEAP variables
with indirections of the corresponding PAUTO variable.
The back ends and the liveness code now just see plain indirected
variables. This may actually produce better code, but the real goal
here is to eliminate these little-used and somewhat suspect code
paths in the back end analyses.
The OPARAM node type goes away too.
A followup CL will do the same to PPARAMREF. I'm not sure that
the back ends (SSA in particular) are handling those right either,
and with the framework established in this CL that change is trivial
and the result clearly more correct.
Fixes #15747.
Change-Id: I2770b1ce3cbc93981bfc7166be66a9da12013d74
Reviewed-on: https://go-review.googlesource.com/23393
Reviewed-by: Keith Randall <khr@golang.org>
Run-TryBot: Russ Cox <rsc@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
											
										 
											2016-05-25 01:33:24 -04:00
										 |  |  | 		Fatalf("canSSA of PAUTOHEAP %v", n) | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	switch n.Class() { | 
					
						
							| 
									
										
										
										
											2016-05-25 10:29:50 -04:00
										 |  |  | 	case PEXTERN: | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 		return false | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 	case PPARAMOUT: | 
					
						
							| 
									
										
										
										
											2017-03-15 22:55:21 -07:00
										 |  |  | 		if s.hasdefer { | 
					
						
							| 
									
										
										
										
											2017-08-19 22:33:51 +02:00
										 |  |  | 			// TODO: handle this case? Named return values must be | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 			// in memory so that the deferred function can see them. | 
					
						
							|  |  |  | 			// Maybe do: if !strings.HasPrefix(n.String(), "~") { return false } | 
					
						
							| 
									
										
										
										
											2017-01-30 14:55:12 -08:00
										 |  |  | 			// Or maybe not, see issue 18860.  Even unnamed return values | 
					
						
							|  |  |  | 			// must be written back so if a defer recovers, the caller can see them. | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 			return false | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		if s.cgoUnsafeArgs { | 
					
						
							|  |  |  | 			// Cgo effectively takes the address of all result args, | 
					
						
							|  |  |  | 			// but the compiler can't see that. | 
					
						
							|  |  |  | 			return false | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PPARAM && n.Sym != nil && n.Sym.Name == ".this" { | 
					
						
							| 
									
										
										
										
											2015-09-08 21:28:44 -07:00
										 |  |  | 		// wrappers generated by genwrapper need to update | 
					
						
							|  |  |  | 		// the .this pointer in place. | 
					
						
							| 
									
										
										
										
											2016-02-27 17:49:31 -08:00
										 |  |  | 		// TODO: treat as a PPARMOUT? | 
					
						
							| 
									
										
										
										
											2015-09-08 21:28:44 -07:00
										 |  |  | 		return false | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 	return canSSAType(n.Type) | 
					
						
							|  |  |  | 	// TODO: try to make more variables SSAable? | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // canSSA reports whether variables of type t are SSA-able. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func canSSAType(t *types.Type) bool { | 
					
						
							| 
									
										
										
										
											2017-04-27 16:37:25 -07:00
										 |  |  | 	dowidth(t) | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 	if t.Width > int64(4*Widthptr) { | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 		// 4*Widthptr is an arbitrary constant. We want it | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 		// to be at least 3*Widthptr so slices can be registerized. | 
					
						
							|  |  |  | 		// Too big and we'll introduce too much register pressure. | 
					
						
							|  |  |  | 		return false | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	switch t.Etype { | 
					
						
							|  |  |  | 	case TARRAY: | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 		// We can't do larger arrays because dynamic indexing is | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 		// not supported on SSA variables. | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 		// TODO: allow if all indexes are constant. | 
					
						
							| 
									
										
										
										
											2017-03-24 09:36:31 -07:00
										 |  |  | 		if t.NumElem() <= 1 { | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 			return canSSAType(t.Elem()) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-06-27 15:45:20 +01:00
										 |  |  | 		return false | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 	case TSTRUCT: | 
					
						
							| 
									
										
										
										
											2016-03-17 13:26:08 -07:00
										 |  |  | 		if t.NumFields() > ssa.MaxStruct { | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 			return false | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-03-17 01:32:18 -07:00
										 |  |  | 		for _, t1 := range t.Fields().Slice() { | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 			if !canSSAType(t1.Type) { | 
					
						
							|  |  |  | 				return false | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 		return true | 
					
						
							| 
									
										
										
										
											2015-08-18 10:26:28 -07:00
										 |  |  | 	default: | 
					
						
							|  |  |  | 		return true | 
					
						
							| 
									
										
										
										
											2015-06-27 15:45:20 +01:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-06-10 15:03:06 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-04-19 21:06:53 -07:00
										 |  |  | // exprPtr evaluates n to a pointer and nil-checks it. | 
					
						
							| 
									
										
										
										
											2016-12-15 17:17:01 -08:00
										 |  |  | func (s *state) exprPtr(n *Node, bounded bool, lineno src.XPos) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-04-19 21:06:53 -07:00
										 |  |  | 	p := s.expr(n) | 
					
						
							| 
									
										
										
										
											2017-02-27 19:56:38 +02:00
										 |  |  | 	if bounded || n.NonNil() { | 
					
						
							| 
									
										
										
										
											2017-03-16 22:42:10 -07:00
										 |  |  | 		if s.f.Frontend().Debug_checknil() && lineno.Line() > 1 { | 
					
						
							|  |  |  | 			s.f.Warnl(lineno, "removed nil check") | 
					
						
							| 
									
										
										
										
											2016-04-19 21:06:53 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		return p | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	s.nilCheck(p) | 
					
						
							|  |  |  | 	return p | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | // nilCheck generates nil pointer checking code. | 
					
						
							| 
									
										
										
										
											2015-07-24 11:55:52 -07:00
										 |  |  | // Used only for automatically inserted nil checks, | 
					
						
							|  |  |  | // not for user code like 'x != nil'. | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | func (s *state) nilCheck(ptr *ssa.Value) { | 
					
						
							| 
									
										
										
										
											2017-05-04 07:55:00 -07:00
										 |  |  | 	if disable_checknil != 0 || s.curfn.Func.NilCheckDisabled() { | 
					
						
							| 
									
										
										
										
											2015-08-11 09:47:45 -07:00
										 |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 	s.newValue2(ssa.OpNilCheck, types.TypeVoid, ptr, s.mem()) | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | // boundsCheck generates bounds checking code. Checks if 0 <= idx < len, branches to exit if not. | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | // Starts a new block on return. | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | // idx is already converted to full int width. | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | func (s *state) boundsCheck(idx, len *ssa.Value) { | 
					
						
							| 
									
										
										
										
											2015-08-18 15:25:40 -07:00
										 |  |  | 	if Debug['B'] != 0 { | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// bounds check | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	cmp := s.newValue2(ssa.OpIsInBounds, types.Types[TBOOL], idx, len) | 
					
						
							| 
									
										
										
										
											2016-09-16 00:33:29 +10:00
										 |  |  | 	s.check(cmp, panicindex) | 
					
						
							| 
									
										
										
										
											2015-08-24 23:52:03 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | // sliceBoundsCheck generates slice bounds checking code. Checks if 0 <= idx <= len, branches to exit if not. | 
					
						
							| 
									
										
										
										
											2015-08-24 23:52:03 -07:00
										 |  |  | // Starts a new block on return. | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | // idx and len are already converted to full int width. | 
					
						
							| 
									
										
										
										
											2015-08-24 23:52:03 -07:00
										 |  |  | func (s *state) sliceBoundsCheck(idx, len *ssa.Value) { | 
					
						
							|  |  |  | 	if Debug['B'] != 0 { | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// bounds check | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	cmp := s.newValue2(ssa.OpIsSliceInBounds, types.Types[TBOOL], idx, len) | 
					
						
							| 
									
										
										
										
											2015-09-17 16:54:15 -07:00
										 |  |  | 	s.check(cmp, panicslice) | 
					
						
							| 
									
										
										
										
											2015-08-24 23:52:03 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | // If cmp (a bool) is false, panic using the given function. | 
					
						
							| 
									
										
										
										
											2017-02-06 13:40:19 -08:00
										 |  |  | func (s *state) check(cmp *ssa.Value, fn *obj.LSym) { | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	b := s.endBlock() | 
					
						
							|  |  |  | 	b.Kind = ssa.BlockIf | 
					
						
							| 
									
										
										
										
											2016-03-15 20:45:50 -07:00
										 |  |  | 	b.SetControl(cmp) | 
					
						
							| 
									
										
										
										
											2015-08-11 17:28:56 -07:00
										 |  |  | 	b.Likely = ssa.BranchLikely | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	bNext := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							| 
									
										
										
										
											2016-12-07 18:14:35 -08:00
										 |  |  | 	line := s.peekPos() | 
					
						
							| 
									
										
										
										
											2017-05-11 11:41:02 -07:00
										 |  |  | 	pos := Ctxt.PosTable.Pos(line) | 
					
						
							| 
									
										
										
										
											2017-10-02 15:47:41 -07:00
										 |  |  | 	fl := funcLine{f: fn, base: pos.Base(), line: pos.Line()} | 
					
						
							| 
									
										
										
										
											2017-05-11 11:41:02 -07:00
										 |  |  | 	bPanic := s.panics[fl] | 
					
						
							| 
									
										
										
										
											2015-11-09 21:35:40 -08:00
										 |  |  | 	if bPanic == nil { | 
					
						
							|  |  |  | 		bPanic = s.f.NewBlock(ssa.BlockPlain) | 
					
						
							| 
									
										
										
										
											2017-05-11 11:41:02 -07:00
										 |  |  | 		s.panics[fl] = bPanic | 
					
						
							| 
									
										
										
										
											2015-11-09 21:35:40 -08:00
										 |  |  | 		s.startBlock(bPanic) | 
					
						
							|  |  |  | 		// The panic call takes/returns memory to ensure that the right | 
					
						
							|  |  |  | 		// memory state is observed if the panic happens. | 
					
						
							|  |  |  | 		s.rtcall(fn, false, nil) | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	b.AddEdgeTo(bNext) | 
					
						
							|  |  |  | 	b.AddEdgeTo(bPanic) | 
					
						
							| 
									
										
										
										
											2015-05-18 16:44:20 -07:00
										 |  |  | 	s.startBlock(bNext) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-09-04 16:59:46 -07:00
										 |  |  | func (s *state) intDivide(n *Node, a, b *ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 	needcheck := true | 
					
						
							|  |  |  | 	switch b.Op { | 
					
						
							|  |  |  | 	case ssa.OpConst8, ssa.OpConst16, ssa.OpConst32, ssa.OpConst64: | 
					
						
							|  |  |  | 		if b.AuxInt != 0 { | 
					
						
							|  |  |  | 			needcheck = false | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	if needcheck { | 
					
						
							|  |  |  | 		// do a size-appropriate check for zero | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		cmp := s.newValue2(s.ssaOp(ONE, n.Type), types.Types[TBOOL], b, s.zeroVal(n.Type)) | 
					
						
							| 
									
										
										
										
											2016-09-04 16:59:46 -07:00
										 |  |  | 		s.check(cmp, panicdivide) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return s.newValue2(s.ssaOp(n.Op, n.Type), a.Type, a, b) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | // rtcall issues a call to the given runtime function fn with the listed args. | 
					
						
							|  |  |  | // Returns a slice of results of the given result types. | 
					
						
							|  |  |  | // The call is added to the end of the current block. | 
					
						
							|  |  |  | // If returns is false, the block is marked as an exit block. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) rtcall(fn *obj.LSym, returns bool, results []*types.Type, args ...*ssa.Value) []*ssa.Value { | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | 	// Write args to the stack | 
					
						
							| 
									
										
										
										
											2016-05-15 00:12:56 -04:00
										 |  |  | 	off := Ctxt.FixedFrameSize() | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | 	for _, arg := range args { | 
					
						
							|  |  |  | 		t := arg.Type | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		off = Rnd(off, t.Alignment()) | 
					
						
							| 
									
										
										
										
											2017-03-08 12:50:00 -08:00
										 |  |  | 		ptr := s.constOffPtrSP(t.PtrTo(), off) | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		size := t.Size() | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(t, ptr, arg) | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | 		off += size | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-21 19:16:15 -07:00
										 |  |  | 	off = Rnd(off, int64(Widthreg)) | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// Issue call | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 	call := s.newValue1A(ssa.OpStaticCall, types.TypeMem, fn, s.mem()) | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | 	s.vars[&memVar] = call | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	if !returns { | 
					
						
							| 
									
										
										
										
											2016-09-09 13:11:07 -07:00
										 |  |  | 		// Finish block | 
					
						
							|  |  |  | 		b := s.endBlock() | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | 		b.Kind = ssa.BlockExit | 
					
						
							| 
									
										
										
										
											2016-03-15 20:45:50 -07:00
										 |  |  | 		b.SetControl(call) | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile: fix argument size of runtime call in SSA for ARM
The argument size for runtime call was incorrectly includes the size
of LR (FixedFrameSize in general). This makes the stack frame
sometimes unnecessarily 4 bytes larger on ARM.
For example,
	func f(b []byte) byte { return b[0] }
compiles to
	0x0000 00000 (h.go:6)	TEXT	"".f(SB), $4-16 // <-- framesize = 4
	0x0000 00000 (h.go:6)	MOVW	8(g), R1
	0x0004 00004 (h.go:6)	CMP	R1, R13
	0x0008 00008 (h.go:6)	BLS	52
	0x000c 00012 (h.go:6)	MOVW.W	R14, -8(R13)
	0x0010 00016 (h.go:6)	FUNCDATA	$0, gclocals·8355ad952265fec823c17fcf739bd009(SB)
	0x0010 00016 (h.go:6)	FUNCDATA	$1, gclocals·69c1753bd5f81501d95132d08af04464(SB)
	0x0010 00016 (h.go:6)	MOVW	"".b+4(FP), R0
	0x0014 00020 (h.go:6)	CMP	$0, R0
	0x0018 00024 (h.go:6)	BLS	44
	0x001c 00028 (h.go:6)	MOVW	"".b(FP), R0
	0x0020 00032 (h.go:6)	MOVBU	(R0), R0
	0x0024 00036 (h.go:6)	MOVB	R0, "".~r1+12(FP)
	0x0028 00040 (h.go:6)	MOVW.P	8(R13), R15
	0x002c 00044 (h.go:6)	PCDATA	$0, $1
	0x002c 00044 (h.go:6)	CALL	runtime.panicindex(SB)
	0x0030 00048 (h.go:6)	UNDEF
	0x0034 00052 (h.go:6)	NOP
	0x0034 00052 (h.go:6)	MOVW	R14, R3
	0x0038 00056 (h.go:6)	CALL	runtime.morestack_noctxt(SB)
	0x003c 00060 (h.go:6)	JMP	0
Note that the frame size is 4, but there is actually no local. It
incorrectly thinks call to runtime.panicindex needs 4 bytes space
for argument.
This CL fixes it.
Updates #15365.
Change-Id: Ic65d55283a6aa8a7861d7a3fbc7b63c35785eeec
Reviewed-on: https://go-review.googlesource.com/24909
Run-TryBot: Cherry Zhang <cherryyz@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-07-13 09:22:48 -06:00
										 |  |  | 		call.AuxInt = off - Ctxt.FixedFrameSize() | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | 		if len(results) > 0 { | 
					
						
							|  |  |  | 			Fatalf("panic call can't have results") | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		return nil | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Load results | 
					
						
							|  |  |  | 	res := make([]*ssa.Value, len(results)) | 
					
						
							|  |  |  | 	for i, t := range results { | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		off = Rnd(off, t.Alignment()) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		ptr := s.constOffPtrSP(types.NewPtr(t), off) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		res[i] = s.load(t, ptr) | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		off += t.Size() | 
					
						
							| 
									
										
										
										
											2015-09-18 15:11:30 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	off = Rnd(off, int64(Widthptr)) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Remember how much callee stack space we needed. | 
					
						
							|  |  |  | 	call.AuxInt = off | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	return res | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-03-27 13:50:08 -07:00
										 |  |  | // do *left = right for type t. | 
					
						
							| 
									
										
										
										
											2018-03-02 20:33:15 -05:00
										 |  |  | func (s *state) storeType(t *types.Type, left, right *ssa.Value, skip skipMask, leftIsStmt bool) { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:50:08 -07:00
										 |  |  | 	s.instrument(t, left, true) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	if skip == 0 && (!types.Haspointers(t) || ssa.IsStackAddr(left)) { | 
					
						
							| 
									
										
										
										
											2017-03-17 11:53:24 -04:00
										 |  |  | 		// Known to not have write barrier. Store the whole type. | 
					
						
							| 
									
										
										
										
											2018-03-02 20:33:15 -05:00
										 |  |  | 		s.vars[&memVar] = s.newValue3Apos(ssa.OpStore, types.TypeMem, t, left, right, s.mem(), leftIsStmt) | 
					
						
							| 
									
										
										
										
											2017-03-17 11:53:24 -04:00
										 |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-02-10 10:15:10 -05:00
										 |  |  | 	// store scalar fields first, so write barrier stores for | 
					
						
							|  |  |  | 	// pointer fields can be grouped together, and scalar values | 
					
						
							|  |  |  | 	// don't need to be live across the write barrier call. | 
					
						
							| 
									
										
										
										
											2017-03-17 11:53:24 -04:00
										 |  |  | 	// TODO: if the writebarrier pass knows how to reorder stores, | 
					
						
							|  |  |  | 	// we can do a single store here as long as skip==0. | 
					
						
							| 
									
										
										
										
											2016-03-21 10:22:03 -07:00
										 |  |  | 	s.storeTypeScalars(t, left, right, skip) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	if skip&skipPtr == 0 && types.Haspointers(t) { | 
					
						
							| 
									
										
										
										
											2017-02-10 10:15:10 -05:00
										 |  |  | 		s.storeTypePtrs(t, left, right) | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | // do *left = right for all scalar (non-pointer) parts of t. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) storeTypeScalars(t *types.Type, left, right *ssa.Value, skip skipMask) { | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	switch { | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | 	case t.IsBoolean() || t.IsInteger() || t.IsFloat() || t.IsComplex(): | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(t, left, right) | 
					
						
							| 
									
										
										
										
											2016-03-28 10:55:44 -07:00
										 |  |  | 	case t.IsPtrShaped(): | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 		// no scalar fields. | 
					
						
							|  |  |  | 	case t.IsString(): | 
					
						
							| 
									
										
										
										
											2016-03-21 10:22:03 -07:00
										 |  |  | 		if skip&skipLen != 0 { | 
					
						
							|  |  |  | 			return | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		len := s.newValue1(ssa.OpStringLen, types.Types[TINT], right) | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 		lenAddr := s.newValue1I(ssa.OpOffPtr, s.f.Config.Types.IntPtr, s.config.PtrSize, left) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(types.Types[TINT], lenAddr, len) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	case t.IsSlice(): | 
					
						
							| 
									
										
										
										
											2016-03-21 10:22:03 -07:00
										 |  |  | 		if skip&skipLen == 0 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			len := s.newValue1(ssa.OpSliceLen, types.Types[TINT], right) | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 			lenAddr := s.newValue1I(ssa.OpOffPtr, s.f.Config.Types.IntPtr, s.config.PtrSize, left) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			s.store(types.Types[TINT], lenAddr, len) | 
					
						
							| 
									
										
										
										
											2016-03-21 10:22:03 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		if skip&skipCap == 0 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			cap := s.newValue1(ssa.OpSliceCap, types.Types[TINT], right) | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 			capAddr := s.newValue1I(ssa.OpOffPtr, s.f.Config.Types.IntPtr, 2*s.config.PtrSize, left) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			s.store(types.Types[TINT], capAddr, cap) | 
					
						
							| 
									
										
										
										
											2016-03-21 10:22:03 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	case t.IsInterface(): | 
					
						
							|  |  |  | 		// itab field doesn't need a write barrier (even though it is a pointer). | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		itab := s.newValue1(ssa.OpITab, s.f.Config.Types.BytePtr, right) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(types.Types[TUINTPTR], left, itab) | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | 	case t.IsStruct(): | 
					
						
							|  |  |  | 		n := t.NumFields() | 
					
						
							| 
									
										
										
										
											2016-03-14 12:45:18 -07:00
										 |  |  | 		for i := 0; i < n; i++ { | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | 			ft := t.FieldType(i) | 
					
						
							|  |  |  | 			addr := s.newValue1I(ssa.OpOffPtr, ft.PtrTo(), t.FieldOff(i), left) | 
					
						
							| 
									
										
										
										
											2016-03-14 12:45:18 -07:00
										 |  |  | 			val := s.newValue1I(ssa.OpStructSelect, ft, int64(i), right) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			s.storeTypeScalars(ft, addr, val, 0) | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 	case t.IsArray() && t.NumElem() == 0: | 
					
						
							|  |  |  | 		// nothing | 
					
						
							|  |  |  | 	case t.IsArray() && t.NumElem() == 1: | 
					
						
							|  |  |  | 		s.storeTypeScalars(t.Elem(), left, s.newValue1I(ssa.OpArraySelect, t.Elem(), 0, right), 0) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 		s.Fatalf("bad write barrier type %v", t) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | // do *left = right for all pointer parts of t. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) storeTypePtrs(t *types.Type, left, right *ssa.Value) { | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	switch { | 
					
						
							| 
									
										
										
										
											2016-03-28 10:55:44 -07:00
										 |  |  | 	case t.IsPtrShaped(): | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(t, left, right) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	case t.IsString(): | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		ptr := s.newValue1(ssa.OpStringPtr, s.f.Config.Types.BytePtr, right) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(s.f.Config.Types.BytePtr, left, ptr) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	case t.IsSlice(): | 
					
						
							| 
									
										
										
										
											2017-11-02 12:37:25 -04:00
										 |  |  | 		elType := types.NewPtr(t.Elem()) | 
					
						
							|  |  |  | 		ptr := s.newValue1(ssa.OpSlicePtr, elType, right) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(elType, left, ptr) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	case t.IsInterface(): | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | 		// itab field is treated as a scalar. | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 		idata := s.newValue1(ssa.OpIData, s.f.Config.Types.BytePtr, right) | 
					
						
							|  |  |  | 		idataAddr := s.newValue1I(ssa.OpOffPtr, s.f.Config.Types.BytePtrPtr, s.config.PtrSize, left) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.store(s.f.Config.Types.BytePtr, idataAddr, idata) | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | 	case t.IsStruct(): | 
					
						
							|  |  |  | 		n := t.NumFields() | 
					
						
							| 
									
										
										
										
											2016-03-14 12:45:18 -07:00
										 |  |  | 		for i := 0; i < n; i++ { | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | 			ft := t.FieldType(i) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			if !types.Haspointers(ft) { | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | 				continue | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			addr := s.newValue1I(ssa.OpOffPtr, ft.PtrTo(), t.FieldOff(i), left) | 
					
						
							| 
									
										
										
										
											2016-03-14 12:45:18 -07:00
										 |  |  | 			val := s.newValue1I(ssa.OpStructSelect, ft, int64(i), right) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 			s.storeTypePtrs(ft, addr, val) | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 	case t.IsArray() && t.NumElem() == 0: | 
					
						
							|  |  |  | 		// nothing | 
					
						
							|  |  |  | 	case t.IsArray() && t.NumElem() == 1: | 
					
						
							|  |  |  | 		s.storeTypePtrs(t.Elem(), left, s.newValue1I(ssa.OpArraySelect, t.Elem(), 0, right)) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	default: | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 		s.Fatalf("bad write barrier type %v", t) | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-01-29 21:57:57 -08:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2016-01-25 17:06:54 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | // slice computes the slice v[i:j:k] and returns ptr, len, and cap of result. | 
					
						
							|  |  |  | // i,j,k may be nil, in which case they are set to their default value. | 
					
						
							|  |  |  | // t is a slice, ptr to array, or string type. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) slice(t *types.Type, v, i, j, k *ssa.Value) (p, l, c *ssa.Value) { | 
					
						
							|  |  |  | 	var elemtype *types.Type | 
					
						
							|  |  |  | 	var ptrtype *types.Type | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 	var ptr *ssa.Value | 
					
						
							|  |  |  | 	var len *ssa.Value | 
					
						
							|  |  |  | 	var cap *ssa.Value | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	zero := s.constInt(types.Types[TINT], 0) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 	switch { | 
					
						
							|  |  |  | 	case t.IsSlice(): | 
					
						
							| 
									
										
										
										
											2016-03-30 10:57:47 -07:00
										 |  |  | 		elemtype = t.Elem() | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		ptrtype = types.NewPtr(elemtype) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		ptr = s.newValue1(ssa.OpSlicePtr, ptrtype, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		len = s.newValue1(ssa.OpSliceLen, types.Types[TINT], v) | 
					
						
							|  |  |  | 		cap = s.newValue1(ssa.OpSliceCap, types.Types[TINT], v) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 	case t.IsString(): | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		elemtype = types.Types[TUINT8] | 
					
						
							|  |  |  | 		ptrtype = types.NewPtr(elemtype) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		ptr = s.newValue1(ssa.OpStringPtr, ptrtype, v) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		len = s.newValue1(ssa.OpStringLen, types.Types[TINT], v) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		cap = len | 
					
						
							|  |  |  | 	case t.IsPtr(): | 
					
						
							| 
									
										
										
										
											2016-03-30 10:57:47 -07:00
										 |  |  | 		if !t.Elem().IsArray() { | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 			s.Fatalf("bad ptr to array in slice %v\n", t) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-03-30 10:57:47 -07:00
										 |  |  | 		elemtype = t.Elem().Elem() | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		ptrtype = types.NewPtr(elemtype) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		s.nilCheck(v) | 
					
						
							|  |  |  | 		ptr = v | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		len = s.constInt(types.Types[TINT], t.Elem().NumElem()) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		cap = len | 
					
						
							|  |  |  | 	default: | 
					
						
							|  |  |  | 		s.Fatalf("bad type in slice %v\n", t) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Set default values | 
					
						
							|  |  |  | 	if i == nil { | 
					
						
							|  |  |  | 		i = zero | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	if j == nil { | 
					
						
							|  |  |  | 		j = len | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	if k == nil { | 
					
						
							|  |  |  | 		k = cap | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Panic if slice indices are not in bounds. | 
					
						
							|  |  |  | 	s.sliceBoundsCheck(i, j) | 
					
						
							|  |  |  | 	if j != k { | 
					
						
							|  |  |  | 		s.sliceBoundsCheck(j, k) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	if k != cap { | 
					
						
							|  |  |  | 		s.sliceBoundsCheck(k, cap) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Generate the following code assuming that indexes are in bounds. | 
					
						
							| 
									
										
										
										
											2016-10-25 15:49:52 -07:00
										 |  |  | 	// The masking is to make sure that we don't generate a slice | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 	// that points to the next object in memory. | 
					
						
							| 
									
										
										
										
											2016-10-25 15:49:52 -07:00
										 |  |  | 	// rlen = j - i | 
					
						
							|  |  |  | 	// rcap = k - i | 
					
						
							|  |  |  | 	// delta = i * elemsize | 
					
						
							|  |  |  | 	// rptr = p + delta&mask(rcap) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change the way SSA does slice zero-cap detection
There is a special case for slicing s[i:j] when the resulting
slice has zero capacity, to prevent pointing to the next object
in memory.
Change this special case code from:
  rptr := rcap == 0 ? ptr : ptr+i*elemsize
to
  rptr := ptr + (rcap == 0 ? 0 : i) * elemsize
This change leads to slightly smaller generated code, replacing
a load with a register zero.
old:
	0x002e 00046 (slice.go:8)	CMPQ	BX, $0
	0x0032 00050 (slice.go:8)	JEQ	$0, 78
	0x0034 00052 (slice.go:8)	MOVQ	"".a+8(FP), BP
	0x0039 00057 (slice.go:8)	LEAQ	(BP)(CX*8), AX
	0x003e 00062 ... rest of function ...
	0x004e 00078 (slice.go:7)	MOVQ	"".a+8(FP), AX
	0x0053 00083 (slice.go:8)	JMP	62
new:
	0x002e 00046 (slice.go:8)	CMPQ	BX, $0
	0x0032 00050 (slice.go:8)	JEQ	$0, 78
	0x0034 00052 (slice.go:8)	MOVQ	"".a+8(FP), BP
	0x0039 00057 (slice.go:8)	LEAQ	(BP)(CX*8), AX
	0x003e 00062 ... rest of function...
	0x004e 00078 (slice.go:8)	MOVQ	$0, CX
	0x0050 00080 (slice.go:8)	JMP	52
Change-Id: I2a396616b0d7b090c226a47c92a7ba14b128401f
Reviewed-on: https://go-review.googlesource.com/20994
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-21 15:24:08 -07:00
										 |  |  | 	// result = (SliceMake rptr rlen rcap) | 
					
						
							| 
									
										
										
										
											2016-10-25 15:49:52 -07:00
										 |  |  | 	// where mask(x) is 0 if x==0 and -1 if x>0. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	subOp := s.ssaOp(OSUB, types.Types[TINT]) | 
					
						
							|  |  |  | 	mulOp := s.ssaOp(OMUL, types.Types[TINT]) | 
					
						
							|  |  |  | 	andOp := s.ssaOp(OAND, types.Types[TINT]) | 
					
						
							|  |  |  | 	rlen := s.newValue2(subOp, types.Types[TINT], j, i) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 	var rcap *ssa.Value | 
					
						
							|  |  |  | 	switch { | 
					
						
							|  |  |  | 	case t.IsString(): | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 		// Capacity of the result is unimportant. However, we use | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 		// rcap to test if we've generated a zero-length slice. | 
					
						
							|  |  |  | 		// Use length of strings for that. | 
					
						
							|  |  |  | 		rcap = rlen | 
					
						
							|  |  |  | 	case j == k: | 
					
						
							|  |  |  | 		rcap = rlen | 
					
						
							|  |  |  | 	default: | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		rcap = s.newValue2(subOp, types.Types[TINT], k, i) | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-25 15:49:52 -07:00
										 |  |  | 	var rptr *ssa.Value | 
					
						
							|  |  |  | 	if (i.Op == ssa.OpConst64 || i.Op == ssa.OpConst32) && i.AuxInt == 0 { | 
					
						
							|  |  |  | 		// No pointer arithmetic necessary. | 
					
						
							|  |  |  | 		rptr = ptr | 
					
						
							|  |  |  | 	} else { | 
					
						
							|  |  |  | 		// delta = # of bytes to offset pointer by. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		delta := s.newValue2(mulOp, types.Types[TINT], i, s.constInt(types.Types[TINT], elemtype.Width)) | 
					
						
							| 
									
										
										
										
											2016-10-25 15:49:52 -07:00
										 |  |  | 		// If we're slicing to the point where the capacity is zero, | 
					
						
							|  |  |  | 		// zero out the delta. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		mask := s.newValue1(ssa.OpSlicemask, types.Types[TINT], rcap) | 
					
						
							|  |  |  | 		delta = s.newValue2(andOp, types.Types[TINT], delta, mask) | 
					
						
							| 
									
										
										
										
											2016-10-25 15:49:52 -07:00
										 |  |  | 		// Compute rptr = ptr + delta | 
					
						
							|  |  |  | 		rptr = s.newValue2(ssa.OpAddPtr, ptrtype, ptr, delta) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change the way SSA does slice zero-cap detection
There is a special case for slicing s[i:j] when the resulting
slice has zero capacity, to prevent pointing to the next object
in memory.
Change this special case code from:
  rptr := rcap == 0 ? ptr : ptr+i*elemsize
to
  rptr := ptr + (rcap == 0 ? 0 : i) * elemsize
This change leads to slightly smaller generated code, replacing
a load with a register zero.
old:
	0x002e 00046 (slice.go:8)	CMPQ	BX, $0
	0x0032 00050 (slice.go:8)	JEQ	$0, 78
	0x0034 00052 (slice.go:8)	MOVQ	"".a+8(FP), BP
	0x0039 00057 (slice.go:8)	LEAQ	(BP)(CX*8), AX
	0x003e 00062 ... rest of function ...
	0x004e 00078 (slice.go:7)	MOVQ	"".a+8(FP), AX
	0x0053 00083 (slice.go:8)	JMP	62
new:
	0x002e 00046 (slice.go:8)	CMPQ	BX, $0
	0x0032 00050 (slice.go:8)	JEQ	$0, 78
	0x0034 00052 (slice.go:8)	MOVQ	"".a+8(FP), BP
	0x0039 00057 (slice.go:8)	LEAQ	(BP)(CX*8), AX
	0x003e 00062 ... rest of function...
	0x004e 00078 (slice.go:8)	MOVQ	$0, CX
	0x0050 00080 (slice.go:8)	JMP	52
Change-Id: I2a396616b0d7b090c226a47c92a7ba14b128401f
Reviewed-on: https://go-review.googlesource.com/20994
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-21 15:24:08 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-12 23:27:26 -07:00
										 |  |  | 	return rptr, rlen, rcap | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | type u642fcvtTab struct { | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	geq, cvt2F, and, rsh, or, add ssa.Op | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 	one                           func(*state, *types.Type, int64) *ssa.Value | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-09-16 22:11:43 +01:00
										 |  |  | var u64_f64 = u642fcvtTab{ | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	geq:   ssa.OpGeq64, | 
					
						
							|  |  |  | 	cvt2F: ssa.OpCvt64to64F, | 
					
						
							|  |  |  | 	and:   ssa.OpAnd64, | 
					
						
							|  |  |  | 	rsh:   ssa.OpRsh64Ux64, | 
					
						
							|  |  |  | 	or:    ssa.OpOr64, | 
					
						
							|  |  |  | 	add:   ssa.OpAdd64F, | 
					
						
							|  |  |  | 	one:   (*state).constInt64, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-09-16 22:11:43 +01:00
										 |  |  | var u64_f32 = u642fcvtTab{ | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	geq:   ssa.OpGeq64, | 
					
						
							|  |  |  | 	cvt2F: ssa.OpCvt64to32F, | 
					
						
							|  |  |  | 	and:   ssa.OpAnd64, | 
					
						
							|  |  |  | 	rsh:   ssa.OpRsh64Ux64, | 
					
						
							|  |  |  | 	or:    ssa.OpOr64, | 
					
						
							|  |  |  | 	add:   ssa.OpAdd32F, | 
					
						
							|  |  |  | 	one:   (*state).constInt64, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) uint64Tofloat64(n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	return s.uint64Tofloat(&u64_f64, n, x, ft, tt) | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) uint64Tofloat32(n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	return s.uint64Tofloat(&u64_f32, n, x, ft, tt) | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) uint64Tofloat(cvttab *u642fcvtTab, n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	// if x >= 0 { | 
					
						
							|  |  |  | 	//    result = (floatY) x | 
					
						
							|  |  |  | 	// } else { | 
					
						
							|  |  |  | 	// 	  y = uintX(x) ; y = x & 1 | 
					
						
							|  |  |  | 	// 	  z = uintX(x) ; z = z >> 1 | 
					
						
							|  |  |  | 	// 	  z = z >> 1 | 
					
						
							|  |  |  | 	// 	  z = z | y | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	// 	  result = floatY(z) | 
					
						
							|  |  |  | 	// 	  result = result + result | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	// } | 
					
						
							|  |  |  | 	// | 
					
						
							|  |  |  | 	// Code borrowed from old code generator. | 
					
						
							|  |  |  | 	// What's going on: large 64-bit "unsigned" looks like | 
					
						
							|  |  |  | 	// negative number to hardware's integer-to-float | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 	// conversion. However, because the mantissa is only | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	// 63 bits, we don't need the LSB, so instead we do an | 
					
						
							|  |  |  | 	// unsigned right shift (divide by two), convert, and | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 	// double. However, before we do that, we need to be | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	// sure that we do not lose a "1" if that made the | 
					
						
							| 
									
										
										
										
											2016-03-01 23:21:55 +00:00
										 |  |  | 	// difference in the resulting rounding. Therefore, we | 
					
						
							|  |  |  | 	// preserve it, and OR (not ADD) it back in. The case | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	// that matters is when the eleven discarded bits are | 
					
						
							|  |  |  | 	// equal to 10000000001; that rounds up, and the 1 cannot | 
					
						
							|  |  |  | 	// be lost else it would round down if the LSB of the | 
					
						
							|  |  |  | 	// candidate mantissa is 0. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	cmp := s.newValue2(cvttab.geq, types.Types[TBOOL], x, s.zeroVal(ft)) | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	b := s.endBlock() | 
					
						
							|  |  |  | 	b.Kind = ssa.BlockIf | 
					
						
							| 
									
										
										
										
											2016-03-15 20:45:50 -07:00
										 |  |  | 	b.SetControl(cmp) | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	b.Likely = ssa.BranchLikely | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	bThen := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	bElse := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	bAfter := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	b.AddEdgeTo(bThen) | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	s.startBlock(bThen) | 
					
						
							|  |  |  | 	a0 := s.newValue1(cvttab.cvt2F, tt, x) | 
					
						
							|  |  |  | 	s.vars[n] = a0 | 
					
						
							|  |  |  | 	s.endBlock() | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	bThen.AddEdgeTo(bAfter) | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	b.AddEdgeTo(bElse) | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 	s.startBlock(bElse) | 
					
						
							|  |  |  | 	one := cvttab.one(s, ft, 1) | 
					
						
							|  |  |  | 	y := s.newValue2(cvttab.and, ft, x, one) | 
					
						
							|  |  |  | 	z := s.newValue2(cvttab.rsh, ft, x, one) | 
					
						
							|  |  |  | 	z = s.newValue2(cvttab.or, ft, z, y) | 
					
						
							|  |  |  | 	a := s.newValue1(cvttab.cvt2F, tt, z) | 
					
						
							|  |  |  | 	a1 := s.newValue2(cvttab.add, tt, a, a) | 
					
						
							|  |  |  | 	s.vars[n] = a1 | 
					
						
							|  |  |  | 	s.endBlock() | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	bElse.AddEdgeTo(bAfter) | 
					
						
							| 
									
										
										
										
											2015-08-20 15:14:20 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	s.startBlock(bAfter) | 
					
						
							|  |  |  | 	return s.variable(n, n.Type) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | type u322fcvtTab struct { | 
					
						
							|  |  |  | 	cvtI2F, cvtF2F ssa.Op | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-09-16 22:11:43 +01:00
										 |  |  | var u32_f64 = u322fcvtTab{ | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	cvtI2F: ssa.OpCvt32to64F, | 
					
						
							|  |  |  | 	cvtF2F: ssa.OpCopy, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-09-16 22:11:43 +01:00
										 |  |  | var u32_f32 = u322fcvtTab{ | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	cvtI2F: ssa.OpCvt32to32F, | 
					
						
							|  |  |  | 	cvtF2F: ssa.OpCvt64Fto32F, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) uint32Tofloat64(n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	return s.uint32Tofloat(&u32_f64, n, x, ft, tt) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) uint32Tofloat32(n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	return s.uint32Tofloat(&u32_f32, n, x, ft, tt) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) uint32Tofloat(cvttab *u322fcvtTab, n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	// if x >= 0 { | 
					
						
							|  |  |  | 	// 	result = floatY(x) | 
					
						
							|  |  |  | 	// } else { | 
					
						
							|  |  |  | 	// 	result = floatY(float64(x) + (1<<32)) | 
					
						
							|  |  |  | 	// } | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	cmp := s.newValue2(ssa.OpGeq32, types.Types[TBOOL], x, s.zeroVal(ft)) | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	b := s.endBlock() | 
					
						
							|  |  |  | 	b.Kind = ssa.BlockIf | 
					
						
							|  |  |  | 	b.SetControl(cmp) | 
					
						
							|  |  |  | 	b.Likely = ssa.BranchLikely | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	bThen := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	bElse := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	bAfter := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	b.AddEdgeTo(bThen) | 
					
						
							|  |  |  | 	s.startBlock(bThen) | 
					
						
							|  |  |  | 	a0 := s.newValue1(cvttab.cvtI2F, tt, x) | 
					
						
							|  |  |  | 	s.vars[n] = a0 | 
					
						
							|  |  |  | 	s.endBlock() | 
					
						
							|  |  |  | 	bThen.AddEdgeTo(bAfter) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	b.AddEdgeTo(bElse) | 
					
						
							|  |  |  | 	s.startBlock(bElse) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	a1 := s.newValue1(ssa.OpCvt32to64F, types.Types[TFLOAT64], x) | 
					
						
							|  |  |  | 	twoToThe32 := s.constFloat64(types.Types[TFLOAT64], float64(1<<32)) | 
					
						
							|  |  |  | 	a2 := s.newValue2(ssa.OpAdd64F, types.Types[TFLOAT64], a1, twoToThe32) | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	a3 := s.newValue1(cvttab.cvtF2F, tt, a2) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	s.vars[n] = a3 | 
					
						
							|  |  |  | 	s.endBlock() | 
					
						
							|  |  |  | 	bElse.AddEdgeTo(bAfter) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	s.startBlock(bAfter) | 
					
						
							|  |  |  | 	return s.variable(n, n.Type) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-28 15:56:43 -05:00
										 |  |  | // referenceTypeBuiltin generates code for the len/cap builtins for maps and channels. | 
					
						
							|  |  |  | func (s *state) referenceTypeBuiltin(n *Node, x *ssa.Value) *ssa.Value { | 
					
						
							|  |  |  | 	if !n.Left.Type.IsMap() && !n.Left.Type.IsChan() { | 
					
						
							|  |  |  | 		s.Fatalf("node must be a map or a channel") | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 	// if n == nil { | 
					
						
							|  |  |  | 	//   return 0 | 
					
						
							|  |  |  | 	// } else { | 
					
						
							| 
									
										
										
										
											2015-08-28 15:56:43 -05:00
										 |  |  | 	//   // len | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 	//   return *((*int)n) | 
					
						
							| 
									
										
										
										
											2015-08-28 15:56:43 -05:00
										 |  |  | 	//   // cap | 
					
						
							|  |  |  | 	//   return *(((*int)n)+1) | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 	// } | 
					
						
							|  |  |  | 	lenType := n.Type | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	nilValue := s.constNil(types.Types[TUINTPTR]) | 
					
						
							|  |  |  | 	cmp := s.newValue2(ssa.OpEqPtr, types.Types[TBOOL], x, nilValue) | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 	b := s.endBlock() | 
					
						
							|  |  |  | 	b.Kind = ssa.BlockIf | 
					
						
							| 
									
										
										
										
											2016-03-15 20:45:50 -07:00
										 |  |  | 	b.SetControl(cmp) | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 	b.Likely = ssa.BranchUnlikely | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	bThen := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	bElse := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	bAfter := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-28 15:56:43 -05:00
										 |  |  | 	// length/capacity of a nil map/chan is zero | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	b.AddEdgeTo(bThen) | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 	s.startBlock(bThen) | 
					
						
							|  |  |  | 	s.vars[n] = s.zeroVal(lenType) | 
					
						
							|  |  |  | 	s.endBlock() | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	bThen.AddEdgeTo(bAfter) | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	b.AddEdgeTo(bElse) | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 	s.startBlock(bElse) | 
					
						
							| 
									
										
										
										
											2017-10-15 22:49:52 +01:00
										 |  |  | 	switch n.Op { | 
					
						
							|  |  |  | 	case OLEN: | 
					
						
							| 
									
										
										
										
											2015-08-28 15:56:43 -05:00
										 |  |  | 		// length is stored in the first word for map/chan | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.vars[n] = s.load(lenType, x) | 
					
						
							| 
									
										
										
										
											2017-10-15 22:49:52 +01:00
										 |  |  | 	case OCAP: | 
					
						
							| 
									
										
										
										
											2015-08-28 15:56:43 -05:00
										 |  |  | 		// capacity is stored in the second word for chan | 
					
						
							|  |  |  | 		sw := s.newValue1I(ssa.OpOffPtr, lenType.PtrTo(), lenType.Width, x) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.vars[n] = s.load(lenType, sw) | 
					
						
							| 
									
										
										
										
											2017-10-15 22:49:52 +01:00
										 |  |  | 	default: | 
					
						
							| 
									
										
										
										
											2015-08-28 15:56:43 -05:00
										 |  |  | 		s.Fatalf("op must be OLEN or OCAP") | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 	s.endBlock() | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	bElse.AddEdgeTo(bAfter) | 
					
						
							| 
									
										
										
										
											2015-08-26 18:40:52 -05:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	s.startBlock(bAfter) | 
					
						
							|  |  |  | 	return s.variable(n, lenType) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | type f2uCvtTab struct { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	ltf, cvt2U, subf, or ssa.Op | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 	floatValue           func(*state, *types.Type, float64) *ssa.Value | 
					
						
							|  |  |  | 	intValue             func(*state, *types.Type, int64) *ssa.Value | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	cutoff               uint64 | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-09-16 22:11:43 +01:00
										 |  |  | var f32_u64 = f2uCvtTab{ | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	ltf:        ssa.OpLess32F, | 
					
						
							|  |  |  | 	cvt2U:      ssa.OpCvt32Fto64, | 
					
						
							|  |  |  | 	subf:       ssa.OpSub32F, | 
					
						
							|  |  |  | 	or:         ssa.OpOr64, | 
					
						
							|  |  |  | 	floatValue: (*state).constFloat32, | 
					
						
							|  |  |  | 	intValue:   (*state).constInt64, | 
					
						
							|  |  |  | 	cutoff:     9223372036854775808, | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-09-16 22:11:43 +01:00
										 |  |  | var f64_u64 = f2uCvtTab{ | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	ltf:        ssa.OpLess64F, | 
					
						
							|  |  |  | 	cvt2U:      ssa.OpCvt64Fto64, | 
					
						
							|  |  |  | 	subf:       ssa.OpSub64F, | 
					
						
							|  |  |  | 	or:         ssa.OpOr64, | 
					
						
							|  |  |  | 	floatValue: (*state).constFloat64, | 
					
						
							|  |  |  | 	intValue:   (*state).constInt64, | 
					
						
							|  |  |  | 	cutoff:     9223372036854775808, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-09-16 22:11:43 +01:00
										 |  |  | var f32_u32 = f2uCvtTab{ | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	ltf:        ssa.OpLess32F, | 
					
						
							|  |  |  | 	cvt2U:      ssa.OpCvt32Fto32, | 
					
						
							|  |  |  | 	subf:       ssa.OpSub32F, | 
					
						
							|  |  |  | 	or:         ssa.OpOr32, | 
					
						
							|  |  |  | 	floatValue: (*state).constFloat32, | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 	intValue:   func(s *state, t *types.Type, v int64) *ssa.Value { return s.constInt32(t, int32(v)) }, | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	cutoff:     2147483648, | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-09-16 22:11:43 +01:00
										 |  |  | var f64_u32 = f2uCvtTab{ | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	ltf:        ssa.OpLess64F, | 
					
						
							|  |  |  | 	cvt2U:      ssa.OpCvt64Fto32, | 
					
						
							|  |  |  | 	subf:       ssa.OpSub64F, | 
					
						
							|  |  |  | 	or:         ssa.OpOr32, | 
					
						
							|  |  |  | 	floatValue: (*state).constFloat64, | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 	intValue:   func(s *state, t *types.Type, v int64) *ssa.Value { return s.constInt32(t, int32(v)) }, | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	cutoff:     2147483648, | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) float32ToUint64(n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	return s.floatToUint(&f32_u64, n, x, ft, tt) | 
					
						
							|  |  |  | } | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) float64ToUint64(n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	return s.floatToUint(&f64_u64, n, x, ft, tt) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) float32ToUint32(n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	return s.floatToUint(&f32_u32, n, x, ft, tt) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) float64ToUint32(n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	return s.floatToUint(&f64_u32, n, x, ft, tt) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | func (s *state) floatToUint(cvttab *f2uCvtTab, n *Node, x *ssa.Value, ft, tt *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	// cutoff:=1<<(intY_Size-1) | 
					
						
							|  |  |  | 	// if x < floatX(cutoff) { | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	// 	result = uintY(x) | 
					
						
							|  |  |  | 	// } else { | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	// 	y = x - floatX(cutoff) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	// 	z = uintY(y) | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	// 	result = z | -(cutoff) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	// } | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	cutoff := cvttab.floatValue(s, ft, float64(cvttab.cutoff)) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	cmp := s.newValue2(cvttab.ltf, types.Types[TBOOL], x, cutoff) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	b := s.endBlock() | 
					
						
							|  |  |  | 	b.Kind = ssa.BlockIf | 
					
						
							| 
									
										
										
										
											2016-03-15 20:45:50 -07:00
										 |  |  | 	b.SetControl(cmp) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	b.Likely = ssa.BranchLikely | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	bThen := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	bElse := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	bAfter := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	b.AddEdgeTo(bThen) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	s.startBlock(bThen) | 
					
						
							|  |  |  | 	a0 := s.newValue1(cvttab.cvt2U, tt, x) | 
					
						
							|  |  |  | 	s.vars[n] = a0 | 
					
						
							|  |  |  | 	s.endBlock() | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	bThen.AddEdgeTo(bAfter) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	b.AddEdgeTo(bElse) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	s.startBlock(bElse) | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	y := s.newValue2(cvttab.subf, ft, x, cutoff) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	y = s.newValue1(cvttab.cvt2U, tt, y) | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	z := cvttab.intValue(s, tt, int64(-cvttab.cutoff)) | 
					
						
							|  |  |  | 	a1 := s.newValue2(cvttab.or, tt, y, z) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 	s.vars[n] = a1 | 
					
						
							|  |  |  | 	s.endBlock() | 
					
						
							| 
									
										
										
										
											2015-08-28 21:36:29 -05:00
										 |  |  | 	bElse.AddEdgeTo(bAfter) | 
					
						
							| 
									
										
										
										
											2015-08-26 14:25:40 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	s.startBlock(bAfter) | 
					
						
							|  |  |  | 	return s.variable(n, n.Type) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | // dottype generates SSA for a type assertion node. | 
					
						
							|  |  |  | // commaok indicates whether to panic or return a bool. | 
					
						
							|  |  |  | // If commaok is false, resok will be nil. | 
					
						
							|  |  |  | func (s *state) dottype(n *Node, commaok bool) (res, resok *ssa.Value) { | 
					
						
							| 
									
										
										
										
											2017-03-27 14:48:24 -07:00
										 |  |  | 	iface := s.expr(n.Left)   // input interface | 
					
						
							|  |  |  | 	target := s.expr(n.Right) // target type | 
					
						
							| 
									
										
										
										
											2017-03-18 10:16:03 -07:00
										 |  |  | 	byteptr := s.f.Config.Types.BytePtr | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	if n.Type.IsInterface() { | 
					
						
							|  |  |  | 		if n.Type.IsEmptyInterface() { | 
					
						
							|  |  |  | 			// Converting to an empty interface. | 
					
						
							|  |  |  | 			// Input could be an empty or nonempty interface. | 
					
						
							|  |  |  | 			if Debug_typeassert > 0 { | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 				Warnl(n.Pos, "type assertion inlined") | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Get itab/type field from input. | 
					
						
							|  |  |  | 			itab := s.newValue1(ssa.OpITab, byteptr, iface) | 
					
						
							|  |  |  | 			// Conversion succeeds iff that field is not nil. | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			cond := s.newValue2(ssa.OpNeqPtr, types.Types[TBOOL], itab, s.constNil(byteptr)) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 			if n.Left.Type.IsEmptyInterface() && commaok { | 
					
						
							|  |  |  | 				// Converting empty interface to empty interface with ,ok is just a nil check. | 
					
						
							|  |  |  | 				return iface, cond | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Branch on nilness. | 
					
						
							|  |  |  | 			b := s.endBlock() | 
					
						
							|  |  |  | 			b.Kind = ssa.BlockIf | 
					
						
							|  |  |  | 			b.SetControl(cond) | 
					
						
							|  |  |  | 			b.Likely = ssa.BranchLikely | 
					
						
							|  |  |  | 			bOk := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			bFail := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bOk) | 
					
						
							|  |  |  | 			b.AddEdgeTo(bFail) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			if !commaok { | 
					
						
							|  |  |  | 				// On failure, panic by calling panicnildottype. | 
					
						
							|  |  |  | 				s.startBlock(bFail) | 
					
						
							|  |  |  | 				s.rtcall(panicnildottype, false, nil, target) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 				// On success, return (perhaps modified) input interface. | 
					
						
							|  |  |  | 				s.startBlock(bOk) | 
					
						
							|  |  |  | 				if n.Left.Type.IsEmptyInterface() { | 
					
						
							|  |  |  | 					res = iface // Use input interface unchanged. | 
					
						
							|  |  |  | 					return | 
					
						
							|  |  |  | 				} | 
					
						
							|  |  |  | 				// Load type out of itab, build interface with existing idata. | 
					
						
							|  |  |  | 				off := s.newValue1I(ssa.OpOffPtr, byteptr, int64(Widthptr), itab) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 				typ := s.load(byteptr, off) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 				idata := s.newValue1(ssa.OpIData, n.Type, iface) | 
					
						
							|  |  |  | 				res = s.newValue2(ssa.OpIMake, n.Type, typ, idata) | 
					
						
							|  |  |  | 				return | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			s.startBlock(bOk) | 
					
						
							|  |  |  | 			// nonempty -> empty | 
					
						
							|  |  |  | 			// Need to load type from itab | 
					
						
							|  |  |  | 			off := s.newValue1I(ssa.OpOffPtr, byteptr, int64(Widthptr), itab) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			s.vars[&typVar] = s.load(byteptr, off) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 			s.endBlock() | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// itab is nil, might as well use that as the nil result. | 
					
						
							|  |  |  | 			s.startBlock(bFail) | 
					
						
							|  |  |  | 			s.vars[&typVar] = itab | 
					
						
							|  |  |  | 			s.endBlock() | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 			// Merge point. | 
					
						
							|  |  |  | 			bEnd := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 			bOk.AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 			bFail.AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 			s.startBlock(bEnd) | 
					
						
							|  |  |  | 			idata := s.newValue1(ssa.OpIData, n.Type, iface) | 
					
						
							|  |  |  | 			res = s.newValue2(ssa.OpIMake, n.Type, s.variable(&typVar, byteptr), idata) | 
					
						
							|  |  |  | 			resok = cond | 
					
						
							|  |  |  | 			delete(s.vars, &typVar) | 
					
						
							|  |  |  | 			return | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		// converting to a nonempty interface needs a runtime call. | 
					
						
							|  |  |  | 		if Debug_typeassert > 0 { | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 			Warnl(n.Pos, "type assertion not inlined") | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		if n.Left.Type.IsEmptyInterface() { | 
					
						
							|  |  |  | 			if commaok { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 				call := s.rtcall(assertE2I2, true, []*types.Type{n.Type, types.Types[TBOOL]}, target, iface) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 				return call[0], call[1] | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			return s.rtcall(assertE2I, true, []*types.Type{n.Type}, target, iface)[0], nil | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 		if commaok { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			call := s.rtcall(assertI2I2, true, []*types.Type{n.Type, types.Types[TBOOL]}, target, iface) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 			return call[0], call[1] | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		return s.rtcall(assertI2I, true, []*types.Type{n.Type}, target, iface)[0], nil | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-10-26 17:34:06 -04:00
										 |  |  | 	if Debug_typeassert > 0 { | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 		Warnl(n.Pos, "type assertion inlined") | 
					
						
							| 
									
										
										
										
											2015-10-26 17:34:06 -04:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	// Converting to a concrete type. | 
					
						
							|  |  |  | 	direct := isdirectiface(n.Type) | 
					
						
							| 
									
										
										
										
											2017-01-03 16:15:38 -08:00
										 |  |  | 	itab := s.newValue1(ssa.OpITab, byteptr, iface) // type word of interface | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	if Debug_typeassert > 0 { | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | 		Warnl(n.Pos, "type assertion inlined") | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-01-03 16:15:38 -08:00
										 |  |  | 	var targetITab *ssa.Value | 
					
						
							|  |  |  | 	if n.Left.Type.IsEmptyInterface() { | 
					
						
							|  |  |  | 		// Looking for pointer to target type. | 
					
						
							|  |  |  | 		targetITab = target | 
					
						
							|  |  |  | 	} else { | 
					
						
							|  |  |  | 		// Looking for pointer to itab for target type and source interface. | 
					
						
							| 
									
										
										
										
											2017-03-28 13:52:33 -07:00
										 |  |  | 		targetITab = s.expr(n.List.First()) | 
					
						
							| 
									
										
										
										
											2017-01-03 16:15:38 -08:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	var tmp *Node       // temporary for use with large types | 
					
						
							|  |  |  | 	var addr *ssa.Value // address of tmp | 
					
						
							|  |  |  | 	if commaok && !canSSAType(n.Type) { | 
					
						
							|  |  |  | 		// unSSAable type, use temporary. | 
					
						
							|  |  |  | 		// TODO: get rid of some of these temporaries. | 
					
						
							| 
									
										
										
										
											2017-03-22 20:27:54 -07:00
										 |  |  | 		tmp = tempAt(n.Pos, s.curfn, n.Type) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 		s.vars[&memVar] = s.newValue1A(ssa.OpVarDef, types.TypeMem, tmp, s.mem()) | 
					
						
							| 
									
										
										
										
											2018-07-03 11:34:38 -04:00
										 |  |  | 		addr = s.addr(tmp, false) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	cond := s.newValue2(ssa.OpEqPtr, types.Types[TBOOL], itab, targetITab) | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 	b := s.endBlock() | 
					
						
							|  |  |  | 	b.Kind = ssa.BlockIf | 
					
						
							| 
									
										
										
										
											2016-03-15 20:45:50 -07:00
										 |  |  | 	b.SetControl(cond) | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 	b.Likely = ssa.BranchLikely | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	bOk := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	bFail := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							|  |  |  | 	b.AddEdgeTo(bOk) | 
					
						
							|  |  |  | 	b.AddEdgeTo(bFail) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	if !commaok { | 
					
						
							|  |  |  | 		// on failure, panic by calling panicdottype | 
					
						
							|  |  |  | 		s.startBlock(bFail) | 
					
						
							| 
									
										
										
										
											2017-03-27 14:48:24 -07:00
										 |  |  | 		taddr := s.expr(n.Right.Right) | 
					
						
							| 
									
										
										
										
											2017-01-03 16:15:38 -08:00
										 |  |  | 		if n.Left.Type.IsEmptyInterface() { | 
					
						
							|  |  |  | 			s.rtcall(panicdottypeE, false, nil, itab, target, taddr) | 
					
						
							|  |  |  | 		} else { | 
					
						
							|  |  |  | 			s.rtcall(panicdottypeI, false, nil, itab, target, taddr) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 		// on success, return data from interface | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 		s.startBlock(bOk) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 		if direct { | 
					
						
							|  |  |  | 			return s.newValue1(ssa.OpIData, n.Type, iface), nil | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		p := s.newValue1(ssa.OpIData, types.NewPtr(n.Type), iface) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		return s.load(n.Type, p), nil | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// commaok is the more complicated case because we have | 
					
						
							|  |  |  | 	// a control flow merge point. | 
					
						
							|  |  |  | 	bEnd := s.f.NewBlock(ssa.BlockPlain) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	// Note that we need a new valVar each time (unlike okVar where we can | 
					
						
							|  |  |  | 	// reuse the variable) because it might have a different type every time. | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	valVar := &Node{Op: ONAME, Sym: &types.Sym{Name: "val"}} | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// type assertion succeeded | 
					
						
							|  |  |  | 	s.startBlock(bOk) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	if tmp == nil { | 
					
						
							|  |  |  | 		if direct { | 
					
						
							|  |  |  | 			s.vars[valVar] = s.newValue1(ssa.OpIData, n.Type, iface) | 
					
						
							|  |  |  | 		} else { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			p := s.newValue1(ssa.OpIData, types.NewPtr(n.Type), iface) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 			s.vars[valVar] = s.load(n.Type, p) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 	} else { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		p := s.newValue1(ssa.OpIData, types.NewPtr(n.Type), iface) | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.move(n.Type, addr, p) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 	s.vars[&okVar] = s.constBool(true) | 
					
						
							|  |  |  | 	s.endBlock() | 
					
						
							|  |  |  | 	bOk.AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// type assertion failed | 
					
						
							|  |  |  | 	s.startBlock(bFail) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	if tmp == nil { | 
					
						
							|  |  |  | 		s.vars[valVar] = s.zeroVal(n.Type) | 
					
						
							|  |  |  | 	} else { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		s.zero(n.Type, addr) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 	s.vars[&okVar] = s.constBool(false) | 
					
						
							|  |  |  | 	s.endBlock() | 
					
						
							|  |  |  | 	bFail.AddEdgeTo(bEnd) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// merge point | 
					
						
							|  |  |  | 	s.startBlock(bEnd) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	if tmp == nil { | 
					
						
							|  |  |  | 		res = s.variable(valVar, n.Type) | 
					
						
							|  |  |  | 		delete(s.vars, valVar) | 
					
						
							|  |  |  | 	} else { | 
					
						
							| 
									
										
										
										
											2018-03-27 13:35:39 -07:00
										 |  |  | 		res = s.load(n.Type, addr) | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 		s.vars[&memVar] = s.newValue1A(ssa.OpVarKill, types.TypeMem, tmp, s.mem()) | 
					
						
							| 
									
										
										
										
											2016-10-28 11:37:45 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	resok = s.variable(&okVar, types.Types[TBOOL]) | 
					
						
							| 
									
										
										
										
											2015-09-17 10:31:16 -07:00
										 |  |  | 	delete(s.vars, &okVar) | 
					
						
							|  |  |  | 	return res, resok | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | // variable returns the value of a variable at the current location. | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (s *state) variable(name *Node, t *types.Type) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	v := s.vars[name] | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	if v != nil { | 
					
						
							|  |  |  | 		return v | 
					
						
							| 
									
										
										
										
											2016-01-14 16:02:23 -08:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	v = s.fwdVars[name] | 
					
						
							|  |  |  | 	if v != nil { | 
					
						
							|  |  |  | 		return v | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	if s.curBlock == s.f.Entry { | 
					
						
							|  |  |  | 		// No variable should be live at entry. | 
					
						
							|  |  |  | 		s.Fatalf("Value live at entry. It shouldn't be. func %s, node %v, value %v", s.f.Name, name, v) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | 	// Make a FwdRef, which records a value that's live on block input. | 
					
						
							|  |  |  | 	// We'll find the matching definition as part of insertPhis. | 
					
						
							|  |  |  | 	v = s.newValue0A(ssa.OpFwdRef, t, name) | 
					
						
							|  |  |  | 	s.fwdVars[name] = v | 
					
						
							| 
									
										
										
										
											2016-01-14 16:02:23 -08:00
										 |  |  | 	s.addNamedValue(name, v) | 
					
						
							| 
									
										
										
										
											2015-04-15 15:51:25 -07:00
										 |  |  | 	return v | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | func (s *state) mem() *ssa.Value { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | 	return s.variable(&memVar, types.TypeMem) | 
					
						
							| 
									
										
										
										
											2016-09-30 10:12:32 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | func (s *state) addNamedValue(n *Node, v *ssa.Value) { | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == Pxxx { | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | 		// Don't track our dummy nodes (&memVar etc.). | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-10-28 13:33:57 -04:00
										 |  |  | 	if n.IsAutoTmp() { | 
					
						
							|  |  |  | 		// Don't track temporary variables. | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PPARAMOUT { | 
					
						
							| 
									
										
										
										
											2016-03-08 20:09:48 -08:00
										 |  |  | 		// Don't track named output values.  This prevents return values | 
					
						
							|  |  |  | 		// from being assigned too early. See #14591 and #14762. TODO: allow this. | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PAUTO && n.Xoffset != 0 { | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 		s.Fatalf("AUTO var with offset %v %d", n, n.Xoffset) | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-11-02 08:10:26 -08:00
										 |  |  | 	loc := ssa.LocalSlot{N: n, Type: n.Type, Off: 0} | 
					
						
							|  |  |  | 	values, ok := s.f.NamedValues[loc] | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | 	if !ok { | 
					
						
							| 
									
										
										
										
											2015-11-02 08:10:26 -08:00
										 |  |  | 		s.f.Names = append(s.f.Names, loc) | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-11-02 08:10:26 -08:00
										 |  |  | 	s.f.NamedValues[loc] = append(values, v) | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | // Branch is an unresolved branch. | 
					
						
							|  |  |  | type Branch struct { | 
					
						
							|  |  |  | 	P *obj.Prog  // branch instruction | 
					
						
							|  |  |  | 	B *ssa.Block // target | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | // SSAGenState contains state needed during Prog generation. | 
					
						
							|  |  |  | type SSAGenState struct { | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 	pp *Progs | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | 	// Branches remembers all the branch instructions we've seen | 
					
						
							| 
									
										
										
										
											2015-08-28 22:51:01 -07:00
										 |  |  | 	// and where they would like to go. | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | 	Branches []Branch | 
					
						
							| 
									
										
										
										
											2015-08-28 22:51:01 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// bstart remembers where each block starts (indexed by block ID) | 
					
						
							|  |  |  | 	bstart []*obj.Prog | 
					
						
							| 
									
										
										
										
											2016-07-26 11:51:33 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// 387 port: maps from SSE registers (REG_X?) to 387 registers (REG_F?) | 
					
						
							| 
									
										
										
										
											2016-08-10 11:44:57 -07:00
										 |  |  | 	SSEto387 map[int16]int16 | 
					
						
							|  |  |  | 	// Some architectures require a 64-bit temporary for FP-related register shuffling. Examples include x86-387, PPC, and Sparc V8. | 
					
						
							|  |  |  | 	ScratchFpMem *Node | 
					
						
							| 
									
										
										
										
											2017-03-17 08:58:36 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	maxarg int64 // largest frame size for arguments to calls made by the function | 
					
						
							| 
									
										
										
										
											2017-03-09 18:32:17 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-03-22 12:04:51 -04:00
										 |  |  | 	// Map from GC safe points to liveness index, generated by | 
					
						
							| 
									
										
										
										
											2017-03-09 18:32:17 -08:00
										 |  |  | 	// liveness analysis. | 
					
						
							| 
									
										
										
										
											2018-03-22 12:04:51 -04:00
										 |  |  | 	livenessMap LivenessMap | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// lineRunStart records the beginning of the current run of instructions | 
					
						
							|  |  |  | 	// within a single block sharing the same line number | 
					
						
							|  |  |  | 	// Used to move statement marks to the beginning of such runs. | 
					
						
							|  |  |  | 	lineRunStart *obj.Prog | 
					
						
							| 
									
										
											  
											
												cmd/compile: add wasm stack optimization
Go's SSA instructions only operate on registers. For example, an add
instruction would read two registers, do the addition and then write
to a register. WebAssembly's instructions, on the other hand, operate
on the stack. The add instruction first pops two values from the stack,
does the addition, then pushes the result to the stack. To fulfill
Go's semantics, one needs to map Go's single add instruction to
4 WebAssembly instructions:
- Push the value of local variable A to the stack
- Push the value of local variable B to the stack
- Do addition
- Write value from stack to local variable C
Now consider that B was set to the constant 42 before the addition:
- Push constant 42 to the stack
- Write value from stack to local variable B
This works, but is inefficient. Instead, the stack is used directly
by inlining instructions if possible. With inlining it becomes:
- Push the value of local variable A to the stack (add)
- Push constant 42 to the stack (constant)
- Do addition (add)
- Write value from stack to local variable C (add)
Note that the two SSA instructions can not be generated sequentially
anymore, because their WebAssembly instructions are interleaved.
Design doc: https://docs.google.com/document/d/131vjr4DH6JFnb-blm_uRdaC0_Nv3OUwjEY5qVCxCup4
Updates #18892
Change-Id: Ie35e1c0bebf4985fddda0d6330eb2066f9ad6dec
Reviewed-on: https://go-review.googlesource.com/103535
Run-TryBot: Brad Fitzpatrick <bradfitz@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Cherry Zhang <cherryyz@google.com>
											
										 
											2018-03-09 00:14:58 +01:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// wasm: The number of values on the WebAssembly stack. This is only used as a safeguard. | 
					
						
							|  |  |  | 	OnWasmStackSkipped int | 
					
						
							| 
									
										
										
										
											2015-08-28 22:51:01 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-20 08:01:28 -07:00
										 |  |  | // Prog appends a new Prog. | 
					
						
							|  |  |  | func (s *SSAGenState) Prog(as obj.As) *obj.Prog { | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 	p := s.pp.Prog(as) | 
					
						
							| 
									
										
										
										
											2018-05-18 15:11:30 -04:00
										 |  |  | 	if ssa.LosesStmtMark(as) { | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 		return p | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	// Float a statement start to the beginning of any same-line run. | 
					
						
							|  |  |  | 	// lineRunStart is reset at block boundaries, which appears to work well. | 
					
						
							|  |  |  | 	if s.lineRunStart == nil || s.lineRunStart.Pos.Line() != p.Pos.Line() { | 
					
						
							|  |  |  | 		s.lineRunStart = p | 
					
						
							|  |  |  | 	} else if p.Pos.IsStmt() == src.PosIsStmt { | 
					
						
							|  |  |  | 		s.lineRunStart.Pos = s.lineRunStart.Pos.WithIsStmt() | 
					
						
							|  |  |  | 		p.Pos = p.Pos.WithNotStmt() | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return p | 
					
						
							| 
									
										
										
										
											2017-03-20 08:01:28 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | // Pc returns the current Prog. | 
					
						
							|  |  |  | func (s *SSAGenState) Pc() *obj.Prog { | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 	return s.pp.next | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-12-07 17:40:46 -08:00
										 |  |  | // SetPos sets the current source position. | 
					
						
							| 
									
										
										
										
											2016-12-15 17:17:01 -08:00
										 |  |  | func (s *SSAGenState) SetPos(pos src.XPos) { | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 	s.pp.pos = pos | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-04-05 16:14:42 -04:00
										 |  |  | // Br emits a single branch instruction and returns the instruction. | 
					
						
							|  |  |  | // Not all architectures need the returned instruction, but otherwise | 
					
						
							|  |  |  | // the boilerplate is common to all. | 
					
						
							|  |  |  | func (s *SSAGenState) Br(op obj.As, target *ssa.Block) *obj.Prog { | 
					
						
							|  |  |  | 	p := s.Prog(op) | 
					
						
							|  |  |  | 	p.To.Type = obj.TYPE_BRANCH | 
					
						
							|  |  |  | 	s.Branches = append(s.Branches, Branch{P: p, B: target}) | 
					
						
							|  |  |  | 	return p | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | // DebugFriendlySetPos adjusts Pos.IsStmt subject to heuristics | 
					
						
							| 
									
										
											  
											
												cmd/compile: reduce debugger-worsening line number churn
Reuse block head or preceding instruction's line number for
register allocator's spill, fill, copy, rematerialization
instructionsl; and also for phi, and for no-src-pos
instructions.  Assembler creates same line number tables
for copy-predecessor-line and for no-src-pos,
but copy-predecessor produces better-looking assembly
language output with -S and with GOSSAFUNC, and does not
require changes to tests of existing assembly language.
Split "copyInto" into two cases, one for register allocation,
one for otherwise.  This caused the test score line change
count to increase by one, which may reflect legitimately
useful information preserved.  Without any special treatment
for copyInto, the change count increases by 21 more, from
51 to 72 (i.e., quite a lot).
There is a test; using two naive "scores" for line number
churn, the old numbering is 2x or 4x worse.
Fixes #18902.
Change-Id: I0a0a69659d30ee4e5d10116a0dd2b8c5df8457b1
Reviewed-on: https://go-review.googlesource.com/36207
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-02-02 14:51:15 -05:00
										 |  |  | // that reduce "jumpy" line number churn when debugging. | 
					
						
							|  |  |  | // Spill/fill/copy instructions from the register allocator, | 
					
						
							|  |  |  | // phi functions, and instructions with a no-pos position | 
					
						
							|  |  |  | // are examples of instructions that can cause churn. | 
					
						
							|  |  |  | func (s *SSAGenState) DebugFriendlySetPosFrom(v *ssa.Value) { | 
					
						
							|  |  |  | 	switch v.Op { | 
					
						
							|  |  |  | 	case ssa.OpPhi, ssa.OpCopy, ssa.OpLoadReg, ssa.OpStoreReg: | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 		// These are not statements | 
					
						
							|  |  |  | 		s.SetPos(v.Pos.WithNotStmt()) | 
					
						
							| 
									
										
											  
											
												cmd/compile: reduce debugger-worsening line number churn
Reuse block head or preceding instruction's line number for
register allocator's spill, fill, copy, rematerialization
instructionsl; and also for phi, and for no-src-pos
instructions.  Assembler creates same line number tables
for copy-predecessor-line and for no-src-pos,
but copy-predecessor produces better-looking assembly
language output with -S and with GOSSAFUNC, and does not
require changes to tests of existing assembly language.
Split "copyInto" into two cases, one for register allocation,
one for otherwise.  This caused the test score line change
count to increase by one, which may reflect legitimately
useful information preserved.  Without any special treatment
for copyInto, the change count increases by 21 more, from
51 to 72 (i.e., quite a lot).
There is a test; using two naive "scores" for line number
churn, the old numbering is 2x or 4x worse.
Fixes #18902.
Change-Id: I0a0a69659d30ee4e5d10116a0dd2b8c5df8457b1
Reviewed-on: https://go-review.googlesource.com/36207
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-02-02 14:51:15 -05:00
										 |  |  | 	default: | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 		p := v.Pos | 
					
						
							|  |  |  | 		if p != src.NoXPos { | 
					
						
							|  |  |  | 			// If the position is defined, update the position. | 
					
						
							|  |  |  | 			// Also convert default IsStmt to NotStmt; only | 
					
						
							|  |  |  | 			// explicit statement boundaries should appear | 
					
						
							|  |  |  | 			// in the generated code. | 
					
						
							|  |  |  | 			if p.IsStmt() != src.PosIsStmt { | 
					
						
							|  |  |  | 				p = p.WithNotStmt() | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			s.SetPos(p) | 
					
						
							| 
									
										
											  
											
												cmd/compile: reduce debugger-worsening line number churn
Reuse block head or preceding instruction's line number for
register allocator's spill, fill, copy, rematerialization
instructionsl; and also for phi, and for no-src-pos
instructions.  Assembler creates same line number tables
for copy-predecessor-line and for no-src-pos,
but copy-predecessor produces better-looking assembly
language output with -S and with GOSSAFUNC, and does not
require changes to tests of existing assembly language.
Split "copyInto" into two cases, one for register allocation,
one for otherwise.  This caused the test score line change
count to increase by one, which may reflect legitimately
useful information preserved.  Without any special treatment
for copyInto, the change count increases by 21 more, from
51 to 72 (i.e., quite a lot).
There is a test; using two naive "scores" for line number
churn, the old numbering is 2x or 4x worse.
Fixes #18902.
Change-Id: I0a0a69659d30ee4e5d10116a0dd2b8c5df8457b1
Reviewed-on: https://go-review.googlesource.com/36207
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-02-02 14:51:15 -05:00
										 |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | // genssa appends entries to pp for each instruction in f. | 
					
						
							|  |  |  | func genssa(f *ssa.Func, pp *Progs) { | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | 	var s SSAGenState | 
					
						
							| 
									
										
										
										
											2015-08-28 22:51:01 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | 	e := f.Frontend().(*ssafn) | 
					
						
							| 
									
										
										
										
											2015-07-22 13:13:53 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-03-22 12:04:51 -04:00
										 |  |  | 	s.livenessMap = liveness(e, f) | 
					
						
							| 
									
										
										
										
											2017-03-09 18:32:17 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 	// Remember where each block starts. | 
					
						
							| 
									
										
										
										
											2015-08-28 22:51:01 -07:00
										 |  |  | 	s.bstart = make([]*obj.Prog, f.NumBlocks()) | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 	s.pp = pp | 
					
						
							| 
									
										
										
										
											2017-07-21 18:00:22 -04:00
										 |  |  | 	var progToValue map[*obj.Prog]*ssa.Value | 
					
						
							|  |  |  | 	var progToBlock map[*obj.Prog]*ssa.Block | 
					
						
							| 
									
										
											  
											
												cmd/compile: adjust locationlist lifetimes
A statement like
  foo = bar + qux
might compile to
  AX := AX + BX
resulting in a regkill for AX before this instruction.
The buggy behavior is to kill AX "at" this instruction,
before it has executed.  (Code generation of no-instruction
values like RegKills applies their effects at the
next actual instruction emitted).
However, bar is still associated with AX until after the
instruction executes, so the effect of the regkill must
occur at the boundary between this instruction and the
next.  Similarly, the new value bound to AX is not visible
until this instruction executes (and in the case of values
that require multiple instructions in code generation, until
all of them have executed).
The ranges are adjusted so that a value's start occurs
at the next following instruction after its evaluation,
and the end occurs after (execution of) the first
instruction following the end of the lifetime as a value.
(Notice the asymmetry; the entire value must be finished
before it is visible, but execution of a single instruction
invalidates.  However, the value *is* visible before that
next instruction executes).
The test was adjusted to make it insensitive to the result
numbering for variables printed by gdb, since that is not
relevant to the test and makes the differences introduced
by small changes larger than necessary/useful.
The test was also improved to present variable probes
more intuitively, and also to allow explicit indication
of "this variable was optimized out"
Change-Id: I39453eead8399e6bb05ebd957289b112d1100c0e
Reviewed-on: https://go-review.googlesource.com/74090
Run-TryBot: David Chase <drchase@google.com>
Reviewed-by: Cherry Zhang <cherryyz@google.com>
											
										 
											2017-10-27 17:02:11 -04:00
										 |  |  | 	var valueToProgAfter []*obj.Prog // The first Prog following computation of a value v; v is visible at this point. | 
					
						
							| 
									
										
										
										
											2016-03-17 13:46:43 +11:00
										 |  |  | 	var logProgs = e.log | 
					
						
							| 
									
										
										
										
											2018-07-24 10:39:00 +03:00
										 |  |  | 	if f.HTMLWriter != nil { | 
					
						
							|  |  |  | 		// logProgs can be false, meaning that we do not dump to the Stdout. | 
					
						
							| 
									
										
										
										
											2017-07-21 18:00:22 -04:00
										 |  |  | 		progToValue = make(map[*obj.Prog]*ssa.Value, f.NumValues()) | 
					
						
							|  |  |  | 		progToBlock = make(map[*obj.Prog]*ssa.Block, f.NumBlocks()) | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 		f.Logf("genssa %s\n", f.Name) | 
					
						
							| 
									
										
										
										
											2017-07-21 18:00:22 -04:00
										 |  |  | 		progToBlock[s.pp.next] = f.Blocks[0] | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:36 -07:00
										 |  |  | 	if thearch.Use387 { | 
					
						
							| 
									
										
										
										
											2016-07-26 11:51:33 -07:00
										 |  |  | 		s.SSEto387 = map[int16]int16{} | 
					
						
							| 
									
										
										
										
											2016-08-10 11:44:57 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-10-03 12:26:25 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-30 14:34:16 -07:00
										 |  |  | 	s.ScratchFpMem = e.scratchFpMem | 
					
						
							| 
									
										
										
										
											2016-07-26 11:51:33 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
											  
											
												[dev.debug] cmd/compile: better DWARF with optimizations on
Debuggers use DWARF information to find local variables on the
stack and in registers. Prior to this CL, the DWARF information for
functions claimed that all variables were on the stack at all times.
That's incorrect when optimizations are enabled, and results in
debuggers showing data that is out of date or complete gibberish.
After this CL, the compiler is capable of representing variable
locations more accurately, and attempts to do so. Due to limitations of
the SSA backend, it's not possible to be completely correct.
There are a number of problems in the current design. One of the easier
to understand is that variable names currently must be attached to an
SSA value, but not all assignments in the source code actually result
in machine code. For example:
  type myint int
  var a int
  b := myint(int)
and
  b := (*uint64)(unsafe.Pointer(a))
don't generate machine code because the underlying representation is the
same, so the correct value of b will not be set when the user would
expect.
Generating the more precise debug information is behind a flag,
dwarflocationlists. Because of the issues described above, setting the
flag may not make the debugging experience much better, and may actually
make it worse in cases where the variable actually is on the stack and
the more complicated analysis doesn't realize it.
A number of changes are included:
- Add a new pseudo-instruction, RegKill, which indicates that the value
in the register has been clobbered.
- Adjust regalloc to emit RegKills in the right places. Significantly,
this means that phis are mixed with StoreReg and RegKills after
regalloc.
- Track variable decomposition in ssa.LocalSlots.
- After the SSA backend is done, analyze the result and build location
lists for each LocalSlot.
- After assembly is done, update the location lists with the assembled
PC offsets, recompose variables, and build DWARF location lists. Emit the
list as a new linker symbol, one per function.
- In the linker, aggregate the location lists into a .debug_loc section.
TODO:
- currently disabled for non-X86/AMD64 because there are no data tables.
go build -toolexec 'toolstash -cmp' -a std succeeds.
With -dwarflocationlists false:
before: f02812195637909ff675782c0b46836a8ff01976
after:  06f61e8112a42ac34fb80e0c818b3cdb84a5e7ec
benchstat -geomean  /tmp/220352263 /tmp/621364410
completed   15 of   15, estimated time remaining 0s (eta 3:52PM)
name        old time/op       new time/op       delta
Template          199ms ± 3%        198ms ± 2%     ~     (p=0.400 n=15+14)
Unicode          96.6ms ± 5%       96.4ms ± 5%     ~     (p=0.838 n=15+15)
GoTypes           653ms ± 2%        647ms ± 2%     ~     (p=0.102 n=15+14)
Flate             133ms ± 6%        129ms ± 3%   -2.62%  (p=0.041 n=15+15)
GoParser          164ms ± 5%        159ms ± 3%   -3.05%  (p=0.000 n=15+15)
Reflect           428ms ± 4%        422ms ± 3%     ~     (p=0.156 n=15+13)
Tar               123ms ±10%        124ms ± 8%     ~     (p=0.461 n=15+15)
XML               228ms ± 3%        224ms ± 3%   -1.57%  (p=0.045 n=15+15)
[Geo mean]        206ms             377ms       +82.86%
name        old user-time/op  new user-time/op  delta
Template          292ms ±10%        301ms ±12%     ~     (p=0.189 n=15+15)
Unicode           166ms ±37%        158ms ±14%     ~     (p=0.418 n=15+14)
GoTypes           962ms ± 6%        963ms ± 7%     ~     (p=0.976 n=15+15)
Flate             207ms ±19%        200ms ±14%     ~     (p=0.345 n=14+15)
GoParser          246ms ±22%        240ms ±15%     ~     (p=0.587 n=15+15)
Reflect           611ms ±13%        587ms ±14%     ~     (p=0.085 n=15+13)
Tar               211ms ±12%        217ms ±14%     ~     (p=0.355 n=14+15)
XML               335ms ±15%        320ms ±18%     ~     (p=0.169 n=15+15)
[Geo mean]        317ms             583ms       +83.72%
name        old alloc/op      new alloc/op      delta
Template         40.2MB ± 0%       40.2MB ± 0%   -0.15%  (p=0.000 n=14+15)
Unicode          29.2MB ± 0%       29.3MB ± 0%     ~     (p=0.624 n=15+15)
GoTypes           114MB ± 0%        114MB ± 0%   -0.15%  (p=0.000 n=15+14)
Flate            25.7MB ± 0%       25.6MB ± 0%   -0.18%  (p=0.000 n=13+15)
GoParser         32.2MB ± 0%       32.2MB ± 0%   -0.14%  (p=0.003 n=15+15)
Reflect          77.8MB ± 0%       77.9MB ± 0%     ~     (p=0.061 n=15+15)
Tar              27.1MB ± 0%       27.0MB ± 0%   -0.11%  (p=0.029 n=15+15)
XML              42.7MB ± 0%       42.5MB ± 0%   -0.29%  (p=0.000 n=15+15)
[Geo mean]       42.1MB            75.0MB       +78.05%
name        old allocs/op     new allocs/op     delta
Template           402k ± 1%         398k ± 0%   -0.91%  (p=0.000 n=15+15)
Unicode            344k ± 1%         344k ± 0%     ~     (p=0.715 n=15+14)
GoTypes           1.18M ± 0%        1.17M ± 0%   -0.91%  (p=0.000 n=15+14)
Flate              243k ± 0%         240k ± 1%   -1.05%  (p=0.000 n=13+15)
GoParser           327k ± 1%         324k ± 1%   -0.96%  (p=0.000 n=15+15)
Reflect            984k ± 1%         982k ± 0%     ~     (p=0.050 n=15+15)
Tar                261k ± 1%         259k ± 1%   -0.77%  (p=0.000 n=15+15)
XML                411k ± 0%         404k ± 1%   -1.55%  (p=0.000 n=15+15)
[Geo mean]         439k              755k       +72.01%
name        old text-bytes    new text-bytes    delta
HelloSize         694kB ± 0%        694kB ± 0%   -0.00%  (p=0.000 n=15+15)
name        old data-bytes    new data-bytes    delta
HelloSize        5.55kB ± 0%       5.55kB ± 0%     ~     (all equal)
name        old bss-bytes     new bss-bytes     delta
HelloSize         133kB ± 0%        133kB ± 0%     ~     (all equal)
name        old exe-bytes     new exe-bytes     delta
HelloSize        1.04MB ± 0%       1.04MB ± 0%     ~     (all equal)
Change-Id: I991fc553ef175db46bb23b2128317bbd48de70d8
Reviewed-on: https://go-review.googlesource.com/41770
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2017-07-21 18:30:19 -04:00
										 |  |  | 	if Ctxt.Flag_locationlists { | 
					
						
							| 
									
										
										
										
											2018-01-29 17:01:41 -05:00
										 |  |  | 		if cap(f.Cache.ValueToProgAfter) < f.NumValues() { | 
					
						
							|  |  |  | 			f.Cache.ValueToProgAfter = make([]*obj.Prog, f.NumValues()) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		valueToProgAfter = f.Cache.ValueToProgAfter[:f.NumValues()] | 
					
						
							|  |  |  | 		for i := range valueToProgAfter { | 
					
						
							|  |  |  | 			valueToProgAfter[i] = nil | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
											  
											
												[dev.debug] cmd/compile: better DWARF with optimizations on
Debuggers use DWARF information to find local variables on the
stack and in registers. Prior to this CL, the DWARF information for
functions claimed that all variables were on the stack at all times.
That's incorrect when optimizations are enabled, and results in
debuggers showing data that is out of date or complete gibberish.
After this CL, the compiler is capable of representing variable
locations more accurately, and attempts to do so. Due to limitations of
the SSA backend, it's not possible to be completely correct.
There are a number of problems in the current design. One of the easier
to understand is that variable names currently must be attached to an
SSA value, but not all assignments in the source code actually result
in machine code. For example:
  type myint int
  var a int
  b := myint(int)
and
  b := (*uint64)(unsafe.Pointer(a))
don't generate machine code because the underlying representation is the
same, so the correct value of b will not be set when the user would
expect.
Generating the more precise debug information is behind a flag,
dwarflocationlists. Because of the issues described above, setting the
flag may not make the debugging experience much better, and may actually
make it worse in cases where the variable actually is on the stack and
the more complicated analysis doesn't realize it.
A number of changes are included:
- Add a new pseudo-instruction, RegKill, which indicates that the value
in the register has been clobbered.
- Adjust regalloc to emit RegKills in the right places. Significantly,
this means that phis are mixed with StoreReg and RegKills after
regalloc.
- Track variable decomposition in ssa.LocalSlots.
- After the SSA backend is done, analyze the result and build location
lists for each LocalSlot.
- After assembly is done, update the location lists with the assembled
PC offsets, recompose variables, and build DWARF location lists. Emit the
list as a new linker symbol, one per function.
- In the linker, aggregate the location lists into a .debug_loc section.
TODO:
- currently disabled for non-X86/AMD64 because there are no data tables.
go build -toolexec 'toolstash -cmp' -a std succeeds.
With -dwarflocationlists false:
before: f02812195637909ff675782c0b46836a8ff01976
after:  06f61e8112a42ac34fb80e0c818b3cdb84a5e7ec
benchstat -geomean  /tmp/220352263 /tmp/621364410
completed   15 of   15, estimated time remaining 0s (eta 3:52PM)
name        old time/op       new time/op       delta
Template          199ms ± 3%        198ms ± 2%     ~     (p=0.400 n=15+14)
Unicode          96.6ms ± 5%       96.4ms ± 5%     ~     (p=0.838 n=15+15)
GoTypes           653ms ± 2%        647ms ± 2%     ~     (p=0.102 n=15+14)
Flate             133ms ± 6%        129ms ± 3%   -2.62%  (p=0.041 n=15+15)
GoParser          164ms ± 5%        159ms ± 3%   -3.05%  (p=0.000 n=15+15)
Reflect           428ms ± 4%        422ms ± 3%     ~     (p=0.156 n=15+13)
Tar               123ms ±10%        124ms ± 8%     ~     (p=0.461 n=15+15)
XML               228ms ± 3%        224ms ± 3%   -1.57%  (p=0.045 n=15+15)
[Geo mean]        206ms             377ms       +82.86%
name        old user-time/op  new user-time/op  delta
Template          292ms ±10%        301ms ±12%     ~     (p=0.189 n=15+15)
Unicode           166ms ±37%        158ms ±14%     ~     (p=0.418 n=15+14)
GoTypes           962ms ± 6%        963ms ± 7%     ~     (p=0.976 n=15+15)
Flate             207ms ±19%        200ms ±14%     ~     (p=0.345 n=14+15)
GoParser          246ms ±22%        240ms ±15%     ~     (p=0.587 n=15+15)
Reflect           611ms ±13%        587ms ±14%     ~     (p=0.085 n=15+13)
Tar               211ms ±12%        217ms ±14%     ~     (p=0.355 n=14+15)
XML               335ms ±15%        320ms ±18%     ~     (p=0.169 n=15+15)
[Geo mean]        317ms             583ms       +83.72%
name        old alloc/op      new alloc/op      delta
Template         40.2MB ± 0%       40.2MB ± 0%   -0.15%  (p=0.000 n=14+15)
Unicode          29.2MB ± 0%       29.3MB ± 0%     ~     (p=0.624 n=15+15)
GoTypes           114MB ± 0%        114MB ± 0%   -0.15%  (p=0.000 n=15+14)
Flate            25.7MB ± 0%       25.6MB ± 0%   -0.18%  (p=0.000 n=13+15)
GoParser         32.2MB ± 0%       32.2MB ± 0%   -0.14%  (p=0.003 n=15+15)
Reflect          77.8MB ± 0%       77.9MB ± 0%     ~     (p=0.061 n=15+15)
Tar              27.1MB ± 0%       27.0MB ± 0%   -0.11%  (p=0.029 n=15+15)
XML              42.7MB ± 0%       42.5MB ± 0%   -0.29%  (p=0.000 n=15+15)
[Geo mean]       42.1MB            75.0MB       +78.05%
name        old allocs/op     new allocs/op     delta
Template           402k ± 1%         398k ± 0%   -0.91%  (p=0.000 n=15+15)
Unicode            344k ± 1%         344k ± 0%     ~     (p=0.715 n=15+14)
GoTypes           1.18M ± 0%        1.17M ± 0%   -0.91%  (p=0.000 n=15+14)
Flate              243k ± 0%         240k ± 1%   -1.05%  (p=0.000 n=13+15)
GoParser           327k ± 1%         324k ± 1%   -0.96%  (p=0.000 n=15+15)
Reflect            984k ± 1%         982k ± 0%     ~     (p=0.050 n=15+15)
Tar                261k ± 1%         259k ± 1%   -0.77%  (p=0.000 n=15+15)
XML                411k ± 0%         404k ± 1%   -1.55%  (p=0.000 n=15+15)
[Geo mean]         439k              755k       +72.01%
name        old text-bytes    new text-bytes    delta
HelloSize         694kB ± 0%        694kB ± 0%   -0.00%  (p=0.000 n=15+15)
name        old data-bytes    new data-bytes    delta
HelloSize        5.55kB ± 0%       5.55kB ± 0%     ~     (all equal)
name        old bss-bytes     new bss-bytes     delta
HelloSize         133kB ± 0%        133kB ± 0%     ~     (all equal)
name        old exe-bytes     new exe-bytes     delta
HelloSize        1.04MB ± 0%       1.04MB ± 0%     ~     (all equal)
Change-Id: I991fc553ef175db46bb23b2128317bbd48de70d8
Reviewed-on: https://go-review.googlesource.com/41770
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2017-07-21 18:30:19 -04:00
										 |  |  | 	} | 
					
						
							| 
									
										
											  
											
												cmd/compile: adjust locationlist lifetimes
A statement like
  foo = bar + qux
might compile to
  AX := AX + BX
resulting in a regkill for AX before this instruction.
The buggy behavior is to kill AX "at" this instruction,
before it has executed.  (Code generation of no-instruction
values like RegKills applies their effects at the
next actual instruction emitted).
However, bar is still associated with AX until after the
instruction executes, so the effect of the regkill must
occur at the boundary between this instruction and the
next.  Similarly, the new value bound to AX is not visible
until this instruction executes (and in the case of values
that require multiple instructions in code generation, until
all of them have executed).
The ranges are adjusted so that a value's start occurs
at the next following instruction after its evaluation,
and the end occurs after (execution of) the first
instruction following the end of the lifetime as a value.
(Notice the asymmetry; the entire value must be finished
before it is visible, but execution of a single instruction
invalidates.  However, the value *is* visible before that
next instruction executes).
The test was adjusted to make it insensitive to the result
numbering for variables printed by gdb, since that is not
relevant to the test and makes the differences introduced
by small changes larger than necessary/useful.
The test was also improved to present variable probes
more intuitively, and also to allow explicit indication
of "this variable was optimized out"
Change-Id: I39453eead8399e6bb05ebd957289b112d1100c0e
Reviewed-on: https://go-review.googlesource.com/74090
Run-TryBot: David Chase <drchase@google.com>
Reviewed-by: Cherry Zhang <cherryyz@google.com>
											
										 
											2017-10-27 17:02:11 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-04-05 11:11:39 -04:00
										 |  |  | 	// If the very first instruction is not tagged as a statement, | 
					
						
							|  |  |  | 	// debuggers may attribute it to previous function in program. | 
					
						
							|  |  |  | 	firstPos := src.NoXPos | 
					
						
							|  |  |  | 	for _, v := range f.Entry.Values { | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 		if v.Pos.IsStmt() == src.PosIsStmt { | 
					
						
							|  |  |  | 			firstPos = v.Pos | 
					
						
							|  |  |  | 			v.Pos = firstPos.WithDefaultStmt() | 
					
						
							| 
									
										
										
										
											2018-04-05 11:11:39 -04:00
										 |  |  | 			break | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 	// Emit basic blocks | 
					
						
							|  |  |  | 	for i, b := range f.Blocks { | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 		s.bstart[b.ID] = s.pp.next | 
					
						
							| 
									
										
										
										
											2018-03-27 16:11:10 -04:00
										 |  |  | 		s.pp.nextLive = LivenessInvalid | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 		s.lineRunStart = nil | 
					
						
							| 
									
										
										
										
											2018-03-27 16:11:10 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 		// Emit values in block | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:36 -07:00
										 |  |  | 		thearch.SSAMarkMoves(&s, b) | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 		for _, v := range b.Values { | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 			x := s.pp.next | 
					
						
							| 
									
										
											  
											
												cmd/compile: reduce debugger-worsening line number churn
Reuse block head or preceding instruction's line number for
register allocator's spill, fill, copy, rematerialization
instructionsl; and also for phi, and for no-src-pos
instructions.  Assembler creates same line number tables
for copy-predecessor-line and for no-src-pos,
but copy-predecessor produces better-looking assembly
language output with -S and with GOSSAFUNC, and does not
require changes to tests of existing assembly language.
Split "copyInto" into two cases, one for register allocation,
one for otherwise.  This caused the test score line change
count to increase by one, which may reflect legitimately
useful information preserved.  Without any special treatment
for copyInto, the change count increases by 21 more, from
51 to 72 (i.e., quite a lot).
There is a test; using two naive "scores" for line number
churn, the old numbering is 2x or 4x worse.
Fixes #18902.
Change-Id: I0a0a69659d30ee4e5d10116a0dd2b8c5df8457b1
Reviewed-on: https://go-review.googlesource.com/36207
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-02-02 14:51:15 -05:00
										 |  |  | 			s.DebugFriendlySetPosFrom(v) | 
					
						
							| 
									
										
										
										
											2018-03-27 16:11:10 -04:00
										 |  |  | 			// Attach this safe point to the next | 
					
						
							|  |  |  | 			// instruction. | 
					
						
							|  |  |  | 			s.pp.nextLive = s.livenessMap.Get(v) | 
					
						
							| 
									
										
										
										
											2017-03-07 12:31:59 -08:00
										 |  |  | 			switch v.Op { | 
					
						
							|  |  |  | 			case ssa.OpInitMem: | 
					
						
							|  |  |  | 				// memory arg needs no code | 
					
						
							|  |  |  | 			case ssa.OpArg: | 
					
						
							|  |  |  | 				// input args need no code | 
					
						
							|  |  |  | 			case ssa.OpSP, ssa.OpSB: | 
					
						
							|  |  |  | 				// nothing to do | 
					
						
							|  |  |  | 			case ssa.OpSelect0, ssa.OpSelect1: | 
					
						
							|  |  |  | 				// nothing to do | 
					
						
							|  |  |  | 			case ssa.OpGetG: | 
					
						
							|  |  |  | 				// nothing to do when there's a g register, | 
					
						
							|  |  |  | 				// and checkLower complains if there's not | 
					
						
							| 
									
										
										
										
											2017-04-19 11:19:53 -07:00
										 |  |  | 			case ssa.OpVarDef, ssa.OpVarLive, ssa.OpKeepAlive: | 
					
						
							| 
									
										
										
										
											2017-03-09 18:32:17 -08:00
										 |  |  | 				// nothing to do; already used by liveness | 
					
						
							| 
									
										
										
										
											2017-04-19 11:19:53 -07:00
										 |  |  | 			case ssa.OpVarKill: | 
					
						
							|  |  |  | 				// Zero variable if it is ambiguously live. | 
					
						
							|  |  |  | 				// After the VARKILL anything this variable references | 
					
						
							|  |  |  | 				// might be collected. If it were to become live again later, | 
					
						
							|  |  |  | 				// the GC will see references to already-collected objects. | 
					
						
							|  |  |  | 				// See issue 20029. | 
					
						
							|  |  |  | 				n := v.Aux.(*Node) | 
					
						
							|  |  |  | 				if n.Name.Needzero() { | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 					if n.Class() != PAUTO { | 
					
						
							| 
									
										
										
										
											2017-04-19 11:19:53 -07:00
										 |  |  | 						v.Fatalf("zero of variable which isn't PAUTO %v", n) | 
					
						
							|  |  |  | 					} | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 					if n.Type.Size()%int64(Widthptr) != 0 { | 
					
						
							| 
									
										
										
										
											2017-04-19 11:19:53 -07:00
										 |  |  | 						v.Fatalf("zero of variable not a multiple of ptr size %v", n) | 
					
						
							|  |  |  | 					} | 
					
						
							|  |  |  | 					thearch.ZeroAuto(s.pp, n) | 
					
						
							|  |  |  | 				} | 
					
						
							| 
									
										
										
										
											2017-03-07 12:31:59 -08:00
										 |  |  | 			case ssa.OpPhi: | 
					
						
							|  |  |  | 				CheckLoweredPhi(v) | 
					
						
							| 
									
										
											  
											
												cmd/compile: don't lower OpConvert
Currently, each architecture lowers OpConvert to an arch-specific
OpXXXconvert. This is silly because OpConvert means the same thing on
all architectures and is logically a no-op that exists only to keep
track of conversions to and from unsafe.Pointer. Furthermore, lowering
it makes it harder to recognize in other analyses, particularly
liveness analysis.
This CL eliminates the lowering of OpConvert, leaving it as the
generic op until code generation time.
The main complexity here is that we still need to register-allocate
OpConvert operations. Currently, each arch's lowered OpConvert
specifies all GP registers in its register mask. Ideally, OpConvert
wouldn't affect value homing at all, and we could just copy the home
of OpConvert's source, but this can potentially home an OpConvert in a
LocalSlot, which neither regalloc nor stackalloc expect. Rather than
try to disentangle this assumption from regalloc and stackalloc, we
continue to register-allocate OpConvert, but teach regalloc that
OpConvert can be allocated to any allocatable GP register.
For #24543.
Change-Id: I795a6aee5fd94d4444a7bafac3838a400c9f7bb6
Reviewed-on: https://go-review.googlesource.com/108496
Run-TryBot: Austin Clements <austin@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2018-04-02 16:08:09 -04:00
										 |  |  | 			case ssa.OpConvert: | 
					
						
							|  |  |  | 				// nothing to do; no-op conversion for liveness | 
					
						
							|  |  |  | 				if v.Args[0].Reg() != v.Reg() { | 
					
						
							|  |  |  | 					v.Fatalf("OpConvert should be a no-op: %s; %s", v.Args[0].LongString(), v.LongString()) | 
					
						
							|  |  |  | 				} | 
					
						
							| 
									
										
										
										
											2017-03-07 12:31:59 -08:00
										 |  |  | 			default: | 
					
						
							|  |  |  | 				// let the backend handle it | 
					
						
							| 
									
										
										
										
											2018-04-05 11:11:39 -04:00
										 |  |  | 				// Special case for first line in function; move it to the start. | 
					
						
							|  |  |  | 				if firstPos != src.NoXPos { | 
					
						
							|  |  |  | 					s.SetPos(firstPos) | 
					
						
							|  |  |  | 					firstPos = src.NoXPos | 
					
						
							|  |  |  | 				} | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:36 -07:00
										 |  |  | 				thearch.SSAGenValue(&s, v) | 
					
						
							| 
									
										
										
										
											2017-03-07 12:31:59 -08:00
										 |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												[dev.debug] cmd/compile: better DWARF with optimizations on
Debuggers use DWARF information to find local variables on the
stack and in registers. Prior to this CL, the DWARF information for
functions claimed that all variables were on the stack at all times.
That's incorrect when optimizations are enabled, and results in
debuggers showing data that is out of date or complete gibberish.
After this CL, the compiler is capable of representing variable
locations more accurately, and attempts to do so. Due to limitations of
the SSA backend, it's not possible to be completely correct.
There are a number of problems in the current design. One of the easier
to understand is that variable names currently must be attached to an
SSA value, but not all assignments in the source code actually result
in machine code. For example:
  type myint int
  var a int
  b := myint(int)
and
  b := (*uint64)(unsafe.Pointer(a))
don't generate machine code because the underlying representation is the
same, so the correct value of b will not be set when the user would
expect.
Generating the more precise debug information is behind a flag,
dwarflocationlists. Because of the issues described above, setting the
flag may not make the debugging experience much better, and may actually
make it worse in cases where the variable actually is on the stack and
the more complicated analysis doesn't realize it.
A number of changes are included:
- Add a new pseudo-instruction, RegKill, which indicates that the value
in the register has been clobbered.
- Adjust regalloc to emit RegKills in the right places. Significantly,
this means that phis are mixed with StoreReg and RegKills after
regalloc.
- Track variable decomposition in ssa.LocalSlots.
- After the SSA backend is done, analyze the result and build location
lists for each LocalSlot.
- After assembly is done, update the location lists with the assembled
PC offsets, recompose variables, and build DWARF location lists. Emit the
list as a new linker symbol, one per function.
- In the linker, aggregate the location lists into a .debug_loc section.
TODO:
- currently disabled for non-X86/AMD64 because there are no data tables.
go build -toolexec 'toolstash -cmp' -a std succeeds.
With -dwarflocationlists false:
before: f02812195637909ff675782c0b46836a8ff01976
after:  06f61e8112a42ac34fb80e0c818b3cdb84a5e7ec
benchstat -geomean  /tmp/220352263 /tmp/621364410
completed   15 of   15, estimated time remaining 0s (eta 3:52PM)
name        old time/op       new time/op       delta
Template          199ms ± 3%        198ms ± 2%     ~     (p=0.400 n=15+14)
Unicode          96.6ms ± 5%       96.4ms ± 5%     ~     (p=0.838 n=15+15)
GoTypes           653ms ± 2%        647ms ± 2%     ~     (p=0.102 n=15+14)
Flate             133ms ± 6%        129ms ± 3%   -2.62%  (p=0.041 n=15+15)
GoParser          164ms ± 5%        159ms ± 3%   -3.05%  (p=0.000 n=15+15)
Reflect           428ms ± 4%        422ms ± 3%     ~     (p=0.156 n=15+13)
Tar               123ms ±10%        124ms ± 8%     ~     (p=0.461 n=15+15)
XML               228ms ± 3%        224ms ± 3%   -1.57%  (p=0.045 n=15+15)
[Geo mean]        206ms             377ms       +82.86%
name        old user-time/op  new user-time/op  delta
Template          292ms ±10%        301ms ±12%     ~     (p=0.189 n=15+15)
Unicode           166ms ±37%        158ms ±14%     ~     (p=0.418 n=15+14)
GoTypes           962ms ± 6%        963ms ± 7%     ~     (p=0.976 n=15+15)
Flate             207ms ±19%        200ms ±14%     ~     (p=0.345 n=14+15)
GoParser          246ms ±22%        240ms ±15%     ~     (p=0.587 n=15+15)
Reflect           611ms ±13%        587ms ±14%     ~     (p=0.085 n=15+13)
Tar               211ms ±12%        217ms ±14%     ~     (p=0.355 n=14+15)
XML               335ms ±15%        320ms ±18%     ~     (p=0.169 n=15+15)
[Geo mean]        317ms             583ms       +83.72%
name        old alloc/op      new alloc/op      delta
Template         40.2MB ± 0%       40.2MB ± 0%   -0.15%  (p=0.000 n=14+15)
Unicode          29.2MB ± 0%       29.3MB ± 0%     ~     (p=0.624 n=15+15)
GoTypes           114MB ± 0%        114MB ± 0%   -0.15%  (p=0.000 n=15+14)
Flate            25.7MB ± 0%       25.6MB ± 0%   -0.18%  (p=0.000 n=13+15)
GoParser         32.2MB ± 0%       32.2MB ± 0%   -0.14%  (p=0.003 n=15+15)
Reflect          77.8MB ± 0%       77.9MB ± 0%     ~     (p=0.061 n=15+15)
Tar              27.1MB ± 0%       27.0MB ± 0%   -0.11%  (p=0.029 n=15+15)
XML              42.7MB ± 0%       42.5MB ± 0%   -0.29%  (p=0.000 n=15+15)
[Geo mean]       42.1MB            75.0MB       +78.05%
name        old allocs/op     new allocs/op     delta
Template           402k ± 1%         398k ± 0%   -0.91%  (p=0.000 n=15+15)
Unicode            344k ± 1%         344k ± 0%     ~     (p=0.715 n=15+14)
GoTypes           1.18M ± 0%        1.17M ± 0%   -0.91%  (p=0.000 n=15+14)
Flate              243k ± 0%         240k ± 1%   -1.05%  (p=0.000 n=13+15)
GoParser           327k ± 1%         324k ± 1%   -0.96%  (p=0.000 n=15+15)
Reflect            984k ± 1%         982k ± 0%     ~     (p=0.050 n=15+15)
Tar                261k ± 1%         259k ± 1%   -0.77%  (p=0.000 n=15+15)
XML                411k ± 0%         404k ± 1%   -1.55%  (p=0.000 n=15+15)
[Geo mean]         439k              755k       +72.01%
name        old text-bytes    new text-bytes    delta
HelloSize         694kB ± 0%        694kB ± 0%   -0.00%  (p=0.000 n=15+15)
name        old data-bytes    new data-bytes    delta
HelloSize        5.55kB ± 0%       5.55kB ± 0%     ~     (all equal)
name        old bss-bytes     new bss-bytes     delta
HelloSize         133kB ± 0%        133kB ± 0%     ~     (all equal)
name        old exe-bytes     new exe-bytes     delta
HelloSize        1.04MB ± 0%       1.04MB ± 0%     ~     (all equal)
Change-Id: I991fc553ef175db46bb23b2128317bbd48de70d8
Reviewed-on: https://go-review.googlesource.com/41770
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2017-07-21 18:30:19 -04:00
										 |  |  | 			if Ctxt.Flag_locationlists { | 
					
						
							| 
									
										
											  
											
												cmd/compile: adjust locationlist lifetimes
A statement like
  foo = bar + qux
might compile to
  AX := AX + BX
resulting in a regkill for AX before this instruction.
The buggy behavior is to kill AX "at" this instruction,
before it has executed.  (Code generation of no-instruction
values like RegKills applies their effects at the
next actual instruction emitted).
However, bar is still associated with AX until after the
instruction executes, so the effect of the regkill must
occur at the boundary between this instruction and the
next.  Similarly, the new value bound to AX is not visible
until this instruction executes (and in the case of values
that require multiple instructions in code generation, until
all of them have executed).
The ranges are adjusted so that a value's start occurs
at the next following instruction after its evaluation,
and the end occurs after (execution of) the first
instruction following the end of the lifetime as a value.
(Notice the asymmetry; the entire value must be finished
before it is visible, but execution of a single instruction
invalidates.  However, the value *is* visible before that
next instruction executes).
The test was adjusted to make it insensitive to the result
numbering for variables printed by gdb, since that is not
relevant to the test and makes the differences introduced
by small changes larger than necessary/useful.
The test was also improved to present variable probes
more intuitively, and also to allow explicit indication
of "this variable was optimized out"
Change-Id: I39453eead8399e6bb05ebd957289b112d1100c0e
Reviewed-on: https://go-review.googlesource.com/74090
Run-TryBot: David Chase <drchase@google.com>
Reviewed-by: Cherry Zhang <cherryyz@google.com>
											
										 
											2017-10-27 17:02:11 -04:00
										 |  |  | 				valueToProgAfter[v.ID] = s.pp.next | 
					
						
							| 
									
										
											  
											
												[dev.debug] cmd/compile: better DWARF with optimizations on
Debuggers use DWARF information to find local variables on the
stack and in registers. Prior to this CL, the DWARF information for
functions claimed that all variables were on the stack at all times.
That's incorrect when optimizations are enabled, and results in
debuggers showing data that is out of date or complete gibberish.
After this CL, the compiler is capable of representing variable
locations more accurately, and attempts to do so. Due to limitations of
the SSA backend, it's not possible to be completely correct.
There are a number of problems in the current design. One of the easier
to understand is that variable names currently must be attached to an
SSA value, but not all assignments in the source code actually result
in machine code. For example:
  type myint int
  var a int
  b := myint(int)
and
  b := (*uint64)(unsafe.Pointer(a))
don't generate machine code because the underlying representation is the
same, so the correct value of b will not be set when the user would
expect.
Generating the more precise debug information is behind a flag,
dwarflocationlists. Because of the issues described above, setting the
flag may not make the debugging experience much better, and may actually
make it worse in cases where the variable actually is on the stack and
the more complicated analysis doesn't realize it.
A number of changes are included:
- Add a new pseudo-instruction, RegKill, which indicates that the value
in the register has been clobbered.
- Adjust regalloc to emit RegKills in the right places. Significantly,
this means that phis are mixed with StoreReg and RegKills after
regalloc.
- Track variable decomposition in ssa.LocalSlots.
- After the SSA backend is done, analyze the result and build location
lists for each LocalSlot.
- After assembly is done, update the location lists with the assembled
PC offsets, recompose variables, and build DWARF location lists. Emit the
list as a new linker symbol, one per function.
- In the linker, aggregate the location lists into a .debug_loc section.
TODO:
- currently disabled for non-X86/AMD64 because there are no data tables.
go build -toolexec 'toolstash -cmp' -a std succeeds.
With -dwarflocationlists false:
before: f02812195637909ff675782c0b46836a8ff01976
after:  06f61e8112a42ac34fb80e0c818b3cdb84a5e7ec
benchstat -geomean  /tmp/220352263 /tmp/621364410
completed   15 of   15, estimated time remaining 0s (eta 3:52PM)
name        old time/op       new time/op       delta
Template          199ms ± 3%        198ms ± 2%     ~     (p=0.400 n=15+14)
Unicode          96.6ms ± 5%       96.4ms ± 5%     ~     (p=0.838 n=15+15)
GoTypes           653ms ± 2%        647ms ± 2%     ~     (p=0.102 n=15+14)
Flate             133ms ± 6%        129ms ± 3%   -2.62%  (p=0.041 n=15+15)
GoParser          164ms ± 5%        159ms ± 3%   -3.05%  (p=0.000 n=15+15)
Reflect           428ms ± 4%        422ms ± 3%     ~     (p=0.156 n=15+13)
Tar               123ms ±10%        124ms ± 8%     ~     (p=0.461 n=15+15)
XML               228ms ± 3%        224ms ± 3%   -1.57%  (p=0.045 n=15+15)
[Geo mean]        206ms             377ms       +82.86%
name        old user-time/op  new user-time/op  delta
Template          292ms ±10%        301ms ±12%     ~     (p=0.189 n=15+15)
Unicode           166ms ±37%        158ms ±14%     ~     (p=0.418 n=15+14)
GoTypes           962ms ± 6%        963ms ± 7%     ~     (p=0.976 n=15+15)
Flate             207ms ±19%        200ms ±14%     ~     (p=0.345 n=14+15)
GoParser          246ms ±22%        240ms ±15%     ~     (p=0.587 n=15+15)
Reflect           611ms ±13%        587ms ±14%     ~     (p=0.085 n=15+13)
Tar               211ms ±12%        217ms ±14%     ~     (p=0.355 n=14+15)
XML               335ms ±15%        320ms ±18%     ~     (p=0.169 n=15+15)
[Geo mean]        317ms             583ms       +83.72%
name        old alloc/op      new alloc/op      delta
Template         40.2MB ± 0%       40.2MB ± 0%   -0.15%  (p=0.000 n=14+15)
Unicode          29.2MB ± 0%       29.3MB ± 0%     ~     (p=0.624 n=15+15)
GoTypes           114MB ± 0%        114MB ± 0%   -0.15%  (p=0.000 n=15+14)
Flate            25.7MB ± 0%       25.6MB ± 0%   -0.18%  (p=0.000 n=13+15)
GoParser         32.2MB ± 0%       32.2MB ± 0%   -0.14%  (p=0.003 n=15+15)
Reflect          77.8MB ± 0%       77.9MB ± 0%     ~     (p=0.061 n=15+15)
Tar              27.1MB ± 0%       27.0MB ± 0%   -0.11%  (p=0.029 n=15+15)
XML              42.7MB ± 0%       42.5MB ± 0%   -0.29%  (p=0.000 n=15+15)
[Geo mean]       42.1MB            75.0MB       +78.05%
name        old allocs/op     new allocs/op     delta
Template           402k ± 1%         398k ± 0%   -0.91%  (p=0.000 n=15+15)
Unicode            344k ± 1%         344k ± 0%     ~     (p=0.715 n=15+14)
GoTypes           1.18M ± 0%        1.17M ± 0%   -0.91%  (p=0.000 n=15+14)
Flate              243k ± 0%         240k ± 1%   -1.05%  (p=0.000 n=13+15)
GoParser           327k ± 1%         324k ± 1%   -0.96%  (p=0.000 n=15+15)
Reflect            984k ± 1%         982k ± 0%     ~     (p=0.050 n=15+15)
Tar                261k ± 1%         259k ± 1%   -0.77%  (p=0.000 n=15+15)
XML                411k ± 0%         404k ± 1%   -1.55%  (p=0.000 n=15+15)
[Geo mean]         439k              755k       +72.01%
name        old text-bytes    new text-bytes    delta
HelloSize         694kB ± 0%        694kB ± 0%   -0.00%  (p=0.000 n=15+15)
name        old data-bytes    new data-bytes    delta
HelloSize        5.55kB ± 0%       5.55kB ± 0%     ~     (all equal)
name        old bss-bytes     new bss-bytes     delta
HelloSize         133kB ± 0%        133kB ± 0%     ~     (all equal)
name        old exe-bytes     new exe-bytes     delta
HelloSize        1.04MB ± 0%       1.04MB ± 0%     ~     (all equal)
Change-Id: I991fc553ef175db46bb23b2128317bbd48de70d8
Reviewed-on: https://go-review.googlesource.com/41770
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2017-07-21 18:30:19 -04:00
										 |  |  | 			} | 
					
						
							| 
									
										
											  
											
												cmd/compile: reimplement location list generation
Completely redesign and reimplement location list generation to be more
efficient, and hopefully not too hard to understand.
RegKills are gone. Instead of using the regalloc's liveness
calculations, redo them using the Ops' clobber information. Besides
saving a lot of Values, this avoids adding RegKills to blocks that would
be empty otherwise, which was messing up optimizations. This does mean
that it's much harder to tell whether the generation process is buggy
(there's nothing to cross-check it with), and there may be disagreements
with GC liveness. But the performance gain is significant, and it's nice
not to be messing with earlier compiler phases.
The intermediate representations are gone. Instead of producing
ssa.BlockDebugs, then dwarf.LocationLists, and then finally real
location lists, go directly from the SSA to a (mostly) real location
list. Because the SSA analysis happens before assembly, it stores
encoded block/value IDs where PCs would normally go. It would be easier
to do the SSA analysis after assembly, but I didn't want to retain the
SSA just for that.
Generation proceeds in two phases: first, it traverses the function in
CFG order, storing the state of the block at the beginning and end. End
states are used to produce the start states of the successor blocks. In
the second phase, it traverses in program text order and produces the
location lists. The processing in the second phase is redundant, but
much cheaper than storing the intermediate representation. It might be
possible to combine the two phases somewhat to take advantage of cases
where the CFG matches the block layout, but I haven't tried.
Location lists are finalized by adding a base address selection entry,
translating each encoded block/value ID to a real PC, and adding the
terminating zero entry. This probably won't work on OSX, where dsymutil
will choke on the base address selection. I tried emitting CU-relative
relocations for each address, and it was *very* bad for performance --
it uses more memory storing all the relocations than it does for the
actual location list bytes. I think I'm going to end up synthesizing the
relocations in the linker only on OSX, but TBD.
TestNexting needs updating: with more optimizations working, the
debugger doesn't stop on the continue (line 88) any more, and the test's
duplicate suppression kicks in. Also, dx and dy live a little longer
now, but they have the correct values.
Change-Id: Ie772dfe23a4e389ca573624fac4d05401ae32307
Reviewed-on: https://go-review.googlesource.com/89356
Run-TryBot: Heschi Kreinick <heschi@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2017-10-26 15:40:17 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 			if logProgs { | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 				for ; x != s.pp.next; x = x.Link { | 
					
						
							| 
									
										
										
										
											2017-07-21 18:00:22 -04:00
										 |  |  | 					progToValue[x] = v | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 				} | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
											  
											
												cmd/compile: reimplement location list generation
Completely redesign and reimplement location list generation to be more
efficient, and hopefully not too hard to understand.
RegKills are gone. Instead of using the regalloc's liveness
calculations, redo them using the Ops' clobber information. Besides
saving a lot of Values, this avoids adding RegKills to blocks that would
be empty otherwise, which was messing up optimizations. This does mean
that it's much harder to tell whether the generation process is buggy
(there's nothing to cross-check it with), and there may be disagreements
with GC liveness. But the performance gain is significant, and it's nice
not to be messing with earlier compiler phases.
The intermediate representations are gone. Instead of producing
ssa.BlockDebugs, then dwarf.LocationLists, and then finally real
location lists, go directly from the SSA to a (mostly) real location
list. Because the SSA analysis happens before assembly, it stores
encoded block/value IDs where PCs would normally go. It would be easier
to do the SSA analysis after assembly, but I didn't want to retain the
SSA just for that.
Generation proceeds in two phases: first, it traverses the function in
CFG order, storing the state of the block at the beginning and end. End
states are used to produce the start states of the successor blocks. In
the second phase, it traverses in program text order and produces the
location lists. The processing in the second phase is redundant, but
much cheaper than storing the intermediate representation. It might be
possible to combine the two phases somewhat to take advantage of cases
where the CFG matches the block layout, but I haven't tried.
Location lists are finalized by adding a base address selection entry,
translating each encoded block/value ID to a real PC, and adding the
terminating zero entry. This probably won't work on OSX, where dsymutil
will choke on the base address selection. I tried emitting CU-relative
relocations for each address, and it was *very* bad for performance --
it uses more memory storing all the relocations than it does for the
actual location list bytes. I think I'm going to end up synthesizing the
relocations in the linker only on OSX, but TBD.
TestNexting needs updating: with more optimizations working, the
debugger doesn't stop on the continue (line 88) any more, and the test's
duplicate suppression kicks in. Also, dx and dy live a little longer
now, but they have the correct values.
Change-Id: Ie772dfe23a4e389ca573624fac4d05401ae32307
Reviewed-on: https://go-review.googlesource.com/89356
Run-TryBot: Heschi Kreinick <heschi@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2017-10-26 15:40:17 -04:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 		// Emit control flow instructions for block | 
					
						
							|  |  |  | 		var next *ssa.Block | 
					
						
							| 
									
										
										
										
											2016-09-09 13:11:07 -07:00
										 |  |  | 		if i < len(f.Blocks)-1 && Debug['N'] == 0 { | 
					
						
							| 
									
										
										
										
											2016-02-22 23:19:00 -08:00
										 |  |  | 			// If -N, leave next==nil so every block with successors | 
					
						
							| 
									
										
										
										
											2016-02-26 16:32:01 -08:00
										 |  |  | 			// ends in a JMP (except call blocks - plive doesn't like | 
					
						
							|  |  |  | 			// select{send,recv} followed by a JMP call).  Helps keep | 
					
						
							|  |  |  | 			// line numbers for otherwise empty blocks. | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 			next = f.Blocks[i+1] | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 		x := s.pp.next | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:31 -07:00
										 |  |  | 		s.SetPos(b.Pos) | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:36 -07:00
										 |  |  | 		thearch.SSAGenBlock(&s, b, next) | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 		if logProgs { | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 			for ; x != s.pp.next; x = x.Link { | 
					
						
							| 
									
										
										
										
											2017-07-21 18:00:22 -04:00
										 |  |  | 				progToBlock[x] = b | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												[dev.debug] cmd/compile: better DWARF with optimizations on
Debuggers use DWARF information to find local variables on the
stack and in registers. Prior to this CL, the DWARF information for
functions claimed that all variables were on the stack at all times.
That's incorrect when optimizations are enabled, and results in
debuggers showing data that is out of date or complete gibberish.
After this CL, the compiler is capable of representing variable
locations more accurately, and attempts to do so. Due to limitations of
the SSA backend, it's not possible to be completely correct.
There are a number of problems in the current design. One of the easier
to understand is that variable names currently must be attached to an
SSA value, but not all assignments in the source code actually result
in machine code. For example:
  type myint int
  var a int
  b := myint(int)
and
  b := (*uint64)(unsafe.Pointer(a))
don't generate machine code because the underlying representation is the
same, so the correct value of b will not be set when the user would
expect.
Generating the more precise debug information is behind a flag,
dwarflocationlists. Because of the issues described above, setting the
flag may not make the debugging experience much better, and may actually
make it worse in cases where the variable actually is on the stack and
the more complicated analysis doesn't realize it.
A number of changes are included:
- Add a new pseudo-instruction, RegKill, which indicates that the value
in the register has been clobbered.
- Adjust regalloc to emit RegKills in the right places. Significantly,
this means that phis are mixed with StoreReg and RegKills after
regalloc.
- Track variable decomposition in ssa.LocalSlots.
- After the SSA backend is done, analyze the result and build location
lists for each LocalSlot.
- After assembly is done, update the location lists with the assembled
PC offsets, recompose variables, and build DWARF location lists. Emit the
list as a new linker symbol, one per function.
- In the linker, aggregate the location lists into a .debug_loc section.
TODO:
- currently disabled for non-X86/AMD64 because there are no data tables.
go build -toolexec 'toolstash -cmp' -a std succeeds.
With -dwarflocationlists false:
before: f02812195637909ff675782c0b46836a8ff01976
after:  06f61e8112a42ac34fb80e0c818b3cdb84a5e7ec
benchstat -geomean  /tmp/220352263 /tmp/621364410
completed   15 of   15, estimated time remaining 0s (eta 3:52PM)
name        old time/op       new time/op       delta
Template          199ms ± 3%        198ms ± 2%     ~     (p=0.400 n=15+14)
Unicode          96.6ms ± 5%       96.4ms ± 5%     ~     (p=0.838 n=15+15)
GoTypes           653ms ± 2%        647ms ± 2%     ~     (p=0.102 n=15+14)
Flate             133ms ± 6%        129ms ± 3%   -2.62%  (p=0.041 n=15+15)
GoParser          164ms ± 5%        159ms ± 3%   -3.05%  (p=0.000 n=15+15)
Reflect           428ms ± 4%        422ms ± 3%     ~     (p=0.156 n=15+13)
Tar               123ms ±10%        124ms ± 8%     ~     (p=0.461 n=15+15)
XML               228ms ± 3%        224ms ± 3%   -1.57%  (p=0.045 n=15+15)
[Geo mean]        206ms             377ms       +82.86%
name        old user-time/op  new user-time/op  delta
Template          292ms ±10%        301ms ±12%     ~     (p=0.189 n=15+15)
Unicode           166ms ±37%        158ms ±14%     ~     (p=0.418 n=15+14)
GoTypes           962ms ± 6%        963ms ± 7%     ~     (p=0.976 n=15+15)
Flate             207ms ±19%        200ms ±14%     ~     (p=0.345 n=14+15)
GoParser          246ms ±22%        240ms ±15%     ~     (p=0.587 n=15+15)
Reflect           611ms ±13%        587ms ±14%     ~     (p=0.085 n=15+13)
Tar               211ms ±12%        217ms ±14%     ~     (p=0.355 n=14+15)
XML               335ms ±15%        320ms ±18%     ~     (p=0.169 n=15+15)
[Geo mean]        317ms             583ms       +83.72%
name        old alloc/op      new alloc/op      delta
Template         40.2MB ± 0%       40.2MB ± 0%   -0.15%  (p=0.000 n=14+15)
Unicode          29.2MB ± 0%       29.3MB ± 0%     ~     (p=0.624 n=15+15)
GoTypes           114MB ± 0%        114MB ± 0%   -0.15%  (p=0.000 n=15+14)
Flate            25.7MB ± 0%       25.6MB ± 0%   -0.18%  (p=0.000 n=13+15)
GoParser         32.2MB ± 0%       32.2MB ± 0%   -0.14%  (p=0.003 n=15+15)
Reflect          77.8MB ± 0%       77.9MB ± 0%     ~     (p=0.061 n=15+15)
Tar              27.1MB ± 0%       27.0MB ± 0%   -0.11%  (p=0.029 n=15+15)
XML              42.7MB ± 0%       42.5MB ± 0%   -0.29%  (p=0.000 n=15+15)
[Geo mean]       42.1MB            75.0MB       +78.05%
name        old allocs/op     new allocs/op     delta
Template           402k ± 1%         398k ± 0%   -0.91%  (p=0.000 n=15+15)
Unicode            344k ± 1%         344k ± 0%     ~     (p=0.715 n=15+14)
GoTypes           1.18M ± 0%        1.17M ± 0%   -0.91%  (p=0.000 n=15+14)
Flate              243k ± 0%         240k ± 1%   -1.05%  (p=0.000 n=13+15)
GoParser           327k ± 1%         324k ± 1%   -0.96%  (p=0.000 n=15+15)
Reflect            984k ± 1%         982k ± 0%     ~     (p=0.050 n=15+15)
Tar                261k ± 1%         259k ± 1%   -0.77%  (p=0.000 n=15+15)
XML                411k ± 0%         404k ± 1%   -1.55%  (p=0.000 n=15+15)
[Geo mean]         439k              755k       +72.01%
name        old text-bytes    new text-bytes    delta
HelloSize         694kB ± 0%        694kB ± 0%   -0.00%  (p=0.000 n=15+15)
name        old data-bytes    new data-bytes    delta
HelloSize        5.55kB ± 0%       5.55kB ± 0%     ~     (all equal)
name        old bss-bytes     new bss-bytes     delta
HelloSize         133kB ± 0%        133kB ± 0%     ~     (all equal)
name        old exe-bytes     new exe-bytes     delta
HelloSize        1.04MB ± 0%       1.04MB ± 0%     ~     (all equal)
Change-Id: I991fc553ef175db46bb23b2128317bbd48de70d8
Reviewed-on: https://go-review.googlesource.com/41770
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2017-07-21 18:30:19 -04:00
										 |  |  | 	if Ctxt.Flag_locationlists { | 
					
						
							| 
									
										
											  
											
												cmd/compile: reimplement location list generation
Completely redesign and reimplement location list generation to be more
efficient, and hopefully not too hard to understand.
RegKills are gone. Instead of using the regalloc's liveness
calculations, redo them using the Ops' clobber information. Besides
saving a lot of Values, this avoids adding RegKills to blocks that would
be empty otherwise, which was messing up optimizations. This does mean
that it's much harder to tell whether the generation process is buggy
(there's nothing to cross-check it with), and there may be disagreements
with GC liveness. But the performance gain is significant, and it's nice
not to be messing with earlier compiler phases.
The intermediate representations are gone. Instead of producing
ssa.BlockDebugs, then dwarf.LocationLists, and then finally real
location lists, go directly from the SSA to a (mostly) real location
list. Because the SSA analysis happens before assembly, it stores
encoded block/value IDs where PCs would normally go. It would be easier
to do the SSA analysis after assembly, but I didn't want to retain the
SSA just for that.
Generation proceeds in two phases: first, it traverses the function in
CFG order, storing the state of the block at the beginning and end. End
states are used to produce the start states of the successor blocks. In
the second phase, it traverses in program text order and produces the
location lists. The processing in the second phase is redundant, but
much cheaper than storing the intermediate representation. It might be
possible to combine the two phases somewhat to take advantage of cases
where the CFG matches the block layout, but I haven't tried.
Location lists are finalized by adding a base address selection entry,
translating each encoded block/value ID to a real PC, and adding the
terminating zero entry. This probably won't work on OSX, where dsymutil
will choke on the base address selection. I tried emitting CU-relative
relocations for each address, and it was *very* bad for performance --
it uses more memory storing all the relocations than it does for the
actual location list bytes. I think I'm going to end up synthesizing the
relocations in the linker only on OSX, but TBD.
TestNexting needs updating: with more optimizations working, the
debugger doesn't stop on the continue (line 88) any more, and the test's
duplicate suppression kicks in. Also, dx and dy live a little longer
now, but they have the correct values.
Change-Id: Ie772dfe23a4e389ca573624fac4d05401ae32307
Reviewed-on: https://go-review.googlesource.com/89356
Run-TryBot: Heschi Kreinick <heschi@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2017-10-26 15:40:17 -04:00
										 |  |  | 		e.curfn.Func.DebugInfo = ssa.BuildFuncDebug(Ctxt, f, Debug_locationlist > 1, stackOffset) | 
					
						
							|  |  |  | 		bstart := s.bstart | 
					
						
							|  |  |  | 		// Note that at this moment, Prog.Pc is a sequence number; it's | 
					
						
							|  |  |  | 		// not a real PC until after assembly, so this mapping has to | 
					
						
							|  |  |  | 		// be done later. | 
					
						
							|  |  |  | 		e.curfn.Func.DebugInfo.GetPC = func(b, v ssa.ID) int64 { | 
					
						
							|  |  |  | 			switch v { | 
					
						
							|  |  |  | 			case ssa.BlockStart.ID: | 
					
						
							| 
									
										
										
										
											2018-02-25 20:08:22 +09:00
										 |  |  | 				return bstart[b].Pc | 
					
						
							| 
									
										
											  
											
												cmd/compile: reimplement location list generation
Completely redesign and reimplement location list generation to be more
efficient, and hopefully not too hard to understand.
RegKills are gone. Instead of using the regalloc's liveness
calculations, redo them using the Ops' clobber information. Besides
saving a lot of Values, this avoids adding RegKills to blocks that would
be empty otherwise, which was messing up optimizations. This does mean
that it's much harder to tell whether the generation process is buggy
(there's nothing to cross-check it with), and there may be disagreements
with GC liveness. But the performance gain is significant, and it's nice
not to be messing with earlier compiler phases.
The intermediate representations are gone. Instead of producing
ssa.BlockDebugs, then dwarf.LocationLists, and then finally real
location lists, go directly from the SSA to a (mostly) real location
list. Because the SSA analysis happens before assembly, it stores
encoded block/value IDs where PCs would normally go. It would be easier
to do the SSA analysis after assembly, but I didn't want to retain the
SSA just for that.
Generation proceeds in two phases: first, it traverses the function in
CFG order, storing the state of the block at the beginning and end. End
states are used to produce the start states of the successor blocks. In
the second phase, it traverses in program text order and produces the
location lists. The processing in the second phase is redundant, but
much cheaper than storing the intermediate representation. It might be
possible to combine the two phases somewhat to take advantage of cases
where the CFG matches the block layout, but I haven't tried.
Location lists are finalized by adding a base address selection entry,
translating each encoded block/value ID to a real PC, and adding the
terminating zero entry. This probably won't work on OSX, where dsymutil
will choke on the base address selection. I tried emitting CU-relative
relocations for each address, and it was *very* bad for performance --
it uses more memory storing all the relocations than it does for the
actual location list bytes. I think I'm going to end up synthesizing the
relocations in the linker only on OSX, but TBD.
TestNexting needs updating: with more optimizations working, the
debugger doesn't stop on the continue (line 88) any more, and the test's
duplicate suppression kicks in. Also, dx and dy live a little longer
now, but they have the correct values.
Change-Id: Ie772dfe23a4e389ca573624fac4d05401ae32307
Reviewed-on: https://go-review.googlesource.com/89356
Run-TryBot: Heschi Kreinick <heschi@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2017-10-26 15:40:17 -04:00
										 |  |  | 			case ssa.BlockEnd.ID: | 
					
						
							| 
									
										
										
										
											2018-02-25 20:08:22 +09:00
										 |  |  | 				return e.curfn.Func.lsym.Size | 
					
						
							| 
									
										
											  
											
												cmd/compile: reimplement location list generation
Completely redesign and reimplement location list generation to be more
efficient, and hopefully not too hard to understand.
RegKills are gone. Instead of using the regalloc's liveness
calculations, redo them using the Ops' clobber information. Besides
saving a lot of Values, this avoids adding RegKills to blocks that would
be empty otherwise, which was messing up optimizations. This does mean
that it's much harder to tell whether the generation process is buggy
(there's nothing to cross-check it with), and there may be disagreements
with GC liveness. But the performance gain is significant, and it's nice
not to be messing with earlier compiler phases.
The intermediate representations are gone. Instead of producing
ssa.BlockDebugs, then dwarf.LocationLists, and then finally real
location lists, go directly from the SSA to a (mostly) real location
list. Because the SSA analysis happens before assembly, it stores
encoded block/value IDs where PCs would normally go. It would be easier
to do the SSA analysis after assembly, but I didn't want to retain the
SSA just for that.
Generation proceeds in two phases: first, it traverses the function in
CFG order, storing the state of the block at the beginning and end. End
states are used to produce the start states of the successor blocks. In
the second phase, it traverses in program text order and produces the
location lists. The processing in the second phase is redundant, but
much cheaper than storing the intermediate representation. It might be
possible to combine the two phases somewhat to take advantage of cases
where the CFG matches the block layout, but I haven't tried.
Location lists are finalized by adding a base address selection entry,
translating each encoded block/value ID to a real PC, and adding the
terminating zero entry. This probably won't work on OSX, where dsymutil
will choke on the base address selection. I tried emitting CU-relative
relocations for each address, and it was *very* bad for performance --
it uses more memory storing all the relocations than it does for the
actual location list bytes. I think I'm going to end up synthesizing the
relocations in the linker only on OSX, but TBD.
TestNexting needs updating: with more optimizations working, the
debugger doesn't stop on the continue (line 88) any more, and the test's
duplicate suppression kicks in. Also, dx and dy live a little longer
now, but they have the correct values.
Change-Id: Ie772dfe23a4e389ca573624fac4d05401ae32307
Reviewed-on: https://go-review.googlesource.com/89356
Run-TryBot: Heschi Kreinick <heschi@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2017-10-26 15:40:17 -04:00
										 |  |  | 			default: | 
					
						
							| 
									
										
										
										
											2018-02-25 20:08:22 +09:00
										 |  |  | 				return valueToProgAfter[v].Pc | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 			} | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 	// Resolove branchers, and relax DefaultStmt into NotStmt | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | 	for _, br := range s.Branches { | 
					
						
							|  |  |  | 		br.P.To.Val = s.bstart[br.B.ID] | 
					
						
							| 
									
										
											  
											
												cmd/compile: assign and preserve statement boundaries.
A new pass run after ssa building (before any other
optimization) identifies the "first" ssa node for each
statement. Other "noise" nodes are tagged as being never
appropriate for a statement boundary (e.g., VarKill, VarDef,
Phi).
Rewrite, deadcode, cse, and nilcheck are modified to move
the statement boundaries forward whenever possible if a
boundary-tagged ssa value is removed; never-boundary nodes
are ignored in this search (some operations involving
constants are also tagged as never-boundary and also ignored
because they are likely to be moved or removed during
optimization).
Code generation treats all nodes except those explicitly
marked as statement boundaries as "not statement" nodes,
and floats statement boundaries to the beginning of each
same-line run of instructions found within a basic block.
Line number html conversion was modified to make statement
boundary nodes a bit more obvious by prepending a "+".
The code in fuse.go that glued together the value slices
of two blocks produced a result that depended on the
former capacities (not lengths) of the two slices.  This
causes differences in the 386 bootstrap, and also can
sometimes put values into an order that does a worse job
of preserving statement boundaries when values are removed.
Portions of two delve tests that had caught problems were
incorporated into ssa/debug_test.go.  There are some
opportunities to do better with optimized code, but the
next-ing is not lying or overly jumpy.
Over 4 CLs, compilebench geomean measured binary size
increase of 3.5% and compile user time increase of 3.8%
(this is after optimization to reuse a sparse map instead
of creating multiple maps.)
This CL worsens the optimized-debugging experience with
Delve; we need to work with the delve team so that
they can use the is_stmt marks that we're emitting now.
The reference output changes from time to time depending
on other changes in the compiler, sometimes better,
sometimes worse.
This CL now includes a test ensuring that 99+% of the lines
in the Go command itself (a handy optimized binary) include
is_stmt markers.
Change-Id: I359c94e06843f1eb41f9da437bd614885aa9644a
Reviewed-on: https://go-review.googlesource.com/102435
Run-TryBot: David Chase <drchase@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Austin Clements <austin@google.com>
											
										 
											2018-03-23 22:46:06 -04:00
										 |  |  | 		if br.P.Pos.IsStmt() != src.PosIsStmt { | 
					
						
							|  |  |  | 			br.P.Pos = br.P.Pos.WithNotStmt() | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-08-28 22:51:01 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 	if logProgs { | 
					
						
							| 
									
										
										
										
											2017-11-20 16:47:08 -05:00
										 |  |  | 		filename := "" | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 		for p := pp.Text; p != nil; p = p.Link { | 
					
						
							| 
									
										
										
										
											2017-11-20 16:47:08 -05:00
										 |  |  | 			if p.Pos.IsKnown() && p.InnermostFilename() != filename { | 
					
						
							|  |  |  | 				filename = p.InnermostFilename() | 
					
						
							|  |  |  | 				f.Logf("# %s\n", filename) | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 			var s string | 
					
						
							| 
									
										
										
										
											2017-07-21 18:00:22 -04:00
										 |  |  | 			if v, ok := progToValue[p]; ok { | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 				s = v.String() | 
					
						
							| 
									
										
										
										
											2017-07-21 18:00:22 -04:00
										 |  |  | 			} else if b, ok := progToBlock[p]; ok { | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 				s = b.String() | 
					
						
							|  |  |  | 			} else { | 
					
						
							|  |  |  | 				s = "   " // most value and branch strings are 2-3 characters long | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2017-11-20 16:47:08 -05:00
										 |  |  | 			f.Logf(" %-6s\t%.5d (%s)\t%s\n", s, p.Pc, p.InnermostLineNumber(), p.InstructionString()) | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2018-07-24 10:39:00 +03:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	if f.HTMLWriter != nil { | 
					
						
							|  |  |  | 		var buf bytes.Buffer | 
					
						
							|  |  |  | 		buf.WriteString("<code>") | 
					
						
							|  |  |  | 		buf.WriteString("<dl class=\"ssa-gen\">") | 
					
						
							|  |  |  | 		filename := "" | 
					
						
							|  |  |  | 		for p := pp.Text; p != nil; p = p.Link { | 
					
						
							|  |  |  | 			// Don't spam every line with the file name, which is often huge. | 
					
						
							|  |  |  | 			// Only print changes, and "unknown" is not a change. | 
					
						
							|  |  |  | 			if p.Pos.IsKnown() && p.InnermostFilename() != filename { | 
					
						
							|  |  |  | 				filename = p.InnermostFilename() | 
					
						
							|  |  |  | 				buf.WriteString("<dt class=\"ssa-prog-src\"></dt><dd class=\"ssa-prog\">") | 
					
						
							|  |  |  | 				buf.WriteString(html.EscapeString("# " + filename)) | 
					
						
							| 
									
										
										
										
											2015-08-10 12:15:52 -07:00
										 |  |  | 				buf.WriteString("</dd>") | 
					
						
							|  |  |  | 			} | 
					
						
							| 
									
										
										
										
											2018-07-24 10:39:00 +03:00
										 |  |  | 
 | 
					
						
							|  |  |  | 			buf.WriteString("<dt class=\"ssa-prog-src\">") | 
					
						
							|  |  |  | 			if v, ok := progToValue[p]; ok { | 
					
						
							|  |  |  | 				buf.WriteString(v.HTML()) | 
					
						
							|  |  |  | 			} else if b, ok := progToBlock[p]; ok { | 
					
						
							|  |  |  | 				buf.WriteString("<b>" + b.HTML() + "</b>") | 
					
						
							|  |  |  | 			} | 
					
						
							|  |  |  | 			buf.WriteString("</dt>") | 
					
						
							|  |  |  | 			buf.WriteString("<dd class=\"ssa-prog\">") | 
					
						
							|  |  |  | 			buf.WriteString(fmt.Sprintf("%.5d <span class=\"l%v line-number\">(%s)</span> %s", p.Pc, p.InnermostLineNumber(), p.InnermostLineNumberHTML(), html.EscapeString(p.InstructionString()))) | 
					
						
							|  |  |  | 			buf.WriteString("</dd>") | 
					
						
							| 
									
										
										
										
											2015-08-10 12:15:52 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2018-07-24 10:39:00 +03:00
										 |  |  | 		buf.WriteString("</dl>") | 
					
						
							|  |  |  | 		buf.WriteString("</code>") | 
					
						
							|  |  |  | 		f.HTMLWriter.WriteColumn("genssa", "genssa", "ssa-prog", buf.String()) | 
					
						
							| 
									
										
										
										
											2015-07-31 14:37:15 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-04-20 10:31:39 -07:00
										 |  |  | 	defframe(&s, e) | 
					
						
							| 
									
										
										
										
											2015-08-24 02:16:19 -07:00
										 |  |  | 	if Debug['f'] != 0 { | 
					
						
							|  |  |  | 		frame(0) | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: rearrange fields between ssa.Func, ssa.Cache, and ssa.Config
This makes ssa.Func, ssa.Cache, and ssa.Config fulfill
the roles laid out for them in CL 38160.
The only non-trivial change in this CL is how cached
values and blocks get IDs. Prior to this CL, their IDs were
assigned as part of resetting the cache, and only modified
IDs were reset. This required knowing how many values and
blocks were modified, which required a tight coupling between
ssa.Func and ssa.Config. To eliminate that coupling,
we now zero values and blocks during reset,
and assign their IDs when they are used.
Since unused values and blocks have ID == 0,
we can efficiently find the last used value/block,
to avoid zeroing everything.
Bulk zeroing is efficient, but not efficient enough
to obviate the need to avoid zeroing everything every time.
As a happy side-effect, ssa.Func.Free is no longer necessary.
DebugHashMatch and friends now belong in func.go.
They have been left in place for clarity and review.
I will move them in a subsequent CL.
Passes toolstash -cmp. No compiler performance impact.
No change in 'go test cmd/compile/internal/ssa' execution time.
Change-Id: I2eb7af58da067ef6a36e815a6f386cfe8634d098
Reviewed-on: https://go-review.googlesource.com/38167
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-03-15 11:15:13 -07:00
										 |  |  | 	f.HTMLWriter.Close() | 
					
						
							|  |  |  | 	f.HTMLWriter = nil | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-04-20 10:31:39 -07:00
										 |  |  | func defframe(s *SSAGenState, e *ssafn) { | 
					
						
							|  |  |  | 	pp := s.pp | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	frame := Rnd(s.maxarg+e.stksize, int64(Widthreg)) | 
					
						
							|  |  |  | 	if thearch.PadFrame != nil { | 
					
						
							|  |  |  | 		frame = thearch.PadFrame(frame) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Fill in argument and frame size. | 
					
						
							|  |  |  | 	pp.Text.To.Type = obj.TYPE_TEXTSIZE | 
					
						
							|  |  |  | 	pp.Text.To.Val = int32(Rnd(e.curfn.Type.ArgWidth(), int64(Widthreg))) | 
					
						
							|  |  |  | 	pp.Text.To.Offset = frame | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Insert code to zero ambiguously live variables so that the | 
					
						
							|  |  |  | 	// garbage collector only sees initialized values when it | 
					
						
							|  |  |  | 	// looks for pointers. | 
					
						
							|  |  |  | 	p := pp.Text | 
					
						
							|  |  |  | 	var lo, hi int64 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Opaque state for backend to use. Current backends use it to | 
					
						
							|  |  |  | 	// keep track of which helper registers have been zeroed. | 
					
						
							|  |  |  | 	var state uint32 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Iterate through declarations. They are sorted in decreasing Xoffset order. | 
					
						
							|  |  |  | 	for _, n := range e.curfn.Func.Dcl { | 
					
						
							|  |  |  | 		if !n.Name.Needzero() { | 
					
						
							|  |  |  | 			continue | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 		if n.Class() != PAUTO { | 
					
						
							|  |  |  | 			Fatalf("needzero class %d", n.Class()) | 
					
						
							| 
									
										
										
										
											2017-04-20 10:31:39 -07:00
										 |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		if n.Type.Size()%int64(Widthptr) != 0 || n.Xoffset%int64(Widthptr) != 0 || n.Type.Size() == 0 { | 
					
						
							|  |  |  | 			Fatalf("var %L has size %d offset %d", n, n.Type.Size(), n.Xoffset) | 
					
						
							| 
									
										
										
										
											2017-04-20 10:31:39 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		if lo != hi && n.Xoffset+n.Type.Size() >= lo-int64(2*Widthreg) { | 
					
						
							| 
									
										
										
										
											2017-04-20 10:31:39 -07:00
										 |  |  | 			// Merge with range we already have. | 
					
						
							|  |  |  | 			lo = n.Xoffset | 
					
						
							|  |  |  | 			continue | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		// Zero old range | 
					
						
							|  |  |  | 		p = thearch.ZeroRange(pp, p, frame+lo, hi-lo, &state) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 		// Set new range. | 
					
						
							|  |  |  | 		lo = n.Xoffset | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 		hi = lo + n.Type.Size() | 
					
						
							| 
									
										
										
										
											2017-04-20 10:31:39 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Zero final range. | 
					
						
							|  |  |  | 	thearch.ZeroRange(pp, p, frame+lo, hi-lo, &state) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | type FloatingEQNEJump struct { | 
					
						
							|  |  |  | 	Jump  obj.As | 
					
						
							|  |  |  | 	Index int | 
					
						
							| 
									
										
										
										
											2015-08-18 14:39:26 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-25 16:55:42 -07:00
										 |  |  | func (s *SSAGenState) oneFPJump(b *ssa.Block, jumps *FloatingEQNEJump) { | 
					
						
							| 
									
										
										
										
											2017-03-20 08:01:28 -07:00
										 |  |  | 	p := s.Prog(jumps.Jump) | 
					
						
							| 
									
										
										
										
											2015-08-18 14:39:26 -04:00
										 |  |  | 	p.To.Type = obj.TYPE_BRANCH | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 	p.Pos = b.Pos | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | 	to := jumps.Index | 
					
						
							| 
									
										
										
										
											2017-03-20 08:01:28 -07:00
										 |  |  | 	s.Branches = append(s.Branches, Branch{p, b.Succs[to].Block()}) | 
					
						
							| 
									
										
										
										
											2015-08-18 14:39:26 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-22 10:27:30 -07:00
										 |  |  | func (s *SSAGenState) FPJump(b, next *ssa.Block, jumps *[2][2]FloatingEQNEJump) { | 
					
						
							| 
									
										
										
										
											2015-08-18 14:39:26 -04:00
										 |  |  | 	switch next { | 
					
						
							| 
									
										
										
										
											2016-04-28 16:52:47 -07:00
										 |  |  | 	case b.Succs[0].Block(): | 
					
						
							| 
									
										
										
										
											2017-03-25 16:55:42 -07:00
										 |  |  | 		s.oneFPJump(b, &jumps[0][0]) | 
					
						
							|  |  |  | 		s.oneFPJump(b, &jumps[0][1]) | 
					
						
							| 
									
										
										
										
											2016-04-28 16:52:47 -07:00
										 |  |  | 	case b.Succs[1].Block(): | 
					
						
							| 
									
										
										
										
											2017-03-25 16:55:42 -07:00
										 |  |  | 		s.oneFPJump(b, &jumps[1][0]) | 
					
						
							|  |  |  | 		s.oneFPJump(b, &jumps[1][1]) | 
					
						
							| 
									
										
										
										
											2015-08-18 14:39:26 -04:00
										 |  |  | 	default: | 
					
						
							| 
									
										
										
										
											2017-03-25 16:55:42 -07:00
										 |  |  | 		s.oneFPJump(b, &jumps[1][0]) | 
					
						
							|  |  |  | 		s.oneFPJump(b, &jumps[1][1]) | 
					
						
							| 
									
										
										
										
											2017-03-20 08:01:28 -07:00
										 |  |  | 		q := s.Prog(obj.AJMP) | 
					
						
							| 
									
										
										
										
											2017-07-21 12:00:38 -04:00
										 |  |  | 		q.Pos = b.Pos | 
					
						
							| 
									
										
										
										
											2015-08-18 14:39:26 -04:00
										 |  |  | 		q.To.Type = obj.TYPE_BRANCH | 
					
						
							| 
									
										
										
										
											2016-04-28 16:52:47 -07:00
										 |  |  | 		s.Branches = append(s.Branches, Branch{q, b.Succs[1].Block()}) | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-06-24 14:37:17 -05:00
										 |  |  | func AuxOffset(v *ssa.Value) (offset int64) { | 
					
						
							|  |  |  | 	if v.Aux == nil { | 
					
						
							|  |  |  | 		return 0 | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 	n, ok := v.Aux.(*Node) | 
					
						
							|  |  |  | 	if !ok { | 
					
						
							|  |  |  | 		v.Fatalf("bad aux type in %s\n", v.LongString()) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	if n.Class() == PAUTO { | 
					
						
							| 
									
										
										
										
											2016-06-24 14:37:17 -05:00
										 |  |  | 		return n.Xoffset | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return 0 | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | // AddAux adds the offset in the aux fields (AuxInt and Aux) of v to a. | 
					
						
							|  |  |  | func AddAux(a *obj.Addr, v *ssa.Value) { | 
					
						
							|  |  |  | 	AddAux2(a, v, v.AuxInt) | 
					
						
							| 
									
										
										
										
											2015-10-21 13:13:56 -07:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | func AddAux2(a *obj.Addr, v *ssa.Value, offset int64) { | 
					
						
							| 
									
										
										
										
											2016-05-06 10:11:41 -07:00
										 |  |  | 	if a.Type != obj.TYPE_MEM && a.Type != obj.TYPE_ADDR { | 
					
						
							| 
									
										
										
										
											2016-04-29 09:02:27 -07:00
										 |  |  | 		v.Fatalf("bad AddAux addr %v", a) | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	// add integer offset | 
					
						
							| 
									
										
										
										
											2015-10-21 13:13:56 -07:00
										 |  |  | 	a.Offset += offset | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 
 | 
					
						
							|  |  |  | 	// If no additional symbol offset, we're done. | 
					
						
							|  |  |  | 	if v.Aux == nil { | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	// Add symbol's offset from its base register. | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 	switch n := v.Aux.(type) { | 
					
						
							|  |  |  | 	case *obj.LSym: | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 		a.Name = obj.NAME_EXTERN | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 		a.Sym = n | 
					
						
							|  |  |  | 	case *Node: | 
					
						
							|  |  |  | 		if n.Class() == PPARAM || n.Class() == PPARAMOUT { | 
					
						
							|  |  |  | 			a.Name = obj.NAME_PARAM | 
					
						
							|  |  |  | 			a.Sym = n.Orig.Sym.Linksym() | 
					
						
							|  |  |  | 			a.Offset += n.Xoffset | 
					
						
							|  |  |  | 			break | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2015-08-24 02:16:19 -07:00
										 |  |  | 		a.Name = obj.NAME_AUTO | 
					
						
							| 
									
										
										
										
											2017-04-21 07:51:41 -07:00
										 |  |  | 		a.Sym = n.Sym.Linksym() | 
					
						
							| 
									
										
										
										
											2016-10-03 12:26:25 -07:00
										 |  |  | 		a.Offset += n.Xoffset | 
					
						
							| 
									
										
										
										
											2015-06-19 21:02:28 -07:00
										 |  |  | 	default: | 
					
						
							|  |  |  | 		v.Fatalf("aux in %s not implemented %#v", v, v.Aux) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-11-02 21:28:13 -08:00
										 |  |  | // extendIndex extends v to a full int width. | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | // panic using the given function if v does not fit in an int (only on 32-bit archs). | 
					
						
							| 
									
										
										
										
											2017-02-06 13:40:19 -08:00
										 |  |  | func (s *state) extendIndex(v *ssa.Value, panicfn *obj.LSym) *ssa.Value { | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 	size := v.Type.Size() | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 	if size == s.config.PtrSize { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 		return v | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 	if size > s.config.PtrSize { | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | 		// truncate 64-bit indexes on 32-bit pointer archs. Test the | 
					
						
							|  |  |  | 		// high word and branch to out-of-bounds failure if it is not 0. | 
					
						
							|  |  |  | 		if Debug['B'] == 0 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 			hi := s.newValue1(ssa.OpInt64Hi, types.Types[TUINT32], v) | 
					
						
							|  |  |  | 			cmp := s.newValue2(ssa.OpEq32, types.Types[TBOOL], hi, s.constInt32(types.Types[TUINT32], 0)) | 
					
						
							| 
									
										
										
										
											2016-05-25 09:49:28 -04:00
										 |  |  | 			s.check(cmp, panicfn) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		return s.newValue1(ssa.OpTrunc64to32, types.Types[TINT], v) | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	// Extend value to the required size | 
					
						
							|  |  |  | 	var op ssa.Op | 
					
						
							|  |  |  | 	if v.Type.IsSigned() { | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 		switch 10*size + s.config.PtrSize { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 		case 14: | 
					
						
							|  |  |  | 			op = ssa.OpSignExt8to32 | 
					
						
							|  |  |  | 		case 18: | 
					
						
							|  |  |  | 			op = ssa.OpSignExt8to64 | 
					
						
							|  |  |  | 		case 24: | 
					
						
							|  |  |  | 			op = ssa.OpSignExt16to32 | 
					
						
							|  |  |  | 		case 28: | 
					
						
							|  |  |  | 			op = ssa.OpSignExt16to64 | 
					
						
							|  |  |  | 		case 48: | 
					
						
							|  |  |  | 			op = ssa.OpSignExt32to64 | 
					
						
							|  |  |  | 		default: | 
					
						
							|  |  |  | 			s.Fatalf("bad signed index extension %s", v.Type) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} else { | 
					
						
							| 
									
										
										
										
											2017-04-21 18:44:34 -07:00
										 |  |  | 		switch 10*size + s.config.PtrSize { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | 		case 14: | 
					
						
							|  |  |  | 			op = ssa.OpZeroExt8to32 | 
					
						
							|  |  |  | 		case 18: | 
					
						
							|  |  |  | 			op = ssa.OpZeroExt8to64 | 
					
						
							|  |  |  | 		case 24: | 
					
						
							|  |  |  | 			op = ssa.OpZeroExt16to32 | 
					
						
							|  |  |  | 		case 28: | 
					
						
							|  |  |  | 			op = ssa.OpZeroExt16to64 | 
					
						
							|  |  |  | 		case 48: | 
					
						
							|  |  |  | 			op = ssa.OpZeroExt32to64 | 
					
						
							|  |  |  | 		default: | 
					
						
							|  |  |  | 			s.Fatalf("bad unsigned index extension %s", v.Type) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	return s.newValue1(op, types.Types[TINT], v) | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile/internal/ssa: redo how sign extension is handled
For integer types less than a machine register, we have to decide
what the invariants are for the high bits of the register.  We used
to set the high bits to the correct extension (sign or zero, as
determined by the type) of the low bits.
This CL makes the compiler ignore the high bits of the register
altogether (they are junk).
On this plus side, this means ops that generate subword results don't
have to worry about correctly extending them.  On the minus side,
ops that consume subword arguments have to deal with the input
registers not being correctly extended.
For x86, this tradeoff is probably worth it.  Almost all opcodes
have versions that use only the correct subword piece of their
inputs.  (The one big exception is array indexing.)  Not many opcodes
can correctly sign extend on output.
For other architectures, the tradeoff is probably not so clear, as
they don't have many subword-safe opcodes (e.g. 16-bit compare,
ignoring the high 16/48 bits).  Fortunately we can decide whether
we do this per-architecture.
For the machine-independent opcodes, we pretend that the "register"
size is equal to the type width, so sign extension is immaterial.
Opcodes that care about the signedness of the input (e.g. compare,
right shift) have two different variants.
Change-Id: I465484c5734545ee697afe83bc8bf4b53bd9df8d
Reviewed-on: https://go-review.googlesource.com/12600
Reviewed-by: Josh Bleecher Snyder <josharian@gmail.com>
											
										 
											2015-07-23 14:35:02 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-06-15 15:26:47 -07:00
										 |  |  | // CheckLoweredPhi checks that regalloc and stackalloc correctly handled phi values. | 
					
						
							|  |  |  | // Called during ssaGenValue. | 
					
						
							|  |  |  | func CheckLoweredPhi(v *ssa.Value) { | 
					
						
							|  |  |  | 	if v.Op != ssa.OpPhi { | 
					
						
							|  |  |  | 		v.Fatalf("CheckLoweredPhi called with non-phi value: %v", v.LongString()) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	if v.Type.IsMemory() { | 
					
						
							|  |  |  | 		return | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	f := v.Block.Func | 
					
						
							|  |  |  | 	loc := f.RegAlloc[v.ID] | 
					
						
							|  |  |  | 	for _, a := range v.Args { | 
					
						
							|  |  |  | 		if aloc := f.RegAlloc[a.ID]; aloc != loc { // TODO: .Equal() instead? | 
					
						
							| 
									
										
										
										
											2017-08-17 12:23:34 -07:00
										 |  |  | 			v.Fatalf("phi arg at different location than phi: %v @ %s, but arg %v @ %s\n%s\n", v, loc, a, aloc, v.Block.Func) | 
					
						
							| 
									
										
										
										
											2016-06-15 15:26:47 -07:00
										 |  |  | 		} | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-07-03 13:40:03 -07:00
										 |  |  | // CheckLoweredGetClosurePtr checks that v is the first instruction in the function's entry block. | 
					
						
							|  |  |  | // The output of LoweredGetClosurePtr is generally hardwired to the correct register. | 
					
						
							|  |  |  | // That register contains the closure pointer on closure entry. | 
					
						
							|  |  |  | func CheckLoweredGetClosurePtr(v *ssa.Value) { | 
					
						
							|  |  |  | 	entry := v.Block.Func.Entry | 
					
						
							|  |  |  | 	if entry != v.Block || entry.Values[0] != v { | 
					
						
							| 
									
										
										
										
											2016-07-21 10:37:59 -07:00
										 |  |  | 		Fatalf("in %s, badly placed LoweredGetClosurePtr: %v %v", v.Block.Func.Name, v.Block, v) | 
					
						
							| 
									
										
										
										
											2016-07-03 13:40:03 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | // AutoVar returns a *Node and int64 representing the auto variable and offset within it | 
					
						
							| 
									
										
										
										
											2015-11-02 08:10:26 -08:00
										 |  |  | // where v should be spilled. | 
					
						
							| 
									
										
										
										
											2016-03-12 14:07:40 -08:00
										 |  |  | func AutoVar(v *ssa.Value) (*Node, int64) { | 
					
						
							| 
									
										
										
										
											2015-11-02 08:10:26 -08:00
										 |  |  | 	loc := v.Block.Func.RegAlloc[v.ID].(ssa.LocalSlot) | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 	if v.Type.Size() > loc.Type.Size() { | 
					
						
							| 
									
										
										
										
											2016-01-04 13:34:54 -08:00
										 |  |  | 		v.Fatalf("spill/restore type %s doesn't fit in slot type %s", v.Type, loc.Type) | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2015-11-02 08:10:26 -08:00
										 |  |  | 	return loc.N.(*Node), loc.Off | 
					
						
							| 
									
										
										
										
											2015-05-12 11:06:44 -07:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2015-05-27 14:52:22 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-10-03 12:26:25 -07:00
										 |  |  | func AddrAuto(a *obj.Addr, v *ssa.Value) { | 
					
						
							|  |  |  | 	n, off := AutoVar(v) | 
					
						
							|  |  |  | 	a.Type = obj.TYPE_MEM | 
					
						
							| 
									
										
										
										
											2017-04-21 07:51:41 -07:00
										 |  |  | 	a.Sym = n.Sym.Linksym() | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:36 -07:00
										 |  |  | 	a.Reg = int16(thearch.REGSP) | 
					
						
							| 
									
										
										
										
											2016-10-03 12:26:25 -07:00
										 |  |  | 	a.Offset = n.Xoffset + off | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PPARAM || n.Class() == PPARAMOUT { | 
					
						
							| 
									
										
										
										
											2016-10-03 12:26:25 -07:00
										 |  |  | 		a.Name = obj.NAME_PARAM | 
					
						
							|  |  |  | 	} else { | 
					
						
							|  |  |  | 		a.Name = obj.NAME_AUTO | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | func (s *SSAGenState) AddrScratch(a *obj.Addr) { | 
					
						
							| 
									
										
										
										
											2016-10-04 13:00:21 -07:00
										 |  |  | 	if s.ScratchFpMem == nil { | 
					
						
							|  |  |  | 		panic("no scratch memory available; forgot to declare usesScratch for Op?") | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-10-03 12:26:25 -07:00
										 |  |  | 	a.Type = obj.TYPE_MEM | 
					
						
							|  |  |  | 	a.Name = obj.NAME_AUTO | 
					
						
							| 
									
										
										
										
											2017-04-21 07:51:41 -07:00
										 |  |  | 	a.Sym = s.ScratchFpMem.Sym.Linksym() | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:36 -07:00
										 |  |  | 	a.Reg = int16(thearch.REGSP) | 
					
						
							| 
									
										
										
										
											2016-10-03 12:26:25 -07:00
										 |  |  | 	a.Offset = s.ScratchFpMem.Xoffset | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-03-29 00:53:26 +02:00
										 |  |  | // Call returns a new CALL instruction for the SSA value v. | 
					
						
							|  |  |  | // It uses PrepareCall to prepare the call. | 
					
						
							| 
									
										
										
										
											2017-03-10 18:34:41 -08:00
										 |  |  | func (s *SSAGenState) Call(v *ssa.Value) *obj.Prog { | 
					
						
							| 
									
										
										
										
											2018-03-29 00:53:26 +02:00
										 |  |  | 	s.PrepareCall(v) | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	p := s.Prog(obj.ACALL) | 
					
						
							|  |  |  | 	if sym, ok := v.Aux.(*obj.LSym); ok { | 
					
						
							|  |  |  | 		p.To.Type = obj.TYPE_MEM | 
					
						
							|  |  |  | 		p.To.Name = obj.NAME_EXTERN | 
					
						
							|  |  |  | 		p.To.Sym = sym | 
					
						
							|  |  |  | 	} else { | 
					
						
							|  |  |  | 		// TODO(mdempsky): Can these differences be eliminated? | 
					
						
							|  |  |  | 		switch thearch.LinkArch.Family { | 
					
						
							| 
									
										
										
										
											2018-03-29 00:55:53 +02:00
										 |  |  | 		case sys.AMD64, sys.I386, sys.PPC64, sys.S390X, sys.Wasm: | 
					
						
							| 
									
										
										
										
											2018-03-29 00:53:26 +02:00
										 |  |  | 			p.To.Type = obj.TYPE_REG | 
					
						
							|  |  |  | 		case sys.ARM, sys.ARM64, sys.MIPS, sys.MIPS64: | 
					
						
							|  |  |  | 			p.To.Type = obj.TYPE_MEM | 
					
						
							|  |  |  | 		default: | 
					
						
							|  |  |  | 			Fatalf("unknown indirect call family") | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		p.To.Reg = v.Args[0].Reg() | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	return p | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | // PrepareCall prepares to emit a CALL instruction for v and does call-related bookkeeping. | 
					
						
							|  |  |  | // It must be called immediately before emitting the actual CALL instruction, | 
					
						
							|  |  |  | // since it emits PCDATA for the stack map at the call (calls are safe points). | 
					
						
							|  |  |  | func (s *SSAGenState) PrepareCall(v *ssa.Value) { | 
					
						
							| 
									
										
										
										
											2018-03-22 12:04:51 -04:00
										 |  |  | 	idx := s.livenessMap.Get(v) | 
					
						
							|  |  |  | 	if !idx.Valid() { | 
					
						
							| 
									
										
										
										
											2018-02-26 20:48:53 -05:00
										 |  |  | 		// typedmemclr and typedmemmove are write barriers and | 
					
						
							|  |  |  | 		// deeply non-preemptible. They are unsafe points and | 
					
						
							|  |  |  | 		// hence should not have liveness maps. | 
					
						
							|  |  |  | 		if sym, _ := v.Aux.(*obj.LSym); !(sym == typedmemclr || sym == typedmemmove) { | 
					
						
							|  |  |  | 			Fatalf("missing stack map index for %v", v.LongString()) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-03-09 18:32:17 -08:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-10 18:34:41 -08:00
										 |  |  | 	if sym, _ := v.Aux.(*obj.LSym); sym == Deferreturn { | 
					
						
							|  |  |  | 		// Deferred calls will appear to be returning to | 
					
						
							|  |  |  | 		// the CALL deferreturn(SB) that we are about to emit. | 
					
						
							|  |  |  | 		// However, the stack trace code will show the line | 
					
						
							|  |  |  | 		// of the instruction byte before the return PC. | 
					
						
							|  |  |  | 		// To avoid that being an unrelated instruction, | 
					
						
							|  |  |  | 		// insert an actual hardware NOP that will have the right line number. | 
					
						
							|  |  |  | 		// This is different from obj.ANOP, which is a virtual no-op | 
					
						
							|  |  |  | 		// that doesn't make it into the instruction stream. | 
					
						
							| 
									
										
										
										
											2017-03-22 16:43:42 -07:00
										 |  |  | 		thearch.Ginsnop(s.pp) | 
					
						
							| 
									
										
										
										
											2017-03-10 18:34:41 -08:00
										 |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 	if sym, ok := v.Aux.(*obj.LSym); ok { | 
					
						
							| 
									
										
											  
											
												cmd/compile: improve coverage of nowritebarrierrec check
The current go:nowritebarrierrec checker has two problems that limit
its coverage:
1. It doesn't understand that systemstack calls its argument, which
means there are several cases where we fail to detect prohibited write
barriers.
2. It only observes calls in the AST, so calls constructed during
lowering by SSA aren't followed.
This CL completely rewrites this checker to address these issues.
The current checker runs entirely after walk and uses visitBottomUp,
which introduces several problems for checking across systemstack.
First, visitBottomUp itself doesn't understand systemstack calls, so
the callee may be ordered after the caller, causing the checker to
fail to propagate constraints. Second, many systemstack calls are
passed a closure, which is quite difficult to resolve back to the
function definition after transformclosure and walk have run. Third,
visitBottomUp works exclusively on the AST, so it can't observe calls
created by SSA.
To address these problems, this commit splits the check into two
phases and rewrites it to use a call graph generated during SSA
lowering. The first phase runs before transformclosure/walk and simply
records systemstack arguments when they're easy to get. Then, it
modifies genssa to record static call edges at the point where we're
lowering to Progs (which is the latest point at which position
information is conveniently available). Finally, the second phase runs
after all functions have been lowered and uses a direct BFS walk of
the call graph (combining systemstack calls with static calls) to find
prohibited write barriers and construct nice error messages.
Fixes #22384.
For #22460.
Change-Id: I39668f7f2366ab3c1ab1a71eaf25484d25349540
Reviewed-on: https://go-review.googlesource.com/72773
Run-TryBot: Austin Clements <austin@google.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-10-22 16:36:27 -04:00
										 |  |  | 		// Record call graph information for nowritebarrierrec | 
					
						
							|  |  |  | 		// analysis. | 
					
						
							|  |  |  | 		if nowritebarrierrecCheck != nil { | 
					
						
							|  |  |  | 			nowritebarrierrecCheck.recordCall(s.pp.curfn, sym, v.Pos) | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2017-03-10 18:34:41 -08:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2018-03-29 00:53:26 +02:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 08:58:36 -07:00
										 |  |  | 	if s.maxarg < v.AuxInt { | 
					
						
							|  |  |  | 		s.maxarg = v.AuxInt | 
					
						
							| 
									
										
										
										
											2017-03-10 18:34:41 -08:00
										 |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | // fieldIdx finds the index of the field referred to by the ODOT node n. | 
					
						
							| 
									
										
										
										
											2016-03-14 12:45:18 -07:00
										 |  |  | func fieldIdx(n *Node) int { | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 	t := n.Left.Type | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ODOT and friends to use Sym, not Right
The Node type ODOT and its variants all represent a selector, with a
simple name to the right of the dot.  Before this change this was
represented by using an ONAME Node in the Right field.  This ONAME node
served no useful purpose.  This CL changes these Node types to store the
symbol in the Sym field instead, thus not requiring allocating a Node
for each selector.
When compiling x/tools/go/types this CL eliminates nearly 5000 calls to
newname and reduces the total number of Nodes allocated by about 6.6%.
It seems to cut compilation time by 1 to 2 percent.
Getting this right was somewhat subtle, and I added two dubious changes
to produce the exact same output as before.  One is to ishairy in
inl.go: the ONAME node increased the cost of ODOT and friends by 1, and
I retained that, although really ODOT is not more expensive than any
other node.  The other is to varexpr in walk.go: because the ONAME in
the Right field of an ODOT has no class, varexpr would always return
false for an ODOT, although in fact for some ODOT's it seemingly ought
to return true; I added an && false for now.  I will send separate CLs,
that will break toolstash -cmp, to clean these up.
This CL passes toolstash -cmp.
Change-Id: I4af8a10cc59078c436130ce472f25abc3a9b2f80
Reviewed-on: https://go-review.googlesource.com/20890
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Brad Fitzpatrick <bradfitz@golang.org>
											
										 
											2016-03-18 16:52:30 -07:00
										 |  |  | 	f := n.Sym | 
					
						
							| 
									
										
										
										
											2016-03-30 14:56:08 -07:00
										 |  |  | 	if !t.IsStruct() { | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 		panic("ODOT's LHS is not a struct") | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-03-14 12:45:18 -07:00
										 |  |  | 	var i int | 
					
						
							| 
									
										
										
										
											2016-03-17 01:32:18 -07:00
										 |  |  | 	for _, t1 := range t.Fields().Slice() { | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ODOT and friends to use Sym, not Right
The Node type ODOT and its variants all represent a selector, with a
simple name to the right of the dot.  Before this change this was
represented by using an ONAME Node in the Right field.  This ONAME node
served no useful purpose.  This CL changes these Node types to store the
symbol in the Sym field instead, thus not requiring allocating a Node
for each selector.
When compiling x/tools/go/types this CL eliminates nearly 5000 calls to
newname and reduces the total number of Nodes allocated by about 6.6%.
It seems to cut compilation time by 1 to 2 percent.
Getting this right was somewhat subtle, and I added two dubious changes
to produce the exact same output as before.  One is to ishairy in
inl.go: the ONAME node increased the cost of ODOT and friends by 1, and
I retained that, although really ODOT is not more expensive than any
other node.  The other is to varexpr in walk.go: because the ONAME in
the Right field of an ODOT has no class, varexpr would always return
false for an ODOT, although in fact for some ODOT's it seemingly ought
to return true; I added an && false for now.  I will send separate CLs,
that will break toolstash -cmp, to clean these up.
This CL passes toolstash -cmp.
Change-Id: I4af8a10cc59078c436130ce472f25abc3a9b2f80
Reviewed-on: https://go-review.googlesource.com/20890
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Brad Fitzpatrick <bradfitz@golang.org>
											
										 
											2016-03-18 16:52:30 -07:00
										 |  |  | 		if t1.Sym != f { | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 			i++ | 
					
						
							|  |  |  | 			continue | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2016-03-28 09:40:53 -07:00
										 |  |  | 		if t1.Offset != n.Xoffset { | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 			panic("field offset doesn't match") | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		return i | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-09-09 21:08:46 -07:00
										 |  |  | 	panic(fmt.Sprintf("can't find field in expr %v\n", n)) | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-04-03 12:43:27 +01:00
										 |  |  | 	// TODO: keep the result of this function somewhere in the ODOT Node | 
					
						
							| 
									
										
										
										
											2016-01-11 21:05:33 -08:00
										 |  |  | 	// so we don't have to recompute it each time we need it. | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | // ssafn holds frontend information about a function that the backend is processing. | 
					
						
							|  |  |  | // It also exports a bunch of compiler services for the ssa backend. | 
					
						
							|  |  |  | type ssafn struct { | 
					
						
							| 
									
										
										
										
											2017-03-30 14:34:16 -07:00
										 |  |  | 	curfn        *Node | 
					
						
							|  |  |  | 	strings      map[string]interface{} // map from constant string to data symbols | 
					
						
							|  |  |  | 	scratchFpMem *Node                  // temp for floating point register / memory moves on some architectures | 
					
						
							|  |  |  | 	stksize      int64                  // stack size for current frame | 
					
						
							|  |  |  | 	stkptrsize   int64                  // prefix of stack containing pointers | 
					
						
							| 
									
										
										
										
											2018-07-24 10:39:00 +03:00
										 |  |  | 	log          bool                   // print ssa debug to the stdout | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | // StringData returns a symbol (a *types.Sym wrapped in an interface) which | 
					
						
							| 
									
										
										
										
											2015-07-24 11:28:12 -07:00
										 |  |  | // is the data component of a global string constant containing s. | 
					
						
							| 
									
										
										
										
											2016-08-30 19:11:19 -07:00
										 |  |  | func (e *ssafn) StringData(s string) interface{} { | 
					
						
							|  |  |  | 	if aux, ok := e.strings[s]; ok { | 
					
						
							|  |  |  | 		return aux | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	if e.strings == nil { | 
					
						
							|  |  |  | 		e.strings = make(map[string]interface{}) | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-11-08 09:43:56 +01:00
										 |  |  | 	data := stringsym(e.curfn.Pos, s) | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 	e.strings[s] = data | 
					
						
							|  |  |  | 	return data | 
					
						
							| 
									
										
										
										
											2015-05-27 14:52:22 -07:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (e *ssafn) Auto(pos src.XPos, t *types.Type) ssa.GCNode { | 
					
						
							|  |  |  | 	n := tempAt(pos, e.curfn, t) // Note: adds new auto to e.curfn.Func.Dcl list | 
					
						
							| 
									
										
										
										
											2015-08-24 02:16:19 -07:00
										 |  |  | 	return n | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) SplitString(name ssa.LocalSlot) (ssa.LocalSlot, ssa.LocalSlot) { | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	n := name.N.(*Node) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	ptrType := types.NewPtr(types.Types[TUINT8]) | 
					
						
							|  |  |  | 	lenType := types.Types[TINT] | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PAUTO && !n.Addrtaken() { | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 		// Split this string up into two separate variables. | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | 		p := e.splitSlot(&name, ".ptr", 0, ptrType) | 
					
						
							|  |  |  | 		l := e.splitSlot(&name, ".len", ptrType.Size(), lenType) | 
					
						
							|  |  |  | 		return p, l | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	// Return the two parts of the larger variable. | 
					
						
							| 
									
										
										
										
											2016-04-29 09:02:27 -07:00
										 |  |  | 	return ssa.LocalSlot{N: n, Type: ptrType, Off: name.Off}, ssa.LocalSlot{N: n, Type: lenType, Off: name.Off + int64(Widthptr)} | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) SplitInterface(name ssa.LocalSlot) (ssa.LocalSlot, ssa.LocalSlot) { | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	n := name.N.(*Node) | 
					
						
							| 
									
										
										
										
											2018-02-27 13:46:03 -08:00
										 |  |  | 	u := types.Types[TUINTPTR] | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	t := types.NewPtr(types.Types[TUINT8]) | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PAUTO && !n.Addrtaken() { | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 		// Split this interface up into two separate variables. | 
					
						
							|  |  |  | 		f := ".itab" | 
					
						
							| 
									
										
										
										
											2016-04-01 13:36:24 -07:00
										 |  |  | 		if n.Type.IsEmptyInterface() { | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 			f = ".type" | 
					
						
							|  |  |  | 		} | 
					
						
							| 
									
										
										
										
											2018-02-27 13:46:03 -08:00
										 |  |  | 		c := e.splitSlot(&name, f, 0, u) // see comment in plive.go:onebitwalktype1. | 
					
						
							|  |  |  | 		d := e.splitSlot(&name, ".data", u.Size(), t) | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | 		return c, d | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	// Return the two parts of the larger variable. | 
					
						
							| 
									
										
										
										
											2018-02-27 13:46:03 -08:00
										 |  |  | 	return ssa.LocalSlot{N: n, Type: u, Off: name.Off}, ssa.LocalSlot{N: n, Type: t, Off: name.Off + int64(Widthptr)} | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) SplitSlice(name ssa.LocalSlot) (ssa.LocalSlot, ssa.LocalSlot, ssa.LocalSlot) { | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	n := name.N.(*Node) | 
					
						
							| 
									
										
										
										
											2018-04-24 13:39:51 -07:00
										 |  |  | 	ptrType := types.NewPtr(name.Type.Elem()) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	lenType := types.Types[TINT] | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PAUTO && !n.Addrtaken() { | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 		// Split this slice up into three separate variables. | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | 		p := e.splitSlot(&name, ".ptr", 0, ptrType) | 
					
						
							|  |  |  | 		l := e.splitSlot(&name, ".len", ptrType.Size(), lenType) | 
					
						
							|  |  |  | 		c := e.splitSlot(&name, ".cap", ptrType.Size()+lenType.Size(), lenType) | 
					
						
							|  |  |  | 		return p, l, c | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	// Return the three parts of the larger variable. | 
					
						
							| 
									
										
										
										
											2016-04-29 09:02:27 -07:00
										 |  |  | 	return ssa.LocalSlot{N: n, Type: ptrType, Off: name.Off}, | 
					
						
							|  |  |  | 		ssa.LocalSlot{N: n, Type: lenType, Off: name.Off + int64(Widthptr)}, | 
					
						
							|  |  |  | 		ssa.LocalSlot{N: n, Type: lenType, Off: name.Off + int64(2*Widthptr)} | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) SplitComplex(name ssa.LocalSlot) (ssa.LocalSlot, ssa.LocalSlot) { | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	n := name.N.(*Node) | 
					
						
							| 
									
										
										
										
											2017-04-28 00:19:49 +00:00
										 |  |  | 	s := name.Type.Size() / 2 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	var t *types.Type | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	if s == 8 { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		t = types.Types[TFLOAT64] | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	} else { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		t = types.Types[TFLOAT32] | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PAUTO && !n.Addrtaken() { | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 		// Split this complex up into two separate variables. | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | 		r := e.splitSlot(&name, ".real", 0, t) | 
					
						
							|  |  |  | 		i := e.splitSlot(&name, ".imag", t.Size(), t) | 
					
						
							|  |  |  | 		return r, i | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	// Return the two parts of the larger variable. | 
					
						
							| 
									
										
										
										
											2016-04-29 09:02:27 -07:00
										 |  |  | 	return ssa.LocalSlot{N: n, Type: t, Off: name.Off}, ssa.LocalSlot{N: n, Type: t, Off: name.Off + s} | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) SplitInt64(name ssa.LocalSlot) (ssa.LocalSlot, ssa.LocalSlot) { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile: decompose 64-bit integer on ARM
Introduce dec64 rules to (generically) decompose 64-bit integer on
32-bit architectures. 64-bit integer is composed/decomposed with
Int64Make/Hi/Lo ops, as for complex types.
The idea of dealing with Add64 is the following:
(Add64 (Int64Make xh xl) (Int64Make yh yl))
->
(Int64Make
	(Add32withcarry xh yh (Select0 (Add32carry xl yl)))
	(Select1 (Add32carry xl yl)))
where Add32carry returns a tuple (flags,uint32). Select0 and Select1
read the first and the second component of the tuple, respectively.
The two Add32carry will be CSE'd.
Similarly for multiplication, Mul32uhilo returns a tuple (hi, lo).
Also add support of KeepAlive, to fix build after merge.
Tests addressed_ssa.go, array_ssa.go, break_ssa.go, chan_ssa.go,
cmp_ssa.go, ctl_ssa.go, map_ssa.go, and string_ssa.go in
cmd/compile/internal/gc/testdata passed.
Progress on SSA for ARM. Still not complete.
Updates #15365.
Change-Id: I7867c76785a456312de5d8398a6b3f7ca5a4f7ec
Reviewed-on: https://go-review.googlesource.com/23213
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-05-18 18:14:36 -04:00
										 |  |  | 	n := name.N.(*Node) | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	var t *types.Type | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile: decompose 64-bit integer on ARM
Introduce dec64 rules to (generically) decompose 64-bit integer on
32-bit architectures. 64-bit integer is composed/decomposed with
Int64Make/Hi/Lo ops, as for complex types.
The idea of dealing with Add64 is the following:
(Add64 (Int64Make xh xl) (Int64Make yh yl))
->
(Int64Make
	(Add32withcarry xh yh (Select0 (Add32carry xl yl)))
	(Select1 (Add32carry xl yl)))
where Add32carry returns a tuple (flags,uint32). Select0 and Select1
read the first and the second component of the tuple, respectively.
The two Add32carry will be CSE'd.
Similarly for multiplication, Mul32uhilo returns a tuple (hi, lo).
Also add support of KeepAlive, to fix build after merge.
Tests addressed_ssa.go, array_ssa.go, break_ssa.go, chan_ssa.go,
cmp_ssa.go, ctl_ssa.go, map_ssa.go, and string_ssa.go in
cmd/compile/internal/gc/testdata passed.
Progress on SSA for ARM. Still not complete.
Updates #15365.
Change-Id: I7867c76785a456312de5d8398a6b3f7ca5a4f7ec
Reviewed-on: https://go-review.googlesource.com/23213
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-05-18 18:14:36 -04:00
										 |  |  | 	if name.Type.IsSigned() { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		t = types.Types[TINT32] | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile: decompose 64-bit integer on ARM
Introduce dec64 rules to (generically) decompose 64-bit integer on
32-bit architectures. 64-bit integer is composed/decomposed with
Int64Make/Hi/Lo ops, as for complex types.
The idea of dealing with Add64 is the following:
(Add64 (Int64Make xh xl) (Int64Make yh yl))
->
(Int64Make
	(Add32withcarry xh yh (Select0 (Add32carry xl yl)))
	(Select1 (Add32carry xl yl)))
where Add32carry returns a tuple (flags,uint32). Select0 and Select1
read the first and the second component of the tuple, respectively.
The two Add32carry will be CSE'd.
Similarly for multiplication, Mul32uhilo returns a tuple (hi, lo).
Also add support of KeepAlive, to fix build after merge.
Tests addressed_ssa.go, array_ssa.go, break_ssa.go, chan_ssa.go,
cmp_ssa.go, ctl_ssa.go, map_ssa.go, and string_ssa.go in
cmd/compile/internal/gc/testdata passed.
Progress on SSA for ARM. Still not complete.
Updates #15365.
Change-Id: I7867c76785a456312de5d8398a6b3f7ca5a4f7ec
Reviewed-on: https://go-review.googlesource.com/23213
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-05-18 18:14:36 -04:00
										 |  |  | 	} else { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		t = types.Types[TUINT32] | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile: decompose 64-bit integer on ARM
Introduce dec64 rules to (generically) decompose 64-bit integer on
32-bit architectures. 64-bit integer is composed/decomposed with
Int64Make/Hi/Lo ops, as for complex types.
The idea of dealing with Add64 is the following:
(Add64 (Int64Make xh xl) (Int64Make yh yl))
->
(Int64Make
	(Add32withcarry xh yh (Select0 (Add32carry xl yl)))
	(Select1 (Add32carry xl yl)))
where Add32carry returns a tuple (flags,uint32). Select0 and Select1
read the first and the second component of the tuple, respectively.
The two Add32carry will be CSE'd.
Similarly for multiplication, Mul32uhilo returns a tuple (hi, lo).
Also add support of KeepAlive, to fix build after merge.
Tests addressed_ssa.go, array_ssa.go, break_ssa.go, chan_ssa.go,
cmp_ssa.go, ctl_ssa.go, map_ssa.go, and string_ssa.go in
cmd/compile/internal/gc/testdata passed.
Progress on SSA for ARM. Still not complete.
Updates #15365.
Change-Id: I7867c76785a456312de5d8398a6b3f7ca5a4f7ec
Reviewed-on: https://go-review.googlesource.com/23213
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-05-18 18:14:36 -04:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PAUTO && !n.Addrtaken() { | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile: decompose 64-bit integer on ARM
Introduce dec64 rules to (generically) decompose 64-bit integer on
32-bit architectures. 64-bit integer is composed/decomposed with
Int64Make/Hi/Lo ops, as for complex types.
The idea of dealing with Add64 is the following:
(Add64 (Int64Make xh xl) (Int64Make yh yl))
->
(Int64Make
	(Add32withcarry xh yh (Select0 (Add32carry xl yl)))
	(Select1 (Add32carry xl yl)))
where Add32carry returns a tuple (flags,uint32). Select0 and Select1
read the first and the second component of the tuple, respectively.
The two Add32carry will be CSE'd.
Similarly for multiplication, Mul32uhilo returns a tuple (hi, lo).
Also add support of KeepAlive, to fix build after merge.
Tests addressed_ssa.go, array_ssa.go, break_ssa.go, chan_ssa.go,
cmp_ssa.go, ctl_ssa.go, map_ssa.go, and string_ssa.go in
cmd/compile/internal/gc/testdata passed.
Progress on SSA for ARM. Still not complete.
Updates #15365.
Change-Id: I7867c76785a456312de5d8398a6b3f7ca5a4f7ec
Reviewed-on: https://go-review.googlesource.com/23213
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-05-18 18:14:36 -04:00
										 |  |  | 		// Split this int64 up into two separate variables. | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | 		if thearch.LinkArch.ByteOrder == binary.BigEndian { | 
					
						
							|  |  |  | 			return e.splitSlot(&name, ".hi", 0, t), e.splitSlot(&name, ".lo", t.Size(), types.Types[TUINT32]) | 
					
						
							|  |  |  | 		} | 
					
						
							|  |  |  | 		return e.splitSlot(&name, ".hi", t.Size(), t), e.splitSlot(&name, ".lo", 0, types.Types[TUINT32]) | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile: decompose 64-bit integer on ARM
Introduce dec64 rules to (generically) decompose 64-bit integer on
32-bit architectures. 64-bit integer is composed/decomposed with
Int64Make/Hi/Lo ops, as for complex types.
The idea of dealing with Add64 is the following:
(Add64 (Int64Make xh xl) (Int64Make yh yl))
->
(Int64Make
	(Add32withcarry xh yh (Select0 (Add32carry xl yl)))
	(Select1 (Add32carry xl yl)))
where Add32carry returns a tuple (flags,uint32). Select0 and Select1
read the first and the second component of the tuple, respectively.
The two Add32carry will be CSE'd.
Similarly for multiplication, Mul32uhilo returns a tuple (hi, lo).
Also add support of KeepAlive, to fix build after merge.
Tests addressed_ssa.go, array_ssa.go, break_ssa.go, chan_ssa.go,
cmp_ssa.go, ctl_ssa.go, map_ssa.go, and string_ssa.go in
cmd/compile/internal/gc/testdata passed.
Progress on SSA for ARM. Still not complete.
Updates #15365.
Change-Id: I7867c76785a456312de5d8398a6b3f7ca5a4f7ec
Reviewed-on: https://go-review.googlesource.com/23213
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-05-18 18:14:36 -04:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	// Return the two parts of the larger variable. | 
					
						
							| 
									
										
										
										
											2017-03-17 13:35:36 -07:00
										 |  |  | 	if thearch.LinkArch.ByteOrder == binary.BigEndian { | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 		return ssa.LocalSlot{N: n, Type: t, Off: name.Off}, ssa.LocalSlot{N: n, Type: types.Types[TUINT32], Off: name.Off + 4} | 
					
						
							| 
									
										
										
										
											2016-10-18 23:50:40 +02:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	return ssa.LocalSlot{N: n, Type: t, Off: name.Off + 4}, ssa.LocalSlot{N: n, Type: types.Types[TUINT32], Off: name.Off} | 
					
						
							| 
									
										
											  
											
												[dev.ssa] cmd/compile: decompose 64-bit integer on ARM
Introduce dec64 rules to (generically) decompose 64-bit integer on
32-bit architectures. 64-bit integer is composed/decomposed with
Int64Make/Hi/Lo ops, as for complex types.
The idea of dealing with Add64 is the following:
(Add64 (Int64Make xh xl) (Int64Make yh yl))
->
(Int64Make
	(Add32withcarry xh yh (Select0 (Add32carry xl yl)))
	(Select1 (Add32carry xl yl)))
where Add32carry returns a tuple (flags,uint32). Select0 and Select1
read the first and the second component of the tuple, respectively.
The two Add32carry will be CSE'd.
Similarly for multiplication, Mul32uhilo returns a tuple (hi, lo).
Also add support of KeepAlive, to fix build after merge.
Tests addressed_ssa.go, array_ssa.go, break_ssa.go, chan_ssa.go,
cmp_ssa.go, ctl_ssa.go, map_ssa.go, and string_ssa.go in
cmd/compile/internal/gc/testdata passed.
Progress on SSA for ARM. Still not complete.
Updates #15365.
Change-Id: I7867c76785a456312de5d8398a6b3f7ca5a4f7ec
Reviewed-on: https://go-review.googlesource.com/23213
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2016-05-18 18:14:36 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) SplitStruct(name ssa.LocalSlot, i int) ssa.LocalSlot { | 
					
						
							| 
									
										
										
										
											2016-03-31 21:24:10 -07:00
										 |  |  | 	n := name.N.(*Node) | 
					
						
							|  |  |  | 	st := name.Type | 
					
						
							|  |  |  | 	ft := st.FieldType(i) | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | 	var offset int64 | 
					
						
							|  |  |  | 	for f := 0; f < i; f++ { | 
					
						
							|  |  |  | 		offset += st.FieldType(f).Size() | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PAUTO && !n.Addrtaken() { | 
					
						
							| 
									
										
										
										
											2016-03-31 21:24:10 -07:00
										 |  |  | 		// Note: the _ field may appear several times.  But | 
					
						
							|  |  |  | 		// have no fear, identically-named but distinct Autos are | 
					
						
							|  |  |  | 		// ok, albeit maybe confusing for a debugger. | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | 		return e.splitSlot(&name, "."+st.FieldName(i), offset, ft) | 
					
						
							| 
									
										
										
										
											2016-03-31 21:24:10 -07:00
										 |  |  | 	} | 
					
						
							| 
									
										
										
										
											2016-04-29 09:02:27 -07:00
										 |  |  | 	return ssa.LocalSlot{N: n, Type: ft, Off: name.Off + st.FieldOff(i)} | 
					
						
							| 
									
										
										
										
											2016-03-31 21:24:10 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) SplitArray(name ssa.LocalSlot) ssa.LocalSlot { | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 	n := name.N.(*Node) | 
					
						
							|  |  |  | 	at := name.Type | 
					
						
							|  |  |  | 	if at.NumElem() != 1 { | 
					
						
							|  |  |  | 		Fatalf("bad array size") | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2018-04-24 13:39:51 -07:00
										 |  |  | 	et := at.Elem() | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	if n.Class() == PAUTO && !n.Addrtaken() { | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | 		return e.splitSlot(&name, "[0]", 0, et) | 
					
						
							| 
									
										
										
										
											2016-10-30 21:10:03 -07:00
										 |  |  | 	} | 
					
						
							|  |  |  | 	return ssa.LocalSlot{N: n, Type: et, Off: name.Off} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) DerefItab(it *obj.LSym, offset int64) *obj.LSym { | 
					
						
							| 
									
										
											  
											
												cmd/compile: de-virtualize interface calls
With this change, code like
    h := sha1.New()
    h.Write(buf)
    sum := h.Sum()
gets compiled into static calls rather than
interface calls, because the compiler is able
to prove that 'h' is really a *sha1.digest.
The InterCall re-write rule hits a few dozen times
during make.bash, and hundreds of times during all.bash.
The most common pattern identified by the compiler
is a constructor like
    func New() Interface { return &impl{...} }
where the constructor gets inlined into the caller,
and the result is used immediately. Examples include
{sha1,md5,crc32,crc64,...}.New, base64.NewEncoder,
base64.NewDecoder, errors.New, net.Pipe, and so on.
Some existing benchmarks that change on darwin/amd64:
Crc64/ISO4KB-8        2.67µs ± 1%    2.66µs ± 0%  -0.36%  (p=0.015 n=10+10)
Crc64/ISO1KB-8         694ns ± 0%     690ns ± 1%  -0.59%  (p=0.001 n=10+10)
Adler32KB-8            473ns ± 1%     471ns ± 0%  -0.39%  (p=0.010 n=10+9)
On architectures like amd64, the reduction in code size
appears to contribute more to benchmark improvements than just
removing the indirect call, since that branch gets predicted
accurately when called in a loop.
Updates #19361
Change-Id: I57d4dc21ef40a05ec0fbd55a9bb0eb74cdc67a3d
Reviewed-on: https://go-review.googlesource.com/38139
Run-TryBot: Philip Hofer <phofer@umich.edu>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2017-03-13 15:03:17 -07:00
										 |  |  | 	return itabsym(it, offset) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | // splitSlot returns a slot representing the data of parent starting at offset. | 
					
						
							|  |  |  | func (e *ssafn) splitSlot(parent *ssa.LocalSlot, suffix string, offset int64, t *types.Type) ssa.LocalSlot { | 
					
						
							|  |  |  | 	s := &types.Sym{Name: parent.N.(*Node).Sym.Name + suffix, Pkg: localpkg} | 
					
						
							| 
									
										
										
										
											2017-03-21 20:58:00 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-10-20 11:56:31 +01:00
										 |  |  | 	n := &Node{ | 
					
						
							|  |  |  | 		Name: new(Name), | 
					
						
							|  |  |  | 		Op:   ONAME, | 
					
						
							|  |  |  | 		Pos:  parent.N.(*Node).Pos, | 
					
						
							|  |  |  | 	} | 
					
						
							| 
									
										
										
										
											2017-03-21 20:58:00 -07:00
										 |  |  | 	n.Orig = n | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
											
												cmd/compile: factor out Pkg, Sym, and Type into package types
- created new package cmd/compile/internal/types
- moved Pkg, Sym, Type to new package
- to break cycles, for now we need the (ugly) types/utils.go
  file which contains a handful of functions that must be installed
  early by the gc frontend
- to break cycles, for now we need two functions to convert between
  *gc.Node and *types.Node (the latter is a dummy type)
- adjusted the gc's code to use the new package and the conversion
  functions as needed
- made several Pkg, Sym, and Type methods functions as needed
- renamed constructors typ, typPtr, typArray, etc. to types.New,
  types.NewPtr, types.NewArray, etc.
Passes toolstash-check -all.
Change-Id: I8adfa5e85c731645d0a7fd2030375ed6ebf54b72
Reviewed-on: https://go-review.googlesource.com/39855
Reviewed-by: Matthew Dempsky <mdempsky@google.com>
											
										 
											2017-04-04 17:54:02 -07:00
										 |  |  | 	s.Def = asTypesNode(n) | 
					
						
							| 
									
										
										
										
											2017-04-27 15:17:57 -07:00
										 |  |  | 	asNode(s.Def).Name.SetUsed(true) | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	n.Sym = s | 
					
						
							|  |  |  | 	n.Type = t | 
					
						
							| 
									
										
										
										
											2017-04-25 18:14:12 -07:00
										 |  |  | 	n.SetClass(PAUTO) | 
					
						
							| 
									
										
										
										
											2017-02-27 19:56:38 +02:00
										 |  |  | 	n.SetAddable(true) | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | 	n.Esc = EscNever | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | 	n.Name.Curfn = e.curfn | 
					
						
							|  |  |  | 	e.curfn.Func.Dcl = append(e.curfn.Func.Dcl, n) | 
					
						
							| 
									
										
										
										
											2017-04-27 16:37:25 -07:00
										 |  |  | 	dowidth(t) | 
					
						
							| 
									
										
										
										
											2017-07-21 18:28:06 -04:00
										 |  |  | 	return ssa.LocalSlot{N: n, Type: t, Off: 0, SplitOf: parent, SplitOffset: offset} | 
					
						
							| 
									
										
											  
											
												cmd/compile: better job of naming compound types
Compound AUTO types weren't named previously.  That was because live
variable analysis (plive.go) doesn't handle spilling to compound types.
It can't handle them because there is no valid place to put VARDEFs when
regalloc is spilling compound types.
compound types = multiword builtin types: complex, string, slice, and
interface.
Instead, we split named AUTOs into individual one-word variables.  For
example, a string s gets split into a byte ptr s.ptr and an integer
s.len.  Those two variables can be spilled to / restored from
independently.  As a result, live variable analysis can handle them
because they are one-word objects.
This CL will change how AUTOs are described in DWARF information.
Consider the code:
func f(s string, i int) int {
    x := s[i:i+5]
    g()
    return lookup(x)
}
The old compiler would spill x to two consecutive slots on the stack,
both named x (at offsets 0 and 8).  The new compiler spills the pointer
of x to a slot named x.ptr.  It doesn't spill x.len at all, as it is a
constant (5) and can be rematerialized for the call to lookup.
So compound objects may not be spilled in their entirety, and even if
they are they won't necessarily be contiguous.  Such is the price of
optimization.
Re-enable live variable analysis tests.  One test remains disabled, it
fails because of #14904.
Change-Id: I8ef2b5ab91e43a0d2136bfc231c05d100ec0b801
Reviewed-on: https://go-review.googlesource.com/21233
Run-TryBot: Keith Randall <khr@golang.org>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: David Chase <drchase@google.com>
											
										 
											2016-03-28 11:25:17 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (e *ssafn) CanSSA(t *types.Type) bool { | 
					
						
							|  |  |  | 	return canSSAType(t) | 
					
						
							| 
									
										
										
										
											2015-09-18 22:58:10 -07:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) Line(pos src.XPos) string { | 
					
						
							| 
									
										
										
										
											2016-12-09 17:15:05 -08:00
										 |  |  | 	return linestr(pos) | 
					
						
							| 
									
										
										
										
											2016-01-14 16:02:23 -08:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | // Log logs a message from the compiler. | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) Logf(msg string, args ...interface{}) { | 
					
						
							| 
									
										
										
										
											2016-09-14 10:01:05 -07:00
										 |  |  | 	if e.log { | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | 		fmt.Printf(msg, args...) | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) Log() bool { | 
					
						
							| 
									
										
										
										
											2016-01-29 14:44:15 -05:00
										 |  |  | 	return e.log | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | // Fatal reports a compiler error and exits. | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) Fatalf(pos src.XPos, msg string, args ...interface{}) { | 
					
						
							| 
									
										
										
										
											2016-12-09 17:15:05 -08:00
										 |  |  | 	lineno = pos | 
					
						
							| 
									
										
										
										
											2018-07-03 11:34:38 -04:00
										 |  |  | 	nargs := append([]interface{}{e.curfn.funcname()}, args...) | 
					
						
							|  |  |  | 	Fatalf("'%s': "+msg, nargs...) | 
					
						
							| 
									
										
										
										
											2015-06-12 11:01:13 -07:00
										 |  |  | } | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-10-26 17:34:06 -04:00
										 |  |  | // Warnl reports a "warning", which is usually flag-triggered | 
					
						
							|  |  |  | // logging output for the benefit of tests. | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) Warnl(pos src.XPos, fmt_ string, args ...interface{}) { | 
					
						
							| 
									
										
										
										
											2016-12-09 17:15:05 -08:00
										 |  |  | 	Warnl(pos, fmt_, args...) | 
					
						
							| 
									
										
										
										
											2015-10-26 17:34:06 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) Debug_checknil() bool { | 
					
						
							| 
									
										
										
										
											2015-10-26 17:34:06 -04:00
										 |  |  | 	return Debug_checknil != 0 | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) UseWriteBarrier() bool { | 
					
						
							| 
									
										
										
										
											2017-02-05 23:43:31 -05:00
										 |  |  | 	return use_writebarrier | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-03-17 07:49:22 -07:00
										 |  |  | func (e *ssafn) Syslook(name string) *obj.LSym { | 
					
						
							| 
									
										
										
										
											2017-04-15 08:00:08 -07:00
										 |  |  | 	switch name { | 
					
						
							|  |  |  | 	case "goschedguarded": | 
					
						
							|  |  |  | 		return goschedguarded | 
					
						
							|  |  |  | 	case "writeBarrier": | 
					
						
							|  |  |  | 		return writeBarrier | 
					
						
							| 
									
										
										
										
											2017-10-26 12:33:04 -04:00
										 |  |  | 	case "gcWriteBarrier": | 
					
						
							|  |  |  | 		return gcWriteBarrier | 
					
						
							| 
									
										
										
										
											2017-04-15 08:00:08 -07:00
										 |  |  | 	case "typedmemmove": | 
					
						
							|  |  |  | 		return typedmemmove | 
					
						
							|  |  |  | 	case "typedmemclr": | 
					
						
							|  |  |  | 		return typedmemclr | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | 	Fatalf("unknown Syslook func %v", name) | 
					
						
							|  |  |  | 	return nil | 
					
						
							| 
									
										
										
										
											2016-10-13 06:57:00 -04:00
										 |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2017-10-24 17:10:02 -04:00
										 |  |  | func (e *ssafn) SetWBPos(pos src.XPos) { | 
					
						
							|  |  |  | 	e.curfn.Func.setWBPos(pos) | 
					
						
							|  |  |  | } | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
											  
											
												cmd/compile: change ssa.Type into *types.Type
When package ssa was created, Type was in package gc.
To avoid circular dependencies, we used an interface (ssa.Type)
to represent type information in SSA.
In the Go 1.9 cycle, gri extricated the Type type from package gc.
As a result, we can now use it in package ssa.
Now, instead of package types depending on package ssa,
it is the other way.
This is a more sensible dependency tree,
and helps compiler performance a bit.
Though this is a big CL, most of the changes are
mechanical and uninteresting.
Interesting bits:
* Add new singleton globals to package types for the special
  SSA types Memory, Void, Invalid, Flags, and Int128.
* Add two new Types, TSSA for the special types,
  and TTUPLE, for SSA tuple types.
  ssa.MakeTuple is now types.NewTuple.
* Move type comparison result constants CMPlt, CMPeq, and CMPgt
  to package types.
* We had picked the name "types" in our rules for the handy
  list of types provided by ssa.Config. That conflicted with
  the types package name, so change it to "typ".
* Update the type comparison routine to handle tuples and special
  types inline.
* Teach gc/fmt.go how to print special types.
* We can now eliminate ElemTypes in favor of just Elem,
  and probably also some other duplicated Type methods
  designed to return ssa.Type instead of *types.Type.
* The ssa tests were using their own dummy types,
  and they were not particularly careful about types in general.
  Of necessity, this CL switches them to use *types.Type;
  it does not make them more type-accurate.
  Unfortunately, using types.Type means initializing a bit
  of the types universe.
  This is prime for refactoring and improvement.
This shrinks ssa.Value; it now fits in a smaller size class
on 64 bit systems. This doesn't have a giant impact,
though, since most Values are preallocated in a chunk.
name        old alloc/op      new alloc/op      delta
Template         37.9MB ± 0%       37.7MB ± 0%  -0.57%  (p=0.000 n=10+8)
Unicode          28.9MB ± 0%       28.7MB ± 0%  -0.52%  (p=0.000 n=10+10)
GoTypes           110MB ± 0%        109MB ± 0%  -0.88%  (p=0.000 n=10+10)
Flate            24.7MB ± 0%       24.6MB ± 0%  -0.66%  (p=0.000 n=10+10)
GoParser         31.1MB ± 0%       30.9MB ± 0%  -0.61%  (p=0.000 n=10+9)
Reflect          73.9MB ± 0%       73.4MB ± 0%  -0.62%  (p=0.000 n=10+8)
Tar              25.8MB ± 0%       25.6MB ± 0%  -0.77%  (p=0.000 n=9+10)
XML              41.2MB ± 0%       40.9MB ± 0%  -0.80%  (p=0.000 n=10+10)
[Geo mean]       40.5MB            40.3MB       -0.68%
name        old allocs/op     new allocs/op     delta
Template           385k ± 0%         386k ± 0%    ~     (p=0.356 n=10+9)
Unicode            343k ± 1%         344k ± 0%    ~     (p=0.481 n=10+10)
GoTypes           1.16M ± 0%        1.16M ± 0%  -0.16%  (p=0.004 n=10+10)
Flate              238k ± 1%         238k ± 1%    ~     (p=0.853 n=10+10)
GoParser           320k ± 0%         320k ± 0%    ~     (p=0.720 n=10+9)
Reflect            957k ± 0%         957k ± 0%    ~     (p=0.460 n=10+8)
Tar                252k ± 0%         252k ± 0%    ~     (p=0.133 n=9+10)
XML                400k ± 0%         400k ± 0%    ~     (p=0.796 n=10+10)
[Geo mean]         428k              428k       -0.01%
Removing all the interface calls helps non-trivially with CPU, though.
name        old time/op       new time/op       delta
Template          178ms ± 4%        173ms ± 3%  -2.90%  (p=0.000 n=94+96)
Unicode          85.0ms ± 4%       83.9ms ± 4%  -1.23%  (p=0.000 n=96+96)
GoTypes           543ms ± 3%        528ms ± 3%  -2.73%  (p=0.000 n=98+96)
Flate             116ms ± 3%        113ms ± 4%  -2.34%  (p=0.000 n=96+99)
GoParser          144ms ± 3%        140ms ± 4%  -2.80%  (p=0.000 n=99+97)
Reflect           344ms ± 3%        334ms ± 4%  -3.02%  (p=0.000 n=100+99)
Tar               106ms ± 5%        103ms ± 4%  -3.30%  (p=0.000 n=98+94)
XML               198ms ± 5%        192ms ± 4%  -2.88%  (p=0.000 n=92+95)
[Geo mean]        178ms             173ms       -2.65%
name        old user-time/op  new user-time/op  delta
Template          229ms ± 5%        224ms ± 5%  -2.36%  (p=0.000 n=95+99)
Unicode           107ms ± 6%        106ms ± 5%  -1.13%  (p=0.001 n=93+95)
GoTypes           696ms ± 4%        679ms ± 4%  -2.45%  (p=0.000 n=97+99)
Flate             137ms ± 4%        134ms ± 5%  -2.66%  (p=0.000 n=99+96)
GoParser          176ms ± 5%        172ms ± 8%  -2.27%  (p=0.000 n=98+100)
Reflect           430ms ± 6%        411ms ± 5%  -4.46%  (p=0.000 n=100+92)
Tar               128ms ±13%        123ms ±13%  -4.21%  (p=0.000 n=100+100)
XML               239ms ± 6%        233ms ± 6%  -2.50%  (p=0.000 n=95+97)
[Geo mean]        220ms             213ms       -2.76%
Change-Id: I15c7d6268347f8358e75066dfdbd77db24e8d0c1
Reviewed-on: https://go-review.googlesource.com/42145
Run-TryBot: Josh Bleecher Snyder <josharian@gmail.com>
TryBot-Result: Gobot Gobot <gobot@golang.org>
Reviewed-by: Keith Randall <khr@golang.org>
											
										 
											2017-04-28 14:12:28 -07:00
										 |  |  | func (n *Node) Typ() *types.Type { | 
					
						
							| 
									
										
										
										
											2015-10-22 14:22:38 -07:00
										 |  |  | 	return n.Type | 
					
						
							|  |  |  | } | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | func (n *Node) StorageClass() ssa.StorageClass { | 
					
						
							|  |  |  | 	switch n.Class() { | 
					
						
							|  |  |  | 	case PPARAM: | 
					
						
							|  |  |  | 		return ssa.ClassParam | 
					
						
							|  |  |  | 	case PPARAMOUT: | 
					
						
							|  |  |  | 		return ssa.ClassParamOut | 
					
						
							|  |  |  | 	case PAUTO: | 
					
						
							|  |  |  | 		return ssa.ClassAuto | 
					
						
							|  |  |  | 	default: | 
					
						
							| 
									
										
										
										
											2018-08-23 05:06:47 +00:00
										 |  |  | 		Fatalf("untranslatable storage class for %v: %s", n, n.Class()) | 
					
						
							| 
									
										
										
										
											2017-09-18 14:53:56 -07:00
										 |  |  | 		return 0 | 
					
						
							|  |  |  | 	} | 
					
						
							|  |  |  | } |