Commits · c932173773948ba14c341e279e449f2e7d2fde77 · Roger Ferrer / llvm-epi-0.8

Jun 30, 2011
- Fix a small thinko for constant i64 lock/orq optimization where we · c9321737
  Eric Christopher authored Jun 30, 2011
  
  we didn't have an opcode for 64-bit constant or expressions. Fixes rdar://9692967 llvm-svn: 134121
  c9321737
May 20, 2011
- Re-commit 131641 with fixes; de-pseudoize MOVSX16rr8 and friends. · 91f1d247
  Stuart Hastings authored May 20, 2011
  
  rdar://problem/8614450 llvm-svn: 131746
  91f1d247
May 17, 2011
- Update comment. · 56a42ebf
  Eric Christopher authored May 17, 2011
  
  llvm-svn: 131459
  56a42ebf
- Support XOR and AND optimization with no return value. · a1d9e295
  Eric Christopher authored May 17, 2011
  
  Finishes off rdar://8470697 llvm-svn: 131458
  a1d9e295
- Couple less magic numbers. · abfe3131
  Eric Christopher authored May 17, 2011
  
  llvm-svn: 131457
  abfe3131
- Make this code a little less magic number laden. · eb47a2a1
  Eric Christopher authored May 17, 2011
  
  llvm-svn: 131456
  eb47a2a1
May 11, 2011
- Turn this into a table, this will make more sense shortly. · 2a9dbbbb
  Eric Christopher authored May 11, 2011
  
  Part of rdar://8470697 llvm-svn: 131200
  2a9dbbbb
- Optimize atomic lock or that doesn't use the result value. · 4a34e61e
  Eric Christopher authored May 10, 2011
  
  Next up: xor and and. Part of rdar://8470697 llvm-svn: 131171
  4a34e61e
Apr 23, 2011
- Silence an overzealous uninitialized variable warning from GCC. · 3db05465
  Benjamin Kramer authored Apr 23, 2011
  
  llvm-svn: 130053
  3db05465
Apr 22, 2011

X86: Try to use a smaller encoding by transforming (X << C1) & C2 into (X &... · 4c816247

Benjamin Kramer authored Apr 22, 2011

X86: Try to use a smaller encoding by transforming (X << C1) & C2 into (X & (C2 >> C1)) & C1. (Part of PR5039)

This tends to happen a lot with bitfield code generated by clang. A simple example for x86_64 is
uint64_t foo(uint64_t x) { return (x&1) << 42; }
which used to compile into bloated code:
shlq $42, %rdi ## encoding: [0x48,0xc1,0xe7,0x2a]
movabsq $4398046511104, %rax ## encoding: [0x48,0xb8,0x00,0x00,0x00,0x00,0x00,0x04,0x00,0x00]
andq %rdi, %rax ## encoding: [0x48,0x21,0xf8]
ret ## encoding: [0xc3]

with this patch we can fold the immediate into the and:
andq $1, %rdi ## encoding: [0x48,0x83,0xe7,0x01]
movq %rdi, %rax ## encoding: [0x48,0x89,0xf8]
shlq $42, %rax ## encoding: [0x48,0xc1,0xe0,0x2a]
ret ## encoding: [0xc3]

It's possible to save another byte by using 'andl' instead of 'andq' but I currently see no way of doing
that without making this code even more complicated. See the TODOs in the code.

llvm-svn: 129990

4c816247

Feb 16, 2011
- Swap VT and DebugLoc operands of getExtLoad() for consistency with · 81c43060
  Stuart Hastings authored Feb 16, 2011
  
  other getNode() methods. Radar 9002173. llvm-svn: 125665
  81c43060
Feb 13, 2011

Enhance ComputeMaskedBits to know that aligned frameindexes · 46c01a30

Chris Lattner authored Feb 13, 2011

have their low bits set to zero.  This allows us to optimize
out explicit stack alignment code like in stack-align.ll:test4 when
it is redundant.

Doing this causes the code generator to start turning FI+cst into
FI|cst all over the place, which is general goodness (that is the
canonical form) except that various pieces of the code generator
don't handle OR aggressively.  Fix this by introducing a new
SelectionDAG::isBaseWithConstantOffset predicate, and using it
in places that are looking for ADD(X,CST).  The ARM backend in
particular was missing a lot of addressing mode folding opportunities
around OR.

llvm-svn: 125470

46c01a30

Jan 27, 2011
- lib/Target/X86/X86ISelDAGToDAG.cpp: __main should be WINCALL64 on Win64. · f3e20b9f
  NAKAMURA Takumi authored Jan 27, 2011
  
  CALL64 marks %xmm* as dead. llvm-svn: 124354
  f3e20b9f
Jan 16, 2011

fix PR8514, a bug where the "heroic" transformation of shift/and · 35a2e65b

Chris Lattner authored Jan 16, 2011

into and/shift would cause nodes to move around and a dangling pointer
to happen.  The code tried to avoid this with a HandleSDNode, but 
got the details wrong.

llvm-svn: 123578

35a2e65b

Jan 14, 2011
- 'HiReg' is written but never read. Nuke its · b5241b2b
  Ted Kremenek authored Jan 14, 2011
  
  declaration and its assignments. Found by clang static analyzer. llvm-svn: 123486
  b5241b2b
Jan 06, 2011

PR8918 - When used with MinGW64, LLVM generates a "calll __main" at the · 81d40711

Bill Wendling authored Jan 06, 2011

beginning of the "main" function. The assembler complains about the invalid
suffix for the 'call' instruction. The right instruction is "callq __main".
Patch by KS Sreeram!

llvm-svn: 122933

81d40711

Dec 21, 2010
- rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for · 3e5fbd74
  Chris Lattner authored Dec 21, 2010
  
  something that just glues two nodes together, even if it is sometimes used for flags. llvm-svn: 122310
  3e5fbd74
Dec 05, 2010

it turns out that when ".with.overflow" intrinsics were added to the X86 · 364bb0a0

Chris Lattner authored Dec 05, 2010

backend that they were all implemented except umul.  This one fell back
to the default implementation that did a hi/lo multiply and compared the
top.  Fix this to check the overflow flag that the 'mul' instruction
sets, so we can avoid an explicit test.  Now we compile:

void *func(long count) {
      return new int[count];
}

into:

__Z4funcl:                              ## @_Z4funcl
	movl	$4, %ecx                ## encoding: [0xb9,0x04,0x00,0x00,0x00]
	movq	%rdi, %rax              ## encoding: [0x48,0x89,0xf8]
	mulq	%rcx                    ## encoding: [0x48,0xf7,0xe1]
	seto	%cl                     ## encoding: [0x0f,0x90,0xc1]
	testb	%cl, %cl                ## encoding: [0x84,0xc9]
	movq	$-1, %rdi               ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff]
	cmoveq	%rax, %rdi              ## encoding: [0x48,0x0f,0x44,0xf8]
	jmp	__Znam                  ## TAILCALL

instead of:

__Z4funcl:                              ## @_Z4funcl
	movl	$4, %ecx                ## encoding: [0xb9,0x04,0x00,0x00,0x00]
	movq	%rdi, %rax              ## encoding: [0x48,0x89,0xf8]
	mulq	%rcx                    ## encoding: [0x48,0xf7,0xe1]
	testq	%rdx, %rdx              ## encoding: [0x48,0x85,0xd2]
	movq	$-1, %rdi               ## encoding: [0x48,0xc7,0xc7,0xff,0xff,0xff,0xff]
	cmoveq	%rax, %rdi              ## encoding: [0x48,0x0f,0x44,0xf8]
	jmp	__Znam                  ## TAILCALL

Other than the silly seto+test, this is using the o bit directly, so it's going in the right
direction.

llvm-svn: 120935

364bb0a0

Oct 27, 2010

Use a MemIntrinsicSDNode for ISD::PREFETCH, which touches · e660f4d0

Dale Johannesen authored Oct 26, 2010

memory, so a MachineMemOperand is useful (not propagated
into the MachineInstr yet).  No functional change except
for dump output.

llvm-svn: 117413

e660f4d0

Oct 06, 2010
- Use #NAME# to have the CMOV multiclass define things with the same names as before · 1a1c6001
  Chris Lattner authored Oct 05, 2010
  
  (e.g. CMOVBE16rr instead of CMOVBErr16). llvm-svn: 115705
  1a1c6001
- switch CMOVBE to the multipattern: · 0067ee02
  Chris Lattner authored Oct 05, 2010
  
  21 insertions(+), 53 deletions(-) Moar change coming before I switch the rest. llvm-svn: 115697
  0067ee02
Sep 22, 2010
- Temporarily work around new address lowering while I figure out what · c1b3e072
  Eric Christopher authored Sep 22, 2010
  
  needs to happen for darwin. llvm-svn: 114577
  c1b3e072
- reimplement elf TLS support in terms of addressing modes, eliminating SegmentBaseAddress. · 8a236b63
  Chris Lattner authored Sep 22, 2010
  
  llvm-svn: 114529
  8a236b63
- convert the last 4 X86ISD nodes that should have memoperands to have them. · a5156c30
  Chris Lattner authored Sep 22, 2010
  
  llvm-svn: 114523
  a5156c30
- give X86ISD::FNSTCW16m a memoperand, since it touches memory. It only · ed85da56
  Chris Lattner authored Sep 22, 2010
  
  can access the stack due to how it is generated though. llvm-svn: 114522
  ed85da56
- give FP_TO_INT16_IN_MEM and friends a memoperand. They are only · 78f518b7
  Chris Lattner authored Sep 22, 2010
  
  used with stack slots, but hey, lets be safe. llvm-svn: 114521
  78f518b7
- give VZEXT_LOAD a memory operand, it now works with segment registers. · 54e53295
  Chris Lattner authored Sep 22, 2010
  
  llvm-svn: 114515
  54e53295
- revert r114386 now that address modes work correctly, we get a nice · 07827ba9
  Chris Lattner authored Sep 22, 2010
  
  call through gs-relative memory now. llvm-svn: 114510
  07827ba9
- give LCMPXCHG_DAG[8] a memory operand, allowing it to work with addrspace 256/257 · e479e964
  Chris Lattner authored Sep 21, 2010
  
  llvm-svn: 114508
  e479e964
- reimplement support for GS and FS relative address space matching · d58d7c19
  Chris Lattner authored Sep 21, 2010
  
  by having X86DAGToDAGISel::SelectAddr get passed in the parent node of the operand match (the load/store/atomic op) and having it get the address space from that, instead of having special FS/GS addr mode operations that require duplicating the entire instruction set to support. This makes FS and GS relative accesses *far* more predictable and work much better. It also simplifies the X86 backend a bit, more to come. There is still a pending issue with nodes like ISD::PREFETCH and X86ISD::FLD, which really should be MemSDNode's but aren't. llvm-svn: 114491
  d58d7c19
Sep 21, 2010

fix a long standing wart: all the ComplexPattern's were being · 0e023ea0

Chris Lattner authored Sep 21, 2010

passed the root of the match, even though only a few patterns
actually needed this (one in X86, several in ARM [which should
be refactored anyway], and some in CellSPU that I don't feel 
like detangling).   Instead of requiring all ComplexPatterns to
take the dead root, have targets opt into getting the root by
putting SDNPWantRoot on the ComplexPattern.

llvm-svn: 114471

0e023ea0

even though I'm about to rip it out, simplify the address mode stuff · c6d8839a
Chris Lattner authored Sep 21, 2010
```
llvm-svn: 114468
```
c6d8839a
propagate MachinePointerInfo through various uses of the old · 3d178ed4
Chris Lattner authored Sep 21, 2010
```
SelectionDAG::getExtLoad overload, and eliminate it.

llvm-svn: 114446
```
3d178ed4
fix rdar://8453210, a crash handling a call through a GS relative load. · bb0a1c44
Chris Lattner authored Sep 21, 2010
```
For now, just disable folding the load into the call.

llvm-svn: 114386
```
bb0a1c44

Sep 04, 2010
- zap dead code. · 65b48b5d
  Chris Lattner authored Sep 04, 2010
  
  llvm-svn: 113073
  65b48b5d
Sep 03, 2010
- Don't call Predicate_* from X86 target. · 08aede25
  Jakob Stoklund Olesen authored Sep 03, 2010
  
  llvm-svn: 112921
  08aede25
Aug 25, 2010
- Remove dead recursive function. Yay for clang -Wunused-function. · f1f2133a
  Benjamin Kramer authored Aug 25, 2010
  
  llvm-svn: 112060
  f1f2133a
Aug 05, 2010
- PR7814: Truncates cannot be ignored for signed comparisons. · 39d0f57c
  Eli Friedman authored Aug 04, 2010
  
  llvm-svn: 110268
  39d0f57c
Jul 09, 2010

Change LEA to have 5 operands for its memory operand, just · f469307c

Chris Lattner authored Jul 08, 2010

like all other instructions, even though a segment is not
allowed.  This resolves a bunch of gross hacks in the 
encoder and makes LEA more consistent with the rest of the
instruction set.

No functionality change.

llvm-svn: 107934

f469307c

Jul 08, 2010
- Move getExtLoad() and (some) getLoad() DebugLoc argument after EVT argument for consistency sake. · 1c349f18
  Evan Cheng authored Jul 07, 2010
  
  llvm-svn: 107820
  1c349f18