Commits · 09b6989ef0589d979c017eca2f3763a4170743f8 · Roger Ferrer / llvm-epi-0.8

Feb 10, 2011

Implement two related optimizations that make de-serialization of · 09b6989e

Douglas Gregor authored Feb 10, 2011

AST/PCH files more lazy:
  - Don't preload all of the file source-location entries when reading
  the AST file. Instead, load them lazily, when needed.
  - Only look up header-search information (whether a header was already
  #import'd, how many times it's been included, etc.) when it's needed
  by the preprocessor, rather than pre-populating it.

Previously, we would pre-load all of the file source-location entries,
which also populated the header-search information structure. This was
a relatively minor performance issue, since we would end up stat()'ing
all of the headers stored within a AST/PCH file when the AST/PCH file
was loaded. In the normal PCH use case, the stat()s were cached, so
the cost--of preloading ~860 source-location entries in the Cocoa.h
case---was relatively low.

However, the recent optimization that replaced stat+open with
open+fstat turned this into a major problem, since the preloading of
source-location entries would now end up opening those files. Worse,
those files wouldn't be closed until the file manager was destroyed,
so just opening a Cocoa.h PCH file would hold on to ~860 file
descriptors, and it was easy to blow through the process's limit on
the number of open file descriptors.

By eliminating the preloading of these files, we neither open nor stat
the headers stored in the PCH/AST file until they're actually needed
for something. Concretely, we went from

*** HeaderSearch Stats:
835 files tracked.
  364 #import/#pragma once files.
  823 included exactly once.
  6 max times a file is included.
  3 #include/#include_next/#import.
    0 #includes skipped due to the multi-include optimization.
1 framework lookups.
0 subframework lookups.

*** Source Manager Stats:
835 files mapped, 3 mem buffers mapped.
37460 SLocEntry's allocated, 11215575B of Sloc address space used.
62 bytes of files mapped, 0 files with line #'s computed.

with a trivial program that uses a chained PCH including a Cocoa PCH
to

*** HeaderSearch Stats:
4 files tracked.
  1 #import/#pragma once files.
  3 included exactly once.
  2 max times a file is included.
  3 #include/#include_next/#import.
    0 #includes skipped due to the multi-include optimization.
1 framework lookups.
0 subframework lookups.

*** Source Manager Stats:
3 files mapped, 3 mem buffers mapped.
37460 SLocEntry's allocated, 11215575B of Sloc address space used.
62 bytes of files mapped, 0 files with line #'s computed.

for the same program.

llvm-svn: 125286

09b6989e

Adjust the object files to be linked in when mcount profiling · 66f2276a
Roman Divacky authored Feb 10, 2011
```
is specified in the FreeBSD linker driver.

llvm-svn: 125285
```
66f2276a

· ce318e49

David Greene authored Feb 10, 2011

[AVX] Implement 256-bit vector lowering for EXTRACT_VECTOR_ELT.

llvm-svn: 125284

ce318e49

Add a testcase for the mcount profiling. · 5bc51512
Roman Divacky authored Feb 10, 2011
```
llvm-svn: 125283
```
5bc51512
Implement mcount profiling, enabled via -pg. · 178e0160
Roman Divacky authored Feb 10, 2011
```
llvm-svn: 125282
```
178e0160
Drop the 'InBits' part from the name of RecordSizeInBits as the value is in · 89d9f360
Ken Dyck authored Feb 10, 2011
```
character units.

llvm-svn: 125281
```
89d9f360
Eliminate some signed-to-unsigned comparision warnings introduced in · f18bf0d2
Ken Dyck authored Feb 10, 2011
```
r125156.

llvm-svn: 125280
```
f18bf0d2
ptx: add passing parameter to kernel functions · 84fde9ef
Che-Liang Chiou authored Feb 10, 2011
```
llvm-svn: 125279
```
84fde9ef
CMake: LLVM_LIT_TOOLS_DIR is needed only on Win32 hosts to use GnuWin32 tools. · 860dc412
NAKAMURA Takumi authored Feb 10, 2011
```
Unixen and Cygwin do not need it.

llvm-svn: 125277
```
860dc412
CMake: LLVM_NO_RTTI must be obsolete now! · 98dd73d6
NAKAMURA Takumi authored Feb 10, 2011
```
llvm-svn: 125275
```
98dd73d6
CMake: LLVM_NO_RTTI must be obsolete now! · 3de6c860
NAKAMURA Takumi authored Feb 10, 2011
```
llvm-svn: 125274
```
3de6c860
lit/TestFormats.py: Unittests may be found with suffix .exe also on Cygwin. · 0117c361
NAKAMURA Takumi authored Feb 10, 2011
```
llvm-svn: 125273
```
0117c361
lit/Util.py: On Cygwin, 'PATHEXT' may exist but it should not be used. · 32e9c838
NAKAMURA Takumi authored Feb 10, 2011
```
llvm-svn: 125272
```
32e9c838

implement the first part of PR8882: when lowering an inbounds · d86ded17

Chris Lattner authored Feb 10, 2011

gep to explicit addressing, we know that none of the intermediate
computation overflows.

This could use review: it seems that the shifts certainly wouldn't
overflow, but could the intermediate adds overflow if there is a 
negative index?

Previously the testcase would instcombine to:

define i1 @test(i64 %i) {
  %p1.idx.mask = and i64 %i, 4611686018427387903
  %cmp = icmp eq i64 %p1.idx.mask, 1000
  ret i1 %cmp
}

now we get:

define i1 @test(i64 %i) {
  %cmp = icmp eq i64 %i, 1000
  ret i1 %cmp
}

llvm-svn: 125271

d86ded17

switch the constantexpr, target folder, and IRBuilder interfaces · e9b4ad73

Chris Lattner authored Feb 10, 2011

for NSW/NUW binops to follow the pattern of exact binops.  This
allows someone to use Builder.CreateAdd(x, y, "tmp", MaybeNUW);

llvm-svn: 125270

e9b4ad73

Fixed a crasher when enabling logging that is due to the new hijack listener stack changes. · 1117795e
Greg Clayton authored Feb 10, 2011
```
llvm-svn: 125269
```
1117795e

Move the check that gives functions with unique-external types unique-external · f768aa76

John McCall authored Feb 10, 2011

linkage into Decl.cpp.  Disable this logic for extern "C" functions, because
the operative rule there is weaker.  Fixes rdar://problem/8898466

llvm-svn: 125268

f768aa76

Enhance a bunch of transformations in instcombine to start generating · 6b657aed

Chris Lattner authored Feb 10, 2011

exact/nsw/nuw shifts and have instcombine infer them when it can prove
that the relevant properties are true for a given shift without them.

Also, a variety of refactoring to use the new patternmatch logic thrown
in for good luck.  I believe that this takes care of a bunch of related
code quality issues attached to PR8862.

llvm-svn: 125267

6b657aed

Enhance the "compare with shift" and "compare with div" · 98457101

Chris Lattner authored Feb 10, 2011

optimizations to be much more aggressive in the face of
exact/nsw/nuw div and shifts.  For example, these (which
are the same except the first is 'exact' sdiv:

define i1 @sdiv_icmp4_exact(i64 %X) nounwind {
  %A = sdiv exact i64 %X, -5   ; X/-5 == 0 --> x == 0
  %B = icmp eq i64 %A, 0
  ret i1 %B
}

define i1 @sdiv_icmp4(i64 %X) nounwind {
  %A = sdiv i64 %X, -5   ; X/-5 == 0 --> x == 0
  %B = icmp eq i64 %A, 0
  ret i1 %B
}

compile down to:

define i1 @sdiv_icmp4_exact(i64 %X) nounwind {
  %1 = icmp eq i64 %X, 0
  ret i1 %1
}

define i1 @sdiv_icmp4(i64 %X) nounwind {
  %X.off = add i64 %X, 4
  %1 = icmp ult i64 %X.off, 9
  ret i1 %1
}

This happens when you do something like:
  (ptr1-ptr2) == 42

where the pointers are pointers to non-unit types.

llvm-svn: 125266

98457101

more cleanups, notably bitcast isn't used for "signed to unsigned type · dcef03fb
Chris Lattner authored Feb 10, 2011
```
conversions". :)

llvm-svn: 125265
```
dcef03fb

A bunch of cleanups and simplifications using the new PatternMatch predicates · 7d0e43ff

Chris Lattner authored Feb 10, 2011

and generally tidying things up.  Only very trivial functionality changes
like now doing (-1 - A) -> (~A) for vectors too.

 InstCombineAddSub.cpp |  296 +++++++++++++++++++++-----------------------------
 1 file changed, 126 insertions(+), 170 deletions(-)

llvm-svn: 125264

7d0e43ff

teach SimplifyDemandedBits that exact shifts demand the bits they · 768003c5
Chris Lattner authored Feb 10, 2011
```
are shifting out since they do require them to be zeros.  Similarly
for NUW/NSW bits of shl

llvm-svn: 125263
```
768003c5
Run ~GRState() when reclaiming GRStates. · 1656db69
Ted Kremenek authored Feb 10, 2011
```
llvm-svn: 125262
```
1656db69
static analyzer: Make GRStates reference counted, with reference counts managed by ExplodedNodes. · 75e45641
Ted Kremenek authored Feb 10, 2011
```
This reduces memory usage of the analyzer on sqlite by another 5%.

llvm-svn: 125260
```
75e45641

After 3-addressifying a two-address instruction, update the register maps; add... · d4fcc053

Evan Cheng authored Feb 10, 2011

After 3-addressifying a two-address instruction, update the register maps; add a missing check when considering whether it's profitable to commute. rdar://8977508.

llvm-svn: 125259

d4fcc053

Add EmulateLDRRtRnImm() for EncodingT1 of LDR (immediate, Thumb) to the g_thumb_opcodes table, · cc13e4c6
Johnny Chen authored Feb 10, 2011
```
and a helper method UnalignedSupport().

llvm-svn: 125258
```
cc13e4c6
Revert this in an attempt to bring the builders back. · da6bd450
Eric Christopher authored Feb 10, 2011
```
llvm-svn: 125257
```
da6bd450
Don't return before calling the post-processing function(s). · 5f3a39e7
Bill Wendling authored Feb 10, 2011
```
llvm-svn: 125256
```
5f3a39e7

Add a new function to Debugger for finding the top/current · b44880ca

Caroline Tice authored Feb 10, 2011

input reader.

Always make sure the input reader stack is not empty before
trying to get the top element from the stack.

llvm-svn: 125255

b44880ca

Turn this pass ordering: · 58c8670a

Cameron Zwarich authored Feb 10, 2011

Natural Loop Information
 Loop Pass Manager
   Canonicalize natural loops
 Scalar Evolution Analysis
 Loop Pass Manager
   Induction Variable Users
   Canonicalize natural loops
   Induction Variable Users
   Loop Strength Reduction

into this:

Scalar Evolution Analysis
Loop Pass Manager
  Canonicalize natural loops
  Induction Variable Users
  Loop Strength Reduction

This fixes <rdar://problem/8869639>. I also filed PR9184 on doing this sort of
thing automatically, but it seems easier to just change the ordering of the
passes if this is the only case.

llvm-svn: 125254

58c8670a

Add hack to CMakeLists.txt so that StaticAnalyzer libraries find their corresponding headers. · 4bd19da5

Ted Kremenek authored Feb 10, 2011

This is a hack because we really should only search in the 'include/clang/StaticAnalyzer' directory
if we are in 'lib/StaticAnalyzer'.  My CMake knowledge is limited, so I appeal to anyone with
more expertise.

llvm-svn: 125252

4bd19da5

Split 'include/clang/StaticAnalyzer' into 'include/clang/StaticAnalyzer/Core'... · f8cbac4b

Ted Kremenek authored Feb 10, 2011

Split 'include/clang/StaticAnalyzer' into 'include/clang/StaticAnalyzer/Core' and 'include/clang/StaticAnalyzer/Checkers'.

This layout matches lib/StaticAnalyzer, which corresponds to two StaticAnalyzer libraries.

llvm-svn: 125251

f8cbac4b

test case for r125249. · d9bc8e08
Devang Patel authored Feb 10, 2011
```
llvm-svn: 125250
```
d9bc8e08
If an aggregate is returned as 'sret' argument then let debugger know about this. · 425909dd
Devang Patel authored Feb 10, 2011
```
llvm-svn: 125249
```
425909dd

Do AsmMatcher operand classification per-opcode. · 6e2e29bd

Jim Grosbach authored Feb 10, 2011

When matching operands for a candidate opcode match in the auto-generated
AsmMatcher, check each operand against the expected operand match class.
Previously, operands were classified independently of the opcode being
handled, which led to difficulties when operand match classes were
more complicated than simple subclass relationships.

llvm-svn: 125245

6e2e29bd

Add a new member variable m_new_inst_cpsr to catch the to-be-updated state · 1cabebe7

Johnny Chen authored Feb 09, 2011

of the CPSR during the course of executing an opcode, and modified SelectInstrSet()
to update this variable instead of the original m_inst_cpsr, which should be
the cached copy of the CPSR at the beginning of executing the opcode.

llvm-svn: 125244

1cabebe7

Delete unused code for analyzing and splitting around loops. · 8dafc875

Jakob Stoklund Olesen authored Feb 09, 2011

Loop splitting is better handled by the more generic global region splitting
based on the edge bundle graph.

llvm-svn: 125243

8dafc875

Add EmulateAddRdnRm() for EncodingT2 of ADD(register) to the g_thumb_opcodes table, · edf55ae5
Johnny Chen authored Feb 09, 2011
```
and a helper method ALUWritePC(Context&, uint32_t).

llvm-svn: 125241
```
edf55ae5

Modified version of a patch from Warren Paul that takes care of issues with · 7bd65b9f

Greg Clayton authored Feb 09, 2011

indirect forms, deals with empty DW_AT_comp_dir attributes, and fixups for
handling other signed integer types.

llvm-svn: 125240

7bd65b9f

Rip out realpath() support. It's expensive, and often a bad idea, and · 56b2ffda
Douglas Gregor authored Feb 09, 2011
```
I have another way to achieve the same goal.

llvm-svn: 125239
```
56b2ffda