a15ad0d310
We had an optimization in Crankshaft where we would call into the megamorphic handler stub directly if an inline cache was already found to be megamorphic when it hit the optimizing compiler. This way we could avoid the dispatch overhead when we know that there's no point in checking for the other states anyways. However we somehow missed to port this optimization to TurboFan. Now this change introduces support to call into LoadIC_Megamorphic and KeyedLoadIC_Megamorphic directly (plus the trampoline versions), which saves quite a lot of overhead for the cases where the map/name pair is found in the megamorphic stub cache, and it's quite a simple change. We can later extend this to also handle the StoreIC and KeyedStoreIC cases if that turns out to be beneficial. This improves the score on the Octane/TypeScript test by around ~2% and the TypeScript test in the web-tooling-benchmark by around ~4%. On the ARES-6 Air test the steady state mean improves by 2-4%, and on the ARES-6 ML test the steady state mean seems to also improve by 1-2%, but that might be within noise. On a micro-benchmark that just runs `o.x` in a hot loop on a set of 9 different objects, which all have `x` as the first property and are all in fast mode, we improve by around ~30%, and are now almost on par with JavaScriptCore. Bug: v8:6344, v8:6936 Change-Id: Iaa4c6e34c37e78da217ee75f32f6acc95a834250 Reviewed-on: https://chromium-review.googlesource.com/1215623 Reviewed-by: Jaroslav Sevcik <jarin@chromium.org> Reviewed-by: Jakob Kummerow <jkummerow@chromium.org> Commit-Queue: Benedikt Meurer <bmeurer@chromium.org> Cr-Commit-Position: refs/heads/master@{#55803} |
||
---|---|---|
.. | ||
arm | ||
arm64 | ||
ia32 | ||
mips | ||
mips64 | ||
ppc | ||
regalloc | ||
s390 | ||
x64 | ||
branch-elimination-unittest.cc | ||
bytecode-analysis-unittest.cc | ||
checkpoint-elimination-unittest.cc | ||
code-assembler-unittest.cc | ||
code-assembler-unittest.h | ||
common-operator-reducer-unittest.cc | ||
common-operator-unittest.cc | ||
compiler-test-utils.h | ||
constant-folding-reducer-unittest.cc | ||
control-equivalence-unittest.cc | ||
control-flow-optimizer-unittest.cc | ||
dead-code-elimination-unittest.cc | ||
diamond-unittest.cc | ||
effect-control-linearizer-unittest.cc | ||
graph-reducer-unittest.cc | ||
graph-reducer-unittest.h | ||
graph-trimmer-unittest.cc | ||
graph-unittest.cc | ||
graph-unittest.h | ||
instruction-selector-unittest.cc | ||
instruction-selector-unittest.h | ||
instruction-sequence-unittest.cc | ||
instruction-sequence-unittest.h | ||
instruction-unittest.cc | ||
int64-lowering-unittest.cc | ||
js-call-reducer-unittest.cc | ||
js-create-lowering-unittest.cc | ||
js-intrinsic-lowering-unittest.cc | ||
js-operator-unittest.cc | ||
js-typed-lowering-unittest.cc | ||
linkage-tail-call-unittest.cc | ||
live-range-builder.h | ||
load-elimination-unittest.cc | ||
loop-peeling-unittest.cc | ||
machine-operator-reducer-unittest.cc | ||
machine-operator-unittest.cc | ||
node-cache-unittest.cc | ||
node-matchers-unittest.cc | ||
node-properties-unittest.cc | ||
node-test-utils.cc | ||
node-test-utils.h | ||
node-unittest.cc | ||
opcodes-unittest.cc | ||
persistent-unittest.cc | ||
schedule-unittest.cc | ||
scheduler-rpo-unittest.cc | ||
scheduler-unittest.cc | ||
simplified-lowering-unittest.cc | ||
simplified-operator-reducer-unittest.cc | ||
simplified-operator-unittest.cc | ||
state-values-utils-unittest.cc | ||
typed-optimization-unittest.cc | ||
typer-unittest.cc | ||
value-numbering-reducer-unittest.cc | ||
zone-stats-unittest.cc |