Commits · 5.1.0 · Runtimes / vixl

Dec 01, 2020

Jacob Bramley authored Nov 06, 2020

Notably:

- Make the simulator stack configurable, and make the guard regions
  functional.
- Fix simulation of frint and SVE while instructions.
- Add some missing mov aliases.

Change-Id: Icaa5c7fc597af007b32b023273483e09efcd91a1

b97550be

Nov 24, 2020

Fix ldapr test · 2ed1d317

Martyn Capewell authored Nov 19, 2020

The pointer to the buffer used for the ldapr tests could be potentially aligned
to the zero-initialised region, causing the test to fail. Fix this by
initialising the rest of the buffer to the correct value.

Change-Id: Ie0d63c58fc4dccdb8a0da1c3da1661875f208b1d

2ed1d317

Nov 20, 2020
- Add missing aliases for mov instruction with immediate · bb4ed214
  Mateusz Kalinowski authored Nov 20, 2020
```
Change-Id: I63a4f52a716008e615a93bd30600d14dbb76bac0
```
  bb4ed214
Nov 12, 2020

[sve] Fix while simulation corner case · 78de9512

Martyn Capewell authored Oct 28, 2020

The simulation of while was computing differences using 64-bit variables, which
didn't handle the overflow cases correctly for W-sized registers. Fix and add a
regression test.

Change-Id: I681f6ecb137d52830a58816fa34545342e6eb6bf

78de9512

Nov 10, 2020
- Remove "dummy" from test and tools · acdea50a
  Martyn Capewell authored Nov 10, 2020
```
Change-Id: I5a237ee9225fd1c8f758498d72658ffbf1a90a61
```
  acdea50a
- Remove use of "dummy" · d42989cf
  Martyn Capewell authored Nov 10, 2020
```
Replace "dummy" with "placeholder".

Change-Id: I9af7f56c93b49c1a6428414601af1dec475ca3b8
```
  d42989cf
Nov 05, 2020

Make the stack size configurable. · c4ef66e9

Jacob Bramley authored Oct 30, 2020

Also fix the implementation of stack guards, and make these configurable
too.

Change-Id: Ic62ee326ed725616322ca8fa26d38b9a089d5043

c4ef66e9

Fix FPRoundInt's handling of INT64_MAX. · f73036ba

Jacob Bramley authored Nov 04, 2020

This fixes a bug caused by an implicit conversion from `int64_t` to
`double`, as well as several related warnings (from recent versions of
Clang) in the "frint" tests.

Change-Id: Ie5dccbf7a86c5e3a608570bd0ffc566bf3813380

f73036ba

Oct 30, 2020

Don't simulate invalid logical immediate instructions · a26a26cb

Martyn Capewell authored Sep 23, 2020

Logical immediate instructions using an immediate encoding that should be
reserved were being simulated. Prevent this by asserting the immediate is
valid.

Change-Id: Ic0388f3f941914d9fae92e0961c2ebf10c3a5d0b

a26a26cb

Oct 20, 2020

Fix undefined behaviour in Halve · 6f755e6a

Martyn Capewell authored Sep 23, 2020

The Halve function relied on shifting a signed value right. Replace with a
well-defined bitfield extraction.

Change-Id: I980d64ee92b68b5d4e213a2f338d810ec3999df7

6f755e6a

Oct 15, 2020
- Simplify the command-line disassembly example UI. · d02f4372
  Jacob Bramley authored Oct 15, 2020
```
Change-Id: I8461bc4c7b50d487334f3a022b614047f132ad61
```
  d02f4372
Oct 14, 2020
- Add a command-line disassembly example. · e522c6e6
  Jacob Bramley authored Oct 14, 2020
```
Change-Id: Icaff12d06d4cee6852f16a419365b61a53dc1ffe
```
  e522c6e6
Oct 07, 2020

Add email address for bug reporting · 46ce31f6

Martyn Capewell authored Sep 23, 2020

Provide an email contact address in the README for bug reports.

Change-Id: Iad9e8c2f135758da12597ab18fc8ea8d72006174

46ce31f6

Sep 23, 2020

Define values for unallocated prefetch modes · 63ceff5b

Martyn Capewell authored Sep 23, 2020

The assembler API provides prefetch functions that accept an enumeration for
the prefetch mode, but that relies on C++ undefined behaviour when used with
integer mode specifiers absent from the enumeration. This patch adds those
unallocated values.

Change-Id: I03fce9158deed2dda43ed35e5c6cbc93a4c3f9af

63ceff5b

Sep 10, 2020

Fix lint.py error regex. · 4c550938

Jacob Bramley authored Sep 10, 2020

The regex didn't match when a prefix was used (i.e. through test.py).
Also, if `re.search()` failed, the whole process would hang. This patch
avoids that by explicitly catching a failed match.

Change-Id: Ib52f88c392c9b1d040aef55b60703dc66f11b824

4c550938

Aug 14, 2020

Optimise single handler tables in the decoder. · e66b0909

Martyn Capewell authored Jul 30, 2020

Previously, the decoder applied a mask-and-value optimisation only if a decoding
table had two entries in it, with an explicit "otherwise" case. This patch makes
the optimisation also work when the case is implicit.

Change-Id: Iacbf96940e1c078fdcb4ad162f483b0d690b4108

e66b0909

Aug 13, 2020

[sve] Disallow dup with shift on byte-sized lanes · 8ed83527

Martyn Capewell authored Aug 11, 2020

The Arm ARM states dup-immediate with shift on byte-sized lanes is an undefined
encoding, so add assertions in the assembler to disallow this, and handle
the encoding appropriately in the disassembler and simulator.

Change-Id: Ibce08d2577e2b780dbb7e4fa478d2f8d18e99a39

8ed83527

Aug 06, 2020

Fix infinite loops in some native tests. · 32a7cd97

Jacob Bramley authored Aug 06, 2020

Some tests assume rely on `sve_vl_in_bits()` to inform code generation.
We still generate SVE code on platforms without SVE, we just don't run
it, so we need to pick a valid (non-zero) default.

Change-Id: I25b457f63c54cccfc299128857244b2504127755

32a7cd97

Aug 03, 2020

Remove undefined behaviour in add/sub immediate · b8da04db

Martyn Capewell authored Aug 03, 2020

The macro assembler support for add/sub immediate could attempt to negate the
minimum int64_t value, which is undefined behaviour. Check the value before
negation.

Change-Id: If5c12515fbef490318a90930e7a362876b0e6dc5

b8da04db

Jul 29, 2020

Revert optimisation for add/sub immediates · f84b4643

Martyn Capewell authored Jul 29, 2020

Removed due to some cases showing worse performance.

Reverts:
960606b6
f3f5d246

Change-Id: I5b9585d112424d97e372bd264d084cb9caf92b5f

f84b4643

Jul 24, 2020

Fix initialisation order for ID register fields. · aaf02c54

Jacob Bramley authored Jul 22, 2020

Making the constructor `constexpr` gives these "constant" rather than
"dynamic" initialisation, which guarantees that they'll be initialised
before any other dynamically-initialised static variables.

This makes it safe to make static variable initialisers dependent on the
result of `CPUFeatures::InferFromOS()` and similar functions.

Change-Id: Ib65e92273c7ac03d07e17cbe844b598ef3bc60ac

aaf02c54

Jul 23, 2020

Fix add/sub immediate for min-int case · f3f5d246

Martyn Capewell authored Jul 21, 2020

960606b6 removed a check for the encodability of a value within an add/sub
instruction, which allowed negation of min-int to pass through undetected.
This caused incorrect code generation, as the value wasn't inverted, but the
operation was.

Change-Id: I9c436a42a67aabca4873972c173ea104ef36f40e

f3f5d246

Jul 16, 2020

Add missing aliases for SVE 0.0 moves. · f48172ba
Jacob Bramley authored Jul 13, 2020
```
Change-Id: I608c610da7de42328ed3984dabb56cf1401d7a15
```
f48172ba
Fix and enable CanTakeSVEMovprfx. · b9616b36
Jacob Bramley authored Jun 30, 2020
```
Change-Id: I9afc03fb9e11546b9e6caf04497339bf45b285b6
```
b9616b36

Support more than 64 CPU features. · 8c4ceb6a

Jacob Bramley authored Jul 08, 2020

After recent patches, we have exactly 64 CPU features. This patch makes
the mechanism flexible so that we can support more features in the
future.

Several operators on CPUFeatures had to be re-written as part of this,
so this patch replaces the default-argument implementations with a more
flexible template-based approach, which can accept more than four
features. Existing usage remains unaffected.

Change-Id: If91a3adb62669aa827464e857a90eb93a64db7a6

8c4ceb6a

Fix CPUFeature iterator behaviour. · caa40eec

Jacob Bramley authored Jul 08, 2020

The `++` operators should return iterators, not values.

This also updates tests to match, and makes wider use of C++11
range-based `for` loops, where they simplify code.

Change-Id: I2c8ef422e851d6b16c8de2890ae16fc69817a738

caa40eec

Add an example that dumps CPU feature information. · 28ff5975

Jacob Bramley authored Jul 08, 2020

For debugging purposes, it's useful to see what CPU features VIXL sees.

Change-Id: I6a501ee8c11e50252db713d7d295d84db0f2aee2

28ff5975

Add support for AT_HWCAP2. · 31d432b2
Jacob Bramley authored Jul 08, 2020
```
Change-Id: I3c893a6c1e3b25756999025a21ae310e5b3e199c
```
31d432b2

Add CPUFeatures up to Armv8.6. · 3d8d3942

Jacob Bramley authored Jul 06, 2020

This adds support for all relevant features described in the latest
Armv8.6 XML.

Note that this removes the CPUFeatures::All() part of the
`API_CPUFeatures_format` test. It added little value to the test, and
was a burden to update when new features are added.

Change-Id: I276a0970be94c3adf2d0100874df0b82c7424a9b

3d8d3942

Jul 13, 2020

Emit pairs of add/sub for larger immediates · 960606b6

Martyn Capewell authored Jul 01, 2020 and

Jacob Bramley committed Jul 13, 2020

For immediates between 12 and 24 bits in size, a pair of add or sub instructions
can be used instead of mov, avoiding the need to allocate a temporary.

Change-Id: I114b4667dcc1bda094652e01d88069d012249dca

960606b6

Use segments in SVE indexed fmul simulation · 4635261c

Martyn Capewell authored Jul 02, 2020

The value used for the second operand in indexed multiplies differs for each
segment (128-bit part) of a vector, but the simulator wasn't doing this for
FP multiplies. Fix and update the tests.

Change-Id: I9cc37ebef9d216243a23bedebea256826e1016cb

4635261c

Jul 06, 2020

Fix numerous issues related to CAS* instructions. · 3eb24e95

Jacob Bramley authored Jul 03, 2020

1. There was no test for 64-bit CASP.

2. The tests had some faulty code for obtaining aligned pointers. The
   natural alignment is sufficient anyway, so this patch removes the
   broken alignment code, and varies the addresses used to strengthen
   the test slightly.

   For the new CASP test, this patch uses the C++11 `alignas` specifier.

3. The simulation of CASP variants accessed memory in the wrong order.
   With this patch, the first-specified register in each pair accesses
   the lowest address.

4. We now check that `rs` and `rt` have the same format. Likewise for
   `rs2` and `rt2` in the CASP variants.

5. Register trace is improved: the `rs` (and `rs2`) update is traced as
   a memory read so we should suppress the log on the register write.
   This is what we do for normal loads.

Change-Id: I213c4b3de32305a8072fdc45357b67cbbf85ba9c

3eb24e95

Make assembler more strict about SVE prefetch arguments · 102e7a5e

Martyn Capewell authored Jul 02, 2020 and

Jacob Bramley committed Jul 06, 2020

Add assertions to the assembler to prevent the use of unsupported addressing
modes for prfb/h/w/d.

Change-Id: Ie12991eb2e29661eb266fc495e9164246371d10e

102e7a5e

Jul 03, 2020

Use PgLow8 rather than Pg<12, 10>. · ebc3b8f5

Jacob Bramley authored Jul 03, 2020

This is just a clean-up. We have the helper, so we should use it.

Change-Id: I8ee2c7929aef6ad737d7079eee62ffe3f7618857

ebc3b8f5

Always assert that 'pg' does not have a lane size. · 7b5819c3

Jacob Bramley authored Jun 17, 2020

We did this for PgLow8, but not for 4-bit 'pg' fields.

In practice, we plan to relax this in the future, permitting lane sizes
where they match the rest of the instruction, but this patch makes our
checks consistent in the meantime.

Change-Id: Ie791027f217eabab305dbd22b8c0e77926c9d3b8

7b5819c3

Jul 02, 2020

Disallow x31/xzr for SVE prefetch scalar offset register · ecca4b1c

Martyn Capewell authored Jul 02, 2020

The architecture disallows rm = x31/xzr for prefetch, so assert this in the
assembler.

Change-Id: I26e14688bde624d38eee40167fb3ada88acaaec7

ecca4b1c

Fix simulation of FCMNE. · 4606adc3

Jacob Bramley authored Jul 02, 2020

FCMNE can return true when the comparison is unordered.

Change-Id: Ic1fa9a83cd9bde23faf2b13b69d3a7e9d1426a12

4606adc3

Require an immediate (0.0) for compare-with-zero instructions. · 5a5e71f3

Jacob Bramley authored Jul 02, 2020

This matches conventions elsewhere in the API, and allows for immediate
synthesis. Immediate synthesis is not included in this patch.

Change-Id: If4bdc9cfd9d4bb83a9c015ef363291c1ff08a64a

5a5e71f3

Prefer to use 'rd' as a scratch. · a8461cf9

Jacob Bramley authored Jun 22, 2020

This is generally useful, but in particular reduces scratch register
pressure in code sequences using ComputeAddress. For example:

    MemOperand addr(...);
    UseScratchRegisterScope temps(&masm);
    Register computed = temps.AcquireX();
    __ ComputeAddress(computed, addr);

Before this patch, that sequence usually required two scratch registers;
one for `computed`, and one for immediate synthesis inside
`ComputeAddress`. With this patch, the same code sequence only needs one
scratch register.

Change-Id: I9c93e6cab51bdacf36046d4d770dc81d1a65a34c

a8461cf9

Fix CPURegister::GetArchitecturalName(). · 32f8fe13

Jacob Bramley authored Jul 02, 2020

The `code_` field is a `uint8_t`, which is treated by stream formatters
as a `char`. This caused strange output from error messages in test
failure.

Change-Id: I16302e6bbd8977bb376d28c7b7cb2091f9891aba

32f8fe13