I got a 30-day trial of the ARM version of clang and re-ran my benchmarks.

TL;DR - ARM clang code is not significantly faster execution than FOSS clang, but it is slightly smaller. GCC -O3 still wins on speed though.

I ran some comparisons of GCC and clang with various optimization settings. Screenshot attached.

TL;DR - A 4-year old version of GCC beats a new clang.

