I wrote up my GCC vs Clang results along with some charts over at this github repo:

@emeb The label for the ARM clank reads "Load %" in the first graph and "Bytes" in the second graph.

