Reference: https://c9x.me/compile/bib/ubc.pdf#page=4
Both the parent comment and the referenced paper fail to mention the out-of-bounds access of d[16]. At best, the paper says:
> The compiler assumed that no out-of-bounds access to d would happen, and from that derived that k is at most 15 after the access
Here is my analysis. By unrolling the loop and tracing the statements and values, we get:
k = 0; dd = d[k];
k is 0; k < 16 is true; loop body; ++k; k is 1; dd = d[k];
k is 1; k < 16 is true; loop body; ++k; k is 2; dd = d[k];
k is 2; k < 16 is true; loop body; ++k; k is 3; dd = d[k];
k is 3; k < 16 is true; loop body; ++k; k is 4; dd = d[k];
k is 4; k < 16 is true; loop body; ++k; k is 5; dd = d[k];
k is 5; k < 16 is true; loop body; ++k; k is 6; dd = d[k];
k is 6; k < 16 is true; loop body; ++k; k is 7; dd = d[k];
k is 7; k < 16 is true; loop body; ++k; k is 8; dd = d[k];
k is 8; k < 16 is true; loop body; ++k; k is 9; dd = d[k];
k is 9; k < 16 is true; loop body; ++k; k is 10; dd = d[k];
k is 10; k < 16 is true; loop body; ++k; k is 11; dd = d[k];
k is 11; k < 16 is true; loop body; ++k; k is 12; dd = d[k];
k is 12; k < 16 is true; loop body; ++k; k is 13; dd = d[k];
k is 13; k < 16 is true; loop body; ++k; k is 14; dd = d[k];
k is 14; k < 16 is true; loop body; ++k; k is 15; dd = d[k];
k is 15; k < 16 is true; loop body; ++k; k is 16; dd = d[k]; OUT OF BOUNDS!
As long as we enter the loop, the loop must eventually execute undefined behavior. Furthermore, every instance of testing `k < 16` is true before we hit UB. Therefore it can be simplified to true without loss of functionality, because after we hit UB, we are allowed to do absolutely anything. In my ancestor post where I said that any mistake, no matter how small, can have unbounded consequences, I fully mean it and believe it.Please stop blaming the compiler. The problem is buggy code. Either fix the code, or fix the language specification so that wild reads either return an arbitrary value or crashes cleanly at that instruction.
Note that we cannot change the spec to give definite behavior to writing out of bounds, because it is always possible to overwrite something critical like a return address or an instruction, and then it is literally undefined behavior and anything can happen.
> I mean thats just sort of nuts, how do you loop over an array then in an UB free manner?
The code is significantly transformed, but the nasty behavior can be prevented by designing code that does not read out of bounds! The trick is that the test `k < 16` must be false before any attempt to read/write `d[k]`. Which 99.99% of programmers get right, especially by writing a loop in the standard way and not in the obtuse way demonstrated in the referenced code. The obvious and correct implementation is:
for (int k = 0; k < 16; k++) {
int dd = d[k];
satd += dd < 0 ? -dd : dd;
}
The fact that the SPEC code chose to load `d[k]` before checking that `k` is still in bounds is an overly clever, counterproductive "jumping the gun" tactic. Putting assignment statements into indexing expressions is also needless obfuscation (which I untangled in the unrolled analysis). according to my understanding UB mechanics says "k < 16" is always true because its used as an index into d[16]. obviously my understanding is wrong because of your correct code allows it, so I'm just looking for the takeaway here. why is yours not UB when original is? Is there some magic going on that the compiler knows that the value 16 will be used as an index?
is the compiler doing a loop unroll and study that parallels your analysis?
aside from trying to understand this, it also seems a bit nuts that these c guys remediated that behavior by allowing an out of bounds index access with their change (?). so are they saying somehow that in this one case, the out of bounds should be allowed?
Regarding the wrong code, I already wrote (and you can confirm in my unrolled analysis):
> every instance of testing `k < 16` is true before we hit UB. Therefore it can be simplified to true without loss of functionality, because after we hit UB, we are allowed to do absolutely anything.
Regarding the correct code, the key insight is that when k is 16, the test `k < 16` is false, so we break out of the loop and avoid reading out of bounds. Always check bounds before indexing the array, not after!
This article is a helpful introduction: https://blog.regehr.org/archives/213
> is the compiler doing a loop unroll and study that parallels your analysis?
Yes, compilers do all sorts of crazy stuff from bounds analyses to loop induction transformations. Personally, I've seen GCC eliminate my second loop variable in an implementation of the TEA cipher because it figured out that the first variable can be derived from the second, and it adjusted the bounds.
> it also seems a bit nuts that these c guys remediated that behavior by allowing an out of bounds index access with their change
You are right, it's insane that they decided to make a special case for one piece of wrong code (SPEC), probably because it's too popular to fail. That's a social/political problem, not a technical one.
seems like gcc is being pragmatic, at least with the version 12 something I'm running, it doesn't remove the k<16 test/expression, loop terminates, and with enough -W's I see the warning. still, even to detect this I have to use something other than -O0 which is news to me lol