Unit stride access of blocked Coefficients #713

michalhabera · 2024-09-11T11:27:25Z

Running some benchmarks with @jhale we've noticed that there was a performance regression introduced in #645.

More specifically, before the above PR any blocked Coeffcient (e.g. a vector-valued material property) would be first copied into a temporary outside of the quadrature loop such that in the hot loop it is accessed with unit stride, see removed logic #645

For higher order quadrature rules (but otherwise simple scalar trees) this could lead to >30% performance regression.

IgorBaratta · 2024-09-11T12:10:33Z

I can add the logic back, but I think that the best place to do this is when packing coefficients in dolfinx.
Otherwise we repeat the packing step (pack coefficients in a given order, then pack them again in a different order).

More specifically I think ffcx kernels should receive data in XXXXXYYYYYZZZZ format and not XYZXYZXYZ...

IgorBaratta · 2024-09-11T12:24:17Z

When I introduced temporaries to avoid strided memory accesses, it was intended as a short-term solution, as it only works with vector elements.

michalhabera · 2024-09-11T12:43:16Z

Agree that this should be done in dolfinx when packing coefficients. I do not remember if coeff packing is templated based on the block size, so there could be small performance drawback when packing is done over non-compile-time constant loop.

I'd suggest to keep this issue open, it will require changes in the coefficient access in FFCx too.

michalhabera added generated-code performance labels Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unit stride access of blocked Coefficients #713

Unit stride access of blocked Coefficients #713

michalhabera commented Sep 11, 2024

IgorBaratta commented Sep 11, 2024

IgorBaratta commented Sep 11, 2024

michalhabera commented Sep 11, 2024

Unit stride access of blocked Coefficients #713

Unit stride access of blocked Coefficients #713

Comments

michalhabera commented Sep 11, 2024

IgorBaratta commented Sep 11, 2024

IgorBaratta commented Sep 11, 2024

michalhabera commented Sep 11, 2024