[firedrake] cached kernels

Lawrence Mitchell lawrence.mitchell at imperial.ac.uk
Thu Nov 12 14:46:45 GMT 2015


On 12/11/15 14:45, Eike Mueller wrote:
> Hi Lawrence,
> 
> ok, problem solved. If I use the in-place Thomas algorithm for the
> lowest order tridiagonal system instead of LAPACKs LU solver
> routines I get excellent memory throughput (average 3.4GB/s per
> core, so about peak for the full node).
> 
> The time per iteration drops significantly from 0.44s to 0.24s
> (compared to 0.35s for the PETSc solver with hypre preconditioner),
> so this was really a change worth implementing!

Ah, nice!

Thanks,

Lawrence

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 490 bytes
Desc: OpenPGP digital signature
URL: <http://mailman.ic.ac.uk/pipermail/firedrake/attachments/20151112/c6f9f66d/attachment.sig>


More information about the firedrake mailing list