For parallel computing I have some experience with openMP in Fortran and it is indeed quite fast. Any example with coarrays? For example, a simple value function iteration with coarrays? I've googled it but found nothing
learn Fortran MPI, which increases speed tenfold on the basis of openMP