-
Notifications
You must be signed in to change notification settings - Fork 21
Open
Description
Great to see the package evolving step by step.
I would like to use the library in two related variants.
- Shared memory partitioned matrices without MPI
- Shared memory partitioned matrices with MPI
The first variant is directly related to #188 , but also very interesting for threaded software (e.g. threaded assembly variants in FEM). The latter should be helpful in maximizing on-node performance for MPI parallel software by using threads on each node in a cluster.
If I understand the idea correctly, then making 1 work should be straight-forward. I just start from the DebugArray
implementation and make a custom type e.g. ThreadedArray
where the dispatches are correctly implemented, correct? If that is the way to go, is there some easy way to compose this new ThreadedArray
with MPIArray
or should I already start differently?
Metadata
Metadata
Assignees
Labels
No labels