The Batch independent multioutput GP tutorial shows how to to set different kernel parameters for each output. For $N$ inputs and $M$ outputs this costs $\mathcal{O}(MN^3)$ as we need to invert $M$ Gram matrices, each $N \times N$. Is there a way to use a shared kernel for all models and therefore only have one gram matrix to invert at cost $\mathcal{O}(N^3)$? I suspect there is a way to do this by setting the batch_shape parameter for the kernel and mean, but I cant figure it out. Any help would be greatly appreciated. Thanks!