My situation is estimating different model architectures in a continuous REPL Main context to see their performance on tasks.
While using Reactant+Enzyme, GC.gc() won't work for reducing VRAM usage, and I have to restart whole julia REPL to release used VRAM. But LuxCUDA+Zygote allowing me to execute GC.gc(); CUDA.reclaim() for releasing VRAM instantly.
Is that possible to implement or not? Or else, is there any other methods to solve this, like explicitly marking object cleanable manually?