-
-
Notifications
You must be signed in to change notification settings - Fork 75
Open
Labels
Description
Hello,
I'm investigating a potential use of Juice framework for deep reinforcement learning (I'm also learning the RL and deep learning as I go, so apologies for potentially newbie questions). RL requires simultaneous learning and using the net for predictions. I've found several issues which I'm not sure are design decisions or implementation shortcuts:
- It looks like if I configure the net for minibatch training, I can't then use it to make predictions on just one input. I get this exception when I try to.
Solver
has anetwork()
method, comment for which says that "This is the recommended method to get a usable trained network." However, you can't callforward()
on it, since it requires amut
ref.
I can probably work around 1 (like artificially creating a batch by replicating a single input vector) and 2 (by using mut_network()
), but it doesn't look right.
Is this something that can (should?) be fixed in the implementation? I'm happy to provide PRs (but will likely require technical guidance).
Thank you!