Have not found Monte Carlo Sampling in the code

Hi, 
 Thanks for releasing the code for active-qa. 
  After browsing the code, I did not find Monte-Carlo Sampling in the training stage. It seems that each training instance consists of only one  「query, reformulated_query, reward」 tuple.  Therefore, the reward is the same for each token in one reformulated query. 
I don't know whether the suspicion is right. If it is right,  what will model perform with or without Monte-Carlo sampling? Maybe using only one instance for Monte Carlo sampling is like the relation between stochastic gradient descent and gradient descent?
Thank you


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Have not found Monte Carlo Sampling in the code #21

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Have not found Monte Carlo Sampling in the code #21

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions