Replies: 1 comment 6 replies
-
Dear, I think that CQL is very simple algorithm. In fact, it only defines a regularizer to make Q-value conservative. In my opinion, it can be defined as a common component, which can be used in various other algorithms. I prefer to command discrete BCQ as a benchmark. |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
@findmyway
My plans for this week are the following:
Beta Was this translation helpful? Give feedback.
All reactions