Skip to content

关于DDPG算法 #208

@zhenbin-li

Description

@zhenbin-li

def choose_action(self, s):
s = s[np.newaxis, :] # single state
return self.sess.run(self.a, feed_dict={S: s})[0] # single action

你好,想问问这个函数返回值最后[0]是返回的什么东西呢

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions