def choose_action(self, s): s = s[np.newaxis, :] # single state return self.sess.run(self.a, feed_dict={S: s})[0] # single action 你好,想问问这个函数返回值最后[0]是返回的什么东西呢