Question about the shape of `X_train` #9

FrankWork · 2018-07-02T10:04:02Z

X_train = tf.placeholder(tf.int32, [n_batch_train, 2, n_ctx, 2])
xmb[:, :, :, 1] = np.arange(n_vocab+n_special, n_vocab+n_special+n_ctx)
why there is a channel of additional tokens?

The text was updated successfully, but these errors were encountered:

FrankWork · 2018-07-02T11:45:33Z

Problem solved! This part of the xmb is used for the learned positional encoding. huggingface/pytorch-openai-transformer-lm#12 (comment)

sharpsy mentioned this issue Jul 18, 2018

Clarifying last step of the 'transform_roc' function huggingface/pytorch-openai-transformer-lm#26

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the shape of `X_train` #9

Question about the shape of `X_train` #9

FrankWork commented Jul 2, 2018

FrankWork commented Jul 2, 2018

Question about the shape of X_train #9

Question about the shape of X_train #9

Comments

FrankWork commented Jul 2, 2018

FrankWork commented Jul 2, 2018

Question about the shape of `X_train` #9

Question about the shape of `X_train` #9