Entailment pr #39

vyskoto4 · 2016-08-20T14:36:57Z

The entailment model implementation is not 100% final, I had to replace some hardcoded code by the SequenceSplit layer and the performance is not as good as before.

pasky · 2016-08-30T15:17:58Z

models/entail1509.py

+    model.add_node(RepeatVector(spad), name='Wh_n_cross_e', input='Wh_n')
+    model.add_node(TimeDistributedDense(N,W_regularizer=l2(l2reg)), name='WY', input='Y')
+    model.add_node(Activation('tanh'), name='M', inputs=['Wh_n_cross_e', 'WY'], merge_mode='sum')
+    model.add_node(TimeDistributedDense(1,activation='softmax'), name='alpha', input='M')


softmax on 1 dim?

Ok, it was a typo, replaced it with linear

pasky · 2016-09-27T13:53:05Z

Thanks a lot for the update! I'm a lot happier I think, a bit wary about the ugly looking code in rnn_input but I understand the neccessity and we'll be able to throw away that complexity again with the Keras1.0 port. So it's okay. :)

But I'm still a bit leery about the ptscorer business. I think the basic sanity check is:

What happens when we use a different model with the rte task? (E.g. re the sigmoid -> linear activation change. But I don't understand that anyway?)
What happens when we use the model with a different task?
Are these cases both okay?

The other problem I see is that we use completely different ptscoring strategies when the to_n is involved - I don't understand that. I think we discussed just making the output dimension a parameter of mlp_ptscorer? Would that be inferior to the current solution?

vyskoto4 · 2016-10-03T08:34:26Z

The sigmoid-linear change in RTE task has been done only because of the new model, as the reached accuracy was the best with these settings. I also tried RNN, AVG (and maybe CNN) and their performance seemed to be the same as before.
I did not test it with a different task. That is on my todo list.
I'l test the modified mlp_scorer as soon as I get some GPU time.

EDIT:

I deleted the custom scorer and the mlp_scorer is being used. I did not notice any significant performance changes after this.
Now testing the new (ok, its actually old today) on anssel/wang. I did not run full eval but the best val MRR during traning was 0.845.

vyskoto4 added 2 commits August 20, 2016 10:45

added simple ptscorers for entail1509 model

709b139

Added entailment model implementation

186b007

pasky reviewed Aug 30, 2016
View reviewed changes

Tomas Vyskocil and others added 3 commits September 9, 2016 12:01

Now using rnn from blocks in entail1509

2feb4f9

Fixed rnn_input support for single input

6f08651

Cleanup in entail model

7ed2f29

Changes according to PR comments: deleted simple_scorer

526789c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Entailment pr #39

Entailment pr #39

vyskoto4 commented Aug 20, 2016

pasky Aug 30, 2016

vyskoto4 Sep 27, 2016

pasky commented Sep 27, 2016

vyskoto4 commented Oct 3, 2016 •

edited

Loading

Entailment pr #39

Are you sure you want to change the base?

Entailment pr #39

Conversation

vyskoto4 commented Aug 20, 2016

pasky Aug 30, 2016

Choose a reason for hiding this comment

vyskoto4 Sep 27, 2016

Choose a reason for hiding this comment

pasky commented Sep 27, 2016

vyskoto4 commented Oct 3, 2016 • edited Loading

vyskoto4 commented Oct 3, 2016 •

edited

Loading