Network does not converge, bad captions #9

PavlosMelissinos · 2017-10-30T09:10:59Z

Hello,

I've followed your instructions and started training the network. The loss reaches its minimum value after about 5 epochs and then it starts to diverge again.

After 50 epochs, the generated captions of the best epoch (5th or 6th) look like this:

Predicting for image: 992
2351479551_e8820a1ff3.jpg : exercise lamb Fourth headphones facing pasta soft her soft her soft her soft her soft her dads college soft her dads college soft her her her her her soft her her her her her soft her her her her
Predicting for image: 993
3514179514_cbc3371b92.jpg : fist graffitti soft her soft her Hollywood Fourth Crowd soft her her soft her her her her her soft her her her her her her soft her her her her soft her her her her soft her her her
Predicting for image: 994
1119015538_e8e796281e.jpg : closeout security soft her soft her security fall soft her her her her her fall soft her her her her her her soft her her her her her soft her her her her soft her her her her her
Predicting for image: 995
3727752439_907795603b.jpg : roots college Fourth tree-filled o swing-set places soft her soft her her soft her her soft her her college soft her her her her her her her soft her her her her soft her her her her her her

Any idea what's wrong?

The text was updated successfully, but these errors were encountered:

MikhailovSergei · 2017-11-04T11:08:57Z

hi, I also have faced this problem. Let's work together to avoid this problem. My mail: [email protected]. Waiting for your answer

anuragmishracse · 2017-11-04T11:31:17Z

It's been a while since I worked on this repo. I'll try to retrain it and reproduce this error sometime next week and see if something needs change.

Meanwhile, @PavlosMelissinos and @MikhailovSergei if you were able to debug this, feel free to update and send a pull request.

MikhailovSergei · 2017-11-04T12:10:39Z

ok), will try too

MikhailovSergei · 2017-11-05T09:20:05Z

Hello, do u have the Flickr_30k.trainimages.txt and Flickr_30k.testimages.txt files. I can't find this files in anywhere=( In official web it's unable to download. I have image I need just this files

lopezlaura · 2017-11-08T16:17:32Z

Hello,
I am also facing the exact same problem, please let me know if you find a solution.
@MikhailovSergei I have just sent you an email.

MikhailovSergei · 2017-11-09T00:52:44Z

Hi, I am glad to receive u comment. I have changed batch-size. I set it equal to 1500 instead 32 in capture_generator.py and train_model.py. after 43-45 epoch it can work a little better. Please give me know about u result and if u find some more better ways)))

anuragmishracse · 2017-11-09T18:42:14Z

@MikhailovSergei @lopezlaura It actually depends on the dataset. Different datasets will ideally require us to tune hyperparameters to get optimal captions. It's not usual that we can reuse the hyperparameters.

Things that you can try:

Changing the batch size, try keeping it 1024
Changing the Learning rate can help you reach an optimum.
Changing the optimization algorithm.

If it helps you improve your model, do post your results here for others to refer to.

MikhailovSergei · 2017-11-21T15:12:06Z

So what batch_size is better for Flickr 8k?

aashimasingh · 2017-12-13T19:34:05Z

I am facing the same issue while using Flickr8k and the captions are not making any sense. Particular words are getting repeated in every sentence. Somehow, it is working better on a subset of 100 images rather than the entire dataset. I have tried changing the batch size but it didn't help. Could you give any suggestions?

EriCongMa · 2017-12-16T17:19:28Z

After I trained the model , it gave me the result as follows:

yielding count: 599098
yielding count: 599099
yielding count: 599100
yielding count: 599101
yielding count: 599102
yielding count: 599103
yielding count: 599104
yielding count: 599105
yielding count: 599106
yielding count: 599107
yielding count: 599108
yielding count: 599109
yielding count: 599110
Epoch 00050: loss did not improve
 - 1177s - loss: 6.7838 - acc: 0.3085
Training complete...

U can see the loss is high and the acc is low. Meanwhile, when I run the test_model, all of the output sentences are the same. I wanna know where to change learning rate and which optimization algorithm can be better?

BTW, can you share ur weight file to me? My email address is [email protected]
Thanks very much.

kashyap32 · 2017-12-22T06:33:21Z

changing a batch size can improve accuracy . try it with 1024.
And can you share me model.save file
mail - [email protected]
Thanks!

zbj6633 · 2017-12-25T09:47:08Z

I am a university student，can you share me model.save file，I want to see the effect.
mail - [email protected]
Thanks!

MikhailovSergei · 2017-12-26T02:09:02Z

but if we take 1024 batch size it will be overfit

zbj6633 · 2017-12-26T02:34:00Z

@MikhailovSergei 1024batch need how much memory GPU

b10112157 · 2018-05-10T04:04:18Z

can ur share me model.save file ?my networks doesn"t also converge
mail :[email protected]
@MikhailovSergei
@kashyap32
@army3401
@aashimasingh
@lopezlaura

thanks

ShixiangWan · 2018-05-14T01:43:08Z

My networks doesn't converge, too. So maybe this is a bug. :(

b10112157 · 2018-05-14T01:48:22Z

do u have else project for "image 2 caption" ? if u have run window10 project，can u give me ? 2018-05-14 9:43 GMT+08:00 Shixiang Wan <[email protected]>:

…

My networks doesn't converge, too. So maybe this is a bug. :( — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#9 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AeQNYp2WICrBieuXQn22ME3hw-E2FXnwks5tyOEugaJpZM4QK3bU> .

ShixiangWan · 2018-05-14T01:58:39Z

@b10112157 Sorry, I have no other image caption projects, and no windows 10 image caption projects. But for this, tensorboard screenshot is the following:

b10112157 · 2018-05-14T02:03:35Z

Can u share ur best weight file? 從我的 iPhone 傳送

…

Shixiang Wan ***@***.***> 於 2018年5月14日上午9:58 寫道： @b10112157 Sorry, I have no other image caption projects, and no windows 10 image caption projects. But for this, tensorboard screenshot is the following: — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

ShixiangWan · 2018-05-14T02:15:30Z

@b10112157 Thanks for your kindly help. This my best weight and model file (epochs=50, batch_size=32): https://drive.google.com/open?id=1DlfecYfiPlViFCh1h9Op_6puaTAKwN0N

b10112157 · 2018-05-14T02:29:08Z

how accuracy ? and best accuracy epoch where 2018-05-14 10:15 GMT+08:00 Shixiang Wan <[email protected]>:

…

@b10112157 <https://github.com/b10112157> Thanks for your kindly help. This my best weight and model file (epochs=50, batch_size=32): https://drive.google.com/open?id=1DlfecYfiPlViFCh1h9Op_6puaTAKwN0N — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#9 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AeQNYgjXcMist2uw32-IvAlX4dILGMu9ks5tyOjDgaJpZM4QK3bU> .

ShixiangWan · 2018-05-14T02:42:38Z

@b10112157 As shown in above tensorboard screenshot, the best loss is 5.502 (5th step) and the best accuracy is 0.3267 according to the best loss.

ShixiangWan · 2018-05-14T02:46:44Z

@army3401 1024 batch need ~4.2GB GPU memory. This is my testing on single K80 GPU:

b10112157 · 2018-05-14T05:22:18Z

My gpu is gtx1060 6g, run train batch 1024 have error,but batch 512 is ok 從我的 iPhone 傳送

…

Shixiang Wan ***@***.***> 於 2018年5月14日上午10:46 寫道： @army3401 1024 batch need ~4.2GB GPU memory. This is my testing on single K80 GPU: — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

ShixiangWan · 2018-05-14T05:31:50Z

@b10112157 Thanks. I am trying batch size 1024, and now the loss curve is apparent better than batch size 32. So maybe small batch size 32 results in the shock.

b10112157 · 2018-05-14T07:52:37Z

can u share ur batch 1024 weight file ? becaus the batch set 1024 i had oom ，so my batch 512 epoch in 15x is the best ，but the acc only 0.6. 2018-05-14 13:32 GMT+08:00 Shixiang Wan <[email protected]>:

…

@b10112157 <https://github.com/b10112157> Thanks. I am trying batch size 1024, and now the loss curve is apparent better than batch size 32. So maybe small batch size 32 results in the shock. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#9 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AeQNYuEYSFGdT6YGs6IfK3JKTYmKHHpKks5tyRbagaJpZM4QK3bU> .

ShixiangWan · 2018-05-14T09:37:32Z

@b10112157 This is 1024 batch size whole model file:
https://drive.google.com/open?id=1rK5OkeCAb_kJLKR6EKlVqd_HzlZrjrYn

Tensorboard screenshot:

But I sample and test some pictures just now, the captions are bad. For example:

b10112157 · 2018-05-14T12:38:48Z

it"s model okay? mymail :[email protected] can u contact me? 2018-05-14 17:37 GMT+08:00 Shixiang Wan <[email protected]>:

…

@b10112157 <https://github.com/b10112157> This is 1024 batch size whole model file: https://drive.google.com/open?id=1rK5OkeCAb_kJLKR6EKlVqd_HzlZrjrYn Tensorboard screenshot: [image: image] <https://user-images.githubusercontent.com/9321757/39989790-77272774-579d-11e8-851c-fad78c723e92.png> But I sample and test some pictures just now, the captions are bad. For example: [image: image] <https://user-images.githubusercontent.com/9321757/39989726-4f06024c-579d-11e8-9183-fa8edd32e589.png> — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#9 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AeQNYpoU4XmYMh9-tMPH3Y8ZMeLDZpAlks5tyVBogaJpZM4QK3bU> .

cynthia0811 · 2018-06-26T13:06:44Z

@ShixiangWan Hey,dear.
And have you fix the bad captioning performance with the higher accuracy?

zhenming33 · 2018-09-05T10:52:05Z

It's not about the model, just replace 'unique = list(set(unique))' by 'unique = sorted(set(unique),key=unique.index)' in caption_generator.py, then results can make some sense. due to the batch size, my final loss is 2.23 and result like

Kinghup · 2019-01-15T07:18:08Z

@b10112157 This is 1024 batch size whole model file:
https://drive.google.com/open?id=1rK5OkeCAb_kJLKR6EKlVqd_HzlZrjrYn

Tensorboard screenshot:

But I sample and test some pictures just now, the captions are bad. For example:

wow! It's great! I have the same problem,and i add the BN layer to stabilize the loss. but the best model's loss is 4.7 and the the acc is 0.37. Do you just adjust the batch size to 1024?

Kinghup · 2019-01-17T03:14:09Z

It's not about the model, just replace 'unique = list(set(unique))' by 'unique = sorted(set(unique),key=unique.index)' in caption_generator.py, then results can make some sense. due to the batch size, my final loss is 2.23 and result like

How do you solve the problem? I try you solution but it does't work. The caption is excursive and has no sense . I don't know where is wrong ,please help me.

Kinghup · 2019-01-18T01:28:00Z

It's not about the model, just replace 'unique = list(set(unique))' by 'unique = sorted(set(unique),key=unique.index)' in caption_generator.py, then results can make some sense. due to the batch size, my final loss is 2.23 and result like

how do you solve the problem? I try but it doesn't work....

a494456818 · 2019-07-08T03:02:00Z

I don't think setting batch_size to 32 will converge the training. I made the following settings:

batch_size = 512
@zhenming33 use his method
At the 45th epoch convergence, loss is 2.4 +. At the same time, I set batch_size to 1024, 49 epoch convergence, loss to 1.5+

If you need weight files, please let me know your email address.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Network does not converge, bad captions #9

Network does not converge, bad captions #9

PavlosMelissinos commented Oct 30, 2017

MikhailovSergei commented Nov 4, 2017

anuragmishracse commented Nov 4, 2017

MikhailovSergei commented Nov 4, 2017

MikhailovSergei commented Nov 5, 2017

lopezlaura commented Nov 8, 2017

MikhailovSergei commented Nov 9, 2017

anuragmishracse commented Nov 9, 2017

MikhailovSergei commented Nov 21, 2017

aashimasingh commented Dec 13, 2017 •

edited

Loading

EriCongMa commented Dec 16, 2017

kashyap32 commented Dec 22, 2017

zbj6633 commented Dec 25, 2017

MikhailovSergei commented Dec 26, 2017

zbj6633 commented Dec 26, 2017

b10112157 commented May 10, 2018

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

cynthia0811 commented Jun 26, 2018

zhenming33 commented Sep 5, 2018

Kinghup commented Jan 15, 2019

Kinghup commented Jan 17, 2019

Kinghup commented Jan 18, 2019

a494456818 commented Jul 8, 2019

Network does not converge, bad captions #9

Network does not converge, bad captions #9

Comments

PavlosMelissinos commented Oct 30, 2017

MikhailovSergei commented Nov 4, 2017

anuragmishracse commented Nov 4, 2017

MikhailovSergei commented Nov 4, 2017

MikhailovSergei commented Nov 5, 2017

lopezlaura commented Nov 8, 2017

MikhailovSergei commented Nov 9, 2017

anuragmishracse commented Nov 9, 2017

MikhailovSergei commented Nov 21, 2017

aashimasingh commented Dec 13, 2017 • edited Loading

EriCongMa commented Dec 16, 2017

kashyap32 commented Dec 22, 2017

zbj6633 commented Dec 25, 2017

MikhailovSergei commented Dec 26, 2017

zbj6633 commented Dec 26, 2017

b10112157 commented May 10, 2018

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

ShixiangWan commented May 14, 2018

b10112157 commented May 14, 2018 via email

cynthia0811 commented Jun 26, 2018

zhenming33 commented Sep 5, 2018

Kinghup commented Jan 15, 2019

Kinghup commented Jan 17, 2019

Kinghup commented Jan 18, 2019

a494456818 commented Jul 8, 2019

aashimasingh commented Dec 13, 2017 •

edited

Loading