Is there anyway to train "big data" using transformer? #275

zqp2009happy · 2020-04-01T06:52:09Z

It sames that Transformer reads training data into the Memory. So it easily got OOM Error with "big training data" like 10G (about 50 million text pairs). Is there some solution for this problem?

ZhitingHu · 2020-06-09T20:25:02Z

By "Transformer" do you mean the example code under examples/ or the transformer modules in the library?

The transformer modules are independent of how you manage training data (either in memory or disk), as long as you pass to it a data minibatch each iteration.

The transformer example code does load the whole training data into the memory beforehand (code here). To avoid this, you may want to use Texar data module that loads data sequentially. Here is an example use of the Texar data module.

gpengzhi added the question Further information is requested label Apr 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there anyway to train "big data" using transformer? #275

Is there anyway to train "big data" using transformer? #275

zqp2009happy commented Apr 1, 2020

ZhitingHu commented Jun 9, 2020

Is there anyway to train "big data" using transformer? #275

Is there anyway to train "big data" using transformer? #275

Comments

zqp2009happy commented Apr 1, 2020

ZhitingHu commented Jun 9, 2020