Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Adding an encoder-decoder model where encoder and decoder are both transformers #3596

Open
chelseagzr opened this issue Jan 14, 2025 · 0 comments
Labels
feature A new feature

Comments

@chelseagzr
Copy link

Problem statement

Encoder-decoder models, where both the encoder and decoder are transformers, have proven to be highly effective for seq2seq tasks, such as translation. It would be extremely helpful if Flair supports this type of model.

Solution

It would be nice if a user can easily create an encoder-decoder model by specifying the name of the encoder (e.g., xlm-roberta-base) and the name of the decoder (e.g., GPT2). Developers can make use of the EncoderDecoderModel class from transformers module to facilitate this.

Additional Context

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature A new feature
Projects
None yet
Development

No branches or pull requests

1 participant