Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SMILES chemistry recognition #9

Open
zbetmen1 opened this issue Jan 11, 2021 · 0 comments
Open

SMILES chemistry recognition #9

zbetmen1 opened this issue Jan 11, 2021 · 0 comments

Comments

@zbetmen1
Copy link

Hi Jianshu, great work as always 😃 . I have two questions:

  • How can I obtain SMILES data set you have trained the network on?
  • How do you deal with SMILES ambiguity in targets?
    • Basically for the same chemical compound there are often many ways to encode the chemical in SMILES. What I'm wondering is how are images aligned with targets? For example, if network starts decoding from left to right it might produce one sequence of tokens, but if it starts decoding right to left it might produce other sequence of tokens. The thing is, both sequence of tokens, which are presumably different, may be completely correct. This is not the case when decoding Latex as the starting point is clear and images and targets are naturally aligned.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant