Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training is not converged on syn-balanced and enzymes datasets #36

Open
yyygou opened this issue May 31, 2024 · 3 comments
Open

Training is not converged on syn-balanced and enzymes datasets #36

yyygou opened this issue May 31, 2024 · 3 comments

Comments

@yyygou
Copy link

yyygou commented May 31, 2024

Hi, it appears that I am encountering an issue while training a model on the syn-balanced and enzymes datasets.

In both cases, the training process seems to be running indefinitely, and the outputs are as follows:

Validation. Epoch 87. Acc: 0.5000. P: 0.5000. R: 1.0000. AUROC: 0.5000. AP: 0.5000.
TN: 0. FP: 2048. FN: 0. TP: 2048
Saving ckpt/model.pt
600it [00:00, 958.17it/s]. Training acc: 0.5000 000
600it [00:00, 980.59it/s] Batch 88394. Loss: 3.2000. Training acc: 0.5000
Batch 88999. Loss: 3.2000. Training acc: 0.5000

Everything remains unchanged except for the batch number.

Is there something wrong with the settings or configuration?

@yyygou yyygou changed the title training is not converged on syn-balanced and enzymes datasets Training is not converged on syn-balanced and enzymes datasets May 31, 2024
@fyulingi
Copy link

fyulingi commented Jun 7, 2024

Hello, I'm having some environment configuration issues and cannot run the training codes. Could you please share your python version and your package version? Thx a lot!

@yyygou
Copy link
Author

yyygou commented Jun 8, 2024

Hello, I'm having some environment configuration issues and cannot run the training codes. Could you please share your python version and your package version? Thx a lot!

hi, I am using Python 3.6 in windows platform, and installed the packages in the following order:

  1. numpy-1.17.1
  2. matplotlib
  3. networkx
    3.1 deepsnap-0.1.2
  4. torch
  5. scipy-1.0.1
  6. pandas
  7. seaborn
  8. scikit_learn
  9. torch-cluster-1.5.3
  10. torch_scatter-2.0.3
  11. torch-sparse-0.6.0
  12. torch-spline-conv-1.2.0
  13. pytest-runner-5.3.2
  14. numba-0.53.1
  15. torch-geometric-1.4.3
  16. test_tube == 0.7.5
  17. tqdm-4.43.0

if the version number is missing, pls use the suggested version number given by the requirements.txt

@YiKangOY
Copy link

YiKangOY commented Jun 26, 2024

I also encountered the same problem, but some times the ACC will stuck at some other number, like 0.33 or 0.667 @yyygou May I ask have you resolved this issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants