Text Aligned SalGAN

Model Architecture

Performance Display

Outputs

Training Loss

Quick Start

conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia
pip install PIL
pip install matplotlib
pip install ftfy regex tqdm
pip install git+https://github.com/openai/CLIP.git

Write the image with its text description in the code(apply_model.py), for example:
download the dataset and put it in the cloned repo

your_cloned_repo/
|
|___ saliency/
|    |
|    |___ image/
|    |    |
|    |    |___ image1.png ...
|    |
|    |___ map/
|    |    |
|    |    |___ map1.png ...
|    |
|    |___ fixation/
|    |    |
|    |    |___ fixation1.png ...
|    |
|    |___ readme.txt
|    |
|    |___ text最终版本.xlsx
|
|___ other_dirs/

image_paths = ['saliency/image/000000001306_0.png']
    text_options = {
        "sal": "A table with chairs, a counter with stools and a big window",
        "nonsal": "A television was mounted on the kitchen wall.",
        "general": "A room with a TV, a table, a bar and floor-to-ceiling Windows",
        'pure': ''
    }

Execute python apply_model.py --help to check relevant information;
Execute python apply_model.py --model [model] --text [text_type] to predict the saliency map.

Train

In train.py, adjust the following code to fine tune the hyper parameters:

batch_size = 16
optimizer_G = torch.optim.Adam(generator.parameters(), lr=0.0002, betas=(0.5, 0.999))
optimizer_D = torch.optim.Adam(discriminator.parameters(), lr=0.0002, betas=(0.5, 0.999))
num_epochs = 50

Then run train.py, and the model will be trained and restored;
If you want more fine-grained control over the division of the training set, the validation set, and the test set, please refer to the comment
in Data_Utils.py and get_data_part.py.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
features		features
g1d1		g1d1
g1d2		g1d2
g2d1		g2d1
g2d2		g2d2
Data_Utils.py		Data_Utils.py
README.md		README.md
Report.pdf		Report.pdf
apply_model.py		apply_model.py
calculate_score.py		calculate_score.py
discriminator.py		discriminator.py
generator.py		generator.py
get_data.py		get_data.py
get_data_part.py		get_data_part.py
get_image_text_Map.py		get_image_text_Map.py
plot_loss.py		plot_loss.py
score.py		score.py
test_data_list_total.json		test_data_list_total.json
text-align-salgan.zip		text-align-salgan.zip
text.json		text.json
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Aligned SalGAN

Model Architecture

Performance Display

Outputs

Training Loss

Quick Start

Train

About

Releases

Packages

Languages

dajiaohuang/Text-Aligned-SalGAN

Folders and files

Latest commit

History

Repository files navigation

Text Aligned SalGAN

Model Architecture

Performance Display

Outputs

Training Loss

Quick Start

Train

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages