Skip to content

Przekopiowac parametry (pomijajac 4 kanal) z papera i sprawdzic wyniki #158

@rojberr

Description

@rojberr

4.5 Classification from aerial images
VHR aerial imagery is more commonly available than ALS data, and it is important to evaluate how the two modalities compare. Therefore, we also train and evaluate a basic image classifier on aerial imagery. We adopt the experimental context reported by creators of TreeSatAI for their own baseline [5]: a ResNet18 [34] encoder with a batch size of 32, optimized with Adam with a cyclic learning rate (0.00005-0.001 range, half-period of 13630 steps). The model is pretrained on ImageNet and we adapt data processing accordingly: images are resized to 224 × 224 pixels, the model is adapted to receive the additional infrared channel by duplicating the weights for the red channel, and color channels are normalized using ImageNet statistics. We use common data augmentations: random rotations, horizontal and vertical flips, color jitter (p=0.5), random cropping (p=0.5), channel dropout (p=0.05).
Training is parallelized as before with 3 GPUS, and we scale the learning rate range linearly to account for the effective batch size. Since the model is pretrained, it converges in a few training epochs and then quickly plateaus. This narrows the window of good models, and therefore we validate the model more frequently, i.e., every tenth of the training set. Training takes 4 minutes per epoch on average.

Punkt 4.5 z publikacji: https://arxiv.org/html/2404.12064v2

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions