Dead wood image tile classification on KV260

some intro

four classes: debris, forest, water, other

Experimental Environment Setup

also some intro

Host Machine

only for CUDA-enabled host machine

docker pull tumbgd/vai-pt-cuda

Target Machine

Xilinx Kria KV260

Usage

On host machine

Download dataset here, then unzip.

Generate h5 dataset
Train model
Pruning the trained model
Quantizing the pruned trained model

Compiling the quantized pruned trained model (for on-board depolyment):

vai_c_xir -x ./quantize_result/Model_int.xmodel -a /opt/vitis_ai/compiler/arch/DPUCZDX8G/KV260/arch.json -o dwc_ob -n dwc_ob

On target machine

Please follow the README in onboard.

Performances

Model size/accuracy trade-off

Model	Acc.	# param.	Size
non-opt	86.0%	205.28k	806kB
opt	85.2%	76.65k (62.6% smaller)	334kB (58.6% smaller)

Inference speed (FPS)

Device & Model	Inference Speed (FPS)
opt on KV260	211.07
non-opt on KV260	87.71
non-opt on laptop (CPU)	255.59
non-opt on laptop (GPU)	308.75

Remaining Questions

setup VART on Ubuntu 22.04 (currently on pre-built images with shabby GUI)
- lower version VART seems okay, but much less functions supported.
bias_corr is None. Seems no error here (accuray hardly drops), but why not 0 rather than None
ReLU should be supported as stated in the Xilinx document but not in practice.
- Maybe torch.nn.ReLU is not supported, but toerch.nn.functional.relu is. Need a try here.
- Or just not supported. In this case, try all possible activation functions to avoid multiple subgraphs. Our aim is to run a model fully on a single DPU graph to aviod data copy between DPU and CPU.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
dwc_ob		dwc_ob
onboard		onboard
.gitignore		.gitignore
0-h5.py		0-h5.py
1-train.py		1-train.py
2-prune.py		2-prune.py
3-quantize.py		3-quantize.py
4-evaexp.py		4-evaexp.py
LICENSE		LICENSE
README.md		README.md
_config.py		_config.py
_data.py		_data.py
_model.py		_model.py
nonopt_isb.py		nonopt_isb.py
quant_config.json		quant_config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dead wood image tile classification on KV260

Experimental Environment Setup

Host Machine

Target Machine

Usage

On host machine

On target machine

Performances

Model size/accuracy trade-off

Inference speed (FPS)

Remaining Questions

About

Releases

Packages

Languages

License

tum-bgd/dwc

Folders and files

Latest commit

History

Repository files navigation

Dead wood image tile classification on KV260

Experimental Environment Setup

Host Machine

Target Machine

Usage

On host machine

On target machine

Performances

Model size/accuracy trade-off

Inference speed (FPS)

Remaining Questions

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages