TimNiklasWitte 20cab55ebb expand readme
2022-03-31 10:16:28 +02:00
2022-03-31 10:16:28 +02:00
2022-03-30 17:01:33 +02:00
2022-03-30 21:20:13 +02:00
2022-03-30 17:01:33 +02:00
2022-03-30 17:01:33 +02:00
2022-03-30 17:01:33 +02:00
2022-03-30 09:10:17 +02:00
2022-03-30 17:01:33 +02:00
2022-03-30 09:17:51 +02:00
2022-03-30 17:01:33 +02:00
2022-03-30 09:17:51 +02:00
2022-03-30 09:17:51 +02:00
2022-03-30 09:17:51 +02:00
2022-03-30 21:20:13 +02:00
2022-03-31 10:16:28 +02:00
2022-03-30 09:17:51 +02:00
2022-03-30 17:01:33 +02:00
2022-03-30 09:17:51 +02:00

Colorization of Grey Images by applying a Convolutional Autoencoder on the Jetson Nano

by Dennis Konkol and Tim Niklas Witte

This repository contains an pretrainied convolutional autoencoder for colorization of grey images. The live camera stream will be colorizatized in real time. The architecture of the ANN is optimized to run on the Jetson Nano. It has 300.000 parameters. In total, 10 FPS can be archived on this embedded GPU.

Example Video

In entire documentation of this project in form of a paper is ./Paper_GPU_colorization.pdf. All corresponding files including .tex file and figures are stored in ./Paper.

This project was created as a part of the GPU programming course of Mario Porrmann in the winter term 2021/22 of the Osnabrück University.

Requirements

  • TensorFlow 2
  • OpenCV 3.3.1
  • CSI camera plugged it (see code of live_recolor[_plot].py)

Model

 Model: "autoencoder"
_______________________________________________________________
 Layer (type)                Output Shape            Param #   
===============================================================
 encoder (Encoder)           multiple                148155    
                                                               
 decoder (Decoder)           multiple                150145    
                                                                 
===============================================================
Total params: 298,302
Trainable params: 297,210
Non-trainable params: 1,092
_______________________________________________________________

Hyperparameters

optimizer = Adam
learning rate = 0.0001
loss function = mean squared error
batch size = 32

Usage

Training

Run Training.py to start the training of the model on the imagenet2012 dataset. Each epoch the weights are stored into ./saved_models. Besides, in ./test_logs are the corresponding trainings statistics (train and test loss and also a batch of colorized test images) logged. Note that, the imagenet2012 dataset is stored in ./imagenet as described in this blog article.

python3 Training.py

Live colorization

The launch of live_recolor_plot.py opens a window as shown in the GIF at the start of this README. Note that, the CSI camera must be plugged in.

python3 live_recolor.py

It has the following structure:

(1) | (2) | (3) | (4)

(1) = live RGB camera image
(2) = live grey camera image
(3) = live colorized image

To get also displayed a loss plot (mean squared error between (1) and (3)), run live_recolor_plot.py instead. The loss plot is presented right from (3).

python3 live_recolor_plot.py

Pretrainied Model

The model was runned for 13 epochs on the imagnet2012 dataset and its weights are stored in ./saved_models. Note that, grey images must have a shape of (256,256,1). The following code will load the model and colorized an image:

autoencoder = Autoencoder()
autoencoder.build((1, 256, 256, 1)) # need a batch size
autoencoder.load_weights("./saved_models/trainied_weights_epoch_12")
autoencoder.summary()

grey_img = ... # grey_img.shape = (256,256,1)
grey_img = np.expand_dims(grey_img, axis=0) # add batch dim
colorized_img = autoencoder(grey_img)
Description
No description provided
Readme 390 MiB
Languages
Python 61.4%
TeX 38.6%