Skip to content

Latest commit

 

History

History
17 lines (11 loc) · 1.38 KB

README.md

File metadata and controls

17 lines (11 loc) · 1.38 KB

Strt2Ftprnt

Predicting building footprints given a street network using deep computer vision.

A gif animation featuring 3 images (input - ground truth - output) displaying learning progress of the DNN

The model predicts the existence (or non-existence) of building footprint for each pixel through a sigmoid output (far right), this animation depicts the progress the model shows during training from random predictions to pretty accurate ones.

The following are samples from the model's prediction of building footprints (far right) given various starting street network images (far left) and a ground truth image (middle):

An image showing a sample from the model's prediction of building footprints given various starting street network images

The models could quickly learn to predict the position of the various building masses relative to the streets and relative to each other, but as could be seen through these images, it fails to predict other details such as courtyard, shaft positions, it only manages to learn those if it overfits the dataset.

A hyperparameter search (bayesian search) was done using Weights&Biases Sweeps prior to training: The best performing hyperparameter combinations are visualized here ..