Enhancing building segmentation by deep multiview classification for advancing sustainable urban development

Sally El Hajjar, Hassan Kassem, Fahed Abdallah, Hichem Omrani

Research output: Contribution to journalArticlepeer-review

Abstract

Accurate building segmentation plays a crucial role in a wide range of applications such as urban planning, monitoring, and mapping. Different deep learning models were employed for building segmentation. However, these models analyze images from a single view. Given the limitations of single-view building segmentation models, our research aims to enhance accuracy by proposing a novel multi-view U-Net deep model for accurate building segmentation that incorporates multiple views of the images. We employ two pre-trained convolutional neural network architectures, MobileNetV2 and ResNet50, to extract features representing two different views of our images. By fusing these features, our proposed method effectively captures complementary information, leading to enhanced segmentation accuracy. To further improve the model's performance, we incorporate skip connections and up-convolutional layers to ensure fine-grained feature propagation. Our experimental results on a large building dataset demonstrate a significant improvement in segmentation accuracy 91% compared to state-of-the-art methods, highlighting the effectiveness of our multiview fusion approach. The experimental results enhance the benefits of creating different views by adopting the novel concept proposed in this paper. This research has the potential to redefine the landscape of building segmentation in applications such as urban planning and mapping. We also conducted a test on a large study area (city scale of Belval–Luxembourg). This demonstrates the capabilities of our method and its efficiency in segmenting satellite images from a large extent area and reinforces its potential for real-world applications.

Original languageEnglish
Article number108421
JournalJournal of Building Engineering
Volume83
DOIs
Publication statusPublished - 15 Apr 2024

Keywords

  • Deep multiview building segmentation
  • Encoder–Decoder
  • Pix2pix
  • Skip connection
  • U-Net model

Cite this