Comparison of State-of-the-Art and Traditional Networks for Image Segmentation on the ISPRS Potsdam Dataset

Comparison of State-of-the-Art and Traditional Networks for Image Segmentation on the ISPRS Potsdam Dataset

🎲 Final_Project 🎲

🌌 Introduction

This project focuses on deep learning methods and provides scripts and notebooks that help build, train, and analyze models efficiently. The structure includes various components such as configuration settings, model creation tools, and utilities for model training and visualization.

πŸ“‚ Project Structure

πŸ“‚00-Proposal: Contains the project proposal and related documents.

πŸ“‚01-Presentation: Includes materials for the final presentation, such as:

πŸ“‚DL_Presentation_Template.pptx: PowerPoint presentation template used for showcasing the project.

πŸ“‚02-Report: Folder for the final report detailing the project and results.

πŸ“‚03-Code: This folder includes all the essential code for the project.

  • πŸ“‹ configs.py: Contains configuration settings for the project (e.g., hyperparameters, file paths, etc.).
  • πŸ“‹ fmain.py: Main script to execute specific functionalities of the project.
  • πŸ“‹ main.py: The primary file to launch the project’s main tasks.
  • πŸ“‹ tools: Additional tools and utilities used in the project.
  • πŸ“‹ README.md: This document, providing an overview of the project.
  • πŸ“‹ requirements.txt: A list of required packages and dependencies for the project.
  • πŸ“‚ bases
    • πŸ“‹ init .py
    • πŸ“‹ postdam_dataset.py
    • πŸ“‹ utils.py
    • πŸ“‹ wandbhelper.py
  • πŸ“‚ nets
    • πŸ“‹ init .py
    • πŸ“‹ feature_extractor.py
    • πŸ“‹ pertrain_net.py
    • πŸ“‹ tansunet.py
    • πŸ“‹ unet_attention_model.py
    • πŸ“‹ unet_Kan.py
    • πŸ“‹ unet_Mamba.py
    • πŸ“‹ unet_model.py
    • πŸ“‹ unet_parts.py
  • πŸ“‚ tests
    • πŸ“‹ EDA.ipynb
    • πŸ“‹ test_datagernerator.ipynb
    • πŸ“‹ test_dataset.py
    • πŸ“‹ test_model.ipynb
    • πŸ“‹ tooltest.ipynb
    • πŸ“‹ unet.ipynb
  • πŸ“‚ tools
    • πŸ“‹ callback.py
    • πŸ“‹ datagernerator.py
    • πŸ“‹ datasplitor.py
    • πŸ“‹ training_loop.py

Authors

  • Menghua, Xie
  • Leung, Yiu Chung

Dataset

ISPRS Postdam Dataset

this dataset include 37 pictures per type(IRRG, Digital surface model, label). each image 600060003 My Figure

Model(s)

we used unet transunet, umamba and ukan as the figure showing. My Figure

Results

Regarding the effects of the sources of data No improvement for adding DSM for training Unet Noticeable improvement for large network, e.g. UKan, TransUnet

Regarding loss functions The experiments in UKan showed that there is no significant difference between HybridLoss, CrossEntropy, FocalLoss The failure in MultiClassDiceLoss might be due to the class imbalance in the dataset

Regarding the effects of learning rate For UKan & Unet, there is a significant improvement in training miou but not in validation miou A sign for overfitting, 1e-3 should be a better learning rate