Dense Attention-Enhanced U-Net for Complex Image Segmentation Tasks

Authors

  • Shahariaj Mohammad Sajid Truman State University
  • Nazmul Shahadat

DOI:

https://doi.org/10.32473/flairs.39.1.141802

Abstract

Accurate image segmentation in real-world and medical domains remains challenging due to complex object structures, scale variation, and the need for precise boundary localization. Conventional U-Net architectures often struggle with limited multi-scale feature fusion and poor preservation of fine-grained details. We propose a unified enhanced U-Net framework that integrates multi-kernel encoder blocks, an Atrous Spatial Pyramid Pooling bottleneck, densely connected decoder stages, and attention mechanisms to improve contextual modeling and boundary reconstruction. Deep supervision is employed to stabilize training and strengthen feature propagation. Evaluations across both medical (brain tumor) and real-world infrastructure (pothole detection) demonstrate consistent improvements over standard U-Net variants, achieving higher Dice and IoU scores. The results highlight the effectiveness and generality of dense multi-scale attention-based architectures for complex segmentation tasks in healthcare and intelligent transportation systems.

Downloads

Published

06-05-2026

How to Cite

Sajid, S. M., & Shahadat, N. (2026). Dense Attention-Enhanced U-Net for Complex Image Segmentation Tasks. The International FLAIRS Conference Proceedings, 39(1). https://doi.org/10.32473/flairs.39.1.141802