The accurate semantic segmentation of individual tree crowns within remotely sensed data is crucial for scientific endeavours such as forest management, biodiversity studies, and carbon sequestration quantification. However, precise segmentation remains challenging due to complexities in the forest canopy, including shadows, intricate backgrounds, scale variations, and subtle spectral differences among tree species. While deep learning models improve accuracy by learning hierarchical features, they often fail to effectively capture fine-grained details and long-range dependencies within complex forest canopies.
This seminar introduces PerceptiveNet, a novel model that incorporates a Logarithmic Gabor-implemented convolutional layer alongside a backbone designed to extract salient features while capturing extensive context and spatial information through a wider receptive field. The presentation will explore the impact of Log-Gabor, Gabor, and standard convolutional layers on semantic segmentation performance, providing a comprehensive analysis of experimental findings. An ablation study will assess the contributions of individual layers and their interactions to overall model effectiveness. Furthermore, PerceptiveNet will be evaluated as a backbone within a hybrid CNN-Transformer model, demonstrating how improved feature representation and long-range dependency modelling enhance segmentation accuracy.