The receptive field refers to the area of the input data that a particular neuron in a convolutional neural network (CNN) is sensitive to. The size of the receptive field is determined by the size of the filter that the neuron uses to convolve the input data.
Theoretical receptive field: Provides a basic estimate of the input area impacting the neuron's activation, assuming that there are no interactions between neurons, treating filters independently.
Effective receptive field: Takes into account the actual influence of surrounding pixels on the neuron's response.
For example, if a neuron has a 3x3 filter, then its receptive field will be a 3x3 patch of the input data.
Adding more layers => increase receptive field size.
Sub-sampling and Dilated Convolutions.
Integrates Spatial Pyramid Matching (SPM) [39] into the CNN to output one fixed-size dimensional feature vector.
The ASPP is the SPP module with dilated convolution operation.
Improve dilated convolution to obtain more comprehensive spatial coverage of feature maps.