Abstract:Crop canopy temperature can often be acquired using the thermal imager. Non-contact and non-destructive automated detection can be expected to achieve for crop water stress status. Automatic image alignment can be used to treat the fuzzy edge distribution, strong noise, as well as shape and texture information lacking in thermal infrared images, according to the information complementarity between visible light and thermal infrared images. The automated extraction can be realized on the crop canopy temperature. This study aims to solve the problems of differences in the radiation, shape, and texture between visible light images and thermal infrared images, leading to the low align images of different modalities. Multimodal image registration was also proposed to integrate the improved brain storm optimization (BSO) and Powell algorithm. Firstly, the original visible light image was downsampled and cropped, according to the normalized cross-correlation value. The area with the most similarity region was obtained in the thermal infrared image under the same resolution; Then, the target area was extracted from the cropped image. The target area image and the original thermal infrared image were decomposed by wavelet transform, where the multilayered low-frequency information was retained; Thirdly, the primitive affine transformation matrix was obtained by the image moments in the low-resolution layer; At the same time, the global search was used to optimize the affine transform matrix in the low-resolution layer using the improved BSO; Fourthly, the optimization was used as the initial point of the Powell algorithm. The optimization was performed in the high-resolution layer; Lastly, the optimization in the previous step was input into the Powell algorithm again. The original image layer was optimized again to obtain the final affine transformation matrix. The original BSO optimization was improved for the optimal affine transformation matrix in the image alignment task. The specific improvements included the following five aspects: The BSO population distribution was initialized using a chaotic mapping function; The mutation range of new individual was modified; The number of K-means clusters was dynamically adjusted in the BSO by the elbow; The chaotic local search was incorporated into the strategy of individual variation; and the probability parameters were dynamically adjusted, according to the different BSO in the early and late stages. Mutual information (MI), normalized mutual information (NMI), root mean square error (RMSE) and mean structure similarity index measure (MSSIM) were taken as the evaluation indexes. A comparison was made with Powell optimization, genetic algorithm (GA) and BSO_Powell algorithm. Specifically, MI indexes were improved by 0.054 2, 0.076 9, 0.040 5, respectively; NMI indexes were improved by 0.015 9, 0.023 1, 0.052 7, respectively; RMSE indexes were reduced by 15.02, 13.03, 27.08, respectively; and MSSIM indexes were improved by 0.0523, 0.0488, 0.1224, respectively, in greenhouse data; In field data, MI indexes were improved by 0.064 2, 0.066 7, 0.035 5, respectively; NMI indexes were improved by 0.007 7, 0.0125, 0.0124, respectively; RMSE indexes were reduced by 14.06, 10.57, 15.40, respectively; and MSSIM indexes were improved by 0.047 1, 0.038 1, 0.042 9, respectively. The strong robustness can accurately achieved in the multimodal image registration tasks for potatoes under complex environments.