Image Registration and Its Applications

Build, deploy, operate computer vision at scale

One platform for all use cases
Connect all your cameras
Flexible for your needs

In many computer vision applications (e.g. object tracking and medical imaging) there is a need to align two or more images of the same object (or scene) taken from different perspectives, at different times, or in different conditions. Image registration algorithms transform a given image (a reference image) into another image (target image) so that they are geometrically aligned. This adjustment is required in multiple applications, such as image fusion, stereo vision, object tracking, and medical image analysis.

About us: Viso Suite is the end-to-end intelligent solution for enterprises. With Viso Suite, ML teams can drastically reduce the time to production of their computer vision applications. To learn more, book a demo for your company.

What is Image Registration?

Image registration is the process that performs spatial transformation and aligns a set of images to a common observational frame of reference – a chosen image from the set. Registration is an important step in image processing tasks where different data sources must be combined. In the image registration process, two situations are apparent:

It utilizes a 3-dimensional transformation of the photos in the set related to the image chosen as a reference.
It is the most time-consuming step of the algorithm’s execution, and the result of the registration cannot be determined in advance.

3d-image registration — Volume Tweening Network (VTN) for 3D moving image registration. Each subnetwork is responsible for finding the deformation field between the fixed image and the moving image – Source

Image registration is frequently used to align the image from diverse camera sources in medical and satellite photography. It can be realized in two ways:

Image-to-Image Registration: multiple images are aligned, so that matching pixels that represent the same scene can be determined.
Image to Map Registration: the input image is displaced to match the map information of a base image while keeping its original spatial resolution.

How to Implement Image Registration?

Image registration methods can be classified into two groups: area-based and feature-based methods. Area-based approaches are preferred when images are missing important features and distinguishing information is given by shaded colors rather than clear forms and structures.

Image alignment is the first step in image registration and it is done in 4 steps:

Feature detection: A domain expert detects the distinctive objects (edges, contours, line boundaries, corners, etc.) in both the reference and checked images.
Feature matching: It defines the correlation between the features in the reference and target images. The matching is done on the content of the picture, or the symbolic description of the control point set.
Determining the transformation model: The parameters, i.e. mapping functions or coordinate systems are calculated, which align the detected picture with the reference image.
Image resampling and transformation: The detected image is changed by applying the mapping functions.

3d medical image registration — Image Registration with Registration Field and Spatial Transform – Source

Computer Vision Techniques for Image Registration

Here we present common techniques for image registration and their advantages/drawbacks:

Pixel-Based Method

This method applies a cross-correlation statistical methodology for image registration. It is based on pattern matching, which finds the location and orientation of a template or pattern in an image. Cross-correlation is a measure of similarity or a match metric.

The 2-dimensional cross-correlation function calculates the similarity of each translation between the reference and the checked image. If the template fits the image, the cross-correlation will be at its top.

The main drawbacks of the correlation approach are the high processing complexity and the flat similarity maximum (due to the self-similarity of the pictures). The method can be improved by pre-processing or applying edge or vector correlation.

Contour-Based Image Registration

This method uses strong statistical characteristics to match picture feature points. Color image segmentation is used to extract regions of interest from images.

To produce the contour of an image – the mean for a given set of colors is computed. During the segmentation process, each RGB pixel in an image is categorized as having a color in a specific range or not. In addition, the Euclidean distance is applied to determine similarity.

contour based image registration — Contour-based image registration from multiple CT scans (contours marked manually) – Source

These two sets are coded as binary images (black and white). A Gaussian filter is used to eliminate noise since thresholds blur the image. Then the contour of the image is obtained. The accuracy of the contour method is satisfactory, but a drawback is that it is manual and slow.

Point-Mapping Method

This is the most common method for registering two images with unknown misalignment. It utilizes image features produced from a feature extraction algorithm/process. The fundamental goal of feature extraction is to filter out redundant information.

Features that are present in both images and are more tolerant of local distortions are chosen. After detecting characteristics in each image, they should be matched.

point mapping image registration — Point Mapping (Multimodal) Image Registration – Source

Control points for point matching are crucial in this strategy. Examples of control points are corners, points of locally greatest curvature, contour lines, lines of intersection, centers of frames with locally maximum curvature, and centers of gravity of closed-boundary areas.

The limitation of the feature-based method is the borderline of the frame content. The registration characteristics should be recognized in border areas of the image. Frames may lack this feature, and their selection is usually not based on their content evaluation.

Feature-Based Registration

The feature-based matching method can be used when image intensities provide more local structural information. Image characteristics produced from the feature extraction technique can be used for registration. They detect and match key features (such as corners, edges, or interest points) between images. Then, transformation parameters are computed based on these features.

feature-based image registration — Image Registration done by feature extraction, image transformation, and similarity measurement – Source

This method can handle changes in scale, translation, and rotation, but it could fail in cases of large deformations or occlusions.

Advanced Image Registration Methods

Intensity-Based Registration: It compares the pixel intensity values of the reference and checked images to compute the optimal transformation parameters. It can handle a wide range of transformations, including nonlinear distortions, but it’s sensitive to noise and may require additional computation.
Mutual Information Registration: It calculates the statistical dependency between pixel intensities of two images, looking for a transformation that maximizes mutual information. It’s effective for registering images with multiple contrasts and modalities, but it’s computationally intensive.
Deep Learning-Based Registration: It applies convolutional neural networks (CNNs) to learn the transformation directly from image pairs. It can handle complex transformations and large datasets but requires additional training data. Also, it’s computationally expensive during training.
Optical Flow Registration: It estimates the motion of pixels between consecutive frames by solving an optical flow equation. Widely used in video analysis and motion tracking, but it may fail in complex scenes. It’s also too sensitive to illumination changes.

Image Registration Deep Learning — Deep Learning FlowNet architecture – Source

Applications of Image Registration

Image Fusion

Image fusion’s task is to combine 2 or more registered images and produce a new image, which is more understandable than the originals. It is quite significant in medical imaging since it creates more acceptable images for human visual perception. A simple image fusion technique is to take the average of two input images, but it leads to a feature contrast reduction.

A better approach is to apply a Laplacian pyramid-based image fusion but it will introduce blocking artifacts cost. Best fusion output images can be achieved based on the Wavelet Transform for each of the source images.

Object Tracking

The object tracking algorithm follows the movement of an object and tries to estimate (predict) its position in a video. An example of such an algorithm is the centroid tracker. It stores the last known bounding boxes, then has a new set of bounding boxes, and then minimizes the maximum distance between objects that match.

To transform images of the same scene generated by different sensors, object tracking requires heterogeneous images that are correctly registered in advance, with cross-modal image registration. Recent deep learning technology utilizes neural networks with large parameter scales to predict feature points.

Multiple Object Tracking (MOT) vs General Object Detection — Multiple Object Tracking (MOT) vs. General Object Detection

Medical Imagery

Medical Image Registration tries to find an optimal spatial transformation that best aligns with the existing anatomical structures. It is used in many clinical applications such as image reconstruction, image guidance, motion tracking, segmentation, dose accumulation, etc. Medical image registration is a broad topic and can be considered from different points of view.

From an input image perspective, registration methods can be divided into unimodal, multimodal, interpatient, and intra-patient registration. The deformation model point of view allows for registration methods to be divided into rigid, affine, and deformable methods. From a region of interest (ROI) perspective, registration methods can be grouped according to anatomical sites, such as brain, lung registration, etc.

image registration affine alignment — Image Registration by Multiple MRI Brain Scans with affine transformation alignment – Source

Limitations of Image Registration

Image registration has certain limitations, such as:

Features Selection: The choice of features (key points) used for registration can significantly impact the results. Choosing inappropriate or insufficient features can lead to poor registration performance.
Noise Sensitivity: Image registration is sensitive to noise in the images. Noisy data can cause errors in the calculation of transformation parameters and affect the registration.
Limited Applicability: Image registration techniques are created for certain types of image transformation, e.g. rigid (translation, rotation), or smooth (deformable) transformations.
Sensitivity to Initial Guess: The accuracy of the registration heavily depends on the quality of this initial guess. Inaccurate initialization can lead to poor results.
Illumination (Viewpoint) Changes: Registration methods could cope when images have significant changes in lighting conditions or viewpoints.

Summary

Image registration is an important technique for the integration, fusion, and evaluation of data from multiple sources (sensors). It has many applications in computer vision, medical imaging, and remote sensing.

Image registrations with complicated nonlinear distortions, multi-modal registration, and registrations of occluded images, contribute to the robustness of the computer vision methods applied in the hardest use cases.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
elementor	never	This cookie is used by the website's WordPress theme. It allows the website owner to implement or change the website's content in real-time.
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
ZCAMPAIGN_CSRF_TOKEN	session	This cookie is used to distinguish between humans and bots.
zfccn	session	Zoho sets this cookie for website security when a request is sent to campaigns.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_177371481_2	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
zabUserId	1 year	This cookie is set by Zoho and identifies whether users are returning or visiting the website for the first time
zabVisitId	one year	Used for identifying returning visits of users to the webpage.
zft-sdc	24hours	It records data about the user's navigation and behavior on the website. This is used to compile statistical reports and heat maps to improve the website experience.
zps-tgr-dts	1 year	These cookies are used to measure and analyze the traffic of this website and expire in 1 year.

Cookie	Duration	Description
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Cookie	Duration	Description
2d719b1dd3	session	This cookie has not yet been given a description. Our team is working to provide more information.
4662279173	session	This cookie is used by Zoho Page Sense to improve the user experience.
ad2d102645	session	This cookie has not yet been given a description. Our team is working to provide more information.
zc_consent	1 year	No description available.
zc_show	1 year	No description available.
zsc2feeae1d12f14395b6d5128904ae3746	1 minute	This cookie has not yet been given a description. Our team is working to provide more information.