|
(11) | EP 4 148 658 A1 |
| (12) | EUROPEAN PATENT APPLICATION |
|
|
|
|
|||||||||||||||||||||||||||
| (54) | METHOD AND SYSTEM FOR SUPER-RESOLUTION RECONSTRUCTION OF HETEROGENEOUS STEREOSCOPIC IMAGES |
| (57) The invention discloses a method and system for super-resolution reconstruction of
heterogeneous stereoscopic images, which comprises :(1) acquires a stereoscopic image
pair at a certain moment compressed by an asymmetrical compression method in a binocular
vision system, and the stereoscopic image pair includes two views, one of which is
a high-resolution image and the other is a low-resolution image; (2) narrows the search
area of the high-resolution image and the low-resolution image; (3) determines the
high-frequency information of the high-resolution image and the low-resolution image
from the search regions of the high-resolution image and the low-resolution image,
respectively;(4) pastes the high-frequency information of the high-resolution image
to the high-frequency information of the low-resolution image through parametric transformation.
The advantages are :(1) the prior knowledge of high-resolution images in heterogeneous
stereo images can be directly utilized to recover the high-frequency information of
the low-resolution images; (2) the spatial relativity and viewpoint relativity of
heterogeneous stereoscopic images can be used for finding the corresponding sub-blocks
for the low-resolution images from the high-resolution images to efficiently compensate
the heterogeneity between stereoscopic images.
|
FIELD OF INVENTION
BACKGROUND
SUMMARY OF INVENTION
acquire a stereoscopic image pair at a certain moment compressed by an asymmetric compression method in the binocular vision system. The heterogeneous stereoscopic image pair includes two views, one of which is a high-resolution image, and the other is a low-resolution image;
narrow the search area of the high-resolution image and the search area of the low-resolution image;
determine the high-frequency information of the high-resolution image from the search region of the high-resolution image and the high-frequency information of the low-resolution image from the search region of the low-resolution image, respectively;
paste the high-frequency information of the high-resolution image to the high-frequency information of the low-resolution image through parametric transformation.
convovle the high-resolution image and the low-resolution image with a Gaussian filter to obtain a denoised high-resolution image and a denoised low-resolution image;
calculate the raw edges of the high-resolution image and the low-resolution image according to the gradient strength G and the directionθ of the denoised high-resolution image and the gradient strength G and the direction θ of the denoised low-resolution image respectively;
use the non-maximum suppression method to sparse the raw edges of the high-resolution image and the raw edges of the low-resolution image to obtain the sparsed edges thereof;
suppress the weak pixels of the sparsed edges according to the preset high and low thresholds to obtain ideal edges and determine the search area of the high-resolution image and the search area of the low-resolution image according to the ideal edges, respectively.
divide the search area of the high-resolution image and the corresponding area of
the low-resolution image that matches the search area of the high-resolution image
into N sub-blocks,
and
, respectively;
use five gradient operators to extract gradient feature vectors of each sub-block,
siand ti; wherein the five gradient operators are expressed as:
the gradient feature vectors extracted by the five gradient operators are denoted as fx , fy , fxx , fyy , and fxy;
calculate the feature differences D, of the sub-blocks according to the extracted
gradient feature vectors; the feature difference D, is given by:
where
arrange all the feature differences in the order of size to obtain a sequence; the sub-block corresponding to the feature difference in the middle position of the sequence is the sub-block of high-frequency information.
acquire cost function which is given by:
Ti is the transformation parameter;
is the index of the stereoscopic image pair. Eu(ti,si,Hi) denotes the Euclidean distance, which can be represented by:
where P(ti) is the intensity of the sub-block ti in low-resolution image. S(si,Ti) is the intensity of the sub-block Si in high resolution image;
calculate the transformation parameter Ti according to the cost function, which comprises:
acquire the isomorphic map Hmi for rectifying the planes in the image, wherein the three vectors of Hmi are denoted as h1,h2,h3. The target sub-block
in the low-resolution image and the rectified source sub-block
in the high-resolution image are denoted as
assume (dx,dy) is the displacement vector of the rectified source sub-block
from the target space to the rectified space; the rectified source sub-block
is expressed by
the source sub-block si in the high resolution image is represented by
Ti is denoted by
and paste the high-frequency information of the high-resolution image to the high-frequency information of the low-resolution image according to the transformation parameter Ti.
an acquisition module configured to acquire a stereoscopic image pair at a certain moment compressed by an asymmetric compression method in a binocular vision system. The stereoscopic image pair includes two views, one of which is a high-resolution image and the other is a low-resolution image.
a narrowing module configured to narrow the search area of a high-resolution image and the search area of a low-resolution image.
a determination module configured to determine the high-frequency information of the high-resolution image from the search region of the high-resolution image and the high-frequency information of the low-resolution image from the search region of the low-resolution image, respectively.
a processing module configured to paste the high-frequency information of the high-resolution image to the high-frequency information of the low-resolution image through parameter transformation.
a denoising module configured to convolve the high-resolution image and the low-resolution image with the Gaussian filter to obtain adenoised high-resolution image and a denoised low-resolution image;
a first calculation module configured to calculate the gradient intensity G and direction θ of the denoised high-resolution image and the denoised low-resolution image respectively;
a second calculation module configured to calculate the raw edges of the high-resolution image and the low-resolution image according to the gradient strength G and the direction θ of the denoised high-resolution image and the gradient strength G and the direction θ of the denoised low-resolution image;
an edge sparse module configured to sparse the raw edges of the high-resolution image and the low-resolution image using a non-maximum suppression method to obtain the sparsed edges thereof;
a suppression module configured to suppress the weak pixels of the sparsed edges according to the preset high and low thresholds to obtain ideal edges and determine the search area of the high-resolution image and the search area of the low-resolution image according to the ideal edges, respectively.
a segmentation module configured to divide the search area of the high-resolution
image and the corresponding area of the low-resolution image that matches the search
area of the high-resolution image into N sub-blocks,
and
, respectively;
an extraction module configured to extract gradient feature vectors of each sub-block,
by using five gradient operators , wherein the five gradient operators are expressed
as:
the gradient feature vectors extracted by the five gradient operators are denoted as fx, fy, fxx, fyy, and fxy;
a feature difference calculation module configured to calculate the feature difference
Di according to the extracted gradient feature vectors, wherein the feature difference
Di is given by:
where
a high-frequency information confirmation module configured to configured to arrange all the feature differences in the order of size to obtain a sequence; the sub-block corresponding to the feature difference in the middle position of the sequence is the sub-block of high-frequency information.
a function acquisition module configured to obtain a cost function; the cost function
is given by
Ti is the transformation parameter;
is the index of the stereoscopic image pair. Eu(ti,si,Hi) denotes the Euclidean distance, which can be represented by:
Where P(t1) is the intensity of the sub-block ti in the low-resolution image. S(si,Ti) is the intensity of the sub-block si in the high resolution image;
a parameter transformation calculation module configured to calculate the transformation parameter Ti according to the cost function, which comprises:
acquire the isomorphic map, Hmi , used for rectifying the planes in the image, wherein the three vectors of Hmi are denoted as h1,h2,h3,and the target sub-block
in the low-resolution image and the rectified source sub-block
in the high-resolution image are denoted as
assume (dx,dy) is the displacement vector of the rectified source sub-block
from the target space to the rectified space; the rectified source sub-block
is expressed by
the source sub-block si in the high resolution image is represented by
Ti is denoted by
a pasting module configured to paste the high-frequency information of the high-resolution image to the high-frequency information of the low-resolution image according to the transformation parameter Ti.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG.1 is the framework of the super-resolution reconstruction method for heterogeneous stereoscopic images.
FIG.2 is the schematic diagram of edge detection algorithm
EMBODIMENT ONE
1. Edge detection
2. High resolution patches search
3. The nearest neighbor region estimation
acquire the isomorphic map, Hmi ,used for rectifying the planes in the image. The three vectors of Hmi are denoted as h1,h2,h3. The target sub-block
in the low-resolution image and the rectified source sub-block
in the high-resolution image are denoted as
assume (dx,dy) is the displacement vector of the rectified source sub-block
from the target space to the rectified space; the rectified source sub-block
is expressed by
the source sub-block si in the high resolution image is represented by
Ti is denoted by
an acquisition module configured to acquire a stereoscopic image pair at a certain moment compressed by an asymmetric compression method in a binocular vision system; the stereoscopic image pair includes two views, one of which is a high-resolution image and the other is a low-resolution image;
a narrowing module configured to narrow the search area of high-resolution image and the search area of the low-resolution image;
a determination module configured to determine the high-frequency information of the high-resolution image from the search region of the high-resolution image and the high-frequency information of the low-resolution image from the search region of the low-resolution image, respectively;
a processing module configured to paste the high-frequency information of the high-resolution image to the high-frequency information of the low-resolution image through parameter transformation.
a denoising module configured to convolve the high-resolution image and the low-resolution image with the Gaussian filter to obtain a denoised high-resolution image and a denoised low-resolution image;
a first calculation module configured to calculate the gradient intensity G and direction θ of the denoised high-resolution image and the denoised low-resolution image respectively;
a second calculation module configured to calculate the raw edges of the high-resolution image and the low-resolution image according to the gradient strength G and direction θ of the denoised high-resolution image and the gradient strength G and the direction θ of the denoised low-resolution image;
an edge sparse module configured to sparse the raw edges of the high-resolution image and the low-resolution image using the non-maximum suppression method to obtain the sparsed edges thereof;
a suppression module configured to suppress the weak pixels of the sparsed edges according to the preset high and low thresholds to obtain ideal edges, and determine the search area of the high-resolution image and the search area of the low-resolution image according to the ideal edges, respectively.
a segmentation module configured to divide the search area of the high-resolution
image and the correspondingly area of the low-resolution image that matches the search
area of the high-resolution image into N sub-blocks,
and
, respectively;
an extraction module configured to extract gradient feature vectors of each each sub-block
by using five gradient operators, wherein the five gradient operators are expressed
as:
a feature difference calculation module configured to calculate the feature difference
Di according to the extracted gradient feature vectors, where the extracted gradient
feature vectors are denoted as fy, fy, fxx, fyy, and fxy, and the feature difference is given by:
where
a high-frequency information confirmation module configured to arrange all the feature differences in the order of size to obtain a sequence; the sub-block corresponding to the feature difference in the middle position of the sequence is the sub-block of high-frequency information.
a function acquisition module configured to obtain a cost function; the cost function
is given by
Ti is the transformation parameter;
is the index of thestereoscopic image pair. Eu(ti,si,Hi) denotes the Euclidean distance, which can be represented by:
where P(ti) is the intensity of the sub-block ti in the low-resolution image. S(si,Ti) is the intensity of the sub-block si in the high resolution image;
a parameter transformation calculation module configured to calculate the transformation parameter Ti according to the cost function, which comprises:
acquire the isomorphic map Hmi for rectifying the planes in the scene, wherein the three vectors of Hmi are denoted as h1,h2,h3, the target sub-block
in the low-resolution image and the rectified source sub-block
in the high-resolution image are denoted as
assume (dx,dy) is the displacement vector of the rectified source sub-block
from the target space to the rectified space; the rectified source sub-block
is expressed by
the source sub-block si in the high resolution image is represented by
Ti is denoted by
a pasting module configured to paste the high-frequency information of the high-resolution image to the high-frequency information of the low-resolution image according to the transformation parameter Ti.
acquire a stereoscopic image pair at a certain moment compressed by an asymmetrical compression method in a binocular vision system, and the stereoscopic image pair includes two views, one of which is a high-resolution image, and the other is a low-resolution image;
narrow the search area of the high-resolution image and the search area of the low-resolution image;
determine the high-frequency information of the high-resolution image from the search region of the high-resolution image and the high-frequency information of the low-resolution image from the search region of the low-resolution image, respectively; and
paste the high-frequency information of the high-resolution image to the high-frequency information of the low-resolution image through parametric transformation.
convolve the high-resolution image and the low-resolution image with a Gaussian filter to obtain a denoised high-resolution image and a denoised low-resolution image;
calculate the gradient strength G and the direction θ of the denoised high-resolution image and the denoised low-resolution image, respectively;
calculate the raw edges of the high-resolution image and the raw edges ofthe low-resolution image according to the gradient strength G and the direction θ of the denoised high-resolution image and the gradient strength G and direction θ of the denoised low-resolution image respectively;
use a non-maximum suppression method to sparse the raw edges of the high-resolution image and the raw edges of the low-resolution image to obtain sparsed edges thereof; and
suppress the weak pixels of the sparsed edges according to the preset high and low thresholds to obtain ideal edges, and determine the search area of the high-resolution image and the search area of the low-resolution image according to the ideal edges, respectively.
divide the search area of the high-resolution image and the corresponding area of
the low-resolution image that matches the search area of the high-resolution image
into N sub-blocks,
and
, respectively;
use five gradient operators to extract gradient feature vectors of each sub-block
siand ti, wherein the five gradient operators are expressed as:
the gradient feature vectors extracted by the five gradient operators are denoted
as fx, fy, fxx, fyy, and fxy;
calculate the feature difference Di of the each sub-block sj and tj according to the extracted gradient feature vectors, where the feature difference
Di is given by:
where
arrange all the feature differences in the order of size to obtain a sequence; the sub-block corresponding to the feature difference in the middle position of the sequence is the sub-block with high-frequency information.
acquire a cost function which is given by:
Ti is the transformation parameter;
is the index of the stereoscopic image pair; Eu(ti,si,Hi) denotes the Euclidean distance, which can be represented by:
Where P(ti) is the intensity of the sub-block ti in the low-resolution image; S(si,Ti) is the intensity of the sub-block si in the high resolution image;
calculate the transformation parameter Ti according to the cost function, which comprises:
acquire the isomorphic map, Hmi , used for rectifying the sub-blocks in the image pair, wherein the three vectors
of Hmi are denoted as h1,h2,h3, and the target sub-block
in the low-resolution image and the rectified source sub-block
in the high-resolution image are denoted as
assume (dx,dy) is the displacement vector of the rectified source sub-block
from the target space to the rectified space; the rectified source sub-block
is expressed by
the source sub-block si in the high resolution image is represented by
Ti is denoted by
paste the high-frequency information of the high-resolution image to the high-frequency information of the low-resolution image according to the transformation parameter Ti.
an acquisition module configured to acquire a stereoscopic image pair at a certain moment compressed by an asymmetric compression method in a binocular vision system; the stereoscopic image pair includes two views, one of which is a high-resolution image and the other is a low-resolution image;
a narrowing module configured to narrow the search area of high-resolution image and the search area of the low-resolution image;
a determination module configured to determine the high-frequency information of the high-resolution image from the search region of the high-resolution image and the high-frequency information of the low-resolution image from the search region of the low-resolution image, respectively;
a processing module configured to paste the high-frequency information of the high-resolution image to the high-frequency information of the low-resolution image through parameter transformation.
a denoising module configured to convolve the high-resolution image and the low-resolution image with the Gaussian filter to obtain a denoised high-resolution image and a denoised low-resolution image;
a first calculation module configured to calculate the gradient intensity G and the direction θ of the denoised high-resolution image and the denoised low-resolution image respectively;
a second calculation module configured to calculate the raw edges of the high-resolution image and the low-resolution image according to the gradient strength G and the directionθ of the denoised high-resolution image and the gradient strength G and the directionθ of the denoised low-resolution image, respectively;
an edge sparse module configured to sparse the raw edges of the high-resolution image and the low-resolution image using a non-maximum suppression method to obtain the sparsed edges thereof;
a suppression module configured to suppress the weak pixels of the sparsed edges according to the preset high and low thresholds to obtain ideal edges, and determine the search area of the high-resolution image and the search area of the low-resolution image according to the ideal edges, respectively.
a segmentation module configured to divide the search area of the high-resolution
image and the corresponding area of the low-resolution image that matches the search
area of the high-resolution image into N sub-blocks,
and
, respectively;
an extraction module configured to extract gradient feature vectors of each sub-block,
siand ti, by using five gradient operators, wherein the five gradient operators are expressed
as:
the gradient feature vectors extracted by the five gradient operators are denoted
as fy, fy, fxx, fyy, and fxy;
a feature difference calculation module configured to calculate the feature difference
Di of the each sub-block si and ti according to the extracted gradient feature vectors, where the feature difference
Di is given by:
where
a high-frequency information determination module configured to arrange all the feature differences in the order of size to obtain a sequence; the sub-block corresponding to the feature difference in the middle position of the sequence is the sub-block of high-frequency information.
a function acquisition module configured to obtain a cost function; the cost function
is given by:
Ti is the transformation parameter;
is the index of the stereoscopic image pair; Eu(ti,si,Hi) denotes the Euclidean distance, which can be represented by:
where P(ti) is the intensity of the sub-block ti in the low-resolution image; S(si,Ti) is the intensity of the sub-block si in the high resolution image;
a parameter transformation calculation module configured to calculate the transformation parameter Ti according to the cost function, which comprises:
acquire the isomorphic map Hmi for rectifying the planes in the image, wherein the three vectors of Hmi are denoted as h1,h2,h3, and the target sub-block
in the low-resolution image and the rectified source sub-block
in the high-resolution image are denoted as
assume (dx,dy) is the displacement vector of the rectified source sub-block
from the target space to the rectified space; the rectified source sub-block
is expressed by
the source sub-block si in the high resolution image is represented by
Ti is denoted by
a pasting module configured to paste the high-frequency information of the high-resolution
image to the high-frequency information of the low-resolution image according to the
transformation parameter Ti.