(19)
(11)EP 3 598 175 A1

(12)EUROPEAN PATENT APPLICATION
published in accordance with Art. 153(4) EPC

(43)Date of publication:
22.01.2020 Bulletin 2020/04

(21)Application number: 18768274.5

(22)Date of filing:  05.03.2018
(51)International Patent Classification (IPC): 
G01S 17/89(2020.01)
(86)International application number:
PCT/JP2018/008370
(87)International publication number:
WO 2018/168552 (20.09.2018 Gazette  2018/38)
(84)Designated Contracting States:
AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR
Designated Extension States:
BA ME
Designated Validation States:
KH MA MD TN

(30)Priority: 14.03.2017 JP 2017048587

(71)Applicant: Konica Minolta, Inc.
Tokyo 100-7015 (JP)

(72)Inventor:
  • TAKAMURA, Shunsuke
    Tokyo 100-7015 (JP)

(74)Representative: Gille Hrabal 
Brucknerstrasse 20
40593 Düsseldorf
40593 Düsseldorf (DE)

  


(54)OBJECT DETECTION SYSTEM


(57) The present invention accurately extracts a moving object desired to be extracted regardless of the background condition of the moving object to be extracted. An object detection system (1) comprises a distance image detection means that detects a distance image from the outside world and a moving object extraction means (2) that extracts a moving object desired to be extracted from the distance image. The moving object extraction means records in advance a distance image in a state where the moving object desired to be extracted is not present in a storage unit (2a) as a background, and extracts only pixels for which the current distance is closer than the distance of the background, as candidates of pixels corresponding to the moving object to be extracted (S2, S3). In addition, the moving object extraction means: performs determination (S7) on the basis of the size of a cluster and so forth and extracts the moving object to be extracted from a pixel group composed of extracted candidate pixels; and newly records, as background distances, the distances of pixels included in parts not extracted as the moving object to be extracted, such recording performed in the storage unit (S8).




Description

TECHNOLOGICAL FIELD



[0001] The present invention relates to an object detection system.

BACKGROUND ART



[0002] Recently, an object detection system that, for the purpose of detecting an intruder or a vehicle in a monitoring space, detects an object in a depth image detected by a laser radar or the like is used. Here, the depth image is obtained by mapping the value of a depth to the object as two-dimensional coordinates.

[0003] As a method of extracting pixels including a moving object from a depth image, a method of extracting pixels different in depth from a previously acquired background (a depth image while the moving object does not exist) is known as described in Patent Document 1 as well. A background depth map generator in Patent Document 1 generates a background depth map indicating information about a depth from a laser radar in a detection region to an object present in the background on the basis of a measurement result of the laser radar. A moving object detector detects a moving object in the detection region from a difference between the depth information in the background depth map stored in advance in a background depth map memory and the measurement result of the laser radar.

PRIOR ART DOCUMENT


PATENT DOCUMENT



[0004] Patent Document 1: JP 2005-300259 A

SUMMARY OF THE INVENTION


PROBLEMS TO BE SOLVED BY THE INVENTION



[0005] However, the above-described method has a problem in that even an essentially unnecessary pixel is extracted in cases as will be described below.

(1) CASE WHERE THERE IS REFLECTIVE OBJECT SUCH AS GLASS OR PUDDLE



[0006] In a case where TOF (a method of measuring the time until light is reflected by an object and returns) is used as a method of measuring a depth, light emitted to a totally reflecting portion such as glass or a puddle is reflected there and reflected by an object at the destination of reflection, and returns in some cases. Then, a depth image as if there is an object behind the glass or puddle is obtained. This is a virtual image appearing at a position where an object does not exist actually, and thus is not a moving object pixel desired to be extracted. However, when extracting a portion different in depth from the background, the virtual image is also extracted as moving object candidate pixels.

[0007] FIG. 10 shows a situation in which a person 1 1 and a glass 12 are present in front of a depth image detection device 10, such as a laser radar, that detects a depth image from an external environment. As shown in FIG. 10, when reflected light returns along an optical path 14, it looks like as if an object has appeared at a position where it does not exist actually.

[0008] Thus, when an object present at a depth different from the background is extracted with conventional techniques, this virtual image 13 is also extracted.

(2) CASE WHERE THERE IS OBJECT, SUCH AS FENCE, THROUGH WHICH OTHER SIDE IS VISIBLE



[0009] In a case where there is an object, such as a fence, through which the other side is visible, a person or the like, if appearing behind the fence, is detected. (This is because light transmits through the fence, is reflected by the person behind the fence, and returns). Depending on the purpose, this is extra information in a case where it is desired to see a movement inside the fence. Since there is a depth from the background (fence), this pixel is also extracted with conventional techniques.

[0010] FIG. 11A and FIG. 11B show a situation in which there is a mesh fence 15 in front of the depth image detection device 10. In this case, in a case where there is no person, light returns only from the fence 15 along an optical path 16 as shown in FIG. 11A, but in a case where the person 11 appears on a farther side 18 of the fence 15 as shown in FIG. 11B, light reflected by that person 11 also returns along an optical path 17. Here, in a case where it is desired to see only a nearer side 19 than the fence 15 as seen from the depth image detection device 10, information about the person 11 in FIG. 11B is unnecessary information, but this person 11 is also extracted since a portion different from the background is extracted with conventional techniques.

(3) CASE WHERE BACKGROUND CHANGES



[0011] This is such a case where the background frequently changes, for example, a case where it is desired to monitor a person on a platform of a station. FIG. 4 to FIG. 7 show a situation in which there are a platform 20 of a station, a track 21, and a wall 22 in front of the depth image detection device 10. If the wall 22, for example, is fixed as a background for data processing, an actual background with respect to the person changes because of a train 23 which enters/leaves the station, and that portion is entirely extracted as pixels including a moving object.

[0012] When also setting the train 23 as a part of the background for data processing, the wall 22 in a situation where the train 23 is absent is also extracted since a portion different from the background is extracted with conventional techniques.

[0013] Therefore, methods of conventional techniques do not enable only pixels including a moving object desired to be extracted, such as a person on the nearer side than the train, to be extracted.

[0014] In addition, since the train 23 is a moving object rather than a fixed object, and is not always present in a detection target range, and there is also a depth error due to a difference between vehicles, variations in stopping position, or the like, it is difficult to set the train 23 as the background for data processing.

[0015] The present invention was made in view of the above problems in the conventional techniques, and has an object to, irrespective of a background situation of a moving object desired to be extracted, accurately extract the moving object to be extracted.

MEANS FOR SOLVING PROBLEMS



[0016] In order to solve the above problems, the invention according to claim 1 is an object detection system including:

a depth image detector that detects a depth image from an external environment; and

a moving object extractor that extracts a moving object desired to be extracted from the depth image,

wherein the moving object extractor registers in advance the depth image in a memory as a background while the moving object to be extracted does not exist, and extracts only a pixel whose current depth is present on a nearer side than a depth of the background as a candidate for a pixel corresponding to the moving object to be extracted.



[0017] According to the present invention, the above-described cases (1), (2), and (3) are solved respectively as will be described below.

The above-described (1) "CASE WHERE THERE IS REFLECTIVE OBJECT SUCH AS GLASS OR PUDDLE"



[0018] Since reflected-back light returns from a place farther away than the background (glass or a puddle), its depth is greater than that of the background. Therefore, if only the nearer side than the background is extracted, these pixels are not included.

The above-described (2) "CASE WHERE THERE IS OBJECT, SUCH AS FENCE, THROUGH WHICH OTHER SIDE IS VISIBLE"



[0019] The object behind the fence has a depth greater than that of the background (fence). Therefore, if only an effort is extracted from the background, these pixels are not included.

The above-described (3) "CASE WHERE BACKGROUND CHANGES"



[0020] By acquiring the background while a changing portion of the background is present in the foreground, a changed portion of the background is not extracted. For example, in a case of monitoring a platform of a station as shown in FIG. 4 to FIG. 7, if the background is acquired and registered while the train 23 is at a stop, that portion (the wall 22) does not change to the nearer side and is not extracted even if the train 23 leaves thereafter. However, the depth changes to the nearer side in a case where the person 11 stands in front of the wall 22, and thus, the person 11 is extracted. In this manner, only the person 11 on the platform 20 can be extracted.

[0021] The invention according to claim 2 is the object detection system according to claim 1, wherein the moving object extractor further makes a determination to extract, as the moving object to be extracted, a pixel from a pixel group composed of pixels of the candidates having been extracted, and for a portion not extracted as the moving object to be extracted, updates and registers a depth of the pixel in the memory as the depth of the background.

[0022] According to this invention, even in a case where it is difficult to acquire an optimum background particularly in the above-described case (3) where the background changes, and the like, the optimum background can be obtained by updating the depth of the background on the basis of information being measured.

[0023] The invention according to claim 3 is the object detection system according to claim 2, wherein the moving object extractor makes the determination in accordance with a size of a detected moving object.

[0024] According to this invention, in a case where it is desired to extract the person 11 walking on the platform 20 without setting the train 23 at the back of the platform 20 of the station as an extraction target as shown in FIG. 4 to FIG. 7, for example, in the above-described case (3), if only a moving object smaller than or equal to a size equivalent to the person 11 is set to be extracted as a "moving object desired to be extracted", the train 23 can be determined as not being a "moving object desired to be extracted" because of its size. In addition, this parameter, size, is easy to understand for a user who makes settings, and easy to adjust.

[0025] The invention according to claim 4 is the object detection system according to any one of claims 1 to 3, wherein when determining whether a pixel is present on a nearer side than the background registered in the memory, the moving object extractor sets a threshold value of a smallest changing depth, and extracts only a pixel present on a nearer side than the depth of the background by more than or equal to the threshold value as a candidate for a pixel corresponding to the moving object to be extracted.

[0026] According to this invention, a pixel accidentally located on the nearer side than the background because of an error or noise can be prevented from being extracted erroneously.

ADVANTAGEOUS EFFECTS OF INVENTION



[0027] According to the present invention as described above, irrespective of a background situation of a moving object desired to be extracted, the moving object to be extracted can be extracted accurately.

BRIEF DESCRIPTION OF THE DRAWINGS



[0028] 

FIG. 1 is a block diagram of an object detection system according to an embodiment of the present invention.

FIG. 2 is a flowchart showing a computation procedure performed by the object detection system according to the embodiment of the present invention.

FIG. 3 is a flowchart subsequent to FIG. 2, showing a computation procedure performed by the object detection system according to the embodiment of the present invention.

FIG. 4 is a schematic view showing a situation in which a depth image detection device such as a laser radar is installed with a platform of a train station set as a detection target.

FIG. 5 is a schematic view showing a situation changed from FIG. 4.

FIG. 6 is a schematic view showing another situation changed from FIG. 4.

FIG. 7 is a schematic view showing still another situation changed from FIG. 4.

FIG. 8 is a schematic view obtained by adding a background depth image composed of a train portion to the same schematic view as FIG. 7.

FIG. 9 is a diagram showing a display example of most recent background depth images obtained by the object detection system according to an embodiment of the present invention and a background update stop button.

FIG. 10 is a schematic view showing a situation in which there are a person and glass in front of the depth image detection device such as a laser radar.

FIG. 11A is a schematic view showing a situation in which there is a mesh fence in front of the depth image detection device such as a laser radar.

FIG. 11B is a schematic view showing a situation in which there is a mesh fence in front of the depth image detection device such as a laser radar, and a person has appeared behind that fence.


EMBODIMENTS FOR CARRYING OUT THE INVENTION



[0029] Hereinafter, an embodiment of the present invention will be described with reference to the drawings. The following is an embodiment of the present invention, and does not limit the present invention.

[0030] As shown in FIG. 1, an object detection system 1 of the present embodiment includes the depth image detection device 10, such as a laser radar, as a depth image detector that detects a depth image from an external environment, and a moving object extractor 2 that extracts a moving object desired to be extracted from the depth image output by the depth image detection device 10. The moving object extractor 2 is implemented by a computer, has a memory 2a, and executes a computation which will be described below by means of a CPU.

[0031] Prior to moving object detection (flows of FIG. 2 and FIG. 3), the moving object extractor 2 acquires a depth image by means of the depth image detection device 10 while there is no obstacle such as a person in an area where detection is desired to be performed, and registers the depth image in the memory 2a as a background.

[0032] After registering the background, the moving object extractor 2 acquires a depth image that the depth image detection device 10 outputs (S1) as shown in FIG. 2. The depth image is equivalent to the current depth image.

[0033] Next, the moving object extractor 2 determines whether or not the current depth in the depth image is present on the nearer side than the depth of the background registered in the memory 2a (S2). That is, in step S2, the moving object extractor 2 compares a depth value of one pixel in the current depth image acquired from the depth image detection device 10 and a depth value of a pixel in the background depth image having the same coordinates with this one pixel to determine whether or not the former is smaller.

[0034] In the case of YES in step S2, the moving object extractor 2 extracts the one pixel targeted for determination as a moving object candidate pixel (S3). The moving object candidate pixel is a candidate for a pixel corresponding to a moving object desired to be extracted.

[0035] When determining whether the depth is smaller in step S2, not only by making a direct comparison, but also a threshold value for the smallest changing depth may be set, and only a pixel present on the nearer side than the depth of the background by more than or equal to the threshold value may be extracted as a moving object candidate pixel. Accordingly, a pixel accidentally located on the nearer side than the background because of an error or noise can be prevented from being extracted erroneously.

[0036] Upon executing the above-described steps S2 and S3 for all the pixels (YES in S4), the moving object extractor 2 executes moving object candidate pixel processing R1 (the flow of FIG. 3).

[0037] As the moving object candidate pixel processing R1, the moving object extractor 2 divides moving object candidate pixels (pixel group) into clusters (S5) as shown in FIG. 3.

[0038] Next, the moving object extractor 2 calculates the size of each class (S6). For example, the moving object extractor 2 calculates a vertical dimension, a horizontal dimension, a total area, and the like. Note that the "size" refers to actual dimensions, rather than an apparent size.

[0039] The moving object extractor 2 determines whether or not the size calculated in step S6 is smaller than or equal to a predetermined threshold value for specifying a moving object to be extracted (S7).

[0040] In the case of YES in step S7, the moving object extractor 2 extracts that cluster as the moving object to be extracted, and executes extraction target moving object processing R2. The extraction target moving object processing R2 is processing such as issuing a notification if the purpose is to monitor an intruder into a predetermined area, for example, and is not particularly limited in details.

[0041] In the case of NO in step S7, the moving object extractor 2 updates and registers the depth of pixels in that cluster in the memory 2a as the depth of the background (S8).

[0042] In the present embodiment, a determination is made to recognize the cluster as a symmetric moving object in a case where the size is smaller than a prescribed size.

[0043] Accordingly, in a case where it is desired to monitor a platform of a station as shown in FIG. 4 to FIG. 7, for example, if a threshold value is set to be larger than the size of a person walking on the platform 20 and smaller than the size of the train vehicle, a cluster of a large object such as the train 23 can be set as the background in step S8. In this manner, in a case where the train 23 is constructed by update and registration as part of the background or is initially registered, a background depth image 30 is obtained on a plane equivalent to a platform-side surface of the train as shown in FIG. 8. Thus, even if the train 23 thereafter enters/leaves as shown in FIG. 4 to FIG. 7, the train 23 and the wall 22 will not be extracted as a moving object desired to be extracted.

[0044] Note that the above method of determining a moving object from moving object candidate pixels is an example, and another method may be used. Also in that case, an object not determined as an extraction target in the other method is set as a background pixel.

[0045] Description will be made further using a platform of a station as an example.

[0046] In the example of a platform of a station as shown in FIG. 4 to FIG. 7, assume that the background is acquired at first while neither the train 23 nor the person 11 is present (FIG. 4). Thereafter, when a depth image is acquired, and moving object candidate pixels present on the nearer side than the background are extracted from the depth image, depth pixels from the person 11 to the train 23 are extracted as moving object candidate pixels in the situation shown in FIG. 6.

[0047] When the target is classified from here depending on the size as in step S7, a cluster composed of depth pixels up to the person 11 becomes an extraction target, and a cluster composed of depth pixels up to the train 23 falls outside the extraction target, and the cluster composed of the depth pixels up to the train 23 is registered as the background.

[0048] Then, depth pixels present on the nearer side than a background depth image partly composed of the depth pixels up to the train 23 become moving object candidate pixels in the next and subsequent turns, and only the person 11 or the like present on the nearer side than the train 23 will be extracted as a moving object desired to be extracted.

[0049] If this is repeated several times, a background in a state where the train 23 exists is generated even if part of the train 23 is blocked partway by the person 11.

[0050] After completion of the background, update of the background may be stopped. Since it is not easy to determine whether or not the background has been completed correctly, measures for having a user make sure in the following manner may be taken.

[0051] That is, the moving object extractor 2 provides a device for having a user make sure by displaying most recent background depth images on a display 2b and for the user to input an instruction to stop update of the background, by means of the display 2b and an operation interface 2c. FIG. 9 shows a display example of most recent background depth images and a background update stop button. In FIG. 9, most recent background depth images 31a are plotted in a main display region 31, and depth auxiliary lines are also displayed. In FIG. 9, black strip-like objects indicate the background depth images 31a. If the user checks such a display and presses a background update stop button 32 by clicking a mouse or touching the screen, update of the background is stopped.

[0052] According to the object detection system 1 of the present embodiment as described above, irrespective of a background situation of a moving object desired to be extracted, the moving object to be extracted can be extracted accurately.

INDUSTRIAL APPLICABILITY



[0053] The present invention can be utilized for detection of an object.

REFERENCE SIGNS LIST



[0054] 
1
object detection system
2
moving object extractor
2a
memory
10
depth image detection device (depth image detector)
11
person
12
glass
13
virtual image
15
fence
20
platform
21
track
22
wall
23
train
30
background depth image
31a
background depth image
32
background update stop button



Claims

1. An object detection system comprising:

a depth image detector that detects a depth image from an external environment; and

a moving object extractor that extracts a moving object desired to be extracted from the depth image,

wherein the moving object extractor registers in advance the depth image in a memory as a background while the moving object to be extracted does not exist, and extracts only a pixel whose current depth is present on a nearer side than a depth of the background as a candidate for a pixel corresponding to the moving object to be extracted.


 
2. The object detection system according to claim 1, wherein the moving object extractor further makes a determination to extract, as the moving object to be extracted, a pixel from a pixel group composed of pixels of the candidates having been extracted, and for a portion not extracted as the moving object to be extracted, updates and registers a depth of the pixel in the memory as the depth of the background.
 
3. The object detection system according to claim 2, wherein the moving object extractor makes the determination in accordance with a size of a detected moving object.
 
4. The object detection system according to any one of claims 1 to 3, wherein when determining whether a pixel is present on a nearer side than the background registered in the memory, the moving object extractor sets a threshold value of a smallest changing depth, and extracts only a pixel present on a nearer side than the depth of the background by more than or equal to the threshold value as a candidate for a pixel corresponding to the moving object to be extracted.
 




Drawing































Search report










Cited references

REFERENCES CITED IN THE DESCRIPTION



This list of references cited by the applicant is for the reader's convenience only. It does not form part of the European patent document. Even though great care has been taken in compiling the references, errors or omissions cannot be excluded and the EPO disclaims all liability in this regard.

Patent documents cited in the description