(19)
(11)EP 3 046 324 B1

(12)EUROPEAN PATENT SPECIFICATION

(45)Mention of the grant of the patent:
26.06.2019 Bulletin 2019/26

(21)Application number: 16154382.2

(22)Date of filing:  09.07.2014
(51)International Patent Classification (IPC): 
H04N 7/18(2006.01)
H04N 7/14(2006.01)
H04N 5/272(2006.01)

(54)

METHOD FOR REGISTERING AND EXECUTING INSTRUCTIONS IN A VIDEO CAPTURING DEVICE

VERFAHREN ZUR REGISTRIERUNG UND AUSFÜHRUNG VON ANWEISUNGEN IN EINER VIDEOERFASSUNGSVORRICHTUNG

PROCÉDÉ PERMETTANT D'ENREGISTRER ET D'EXÉCUTER DES INSTRUCTIONS DANS UN DISPOSITIF DE CAPTURE VIDÉO


(84)Designated Contracting States:
AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

(43)Date of publication of application:
20.07.2016 Bulletin 2016/29

(62)Application number of the earlier application in accordance with Art. 76 EPC:
14176250.0 / 2966861

(73)Proprietor: Axis AB
223 69 Lund (SE)

(72)Inventors:
  • Rolf, Magnus
    216 19 Malmö (SE)
  • Hansson, Niklas
    242 30 Hörby (SE)
  • Adolfsson, Johan
    247 34 Södra Sandby (SE)

(74)Representative: AWA Sweden AB 
P.O. Box 5117
200 71 Malmö
200 71 Malmö (SE)


(56)References cited: : 
US-A1- 2008 129 498
  
  • "Teamviewer and Cheese", , 28 March 2012 (2012-03-28), pages 1-1, XP55191662, Retrieved from the Internet: URL:https://www.youtube.com/watch?v=wSd-sB fnmFE [retrieved on 2015-05-26]
  
Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been paid. (Art. 99(1) European Patent Convention).


Description

Technical field of the invention



[0001] The present invention relates to a method for registering and executing instructions in a video capturing device.

Background of the invention



[0002] Access control devices arranged to control access to specific areas has been around for some time. These access control devices may for instance be card readers, readers of biometric data, keypads, tag readers, audio capturing devices, video capturing devices, etc. Door stations enabling a visitor to contact a person in order to gain access to the area are usually mounted at entrances in office buildings, apartment buildings, condominiums, airports, campuses, logistic centres. In some applications one or a few door stations are connectable to a plurality of access granting indoor stations, e.g. one in each apartment. In other applications an access control centre is arranged to handle all or most access requests from a plurality of door stations. A door station may combine many features of the above mentioned access control devices. However, one important feature is to enable communication with a person enabled to allow access to the area. Early door stations included an audio capturing device and a speaker for voice communication with a person authorized to open the entrance. However, nowadays the door stations commonly include a video camera as well. The imagery from the video camera allows the person enabled to allow access to see the person requesting entrance.

[0003] Traditionally an access control device like a door station is included in an intercom system, see an example in Fig 1. In this traditional intercom a plurality of system specific indoor units 102, e.g. in the form of system specific video phones or telephones, are connected to a specialized intercom server 104. A system specific door station 106 is also connected to the intercom server 104 and to a lock mechanism 108 of a door 110. Moreover, the intercom system may include a control desk 112 enabling communication and control of the devices in the system. A person requesting entry via the door 110 connected to the door station 106 either presses a request button on the door station or inputs a request address via some input means on the door station. The request is then sent either to one or a plurality of predetermined indoor units and/or the control desk, in case of the request button being used, or to an indoor unit or the control desk being directly addressed by the user inputting a request address. At the indoor unit or the control desk a user or an operator may view the imagery captured by the door station and/or communicate verbally with the person at the door station and, if they find it appropriate, signal to the door station that the door should be unlocked and/or opened. One drawback with these traditional systems is that they require system specific indoor units, door stations, etc.

[0004] More and more systems including door stations for requesting access to areas are designed to enable use of general computers or smartphones running specific applications as indoor units or control desks for authorizing access to an area. However, these systems require the indoor unit or control desk to run an application and therefore these systems are restricted to use devices having specific operating systems.

[0005] The above is equally true for networked motion video cameras.

Summary of the invention



[0006] One object of the present invention is to provide a method allowing door station access systems and/or networked motion video cameras facilitating the use of various devices as authorized devices, e.g. as indoor units and/or desk units.

[0007] More specifically, the invention is defined in claim 1. The advantage of generating graphics of a received signal at the video capturing device and then overlaying this graphic onto the video stream is that low technical requirements may be put on the devices used in controlling a video capturing device, i.e. authorized devices. Moreover, general purpose devices may be used as without requiring particular applications formed for the specific video capturing device. Moreover, the graphical representation in the return stream makes it possible for the user of the controlling device, i.e. the authorized device, to see what is inputted and get a hint if he is making correct inputs and/or if the video capturing device is receiving the correct inputs.

[0008] According to some embodiments the instruction represented by the at least one signal is an operation requested by a user of the authorized device. Hence, the device used for controlling the video capturing device, i.e. the authorized device, may instruct the video capturing device to perform an operation without the authorized controlling device having any information of the operations or instructions.

[0009] In some embodiments said received at least one signal represents a symbol inputted by an operator of the authorized device. One advantage of this embodiment is that the authorized device may be simple and may not need to perform any intelligent operations as all logic for interpreting instructions from the user of the authorized device may be arranged in the video capturing device. The symbols may also be used for authorisation and/or identification of the user of the authorised device at the video capturing device.

[0010] According to some embodiments said superimposing of the graphical representation onto the captured video of the video stream includes changing pixel values for pixels forming the symbol in the video stream. One advantage is that the user of the authorised device get confirmation of each symbol inputted at the authorized device even if the authorized device is a simple device.

[0011] In some other embodiments said superimposing of the graphical representation onto the captured video of the video stream includes changing pixel values for pixels forming a visual representation of said requested operation. One advantage of this embodiment corresponds to the above advantages of enabling the user of the authorised device see the result of his input even when using a very simple authorised device.

[0012] In the method the received at least one signal and the received concluding signal may be in the form of DTMF-signals. An advantage of this feature being that many existing systems based on telephone devices may implement the method according to the invention.

[0013] Moreover, a communication session between the video capturing device and the authorized device may be a SIP-session (Session Initiation Protocol). One advantage of implementing SIP-sessions is that VoIP (Video over Internet Protocol) may be used. Further it enables software based clients, i.e. authorized devices. Moreover, it results in an open system that is enabled to operate and interact with a plurality of existing devices employing SIP.

[0014] According to some embodiments a method for controlling entry to an area, comprises said method for registering and executing instructions as described above. In the method the video capturing device is a door station, receiving, at the door station and before receiving at least one signal representing a first input made using the authorized device, an input representing a request for a communication session with an authorized device being authorized to grant entry to the area, sending to the authorized device, in response to receipt of said input representing a request, a request for setting up the communication session, and generating an entry control signal in response to the receipt of the at least one signal and the concluding signal from the authorized device. An advantage of controlling entry into areas using this method is that the method facilitates use in various different systems of entry control, both simple ones and advanced. Hence, a door station implementing the invention may be installed in existing systems without the existing systems being required to be exchanged or upgraded.

[0015] The entry control signal may be sent from the door station to a physical access controller controlling entry to said area. An advantage of this is that the door station may operate in a new access control system and simultaneously in a different, old, or existing entry control system.

[0016] In some embodiments the input that is representing a request for a communication session is generated by physical interaction with a device mounted in the door station.

[0017] The concluding signal received may be a confirmation signal received from the authorized device in response to a confirmation input made at the authorized device.

[0018] A further scope of applicability of the present invention will become apparent from the detailed description given below. However, it should be understood that the detailed description and specific examples, while indicating preferred embodiments of the invention, are given by way of illustration only, since various changes and modifications within the scope of the invention will become apparent to those skilled in the art from this detailed description. Hence, it is to be understood that this invention is not limited to the particular steps of the methods described as such method may vary. It is also to be understood that the terminology used herein is for purpose of describing particular embodiments only, and is not intended to be limiting. It must be noted that, as used in the specification and the appended claim, the articles "a," "an," "the," and "said" are intended to mean that there are one or more of the elements unless the context clearly dictates otherwise. Thus, for example, reference to "a sensor" or "the sensor" may include several sensors, and the like. Furthermore, the word "comprising" does not exclude other elements or steps.

Brief description of the drawings



[0019] Other features and advantages of the present invention will become apparent from the following detailed description of a presently preferred embodiment, with reference to the accompanying drawings, in which

Fig 1 is a block diagram of a traditional entry system based on an intercom system,

Fig 2 is a block diagram of a modern entry system based on various communication methods that at least partly includes a computer network communication,

Fig 3 is a block diagram schematically showing a video capturing device according to one embodiment of invention, in this particular figure the block diagram includes features of a door station,

Fig 4 is a flowchart of a method for registering and executing instructions in a video capturing device according to one embodiment of the invention, and

Fig 5 is a flowchart of a method for executing entry into a specific blocked area according to one embodiment of the invention.



[0020] Further, in the figures like reference characters designate like or corresponding parts throughout the several figures.

Detailed description of embodiments



[0021] A door station according to one embodiment of the present invention may be arranged to communicate using various types of communication systems or various combinations of communication systems. In Fig 2 some examples of the communication systems that may be implemented by themselves or in combination are described. The door station 106 is, as in the traditional intercom system, connected to a locking mechanism 108 controlling an entry to an area, e.g. via a door 110, a gate, turnstile, a sliding door, and other movable blocking device that may be put in an passage to prevent people from entering an area. The connection to the locking mechanism 108 may be a direct connection from the door station 106 to the locking mechanism 108, which is a design frequently used in traditional systems. However, the door station may alternatively be connected to an access controller, not shown, which is arranged in the protected area and communicates directly with the locking mechanism 108. The communication between the door station 106 and such an access controller may be using a computer network and an internet protocol. The door station 106 is arranged at an entry to the area and allows a user to request to enter into an area without any automatically readable credentials. The door station 106 may also be combined with a credential reader, i.e. a card reader, a keypad, a fingerprint reader, etc. A door station 106 may also be arranged for allowing a user to request to pass out from an area.

[0022] The request for entering into the area is sent from the door station 106 to a device, 202, 204, 206, 208, and 210 that is authorized to order the entry to the area to be opened so that the requesting person may enter into the area. Hereinafter, in the present description, the device authorized to order the entry to an area to be opened is referred to as authorized device. The authorized device may be a an analogue video telephone 202, a digital voice over internet protocol (VoIP) telephone 204 having video capabilities, a general purpose computer 206 provided with an application allowing it to allow access to the area, a smartphone 208, a mobile phone 208, a video managing server (VMS) 210 connected to the camera surveillance system. Some of the authorized devices is authorized to allow access to the area by hardware, e.g. by hardwired identity, by hard coded identity, by the position within a network, etc., others are authorized by means of the user entering authorization credentials giving the device the right to allow people pass through the passage.

[0023] The communication in the system may be based on a Session Initiation Protocol (SIP) connecting the door station 106 to an SIP server 212 via a connection 214. The communication may then be relayed via a computer network 216, e.g. the Internet, a LAN, a WAN, wired, wireless, etc., to another SIP server 218, which is setting up the final communication link to the authorized device via connection or via an analogue telephone connection 222. The SIP server being a central component of an Internet protocol private branch exchange (IP PBX) which is arranged to deal with the setup of SIP calls in the network 214, 220. The transmission of video and/or audio is generally performed using another protocol, e.g. real-time transport protocol (RTP), RTSP, SRTP, ZRTP, or similar.

[0024] Alternatively, the communication may be entirely computer network protocol based using RTP control protocol (RTCP) and RTP, RTSP, SRTP, SRTCP or ZRTP via the computer network 216, see line 224 in figure. Moreover, the communication with a video managing system may be performed using protocols of the video surveillance system, e.g. using any known API, depicted by line 226 in the figure. In one computer network protocol based implementation the communication between door station 106 and authorized device 202-210 is implemented as peer to peer communication.

[0025] Any single one or combination of these or other communications protocols or methods may be implemented in the door station 106.

[0026] Now referring to Fig 3, a door station 106 according to one embodiment of the present invention is depicted. The door station 106 includes a camera 302 arranged to capture moving images of a person in front of the door station 106, and an input device 304 including a request key, a plurality of function keys, and/or a numeric keypad. Further, the doorstation106 includes a microphone 306 for capturing sound from the vicinity of the door station 106, a speaker 308 arranged to present audio captured or recorded elsewhere and/or to present audio messages stored in the door station 106, an overlay device 310 arranged to generate a graphic representation of signals received from an authorized device 202-210 and overlay the generated graphic onto the video sent to the authorized device 202-210, a processing unit 312 arranged to implement functions and processes of the door station 106, a communication interface 314 arranged to connect the door station 106 to a communication channel, as disclosed above, for transmission of information to and reception of information from an authorized device 202-210, and a lock interface 316 arranged to send a signal to a lock blocking the passage of an entry in order to change the state of the lock from open to closed or from closed to open.

[0027] In one embodiment the signal intended for the lock blocking the passage of entry is sent via the communication interface 314 to an access controller (not shown). The signal is in the form of data sent using the network communication. Then the access controller interprets the communication and generates the signal sent to the passage for instructing the lock to open or to close.

[0028] The communication interface 314 includes a network transceiver. There are plenty of transceivers to select from that are known to the skilled person. Various communications protocols may be used, e.g. TCP, UDP, IP, SIP, RTP, RTCP, SIPS, RTSP, SRTP, HTTP, HTTPS, ZRTP, TLS, SRTCP.

[0029] The camera 302 may be any camera capturing video and is according to one embodiment arranged so that a person or an object positioned in front of the door station 106 is in the field of view of the camera 302 and, thus, is captured in the video.

[0030] The input device 304 may be a single key or button arranged to initiate a sending of a communication request to an authorized device, i.e. initiating sending of a request for a video call via SIP to an authorized device, initiating sending of a request of communication via RTP to an authorized device, etc. In one embodiment the input device 304 alternatively or additionally includes a plurality of keys. For example, the input device 304 may be a key pad that may be used to enter an identity to which the request of communication is to be sent, e.g. a telephone number, an apartment number, an office number, or a code identifying the intended recipient of the request.

[0031] The overlay device 310 is a video processing device arranged to translate a received signal or code into a graphical representation and put it on top of the video that is to be sent to the authorized device via the communication transceiver 314. It may be a dedicated device implementing the overlay functionality or it may be a process running on the processing unit 312. The dedicated device may be a hardwired logic circuit or it may be a programmable device. There may be various video sessions established from the door station 106 with different authorized devices or other devices. Different sessions may get different overlays, e.g. the session of a authorized device entering instructions may get an overlay of symbols or indications of the instruction while another device just displaying the video from the door station gets an video without any overlay. In other words, the overlay is session specific. The received signal that is to be translated into a graphical representation is received from the authorized device 202-210. For example, if a dual-tone multiple-frequency signal (DTMF signal) representing a person pushing the "4" key on the keypad of an authorized device 202-210, then the overlay device 310 generates a graphical representation of the digit 4 and put it on top of the video to be sent to the authorized device 202. The generation of the graphical representation may be performed in any known way and the superimposing of graphics onto a video stream, i.e. process of applying the graphic onto the outgoing video, may be performed in various ways known to the person skilled in the art. One subset of methods for superimposing graphics into a video stream includes replacing corresponding pixels in each frame with the overlay graphic, e.g. by writing the information into the memory addresses of the corresponding pixels.

[0032] The processing unit 312 may be a custom processing unit designed for the door station 106 or a general processing unit configured to control the door station 106 and its functionality. Further, it is configured to recognise and execute instructions received from authorized devices 202-210.

[0033] Now referring to Fig 4, a process 400 for registering and executing instructions in a video capturing device 106, such as a standalone video camera connected to a communication network or a video camera embedded in another device that is connected to a communication network, is described. An example of a device in which a video camera may be embedded in is a door station 106. The process 400 requires that there is an authorized device 202-210 connected to the same communication network 216 as the video capturing device 106. The authorized device 202-210 being a device authorized to control functionality and operation of the video capturing device 106. The process starts with the video capturing device 106 receiving a signal from an authorized device 202-210, step 402, where the signal represents a symbol, which in turn indicates an instruction or part of an instruction entered at the authorized device 202-210. A representation of the symbol is stored in a buffer of the video capturing device, step 404, and the overlay device 310, generates a graphical representation of the symbol, step 406, which it superimposes onto the video currently captured and streamed by the video capturing device 106, step 408. The video stream may be streamed to the authorized device 202-210. However, the video stream may alternatively be streamed to a device that is enabled to present the video visually to the user of the authorized device 202-210, e.g. a standalone network connected display, a computer, a smartphone, etc. It may be advantageous to use such separate display device if the authorized device 202-210 has no display capability and the display device is not authorized and/or is difficult to make an authorized device 202-210 due to the system implemented.

[0034] Another signal is received from the authorized device 202-210 after the reception of the initial signal, step 410. This signal is checked, step 412 in order to determine if it represents a concluding input made at the authorized device 202-210. The concluding signal may be defined by the system implementing a predetermined instruction length, i.e. the number of symbols in an instruction, as the last symbol in an instruction. For example in a system having a three symbol instruction length the video capturing device 106 identifies the third inputted symbol as the concluding signal when received. In such an implementation the concluding symbol also is buffered as it is part of the instruction. Alternatively, the concluding signal may be a signal generated and sent by the authorized device from a confirmation input by the operator of the authorized device. In this embodiment the person operating the authorized device have the opportunity to review the entire instruction inputted before the video capturing device executes the instruction. The confirmation signal is generated in response to the user of the authorized device 202-210 being content that the symbols presented in the video stream corresponds to the intended instruction. In a DTMF based signal system the confirmation input may for instance be the hash tag symbol # and, thus, in such embodiment the received signal is checked for a signal representing the hash tag symbol. The rest of the description relating to Fig 4, is based on the concluding signal being a confirmation signal.

[0035] If the signal received is yet another symbol, i.e. not a symbol representing a confirmation input performed at the authorized device 202-210, then the process returns to step 404 where the symbol is buffered together with the previously received symbol/s.

[0036] If the signal received is the confirmation signal representing the confirmation input, then the process proceeds to step 414. In step 414 the symbols in the buffer is translated into an instruction of the video capturing device 106. The instruction may comprise one or a plurality of symbols. If the symbols in the buffer does not correspond to any instruction known to the system an error message will be generated and overlaid onto the video stream. Then the instruction is executed and the operation or function requested from the authorized device 202-210 is performed, step 416.

[0037] The signal received from the authorized device 202-210 may also be information sent in a data packet over the computer network. For example may the signal be received as a DTMF signal via a SIP INFO packet, an RTP packet, etc.

[0038] This method of registering and executing instructions may be especially advantageous in an access system including a door station having a camera. In particular when the authorized device 202-210 is not provided with software applications customised for the door station 106 or when the authorized device 202-210 is a simple device enabled to send signals, e.g. DTMF, and receive the video captured by the camera 302 of the door station 106. As mentioned previously the video may not necessarily be presented on the same device as the one enabling manual input for generating signals to be sent to the door station 106 for providing instructions.

[0039] In Fig 5 there is showed an example of a method for controlling entry to an area by means of a door station 106 and an authorized device 202-210 using the method described in connection with Fig 4. A person that wants to enter into the specific area arrives at the door station 106 at which the person requests entry to the area by means of interacting with the door station 106, step 502. This interaction by the requesting person results in an input being received at the door station 106. The input may be the requesting person pushing a button on the door station 106, the requesting person keying in an identifier of an authorized device 202-210, etc. In response to the input resulting from step 502 a request for a communication session is sent to an authorized device 202-210, step 504. The request may be sent to a predetermined authorized device 202-210 or it may be sent to an authorized device 202-210 identified in connection with the input in step 502 by the person keying in an identity. Then the authorized device 202-210 accepts the request and the authorized device 202-210 and the door station 106 starts the communication session, step 506. The door station 106 is then registering and executing instructions as described in connection with Fig4, step 508. If the instruction executed is not an instruction to open the passage then the door station 106 executes the instruction in accordance with the processes defined in the door station 106, steps 510 and 512. However, if the instruction executed is an instruction to open the passage then the door station 106 generates an entry control signal instructing to allow entry into the area, steps 510 and 514. This signal may be sent directly from the door station 106 to a lock 108 arranged in the passage 110 or the signal may be sent to an access controller arranged to control the opening and locking of the passage 110.

[0040] To facilitate the understanding of the present invention an example situation will be described below. The first example relates to a person approaching the entry 110 or blocked passage 110 in order to enter a restricted area. The person approaches the door station 106 arranged in proximity to the blocked passage 110 and pushes a button 304 on the door station 106 in order to have someone unlock and open the blocked passage 110. The button 304 initiates a communication request in the form of a phone call using SIP. The authorized device 202-210, e.g. being a video enabled telephone, may be installed in an office within the perimeter of the restricted area or at a remote location. The answering of the video telephone sets up the communication and the image from the video camera 302 of the door station 106 is displayed on the video telephone 202-204. Now, the person at the door station 106 wanting to enter the area and the person controlling the video telephone 202-204 talks to each other and the person at the door station 106 states his errand. Then the person at the authorized device 202-210 decides if he would like to open the passage 110 or not for the person requesting entry into the area. Let us assume the person at the authorized device 202-210 decides to open the passage. In this example system the digit 1 represents an instruction to open the passage 110 and all instructions should be terminated using hash tag symbol #. The digit 1 is entered at the authorized device 202-210 and shortly thereafter the digit 1 is presented on the screen of the video telephone 202-204 on top of the image of the scene captured by the video camera. The digit is, as described earlier, overlaid at the door station 106 and thereby integrally incorporated into the video stream received at the video telephone 202-204. Thereby the user of the video telephone 202-204 is able to confirm the input made by looking at the display even if the video telephone 202-204 only is able to operate as a telephone and to receive video streams. When the user of the video telephone 202-204 sees on the screen that he has entered the digit 1 and that the door station 106 has received the digit 1 he simply pushes the hash tag key in order to have the door station 106 execute the instruction represented by the digit 1, which in this case was to unlock the passage 110. The door station 106 receives the confirmation signal, i.e. the hash tag, and generates a signal instructing the passage 110 to be unlocked. This signal is then sent to an access control device controlling the lock 108 of the passage 110.

[0041] Other instructions sent from the authorized device 202-210 may for example be an instruction making the door station 106 present possible commands, show the latest received instructions as a list of previous instructions, activate devices/accessories connected to I/O terminals on the door station 106, switch light source on or off, mute and unmute the speaker 308, control the direction of the camera, e.g. pan, tilt, and/or zoom, reboot the door station 106, etc.

[0042] The instruction for making the door station 106 present a list of possible instructions may be implemented to make the door station generate an overlay on the video stream to be sent from the door station 106 of a list of possible instructions and a short description of the instruction. Hence, the authorized device 202-210 is not required to be customised for a particular system but may serve a plurality of systems. Moreover, this facilitates the operation of various systems by the same operator as the operator easily may get a list of possible instructions presented from the door station 106 itself and therefore may operate different systems having different sets of instructions in a facilitated fashion.

[0043] In an alternative embodiment the video capturing device 106 such as a networked motion video camera is controllable from an authorized device 202-210 as described in connection with Fig 4. Thereby the motion video camera may be controlled, setup, and/or configured from a great variety of devices. Accordingly, the advantage of the device being possible to control from any device enable to send commands and receive video without the use of any special purpose applications makes a system implementing the invention more flexible in that many various devices having basic communication functions may be used for controlling the camera.


Claims

1. A method for registering and executing instructions in a video capturing device, comprising:

establishing a first video session between the video capturing device and a first device used in controlling the video capturing device,

receiving at a communication interface of the video capturing device at least one signal representing a first input made using the first device,

generating at the video capturing device a first graphical representation of the at least one received signal,

superimposing at the video capturing device the first graphical representation onto video captured by the video capturing device, wherein the superimposed first graphical representation is specific to the first video session in that it is not superimposed onto video in a second video session established between the video capturing device and a second device,

sending the captured video with the superimposed first graphical representation to the first device,

receiving at the video capturing device, after the at least one signal representing an input made at the first device has been received and the first graphical representation has been generated and superimposed onto the captured video and sent to the first device, a concluding signal representing a concluding input made using the first device,

translating, in response to said concluding input, the received at least one signal into an instruction executable by the video capturing device, and

executing the instruction resulting from the translation of the at least one signal.


 
2. The method of claim 1, further comprising:
sending the captured video to the second device without the superimposed first graphical representation.
 
3. The method of claim 1, further comprising:
sending the captured video to the second device with a superimposed second graphical representation which is different from the first graphical representation.
 
4. The method according to any one of claims 1-3, wherein the instruction represented by the at least one signal is an operation requested by a user of the first device.
 
5. The method according to any one of claims 1-4, wherein said received at least one signal represents a symbol inputted by an operator of the first device, wherein said superimposing of the first graphical representation onto the captured video includes changing pixel values for pixels forming the symbol in the captured video.
 
6. The method according to claim 4, wherein said superimposing of the first graphical representation onto the captured video includes changing pixel values for pixels forming a visual representation of said requested operation.
 
7. The method according to any one of claims 1-6, wherein the received at least one signal and the received concluding signal are in the form of DTMF-signals.
 
8. The method according to any one of claims 1-7, wherein the first video session is a SIP-session (Session Initiation Protocol).
 
9. The method according to any one of claims 1-8, wherein the video capturing device is embedded in a networked motion video camera.
 
10. A method for controlling entry to an area, comprising:

the method for registering and executing instructions according to any one of claims 1-9, wherein the video capturing device is embedded in a door station,

receiving, at the door station and before receiving the at least one signal representing the first input made using the first device, an input representing a request for the first video session with the first device being authorized to grant entry to the area,

sending to the first device, in response to receipt of said input representing the request, a request for setting up the first video session, and

generating an entry control signal in response to the receipt of the at least one signal and the concluding signal from the first device.


 
11. The method according to claim 10, wherein the entry control signal is sent from the door station to a physical access controller controlling entry to said area.
 
12. The method according to claim 10 or 11, wherein the input that is representing the request for the first video session is generated by physical interaction with a device mounted in the door station.
 
13. The method according to any one of claims 10-12, wherein the concluding signal received is a confirmation signal received from the first device in response to a confirmation input made at the first device.
 


Ansprüche

1. Verfahren zum Registrieren und Ausführen von Anweisungen in einer Videoaufnahmevorrichtung, umfassend:

Herstellen einer ersten Videositzung zwischen der Videoaufnahmevorrichtung und einer ersten Vorrichtung, die zum Steuern der Videoaufnahmevorrichtung verwendet wird,

Empfangen an einer Kommunikationsschnittstelle der Videoerfassungsvorrichtung mindestens eines Signals, das eine erste Eingabe repräsentiert, die unter Verwendung der ersten Vorrichtung gemacht wurde,

Erzeugen an der Videoaufnahmevorrichtung einer ersten grafischen Darstellung des mindestens einen empfangenen Signals,

Überlagern an der Videoaufnahmevorrichtung der ersten grafischen Darstellung auf das Video, das von der Videoaufnahmevorrichtung aufgenommen wurde, wobei die überlagerte erste grafische Darstellung insofern spezifisch für die erste Videositzung ist, als dass sie nicht auf Video in einer zweiten Videositzung überlagert wird, die zwischen der Videoaufnahmevorrichtung und einer zweiten Vorrichtung hergestellt wird,

Senden des aufgenommenen Videos mit der überlagerten ersten grafischen Darstellung an die erste Vorrichtung,

Empfangen an der Videoaufnahmevorrichtung, nachdem das mindestens eine Signal, das eine Eingabe repräsentiert, die an der Vorrichtung gemacht wurde, empfangen wurde und die erste grafische Darstellung erzeugt und auf das aufgenommene Video überlagert und an die erste Vorrichtung gesendet wurde, eines abschließenden Signals, das eine abschließende Eingabe repräsentiert, die unter Verwendung der ersten Vorrichtung gemacht wurde, Übersetzen, in Reaktion auf die abschließende Eingabe, des empfangenen mindestens einen Signals in eine Anweisung, die von der Videoaufnahmevorrichtung ausführbar ist, und

Ausführen der Anweisung, die aus der Übersetzung des mindestens einen Signals resultiert.


 
2. Verfahren nach Anspruch 1, ferner umfassend:
Senden des aufgenommenen Videos an die zweite Vorrichtung ohne die überlagerte grafische Darstellung.
 
3. Verfahren nach Anspruch 1, ferner umfassend:
Senden des aufgenommenen Videos an die zweite Vorrichtung mit einer überlagerten zweiten grafischen Darstellung, die sich von der ersten grafischen Darstellung unterscheidet.
 
4. Verfahren nach einem der Ansprüche 1 bis 3, wobei die Anweisung, die von dem mindestens einen Signal repräsentiert wird, eine Operation ist, die von einem Benutzer der ersten Vorrichtung angefordert wurde.
 
5. Verfahren nach einem der Ansprüche 1 bis 4, wobei das empfangene mindestens eine Signal ein Symbol repräsentiert, das von einem Bediener der ersten Vorrichtung eingegeben wurde, wobei das Überlagern der ersten grafischen Darstellung auf das aufgenommene Video Verändern der Pixelwerte für Pixel, die das Symbol in dem aufgenommenen Video bilden, beinhaltet.
 
6. Verfahren nach Anspruch 4, wobei das Überlagern der ersten grafischen Darstellung auf das aufgenommene Video Verändern von Pixelwerten für Pixel, die eine visuelle Darstellung der angeforderten Operation bilden, beinhaltet.
 
7. Verfahren nach einem der Ansprüche 1 bis 6, wobei das empfangene mindestens eine Signal und das empfangene abschließende Signal in der Form von DTMF-Signalen sind.
 
8. Verfahren nach einem der Ansprüche 1 bis 7, wobei die erste Videositzung eine SIP-Sitzung ist (Session Initiation Protocol).
 
9. Verfahren nach einem der Ansprüche 1 bis 8, wobei die Videoaufnahmevorrichtung in einer vernetzten Bewegungsvideokamera eingebettet ist.
 
10. Verfahren zum Kontrollieren von Zugang zu einem Bereich, umfassend:

das Verfahren zum Registrieren und Ausführen von Anweisungen gemäß einem der Ansprüche 1 bis 9, wobei die Videoaufnahmevorrichtung in einer Türstation eingebettet ist,

Empfangen, an der Türstation und vor Empfangen des mindestens einen Signals, das die erste Eingabe repräsentiert, die unter Verwendung der ersten Vorrichtung erfolgte, einer Eingabe, die eine Anforderungen nach der ersten Videositzung mit der ersten Vorrichtung, die autorisiert ist, Zugang zu dem Bereich zu gewähren, repräsentiert,

Senden an die erste Vorrichtung, in Reaktion auf Empfang der Eingabe, die die Anforderung repräsentiert, einer Anforderung zum Einrichten der ersten Videositzung, und

Erzeugen eines Zugangskontrollsignals in Reaktion auf den Empfang des mindestens einen Signals und des abschließenden Signals von der ersten Vorrichtung.


 
11. Verfahren nach Anspruch 10, wobei das Zugangskontrollsignal von der Türstation an eine physikalische Zugangssteuereinrichtung, die den Zugang zu dem Bereich kontrolliert, gesendet wird.
 
12. Verfahren nach Anspruch 10 oder 11, wobei die Eingabe, die die Anforderung nach der ersten Videositzung repräsentiert, von physikalischer Interaktion mit einer Vorrichtung erzeugt wird, die in der Türstation befestigt ist.
 
13. Verfahren nach einem der Ansprüche 10 bis 12, wobei das empfangene abschließende Signal ein Bestätigungssignal ist, das von der ersten Vorrichtung in Reaktion auf eine Bestätigungseingabe, die an der ersten Vorrichtung gemacht wurde, empfangen wurde.
 


Revendications

1. Procédé d'enregistrement et d'exécution d'instructions dans un dispositif de capture vidéo, comprenant de:

établir une première session vidéo entre le dispositif de capture vidéo et un premier dispositif utilisé pour commander le dispositif de capteur vidéo,

recevoir au niveau d'une interface de communication du dispositif de capteur vidéo au moins un signal représentant une première saisie faite en utilisant le premier dispositif,

généré au niveau du dispositif de capture vidéo une première représentation graphique d'au moins un signal reçu,

superposer au niveau du dispositif de capture vidéo la première représentation graphique sur une vidéo capturée par le dispositif de capture vidéo, dans lequel la première représentation graphique superposée est spécifique à la première session vidéo en ce qu'elle n'est pas superposée sur une vidéo dans une seconde session vidéo établie entre le dispositif de capteur vidéo et un second dispositif,

envoyer la vidéo capturée avec la première représentation graphique superposée au premier dispositif,

recevoir au niveau du dispositif de capture vidéo, après qu'au moins un signal représentant une saisie faite au niveau du premier dispositif a été reçu st que la première représentation graphique a été générée et superposée sur la vidéo capturée et envoyée au premier dispositif, un signal final représentant une saisie finale faite en utilisant le premier dispositif,

traduire, en réponse à ladite saisie finale, au moins un signal reçu en une instruction exécutable par le dispositif de capture vidéo et

exécuter l'instruction résultant de la traduction d'au moins un signal.


 
2. Procédé selon la revendication 1, comprenant en outre de:
envoyer la vidéo capturée au second dispositif sans la première représentation graphique superposée.
 
3. Procédé selon la revendication 1, comprenant en outre de:
envoyer la vidéo capturée au second dispositif avec une seconde représentation graphique superposée qui est différente de la première représentation graphique.
 
4. Procédé selon une quelconque des revendications 1-3, dans lequel l'instruction représentée par au moins un signal est une opération demandée par un utilisateur du premier dispositif.
 
5. Procédé selon une quelconque des revendications 1-4, dans lequel au moins un signal reçu représente un symbole saisi par un opérateur du premier dispositif, dans lequel ladite superposition de la première représentation graphique sur la vidéo capturée inclut de changer les valeurs de pixel pour des pixels formant le symbole dans la vidéo capturée.
 
6. Procédé selon la revendication 4, dans lequel ladite superposition de la première représentation graphique sur la vidéo capturée inclut de changer les valeurs de pixel pour des pixels formant une représentation visuelle de ladite opération demandée.
 
7. Procédé selon une quelconque des revendications 1-6, dans lequel au moins un signal reçu est le signal final reçu prend la forme de signaux DTMF.
 
8. Procédé selon une quelconque des revendications 1-7, dans lequel la première session vidéo est une session SIP (protocole d'amorce de session).
 
9. Procédé selon une quelconque des revendications 1-8, dans lequel le dispositif de capture vidéo est incorporé dans une caméra vidéo animée mise en réseau.
 
10. Procédé de commande d'une entrée dans une zone, comprenant:

le procédé d'enregistrement et d'exécution d'instructions selon une quelconque des revendications 1-9, dans lequel le dispositif de capture vidéo est incorporé dans une platine de rue,

recevoir, au niveau de la platine de rue et avant de recevoir au moins un signal représentant la première saisie faite en utilisant le premier dispositif,

une saisie représentant une demande pour la première session vidéo avec le premier dispositif étant autorisé à accorder l'entrée à la zone,

envoyer au premier dispositif, en réponse à la réception de ladite saisie représentant la demande, une demande d'établissement de la première session vidéo, et

générer un signal de contrôle d'entrée en réponse à la réception d'au moins un signal et du signal final provenant du premier dispositif.


 
11. Procédé selon la revendication 10, dans lequel le signal de contrôle d'entrée est envoyé de la platine de rue à un contrôleur d'accès physique contrôlant l'entrée dans ladite zone.
 
12. Procédé selon la revendication 10 ou 11, dans lequel la saisie qui représente la demande pour la première session vidéo est générée par interaction physique avec un dispositif monté dans la platine de rue.
 
13. Procédé selon une quelconque des revendications 10-12, dans lequel le signal final reçu est un signal de confirmation reçu depuis le premier dispositif en réponse à une saisie de confirmation faite au niveau du premier dispositif.
 




Drawing