(19)
(11) EP 3 432 182 B1

(12) EUROPEAN PATENT SPECIFICATION

(45) Mention of the grant of the patent:
15.04.2020 Bulletin 2020/16

(21) Application number: 18161355.5

(22) Date of filing: 13.03.2018
(51) International Patent Classification (IPC): 
G06F 21/36(2013.01)

(54)

SYSTEMS AND METHODS FOR SECURE, ACCESSIBLE AND USABLE CAPTCHA

SYSTEME UND VERFAHREN FÜR SICHERE, BARRIEREFREIE UND BENUTZBARE CAPTCHA

SYSTÈMES ET PROCÉDÉS POUR CAPTCHA SCURE, ACCESSIBLE ET UTILISABLE


(84) Designated Contracting States:
AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

(30) Priority: 17.07.2017 IN 201721025362

(43) Date of publication of application:
23.01.2019 Bulletin 2019/04

(73) Proprietor: Tata Consultancy Services Limited
Maharashtra (IN)

(72) Inventors:
  • JADHAV, Charudatta
    400093 Maharashtra, Mumbai (IN)
  • AGRAWAL, Sumeet
    400093 Maharashtra, Mumbai (IN)
  • PALADUGU, Madhu Priyatam Venkata
    400093 Maharashtra, Mumbai (IN)

(74) Representative: Goddar, Heinz J. 
Boehmert & Boehmert Anwaltspartnerschaft mbB Pettenkoferstrasse 22
80336 München
80336 München (DE)


(56) References cited: : 
US-A1- 2014 101 739
US-A1- 2015 271 166
US-A1- 2014 307 876
   
  • Matej Saric: "The Accessibility Demand for Audio CAPTCHA", , 25 December 2016 (2016-12-25), XP055489169, Retrieved from the Internet: URL:https://web.archive.org/web/2016122504 3132/https://captcha.com/articles/audio-ca ptcha.html [retrieved on 2018-06-29]
  • "A New Method for Telling Humans and Computers Apart Automatically", IP.COM JOURNAL, IP.COM INC., WEST HENRIETTA, NY, US, 20 October 2009 (2009-10-20), XP013134856, ISSN: 1533-0001
   
Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been paid. (Art. 99(1) European Patent Convention).


Description

Technical Field



[0001] The embodiments herein generally relate to data processing and information security, and more particularly to systems and methods for inclusive CAPTCHA that addresses users of all abilities and particularly optimizes tradeoff between security and accessibility.

Background



[0002] Completely Automated Public Turing Tests to Tell Computers and Humans Apart (CAPTCHA) is a computer program or system intended to distinguish human from machine input as a security measure. The purpose of the CAPTCHA is to protect web applications like website registrations, online polls, etc. from malicious programs (bots), dictionary attacks and web crawlers. Users are given a challenge to solve which is understandable by humans. Based on the response given, the system determines whether the user is human or not. CAPTCHAs are classified based on the type of content that is distorted. Some types of CAPTCHA are CAPTCHA based on text, CAPTCHA based on image, CAPTCHA based on audio and CAPTCHA based on puzzle. The state of art CAPTCHA imperatively involves a tradeoff between goals of security and accessibility. It has been observed that in order to provide accessible CAPTCHA for users with disabilities, protected applications may need to be compromised on security aspects. Since CAPTCHA is primarily targeted at information security, it is a challenge to manage security while ensuring usability and accessibility for users of all abilities.

[0003] Document US 2015/271166 A1 discloses a method utilizing a CAPTCHA to generate a human likeness score including blocks: a) receiving a user solution to the CAPTCHA; b) receiving a user interaction pattern descriptive of an interaction undertaken by the user, through a graphical interface of the CAPTCHA, to achieve the user solution; c) determining the accuracy of the user solution; d) comparing the user interaction pattern against an interaction model generated from interaction patterns of previous users; e) calculating the human likeness score based upon the determination of block c) and the comparison of block d), wherein the human likeness score lies within a continuum of human likeness scores.

[0004] Document Matej Saric: "The Accessibility Demand for Audio Captcha" discloses Image-based Captcha challenges are a staple of web anti-spam protection, and for a reason. When implemented properly, they are a minor speed-bump for human users, but keep various bots and automated scripts dumbfounded. If spammer mischief mandates such a "human interaction proof' in the first place, what could be easier and more natural for your visitors than simply looking at a picture and reading several characters. But this ease-of-use is based on a pretty big assumption: that all visitors have flawless vision.

[0005] Document US 2014/307876 A1 discloses systems and methods for generating and performing a three-dimensional audio CAPTCHA are provided. One exemplary system can include a decoy signal database storing a plurality of decoy signals. The system also can include a three-dimensional audio simulation engine for simulating the sounding of a target signal and at least one decoy signal in an acoustic environment and outputting a stereophonic audio signal based on the simulation. One exemplary method includes providing an audio prompt to a resource requesting entity. The audio prompt can have been generated based on a three-dimensional audio simulation of the sounding of a target signal containing an authentication key and at least one decoy signal in an acoustic environment. The method can include receiving a response to the audio prompt from the resource requesting entity and comparing the response to the authentication key.

[0006] Document US 2014/101739 A1 discloses a human interactive proof portal 140 may control access to an online data service 122. A communication interface 260 establishing a human interactive proof session 600 with a user device 110 accessing an online data service 122. The communication interface 260 may iteratively send an audio proof challenge set having multiple audio proof challenges each asking a semantic query to the user device 110 for presentation to a user. A processor 220 may provide access to the online data service 122 based in part on at least one proof response having a semantic reply indicating a human user.

SUMMARY



[0007] Embodiments of the present disclosure present technological improvements as solutions to one or more of the above-mentioned technical problems recognized by the inventors in conventional systems.

[0008] In an aspect, there is provided a processor implemented method for inclusive CAPTCHA comprising: in response to a user request for a webpage having CAPTCHA, creating a media file in real-time, wherein the created media file is characterized by distortion interference and corresponds to a selected theme from a plurality of themes associated with real-world scenarios; randomly selecting a comprehension question based on the created media file as the CAPTCHA, the comprehension question being based on the selected theme, geography associated with a user requesting the webpage and context of the created media file; and transmitting the webpage including the CAPTCHA. Again in response to a user input to the comprehension question, the method intelligently detects either a human input or a machine input based on a self-learning CAPTCHA decision module.

[0009] In another aspect, there is provided a system comprising: one or more hardware processors and one or more data storage devices (102) operatively coupled to the one or more processors for storing instructions configured for execution by the one or more processors, the instructions being comprised in: a CAPTCHA generating module configured to, in response to a user request for a webpage having CAPTCHA, create a media file in real-time, wherein the created media file is characterized by distortion interference and corresponds to a selected theme from a plurality of themes associated with real-world scenarios; randomly select a comprehension question based on the created media file as the CAPTCHA, the comprehension question being based on the selected theme, geography associated with a user requesting the webpage and context of the created media file; and transmit the webpage including the CAPTCHA. The instructions are further comprised in a self-learning CAPTCHA decision module configured to, in response to a user input to the comprehension question, intelligently detect either a human input or a machine input.

[0010] In yet another aspect, there is provided a computer program product comprising a non-transitory computer readable medium having a computer readable program embodied therein, wherein the computer readable program, when executed on a computing device, causes the computing device to: in response to a user request for a webpage having CAPTCHA, create a media file in real-time, wherein the created media file is characterized by distortion interference and corresponds to a selected theme from a plurality of themes associated with real-world scenarios; randomly select a comprehension question based on the created media file as the CAPTCHA, the comprehension question being based on the selected theme, geography associated with a user requesting the webpage and context of the created media file; and transmit the webpage including the CAPTCHA. Again in response to a user input to the comprehension question, the computing device intelligently detects either a human input or a machine input based on a self-learning CAPTCHA decision module.

[0011] In an embodiment of the present disclosure, the created media file is at least one of an aural form or a visual form.

[0012] In an embodiment of the present disclosure, the created media file is a combination of two or more media files of the same form but varying type or a combination of two or more media files of different forms, each combination corresponding to the selected theme.

[0013] In an embodiment of the present disclosure, the varying types of the two or more media files are associated with an environment and contextual information thereof.

[0014] In an embodiment of the present disclosure, the CAPTCHA generating module is further configured to, in response to a user input to the comprehension question, select and transmit at least one alternative comprehension question for the selected theme.

[0015] In an embodiment of the present disclosure, each comprehension question is associated with a plurality of solutions corresponding to the created media file for the selected theme.

[0016] In an embodiment of the present disclosure, the CAPTCHA generating module is further configured to present the selected comprehension question in at least one of visual or aural form.

[0017] In an embodiment of the present disclosure, the self-learning CAPTCHA decision module is configured to intelligently detect either a human input or a machine input by: intelligently comparing the user input to previously stored user inputs from a decision repository of the self-learning CAPTCHA decision module, considering one or more of spelling errors, incomplete responses, contextual metonyms, synonyms and variants thereof; detecting either a human input or a machine input based on the comparing and associated decision thereof; and updating the decision repository with the user input and associated decision based on the detected input.

[0018] It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the embodiments of the present disclosure, as claimed.

BRIEF DESCRIPTION OF THE DRAWINGS



[0019] The embodiments herein will be better understood from the following detailed description with reference to the drawings, in which:

FIG.1 illustrates an exemplary block diagram of a system for providing inclusive CAPTCHA, in accordance with an embodiment of the present disclosure;

FIG.2 illustrates an exemplary architectural diagram of the system for providing inclusive CAPTCHA, in accordance with an embodiment of the present disclosure;

FIG.3 is an exemplary flow diagram illustrating a computer implemented method for inclusive CAPTCHA, in accordance with an embodiment of the present disclosure;

FIG.4 illustrates a working flow diagram of the method for inclusive CAPTCHA, in accordance with an embodiment of the present disclosure;

FIG.5 illustrates a graphical representation of users with different abilities and number of successful and failed attempts using the inclusive CAPTCHA in accordance with an embodiment of the present disclosure;

FIG.6 illustrates a graphical representation of users with different abilities and number of attempts made to successfully complete the inclusive CAPTCHA in accordance with an embodiment of the present disclosure;

FIG.7 illustrates a graphical representation of users with different abilities and time taken to complete the inclusive CAPTCHA in accordance with an embodiment of the present disclosure;

FIG.8 illustrates a graphical representation of audio file wise analysis for main stream users of the inclusive CAPTCHA in accordance with an embodiment of the present disclosure; and

FIG.9 illustrates a graphical representation of audio file wise analysis for screen reader users of the inclusive CAPTCHA in accordance with an embodiment of the present disclosure.



[0020] It should be appreciated by those skilled in the art that any block diagram herein represent conceptual views of illustrative systems embodying the principles of the present subject matter. Similarly, it will be appreciated that any flow charts, flow diagrams, state transition diagrams, pseudo code, and the like represent various processes which may be substantially represented in computer readable medium and so executed by a computing device or processor, whether or not such computing device or processor is explicitly shown.

DETAILED DESCRIPTION



[0021] Exemplary embodiments are described with reference to the accompanying drawings. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. Wherever convenient, the same reference numbers are used throughout the drawings to refer to the same or like parts. While examples and features of disclosed principles are described herein, modifications, adaptations, and other implementations are possible without departing from the spirit and scope of the disclosed embodiments. It is intended that the following detailed description be considered as exemplary only, with the true scope and spirit being indicated by the following claims.

[0022] Before setting forth the detailed explanation, it is noted that all of the discussion below, regardless of the particular implementation being described, is exemplary in nature, rather than limiting.

[0023] Completely Automated Public Turing Tests to Tell Computers and Humans Apart (CAPTCHA) is a challenge-response test which determines whether the user is a human or not. There are different types of CAPTCHA based on distortion introduced.
  1. 1) CAPTCHA based on text: A textual challenge based on reading text or other visual/aural perception tasks are presented to users. For example,
    • Flower, resting, lawyer and campsite: the word starting with "c " is?
    • What is 1 + six?
    • Which of sock, library, cake or red is a color?
    • The word in capitals from relieves, luxuriate or CAMPAIGN is?

    CAPTCHA based on text which have mostly arithmetic, logical or general knowledge based questions can be easily solved by computing engines, creating a security issue. These CAPTCHA are also arduous for cognitively- disabled users to solve.
  2. 2) CAPTCHA based on image: Image having combinations of distorted characters and obfuscation techniques or images of real-world objects like animals, people or landscapes are presented to users which they have to identify or read and retype. They are based on reading text or other visual-perception tasks. For example,
    • An image having 12 pets may be presented and user maybe asked to identify all images of cats.
    • User may be presented a picture divided into chunks and user has to combine or swap them to form a complete picture like a jigsaw puzzle.

    CAPTCHA based on visual perception are inaccessible to visually impaired users as they are designed to be unreadable by machines; common assistive technology tools such as screen readers also cannot interpret them. Sometimes these CAPTCHA have distorted text and images in such a way that it gets difficult for main stream users and low vision users as well to read, thus hampering usability. Also image based CAPTCHA which are in the form of puzzles operations like drag and drop, flipping of images, pointing with mouse may be difficult to perform for keyboard only users. They may also be a challenge for cognitively-disabled users.
  3. 3) CAPTCHA based on audio: Audio CAPTCHA was developed particularly for visually impaired users based on sound based systems. Users are generally required to listen to an audio and type into a textbox. They are based on aural-perception tasks. As per research literature, humans find audio CAPTCHA difficult to solve because of distortion interference present. Misinterpretation of letters like T and D, B and P which sound similar when distorted is common. Also, these audios may be easily decoded by current automated speech recognition (ASR) techniques raising a security concern. Audio CAPTCHA also face localization issues as the content in the audios may not be understandable to every user. Audio playback is linear. User relying on screen reader may have a challenge in understanding the difference between voice of the screen reader and the audio.


[0024] It may be noted from the state of art CAPTCHA that successful implementation of CAPTCHA that is accessible, secure and simultaneously usable is a challenge. Visually impaired, low vision users find image based CAPTCHA challenging as they are unable to visually perceive it thus making it inaccessible and un-usable. Security aspect of image based CAPTCHA may also be negotiated and broken with machine learning techniques unless accessibility is compromised. Audio CAPTCHA may cater to visually challenged users but may be difficult to solve and time consuming and have a low success rate as seen in research literature, thus further degrading usability and effecting accessibility. Also state of art audio CAPTCHA may be been broken by high-quality Automatic Speech Recognition (ASR) and noise removal systems. The present disclosure addresses particularly the tradeoff between accessibility and security seen in state of the art CAPTCHA and accordingly, the methods and systems of the present disclosure aim to provide an inclusive CAPTCHA that meets the orthogonal requirements of usability, accessibility and security while addressing users of all needs and hence being inclusive.

[0025] Referring now to the drawings, and more particularly to FIGS. 1 through 9, where similar reference characters denote corresponding features consistently throughout the figures, there are shown preferred embodiments and these embodiments are described in the context of the following exemplary system and method.

[0026] FIG.1 illustrates an exemplary block diagram of a system 100 for providing inclusive CAPTCHA, in accordance with an embodiment of the present disclosure. In an embodiment, the system 100 includes one or more processors 104, communication interface device(s) or input/output (I/O) interface(s) 106, and one or more data storage devices or memory 102 operatively coupled to the one or more processors 104. The one or more processors 104 that are hardware processors can be implemented as one or more microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, graphics controllers, logic circuitries, and/or any devices that manipulate signals based on operational instructions. Among other capabilities, the processor(s) are configured to fetch and execute computer-readable instructions stored in the memory. In an embodiment, the system 100 can be implemented in a variety of computing systems, such as laptop computers, notebooks, hand-held devices, workstations, mainframe computers, servers, a network cloud and the like.

[0027] The I/O interface device(s) 106 can include a variety of software and hardware interfaces, for example, a web interface, a graphical user interface, and the like and can facilitate multiple communications within a wide variety of networks N/W and protocol types, including wired networks, for example, LAN, cable, etc., and wireless networks, such as WLAN, cellular, or satellite. In an embodiment, the I/O interface device(s) can include one or more ports for connecting a number of devices to one another or to another server.

[0028] The memory 102 may include any computer-readable medium known in the art including, for example, volatile memory, such as static random access memory (SRAM) and dynamic random access memory (DRAM), and/or non-volatile memory, such as read only memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes. In an embodiment, one or more modules (not shown) of the system 100 can be stored in the memory 102.

[0029] FIG.2 illustrates an exemplary architectural diagram of the system for providing inclusive CAPTCHA, in accordance with an embodiment of the present disclosure. The inclusive CAPTCHA of the present disclosure considers the orthogonal requirements of usability, security and accessibility for users with disabilities (visual, hear, motor, disability) and also main stream users to make the CAPTCHA practically inclusive. FIG.3 is an exemplary flow diagram illustrating a computer implemented method 200 for inclusive CAPTCHA, in accordance with an embodiment of the present disclosure. In an embodiment, the system 100 comprises one or more data storage devices or memory 102 operatively coupled to the one or more processors 104 and is configured to store instructions configured for execution of steps of the method 200 by the one or more processors 104.

[0030] The steps of the method 200 will now be explained in detail with reference to the components of the system 100 based on the architectural diagram of FIG.2. In an embodiment, the system 100 may comprise exemplary modules such a CAPTCHA generation module (not particularly illustrated), and a self-learning CAPTCHA decision module (not particularly illustrated). In an embodiment, the CAPTCHA generating module is configured to create in real-time, a media file, at step 202, in response to a user request for a webpage having CAPTCHA, wherein the created media file is characterized by distortion interference. In the context of the present disclosure, distortion interference involves overlap of media files. In accordance with the present disclosure, the created file may have an aural form, a visual form or a combination of both, such as audio-visual form, wherein the visual form may include videos or animations. However, for ease of explanation, certain exemplary embodiments may be explained with reference to audio files that may not be construed as limiting the scope and/or the applicability of the appended claims. In another embodiment, the created media file is a combination of two or more media files that are of the same form or of different forms. Again, when the two or more media files are of the same form, they may be of varying type. In accordance with the present disclosure, there may be a plurality of themes associated with real-world scenarios. For instance, the created media file may be an audio file exemplifying conversation in a school, traffic, railway station, zoo, park, and the like. The created media file corresponds to a selected theme from the plurality of themes. Each of the mentioned themes such as school, railway station, zoo, and the like may be associated with a library of media files. Again, in an embodiment, the varying types of the two or more media files are associated with an environment and associated contextual information. For instance, in case of the created media file being an audio file, each theme may be associated with a library of audio files wherein each library may comprise further sub-libraries of audio files pertaining to main dialogues, environmental noise, background noise and background speech in line with the theme of the environment which may be combined in real-time and presented to the user as a created media file. The level of possible combinations for each theme and the possible variants along with the distortion interference or overlap of the media files, for instance the described four types of audio files to create the media file enables addressing the security aspect.

[0031] In an embodiment, the CAPTCHA generating module is configured to select a comprehension question randomly, at step 204, for the created media file as the CAPTCHA. In an embodiment, a question bank may comprise all possible comprehension questions related to each theme. In an embodiment, the selected comprehension question is based on the selected theme, geography associated with a user requesting the webpage and context of the created media file; wherein geography associated with a user also pertains to the corresponding culture and language. FIG.4 illustrates a working flow diagram of the method for inclusive CAPTCHA, in accordance with an embodiment of the present disclosure. The CAPTCHA generating module first randomly selects a theme (a school in the illustration of FIG.4). In the exemplary embodiment as illustrated in FIG.4, the created media file is an audio file that is a combination in real-time of four types of audio files viz., main dialogues, background noise, environmental noise and background speech. This technique of combining media files (for instance, audio files, by say a real-time audio mixer module) makes it difficult for an ASR engine to interpret the CAPTCHA question. The dialogue in the created audio file may be in single language or multiple languages, grammatically correct /incorrect to make it tough for ASRs to break. As the created audio file presented to the user is a real-world scenario which a human experiences in day-to day life like the school in FIG.4, processing, extracting and interpreting information would require minimal mental effort. Also, the fact that the theme relates to a real-world scenario and the selected question is based on the context of the created media file facilitates accessibility for users of all abilities. Again, the created media (audio in the exemplary embodiment) file may also be customized considering the geography, culture and language of the user. As seen in FIG.4, a comprehension question presented to the user is "What is the teacher teaching". The question asked in the CAPTCHA is purely based on the context of the school theme and the created audio file, thereby reducing cognitive load on the user.

[0032] It may be noted that merging multiple media files makes it difficult for automated scripts to recognize audio or images and extract information accurately to respond to the CAPTCHA; whereas, being context related and associated with real-world scenarios, the CAPTCHA is easy for a human to comprehend.

[0033] In an embodiment, the CAPTCHA generating module is configured to transmit the webpage including the CAPTCHA, at step 206. In an embodiment, the selected comprehension question may be presented in a visual form, an aural form or a combination thereof such as audio-visual form.

[0034] In an embodiment, the self-learning CAPTCHA decision module is configured to, at step 208, in response to a user input to the comprehension question, intelligently detect either a human input or a machine input.

[0035] In an embodiment, in response to the user input to the comprehension question, the CAPTCHA generating module may select and transmit at least one alternative comprehension question for the selected theme. For instance, if the self-learning CAPTCHA decision module is unable to make a decision based on the user input to the comprehension question, the CAPTCHA generating module may present an alternative comprehension question for the selected theme.

[0036] Once the user answers the comprehensive question presented the self-learning CAPTCHA decision module determines its correctness and decides whether the access is made by a human or a machine. In an embodiment, the self-learning CAPTCHA decision module is configured to detect either a human input or a machine input by firstly intelligently comparing the user input to previously stored user inputs from a decision repository of the self-learning CAPTCHA decision module. The CAPTCHA decision module may use information retrieval technology strategies to compare the user input by the user with stored user inputs by considering one or more of spelling errors, incomplete responses, contextual metonyms, synonyms and variants thereof. For instance, if a response to a comprehension question is "tea", possible acceptable responses include "chai (Hindi language synonym), tea (actual answer), chaha (Marathi language synonym), tee (incorrect spelling), hot drink (metonym), cha (incomplete response)". This feature of the present disclosure ensures accessibility particularly to dyslexic users and users with learning disabilities. The step of detecting either a human input or a machine input is based on the comparing and associated decision thereof. In a self-learning manner, each user input and associated decision based on the detected input by the self-learning CAPTCHA decision module is dynamically updated in the decision repository to build a knowledge base that may be continually updated for improved decision accuracy.

[0037] In the exemplary example of FIG.4, an acceptable user response for the question asked may be "math / mathematics /sum/ plus/ add /addition" which only a human can interpret and answer. In another instance, if the same theme (school) is selected, the question asked may remain same ("What is teacher teaching") but if the main dialogue audio file selected by the CAPTCHA generating module was modified wherein the teacher was teaching English, the accepted answers may change to "english / angrezi / british / foreign". Thus, in an embodiment, each comprehension question may be associated with a plurality of solutions corresponding to the created media file for the selected theme making it more complicated for ASR and machine learning engines to crack the CAPTCHA.

[0038] In another exemplary example, there may be four types of audio files:

Background noise: traffic sound, car honk 2 times, car driving sound

Background speech: beggar sound, a person selling newspaper in the traffic

Environment noise: A person in a car which is moving traffic is having a conversation on the phone.

Main dialogues: Hi Srikanth. What's the time there? It is 10o'clock in the morning in India.

Comprehension questions and acceptable user inputs may be:

To whom was the person asking time? Srikanth / male / Srikant

What is the time in India as specified in the audio? 10am / 10 o'clock / dus

How many times did the car in the background honk? 2 times / twice / do / 2 / two

Which country is being referred in the audio? India / Bharat

Which part of the day is 10o'clock - Morning



[0039] In accordance with the present disclosure, usability with regards to the inclusive CAPTCHA may be further enhanced by enabling replay of the created media file. User interface (UI) may be further enhanced by enabling the user to listen and type at the same time using screen reader and providing keyboard accessibility. As the user clicks on 'play', the focus may automatically set on a text input field to enable the user to type as soon as the audio is heard. In an embodiment, a shortcut key may be implemented to replay the audio for the user while the focus remains on the text field, thus reducing the number of interaction clicks needed by the user to complete the CAPTCHA.

[0040] Evaluation study conducted on an exemplary embodiment of the present disclosure with audio files:
Procedure: A user study and security testing was conducted to evaluate the usability, accessibility and security of the inclusive CAPTCHA. A total of 119 participants took part in the user study where 24 participants were screen reader users (partially blind or completely blind) and 95 were non-disabled (main stream) users recruited by crowd sourcing. There was no restriction on the type of screen reader software used by the screen reader users and all the users were asked to use their own devices. The users were given a brief introduction about CAPTCHA and were required to fill a web-form which had the inclusive CAPTCHA incorporated in it. Through the web-form, the users were asked information like name, age, electronic mail, whether visually impaired. From a set of 10 audios files, a single audio file was selected at random and presented to the user. To take the edge of usability issue, the user was asked to fill a feedback questionnaire consisting of three questions where the users were asked to rate the inclusive CAPTCHA of the present disclosure between land 5 where 1 is the least and 5 is the highest rating.

[0041] Security Analysis: CAPTCHA is a mechanism adopted by several portals to protect their web applications from malicious programs such as bots. Powerful robots backed by machine learning algorithms have the capability to break the existing audio CAPTCHAs. However, the inclusive CAPTCHA of the present disclosure consists of a media file such as an audio file in the exemplary of FIG.4 which is a combination of multiple orthogonal speech conversations and different sounds. The inclusive CAPTCHA consisted of human spoken dialogues superimposed with background noise in form of human voices and environment noise relevant to the theme. These created audio CAPTCHA were not decodable by a tested state of the art general purpose Automatic Speech Recognition engine (Speech to text convertor). Table 1 below shows some of the transcripts obtained from the ASR.
Table 2: Comparison of ASR engine and actual audio file transcripts
Theme and audio file Transcripts obtained from ASR Actual scripts of the audio file
Temple Environment:audio_1 He couldn't refrain greenness Where are you going, Kiran?
I am going to the Temple, John.
Phone Conversation in Busy Street: audio_2 null Hi Tina, What's the time there?
It's 10'o clock morning in India
School Environment :audio_3 added on the line Newton anything but how do you live like Florida yeah family jewels Add the numbers and you will add like 43 plus 72
Train Station Environment: audio_4 null 78861 from Delhi to Raipur is arriving on platform number 6
Zoo Environment: audio_5 null Mummy, see the white elephant
Lift Environment:Audio_ 6 can you clarify for legal shelf 7th floor. Can u press the 4th floor? Sure. 4th Floor. Going up.
Grocery Store Environment:audio_7 Export gate by he how can it be quantify it first people please on my face What's the price of onions? Its 45 per kg. How can it be 45, it was 30 on Monday
Hotel Reception Environment: audio_8 null Here's your key. Your room number is 324. If you need anything, please dial 0 for the reception area.


[0042] The results clearly indicate that the transcripts obtained from ASR were not at all close to the actual scripts of the audio, thereby proving the inclusive CAPTCHA of the present disclosure is strong and difficult for a machine to decode.

[0043] Quantitative analysis: The inclusive CAPTCHA audio of the present disclosure has an average play time of 11 seconds. The usability and accessibility of the CAPTCHA was measured on the following metrics:
  • Time to complete one entire challenge (completion time)
  • Thinking time of the user (the time duration between the audio completion and before the user types his response)
  • Number of times the user heard the audio to solve the CAPTCHA


[0044] User Performance Analysis: With the assumption that a user is able to solve the CAPTCHA in first attempt it is recorded as a successful attempt. FIG.5 illustrates a graphical representation of users with different abilities and number of successful and failed attempts using the inclusive CAPTCHA in accordance with an embodiment of the present disclosure. Of 86 % (103 out of 119) successful attempts by all users, 84 % (20 out of 24) of the users using screen reader and 88 % of the non-disabled (main stream) users were able to complete the challenge in the first attempt; however all the users were able to complete the task (success rate is 100 % as shown in FIG.6 wherein a graphical representation of users with different abilities and number of attempts made to successfully complete the inclusive CAPTCHA in accordance with an embodiment of the present disclosure is illustrated.

[0045] The failed attempts in the FIG.5 represent the number of users who failed to solve the CAPTCHA in the first attempt. However they were able to complete the CAPTCHA task after two or more attempts as shown in FIG.6. As observed, there was no user who required a third attempt in the mainstream category while only one user in screen reader category needed a third attempt.

[0046] Response time analysis: Table 2 below shows a statistical measure for average response time taken to solve the CAPTCHA in the user study by all, mainstream and screen reader users.
Table 2: Time Response comparison for inclusive CAPTCHA
  All users Mainstream users Screen reader users
Avg. response time 31 seconds 30 seconds 38 seconds
Minimum time 8 seconds 8 seconds 14 seconds
Maximum time 142 seconds 130 seconds 142 seconds
To understand the overall distribution and variation for the response time of all the participants part of user study a box plot was included for measurement in addition to the average response time. FIG.7 illustrates a graphical representation, in the form of a box plot of users with different abilities and time taken to complete the inclusive CAPTCHA in accordance with an embodiment of the present disclosure. The plot clearly shows no significant difference in the completion time scores of the lower and upper quartile range for all and mainstream users however more number of screen reader users fall in the upper quartile range. Summarizing, screen reader users took more time to respond to the CAPTCHA (FIG.7) but 84% of the users got it right in the first attempt (with no background about the inclusive CAPTCHA) also the response time improves when used multiple times. The average time to complete the inclusive CAPTCHA test is 30 seconds with minimum of 8 seconds for main stream users and 38 seconds with minimum of 14 seconds for the screen reader users which is less than 65.64 seconds, the average time to complete a ReCAPTCHA test as known in the art, thus making it more usable.

[0047] Audio file-wise analysis: FIG.8 illustrates a graphical representation of audio file wise analysis for main stream users of the inclusive CAPTCHA in accordance with an embodiment of the present disclosure. As observed, more than 85% of the mainstream users have played the audio for one time or two times. In the FIG.8, Audio 8 has the highest number of users listening to the audio file one time. This clearly is indicative of how clear the audio sounds were to a human. FIG.9 illustrates a graphical representation of audio file wise analysis for screen reader users of the inclusive CAPTCHA in accordance with an embodiment of the present disclosure. As observed, more than 83 % of the screen reader users have played the audio for one time or two times. This clearly is indicative of how clear the audio sounds were to a screen reader user. Table 3 below present the results comparing the inclusive CAPTCHA with the standard image CAPTCHA.
Table 3: Comparing inclusive CAPTCHA and image CAPTCHA between mainstream users.
  Inclusive CAPTCHA Image CAPTCHA
Average time taken to complete the CAPTCHA challenge 33 seconds 22 seconds
Minimum time 8 seconds 1.62 seconds
Maximum time 130 seconds 39 seconds
Average typing time 6 seconds 8 seconds
Minimum 1.43 seconds 3 seconds
Maximum 31 seconds 22 seconds
The average time taken to complete the inclusive CAPTCHA of the present disclosure is on higher side however it was observed that average typing time taken for inclusive CAPTCHA is less than that of Image CAPTCHA because the user had read the questions, listened to the audio file and answered the question based on the audio file thus reducing the cognitive load of thinking of an answer to solve the CAPTCHA.

[0048] Qualitative Analysis: At the end, a feedback questionnaire of 3 questions was asked and the users were supposed to rate between land 5 where 1 is the least and 5 is the highest rating. Table 4 below summarizes the average score of the each question given by the users depicting a positive feedback about the overall experience in solving the inclusive CAPTCHA.
Table 4: Average score for the feedback questions asked.
  Inclusive CAPTCHA Image CAPTCHA
How easy was it to use the inclusive CAPTCHA? 4.31 4.62
How easy is it to understand the content in the audio file? 4.13 4.48
How easy was it to solve the CAPTCHA challenge? 4.27 4.62


[0049] Thus in accordance with the present disclosure, systems and methods described herein above provide an inclusive CAPTCHA based on real-world scenarios that users could relate to in day to day life thereby decreasing the cognitive load. Results of security test shows it was difficult to crack and decode by automated engines known in the art. Accessibility and usability tests showed positive responses, easy to use and had a task success rate of 100% and 83% of the screen reader users participated were successful in the first attempt without any initial training of the inclusive CAPTCHA of the present disclosure. Screen Reader users were able to complete the task with an average time of 38 seconds with minimum of 14 seconds which is less as compared to previous studies thus making the inclusive CAPTCHA truly secure, accessible and usable for users of all abilities.

[0050] The written description describes the subject matter herein to enable any person skilled in the art to make and use the embodiments of the present disclosure. The scope of the subject matter embodiments defined here may include other modifications that occur to those skilled in the art. Such other modifications are intended to be within the scope if they have similar elements that do not differ from the literal language of the claims or if they include equivalent elements with insubstantial differences from the literal language.

[0051] The scope of the subject matter embodiments defined here may include other modifications that occur to those skilled in the art. Such other modifications are intended to be within the scope if they have similar elements that do not differ from the literal language of the claims or if they include equivalent elements with insubstantial differences from the literal language.

[0052] It is, however to be understood that the scope of the protection is extended to such a program and in addition to a computer-readable means having a message therein; such computer-readable storage means contain program-code means for implementation of one or more steps of the method, when the program runs on a server or mobile device or any suitable programmable device. The hardware device can be any kind of device which can be programmed including e.g. any kind of computer like a server or a personal computer, or the like, or any combination thereof. The device may also include means which could be e.g. hardware means like e.g. an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or a combination of hardware and software means, e.g. an ASIC and an FPGA, or at least one microprocessor and at least one memory with software modules located therein. Thus, the means can include both hardware means and software means. The method embodiments described herein could be implemented in hardware and software. The device may also include software means. Alternatively, the embodiments of the present disclosure may be implemented on different hardware devices, e.g. using a plurality of CPUs.

[0053] The embodiments herein can comprise hardware and software elements. The embodiments that are implemented in software include but are not limited to, firmware, resident software, microcode, etc. The functions performed by various modules comprising the system of the present disclosure and described herein may be implemented in other modules or combinations of other modules. For the purposes of this description, a computer-usable or computer readable medium can be any apparatus that can comprise, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. The various modules described herein may be implemented as software and/or hardware modules and may be stored in any type of non-transitory computer readable medium or other storage device. Some nonlimiting examples of non-transitory computer-readable media include CDs, DVDs, BLU-RAY, flash memory, and hard disk drives.

[0054] Further, although process steps, method steps, techniques or the like may be described in a sequential order, such processes, methods and techniques may be configured to work in alternate orders. In other words, any sequence or order of steps that may be described does not necessarily indicate a requirement that the steps be performed in that order. The steps of processes described herein may be performed in any order practical. Further, some steps may be performed simultaneously.

[0055] The illustrated steps are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope and spirit of the disclosed embodiments. Also, the words "comprising," "having," "containing," and "including," and other similar forms are intended to be equivalent in meaning and be open ended in that an item or items following any one of these words is not meant to be an exhaustive listing of such item or items, or meant to be limited to only the listed item or items. It must also be noted that as used herein and in the appended claims, the singular forms "a," "an," and "the" include plural references unless the context clearly dictates otherwise.

[0056] It is intended that the disclosure and examples be considered as exemplary only, with a true scope and spirit of disclosed embodiments being indicated by the following claims.


Claims

1. A processor implemented method (200) for inclusive CAPTCHA, the method comprising:

in response to a user request for a webpage having CAPTCHA,

creating a media file in real-time, wherein the created media file is characterized by distortion interference and corresponds to a selected theme from a plurality of themes associated with real-world scenarios (202);

randomly selecting a comprehension question for the created media file as the CAPTCHA, the comprehension question being based on the selected theme, geography associated with a user requesting the webpage and context of the created media file (204);

transmitting the webpage including the CAPTCHA (206); and

in response to a user input to the comprehension question,

detecting either a human input or a machine input based on a self-learning CAPTCHA decision module (208) in an intelligent manner ,wherein detecting either a human input or a machine input based on a self-learning CAPTCHA decision module comprises:

comparing the user input to previously stored user inputs from a decision repository of the self-learning CAPTCHA decision module, considering one or more of spelling errors, incomplete responses, contextual metonyms, synonyms and variants thereof;

detecting either a human input or a machine input based on the comparing and associated decision thereof; and

dynamically updating the decision repository with the user input and associated decision based on the detected input.


 
2. The processor implemented method of claim 1, wherein the created media file is at least one of an aural form or a visual form.
 
3. The processor implemented method of claim 2, wherein the created media file is a combination of two or more media files of the same form but varying type or a combination of two or more media files of different forms, each combination corresponding to the selected theme.
 
4. The processor implemented method of claim 3, wherein the varying types of the two or more media files are associated with an environment and contextual information thereof.
 
5. The processor implemented method of claim 1, wherein the step of in response to a user input to the comprehension question further comprises selecting and transmitting at least one alternative comprehension question for the selected theme.
 
6. The processor implemented method of claim 1, wherein each comprehension question is associated with a plurality of solutions corresponding to the created media file for the selected theme.
 
7. The processor implemented method of claim 1, wherein the step of transmitting the webpage including the CAPTCHA comprises presenting the selected comprehension question in at least one of visual or aural form.
 
8. A system (100) for providing inclusive CAPTCHA comprising:

one or more hardware processors (104); and

one or more data storage devices (102) operatively coupled to the one or more processors (104) for storing instructions configured for execution by the one or more processors (104), the instructions being comprised in:

a CAPTCHA generating module configured to, in response to a user request for a webpage having CAPTCHA,

create a media file in real-time, wherein the created media file is characterized by distortion interference and corresponds to a selected theme from a plurality of themes associated with real-world scenarios;

randomly select a comprehension question for the created media file as the CAPTCHA, the comprehension question being based on the selected theme, geography associated with a user requesting the webpage and context of the created media file;

transmit the webpage including the CAPTCHA;

a self-learning CAPTCHA decision module configured to, in response to a user input to the comprehension question,
detect either a human input or a machine input in an intelligent manner, wherein the self-learning CAPTCHA decision module is configured to detect either a human input or a machine input by:

comparing the user input to previously stored user inputs from a decision repository of the self-learning CAPTCHA decision module, considering one or more of spelling errors, incomplete responses, contextual metonyms, synonyms and variants thereof;

detecting either a human input or a machine input based on the comparing and associated decision thereof; and

updating the decision repository with the user input and associated decision based on the detected input.


 
9. The system of claim 8, wherein the created media file is at least one of an aural form or a visual form and is a combination of two or more media files of the same form but varying type or a combination of two or more media files of different forms, each combination corresponding to the selected theme, and wherein the varying type of the two or more media files are associated with an environment and contextual information thereof
 
10. The system of claim 8, wherein the CAPTCHA generating module is further configured to, in response to a user input to the comprehension question, select and transmit at least one alternative comprehension question for the selected theme.
 
11. The system of claim 8, wherein each comprehension question is associated with a plurality of solutions corresponding to the created media file for the selected theme.
 
12. The system of claim 8, wherein the CAPTCHA generating module is further configured to present the selected comprehension question in at least one of visual or aural form.
 
13. A computer program product comprising a non-transitory computer readable medium having a computer readable program embodied therein, wherein the computer readable program, when executed on a computing device, causes the computing device to:

in response to a user request for a webpage having CAPTCHA,

create a media file in real-time, wherein the created media file is characterized by distortion interference and corresponds to a selected theme from a plurality of themes associated with real-world scenarios;

randomly select a comprehension question for the created media file as the CAPTCHA, the comprehension question being based on the selected theme, geography associated with a user requesting the webpage and context of the created media file; and

transmit the webpage including the CAPTCHA;

in response to a user input to the comprehension question,

detect either a human input or a machine input, wherein the self-learning CAPTCHA decision module is configured to detect either a human input or a machine input by:

comparing the user input to previously stored user inputs from a decision repository of the self-learning CAPTCHA decision module, considering one or more of spelling errors, incomplete responses, contextual metonyms, synonyms and variants thereof;

detecting either a human input or a machine input based on the comparing and associated decision thereof; and

updating the decision repository with the user input and associated decision based on the detected input.


 


Ansprüche

1. Ein prozessorimplementiertes Verfahren (200) für inklusives CAPTCHA, wobei das Verfahren umfasst:

als Antwort auf eine Benutzeranfrage für eine Webseite mit CAPTCHA,

Erstellung einer Mediendatei in Echtzeit, wobei die erstellte Mediendatei durch Verzerrungstörungen gekennzeichnet ist und einem ausgewählten Thema aus einer Mehrzahl von Themen entspricht, die realen Szenarien zugeordnet sind (202); die zufällige Auswahl einer Verständnisfrage für die erstellte Mediendatei als CAPTCHA, wobei die Verständnisfrage auf dem ausgewählten Thema, der Geographie, die einem Benutzer, der die Webseite anfordert, zugeordnet ist, und dem Kontext der erstellten Mediendatei basiert (204);

die Übermittlung der Webseite einschließlich des CAPTCHA (206); und

als Antwort auf eine Benutzereingabe zur Verständnisfrage,
Erfassen entweder einer menschlichen Eingabe oder einer maschinellen Eingabe basierend auf einem selbstlernenden CAPTCHA-Entscheidungsmodul (208) auf intelligente Weise,

wobei das Erfassen entweder einer menschlichen Eingabe oder einer maschinellen Eingabe auf der Grundlage eines selbstlernenden CAPTCHA-Entscheidungsmoduls umfasst:

Vergleich der Benutzereingabe mit zuvor gespeicherten Benutzereingaben aus einem Entscheidungsrepository des selbstlernenden CAPTCHA-Entscheidungsmoduls unter Berücksichtigung von einem oder mehreren aus Rechtschreibfehlern, unvollständigen Antworten, kontextuellen Metonymen, Synonymen und Varianten davon;

Erkennung entweder einer menschlichen oder maschinellen Eingabe auf der Grundlage des Vergleichs und einer entsprechenden Entscheidung; und

dynamische Aktualisierung des Entscheidungsrepository mit der Benutzereingabe und der zugeordneten Entscheidung auf der Grundlage der erkannten Eingabe.


 
2. Das prozessorimplementierte Verfahren nach Anspruch 1, wobei die erzeugte Mediendatei mindestens eine akustische Form oder eine visuelle Form ist.
 
3. Das prozessorimplementierte Verfahren nach Anspruch 2, wobei die erstellte Mediendatei eine Kombination aus zwei oder mehr Mediendateien der gleichen Form, aber unterschiedlichen Typs, oder eine Kombination aus zwei oder mehr Mediendateien unterschiedlicher Form ist, wobei jede Kombination dem ausgewählten Thema entspricht.
 
4. Das prozessorimplementierte Verfahren nach Anspruch 3, wobei die unterschiedlichen Typen der zwei oder mehr Mediendateien einer Umgebung und Kontextinformationen derselben zugeordnet sind.
 
5. Das prozessorimplementierte Verfahren nach Anspruch 1, wobei der Schritt in Antwort auf eine Benutzereingabe zu der Verständnisfrage weiterhin die Auswahl und Übermittlung mindestens einer alternativen Verständnisfrage für das ausgewählte Thema umfasst.
 
6. Das prozessorimplementierte Verfahren nach Anspruch 1, wobei jeder Verständnisfrage eine Mehrzahl von Lösungen zugeordnet sind, die der erstellten Mediendatei für das ausgewählte Thema entsprechen.
 
7. Das prozessorimplementierte Verfahren nach Anspruch 1, wobei der Schritt der Übertragung der Webseite einschließlich des CAPTCHA die Darstellung der ausgewählten Verständnisfrage in mindestens einer visuellen oder akustischen Form umfasst.
 
8. Ein System (100) zum Liefern von inklusivem CAPTCHA, umfassend:

einen oder mehrere Hardware-Prozessoren (104); und

eine oder mehrere Datenspeichervorrichtungen (102), die operativ mit dem oder den Prozessoren (104) gekoppelt sind, zum Speichern von Befehlen, die zur Ausführung durch den oder die Prozessoren (104) konfiguriert sind, wobei die Befehle enthalten sind in:

einem CAPTCHA-Generierungsmodul, das so konfiguriert ist, dass es als Antwort auf eine Benutzeranforderung für eine Webseite mit CAPTCHA,

eine Mediendatei in Echtzeit erstellt, wobei die erstellte Mediendatei durch Verzerrungsstörungen gekennzeichnet ist und einem ausgewählten Thema aus einer Mehrzahl von Themen entspricht, die realen Szenarien zugeordnet sind;

zufällig eine Verständnisfrage für die erstellte Mediendatei als CAPTCHA auswählt, wobei die Verständnisfrage auf dem ausgewählten Thema, der Geographie, die einem Benutzer, der die Webseite anfordert, zugeordnet ist, und dem Kontext der erstellten Mediendatei basiert;

die Webseite einschließlich des CAPTCHA zu überträgt;

einem selbstlernenden CAPTCHA-Entscheidungsmodul, das so konfiguriert ist, dass es als Antwort auf eine Benutzereingabe zu der Verständnisfrage
auf intelligente Weise entweder eine menschliche oder eine maschinelle Eingabe erkennt,

wobei das selbstlernende CAPTCHA-Entscheidungsmodul so konfiguriert ist, dass es entweder eine menschliche oder eine maschinelle Eingabe erkennt durch:

Vergleich der Benutzereingabe mit zuvor gespeicherten Benutzereingaben aus einem Entscheidungsrepository des selbstlernenden CAPTCHA-Entscheidungsmoduls unter Berücksichtigung eines oder mehrerer aus Rechtschreibfehlern, unvollständigen Antworten, kontextuellen Metonymen, Synonymen und Varianten davon;

Erkennung entweder einer menschlichen oder maschinellen Eingabe auf der Grundlage des Vergleichs und einer entsprechenden Entscheidung; und

Aktualisierung des Entscheidungsrepository mit der Benutzereingabe und der zugehörigen Entscheidung auf der Grundlage der erkannten Eingabe.


 
9. System nach Anspruch 8, wobei die erzeugte Mediendatei von einer akustischen Form und/oder einer visuellen Form ist und eine Kombination von zwei oder mehr Mediendateien derselben Form, aber unterschiedlichen Typs, oder eine Kombination von zwei oder mehr Mediendateien unterschiedlicher Form ist, wobei jede Kombination dem ausgewählten Thema entspricht, und wobei der unterschiedliche Typ der zwei oder mehr Mediendateien einer Umgebung und Kontextinformationen derselben zugeordnet ist.
 
10. Das System nach Anspruch 8, wobei das CAPTCHA-Generierungsmodul ferner so konfiguriert ist, dass es als Antwort auf eine Benutzereingabe zu der Verständnisfrage mindestens eine alternative Verständnisfrage für das ausgewählte Thema auswählt und übermittelt.
 
11. Das System nach Anspruch 8, bei dem jede Verständnisfrage einer Mehrzahl von Lösungen zugeordnet ist, die der erstellten Mediendatei für das ausgewählte Thema entsprechen.
 
12. Das System nach Anspruch 8, wobei das CAPTCHA-Generierungsmodul ferner dafür konfiguriert ist, die ausgewählte Verständnisfrage mindestens in einer visuellen oder akustischen Form zu präsentieren.
 
13. Ein Computerprogrammprodukt, das ein nicht transientes computerlesbares Medium mit einem darin verkörperten computerlesbaren Programm umfasst, wobei das computerlesbare Programm, wenn es auf einer Computervorrichtung ausgeführt wird, die Computervorrichtung dazu veranlasst:

als Antwort auf eine Benutzeranfrage für eine Webseite mit CAPTCHA,

eine Mediendatei in Echtzeit zu erstellen, wobei die erstellte Mediendatei durch Verzerrungsstörungen gekennzeichnet ist und einem ausgewählten Thema aus einer Mehrzahl von Themen entspricht, die realen Szenarien zugeordnet sind;

eine Verständnisfrage für die erstellte Mediendatei nach dem Zufallsprinzip als CAPTCHA auszuwählen, wobei die Verständnisfrage auf dem ausgewählten Thema, der Geographie, die einem Benutzer, der die Webseite anfordert, zugeordnet ist, und dem Kontext der erstellten Mediendatei basiert; und

die Webseite einschließlich des CAPTCHA zu übertragen;

als Antwort auf eine Benutzereingabe zur Verständnisfrage,
entweder eine menschliche oder eine maschinelle Eingabe zu erkennen,

wobei das selbstlernende CAPTCHA-Entscheidungsmodul so konfiguriert ist, dass es entweder eine menschliche oder eine maschinelle Eingabe erkennt durch:

Vergleich der Benutzereingabe mit zuvor gespeicherten Benutzereingaben aus einem Entscheidungsrepository des selbstlernenden CAPTCHA-Entscheidungsmoduls unter Berücksichtigung eines oder mehrerer aus Rechtschreibfehlern, unvollständigen Antworten, kontextuellen Metonymen, Synonymen und Varianten davon;

Erkennung entweder einer menschlichen oder maschinellen Eingabe auf der Grundlage des Vergleichs und einer zugehörigen Entscheidung; und

Aktualisierung des Entscheidungsrepository mit der Benutzereingabe und der zugehörigen Entscheidung auf der Grundlage der erkannten Eingabe.


 


Revendications

1. Procédé mis en œuvre par processeur (200) pour un test de reconnaissance humaine, CAPTCHA, inclusif, le procédé comprenant les étapes ci-dessous consistant à :

en réponse à une demande d'utilisateur pour une page web présentant un test CAPTCHA,

créer un fichier multimédia en temps réel, dans lequel le fichier multimédia créé est caractérisé par une interférence de distorsion et correspond à un thème sélectionné parmi une pluralité de thèmes associés à des scénarios du monde réel (202) ;

sélectionner de manière aléatoire une question de compréhension pour le fichier multimédia créé, en tant que le test CAPTCHA, la question de compréhension étant basée sur le thème sélectionné, une géographie associée à un utilisateur demandant la page web et un contexte du fichier multimédia créé (204) ;

transmettre la page web incluant le test CAPTCHA (206) ; et

en réponse à une entrée d'utilisateur en ce qui concerne la question de compréhension,
détecter une entrée humaine ou une entrée machine, sur la base d'un module de décision de test CAPTCHA d'auto-apprentissage (208), de manière intelligente, dans lequel l'étape de détection d'une entrée humaine ou d'une entrée machine sur la base d'un module de décision de test CAPTCHA d'auto-apprentissage comprend les étapes ci-dessous consistant à :

comparer l'entrée d'utilisateur à des entrées d'utilisateur stockées précédemment provenant d'un référentiel de décision du module de décision de test CAPTCHA d'auto-apprentissage, en tenant compte d'un ou plusieurs éléments parmi des erreurs d'orthographe, des réponses incomplètes, des métonymies contextuelles, des synonymes et des variantes connexes ;

détecter soit une entrée humaine, soit une entrée machine, sur la base de l'étape de comparaison et de la décision associée connexe ; et

mettre à jour dynamiquement le référentiel de décision avec l'entrée d'utilisateur et la décision associée sur la base de l'entrée détectée.


 
2. Procédé mis en œuvre par processeur selon la revendication 1, dans lequel le fichier multimédia créé présente au moins une forme parmi une forme sonore ou une forme visuelle.
 
3. Procédé mis en œuvre par processeur selon la revendication 2, dans lequel le fichier multimédia créé correspond à une combinaison de deux fichiers multimédias ou plus de la même forme, mais de types différents, ou à une combinaison de deux fichiers multimédias ou plus de formes différentes, chaque combinaison correspondant au thème sélectionné.
 
4. Procédé mis en œuvre par processeur selon la revendication 3, dans lequel les différents types des deux fichiers multimédias ou plus sont associés à un environnement et à des informations contextuelles de celui-ci.
 
5. Procédé mis en œuvre par processeur selon la revendication 1, dans lequel l'étape de détection en réponse à une entrée d'utilisateur en ce qui concerne la question de compréhension comprend en outre l'étape consistant à sélectionner et transmettre au moins une question de compréhension alternative pour le thème sélectionné.
 
6. Procédé mis en œuvre par processeur selon la revendication 1, dans lequel chaque question de compréhension est associée à une pluralité de solutions correspondant au fichier multimédia créé pour le thème sélectionné.
 
7. Procédé mis en œuvre par processeur selon la revendication 1, dans lequel l'étape de transmission de la page web incluant le test CAPTCHA comprend l'étape consistant à présenter la question de compréhension sélectionnée dans au moins une forme parmi une forme visuelle et une forme sonore.
 
8. Système (100) destiné à fournir un test CAPTCHA inclusif, comprenant :

un ou plusieurs processeurs matériels (104) ; et

un ou plusieurs dispositifs de stockage de données (102) couplés fonctionnellement audit un ou auxdits plusieurs processeurs (104) en vue de stocker des instructions configurées de manière à être exécutées par ledit un ou lesdits plusieurs processeurs (104), les instructions étant comprises dans :

un module de génération de test CAPTCHA configuré de manière à, en réponse à une demande d'utilisateur pour une page web présentant un test CAPTCHA,

créer un fichier multimédia en temps réel, dans lequel le fichier multimédia créé est caractérisé par une interférence de distorsion et correspond à un thème sélectionné parmi une pluralité de thèmes associés à des scénarios du monde réel ;

sélectionner de manière aléatoire une question de compréhension pour le fichier multimédia créé, en tant que le test CAPTCHA, la question de compréhension étant basée sur le thème sélectionné, une géographie associée à un utilisateur demandant la page web et un contexte du fichier multimédia créé ;

transmettre la page web incluant le test CAPTCHA ; et

un module de décision de test CAPTCHA d'auto-apprentissage configuré de manière à, en réponse à une entrée d'utilisateur en ce qui concerne la question de compréhension :
détecter une entrée humaine ou une entrée machine, de manière intelligente, dans lequel le module de décision de test CAPTCHA d'auto-apprentissage est configuré de manière à détecter soit une entrée humaine, soit une entrée machine, en mettant en œuvre les étapes ci-dessous consistant à :

comparer l'entrée d'utilisateur à des entrées d'utilisateur stockées précédemment provenant d'un référentiel de décision du module de décision de test CAPTCHA d'auto-apprentissage, en tenant compte d'un ou plusieurs éléments parmi des erreurs d'orthographe, des réponses incomplètes, des métonymies contextuelles, des synonymes et des variantes connexes ;

détecter soit une entrée humaine, soit une entrée machine, sur la base de l'étape de comparaison et de la décision associée connexe ; et

mettre à jour le référentiel de décision avec l'entrée d'utilisateur et la décision associée sur la base de l'entrée détectée.


 
9. Système selon la revendication 8, dans lequel le fichier multimédia créé présente au moins une forme parmi une forme sonore ou une forme visuelle, correspond à une combinaison de deux fichiers multimédias ou plus de la même forme, mais de types différents, ou à une combinaison de deux fichiers multimédias ou plus de formes différentes, chaque combinaison correspondant au thème sélectionné, et dans lequel les différents types des deux fichiers multimédias ou plus sont associés à un environnement et à des informations contextuelles de celui-ci.
 
10. Système selon la revendication 8, dans lequel le module de génération de test CAPTCHA est en outre configuré de manière à, en réponse à une entrée d'utilisateur en ce qui concerne la question de compréhension, sélectionner et transmettre au moins une question de compréhension alternative pour le thème sélectionné.
 
11. Système selon la revendication 8, dans lequel chaque question de compréhension est associée à une pluralité de solutions correspondant au fichier multimédia créé pour le thème sélectionné.
 
12. Système selon la revendication 8, dans lequel le module de génération de test CAPTCHA est en outre configuré de manière à présenter la question de compréhension sélectionnée dans au moins une forme parmi une forme visuelle et une forme sonore.
 
13. Produit-programme informatique comprenant un support non transitoire lisible par ordinateur dans lequel un programme lisible par ordinateur est incorporé, dans lequel le programme lisible par ordinateur, lorsqu'il est exécuté sur un dispositif informatique, amène le dispositif informatique à :

en réponse à une demande d'utilisateur pour une page web présentant un test CAPTCHA,

créer un fichier multimédia en temps réel, dans lequel le fichier multimédia créé est caractérisé par une interférence de distorsion et correspond à un thème sélectionné parmi une pluralité de thèmes associés à des scénarios du monde réel ;

sélectionner de manière aléatoire une question de compréhension pour le fichier multimédia créé, en tant que le test CAPTCHA, la question de compréhension étant basée sur le thème sélectionné, une géographie associée à un utilisateur demandant la page web et un contexte du fichier multimédia créé ;

transmettre la page web incluant le test CAPTCHA ; et

en réponse à une entrée d'utilisateur en ce qui concerne la question de compréhension,
détecter une entrée humaine ou une entrée machine, dans lequel le module de décision de test CAPTCHA d'auto-apprentissage est configuré de manière à détecter soit une entrée humaine, soit une entrée machine, en mettant en œuvre les étapes ci-dessous consistant à :

comparer l'entrée d'utilisateur à des entrées d'utilisateur stockées précédemment provenant d'un référentiel de décision du module de décision de test CAPTCHA d'auto-apprentissage, en tenant compte d'un ou plusieurs éléments parmi des erreurs d'orthographe, des réponses incomplètes, des métonymies contextuelles, des synonymes et des variantes connexes ;

détecter soit une entrée humaine, soit une entrée machine, sur la base de l'étape de comparaison et de la décision associée connexe ; et

mettre à jour le référentiel de décision avec l'entrée d'utilisateur et la décision associée sur la base de l'entrée détectée.


 




Drawing
































Cited references

REFERENCES CITED IN THE DESCRIPTION



This list of references cited by the applicant is for the reader's convenience only. It does not form part of the European patent document. Even though great care has been taken in compiling the references, errors or omissions cannot be excluded and the EPO disclaims all liability in this regard.

Patent documents cited in the description




Non-patent literature cited in the description