(Für diese Definition ist die deutsche Übersetzung noch nicht abgeschlossen)
Higher-level interpretation and recognition of images or videos, which includes pattern recognition, pattern learning and semantic interpretation as fundamental aspects. These aspects involve the detection, categorisation, identification, authentication of image or video patterns. For this purpose, image or video data are acquired and preprocessed. In the next step, distinctive features are extracted. Based on these features or representations derived from them, matching, clustering or classification is performed which may lead to one or several decisions, related confidence values (e.g. probabilities), classification or clustering labels. The aim is to find an explanation or to derive a specific meaning.
Pattern recognition or pattern learning in a specific, image or video-related context that includes:
Further details are given in the Definition statement of group G06V 10/00.
Image or video recognition can be carried out by using electronic means (group G06V 10/70) or by using optical means (group G06V 10/88).
Typically, a pattern recognition system involves one or more of the following techniques:
Data entities (e.g. image objects) involved | Data entities (e.g. image objects) involved | |
Individual | Groups (classes) | |
One data sample | Authentication | Categorisation |
Several data samples | Identification | Clustering |
Pattern recognition techniques in general are classified in group G06F 18/00.
Some techniques of image or video understanding performed in the preprocessing step — which start with a bitmap image as an input and derive a non-bitmap representation from it—can also be encountered in general image analysis. If these techniques do not involve one of the functions of image or video pattern authentication, identification, categorisation or clustering, classification should be made only in the appropriate subgroups of subclass G06T.
Some examples of these techniques are: general methods for image segmentation, e.g. obtaining contiguous image regions with similar pixels, for position and size determination of an object without establishing its identity, for calculating the motion of an image region corresponding to an object irrespective as to the identity of the object, for camera calibration, etc.
Techniques based on coding, decoding, compressing or decompressing digital video signals using video object coding are classified in group H04N 19/20.
Velocity or trajectory determination systems or sense-of-movement determination systems using radar, sonar or lidar are classified in groups G01S 13/58, G01S 15/58, G01S 17/58, respectively. Radar, sonar or lidar systems specially adapted for mapping or imaging are classified in groups G01S 13/89, G01S 15/89, G01S 17/89.
General purpose image data processing, in particular image watermarking is classified in group G06T 1/00, while selective content distribution, such as generation or processing of protective or descriptive data associated with content involving watermarking is covered by group H04N 21/8358. General purpose image data acquisition and related pre-processing using digital cameras, and processing used to control digital cameras is classified in group H04N 5/00. Play-back, editing, or synchronising of a music score, including interpretation therefor, as well as transmission of a music score between systems of musical instruments for play-back, editing or synchronising is classified in subclass G10H.
Detecting, measuring and recording for medical diagnostic purposes | A61B 5/00 |
Identifications of persons in medical applications | A61B 5/117 |
Sorting of mail or documents using means for detection of the destination | B07C 3/10 |
Input arrangements for interaction between user and computer | G06F 3/01 |
Testing to determine the identity or genuineness of paper currency or similar valuable papers or for segregating those which are unacceptable, e.g. banknotes that are alien to a currency | G07D 7/00 |
Programme-controlled manipulators | B25J 9/00 |
Optical viewing arrangements in vehicles | B60R 1/00 |
Photogrammetry or videogrammetry, e.g. stereogrammetry; Photographic surveying | G01C 11/00 |
Testing balance of machines or structures | G01M |
Investigating or analysing materials by determining their chemical or physical properties | G01N |
Radio direction-finding; Radio navigation; Determining distance or velocity by use of radio waves; Locating or presence-detecting by use of the reflection or reradiation of radio waves; Analogous arrangements using other waves | G01S |
Geophysics | G01V |
Optical elements, systems or apparatus | G02B |
Photomechanical production of textured or patterned surfaces, e.g. for printing, for processing of semiconductor devices | G03F |
Control or regulating systems in general | G05B |
Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements | G06F 3/00 |
Comparing digital values in methods or arrangements for processing data by operating upon the order or content of the data handled | G06F 7/02 |
Content-based image retrieval | G06F 16/50 |
Fourier, Walsh or analogous domain transformations in digital computers | G06F 17/14 |
Security arrangements for protecting computer systems against unauthorised activity | G06F 21/00 |
Authentication, i.e. establishing the identity or authorisation of security principals | G06F 21/30 |
Computer-aided design [CAD] | G06F 30/00 |
Handling natural language data | G06F 40/00 |
Methods or arrangements for sensing record carriers | G06K 7/00 |
Record carriers for use with machines and with at least a part designed to carry digital markings | G06K 19/00 |
Computer systems based on specific computational models | G06N |
Data processing for business purposes, logistics, stock management | G06Q |
General purpose image data processing, e.g. specific image analysis processor architectures or configurations | G06T 1/00 |
Geometric image transformation in the plane of the image, e.g. rotation of a whole image or part thereof | G06T 3/00 |
Image enhancement or restoration | G06T 5/00 |
Image analysis in general | G06T 7/00 |
Motion image analysis using feature-based methods | G06T 7/246 |
Image analysis using feature-based methods for determination of transform parameters for the alignment of images | G06T 7/33 |
Image analysis of texture | G06T 7/40 |
Image analysis for depth or shape recovery | G06T 7/50 |
Image analysis using feature-based methods for determining position and orientation of objects | G06T 7/73 |
Image analysis for determination of colour characteristics | G06T 7/90 |
Image coding | G06T 9/00 |
Image contour coding, e.g. using detection of edges | G06T 9/20 |
Two-dimensional [2D] image generation | G06T 11/00 |
Three-dimensional [3D] image rendering | G06T 15/00 |
Lighting effects in 3D image rendering | G06T 15/50 |
Three-dimensional [3D] modelling for computer graphics | G06T 17/00 |
Manipulating 3D models or images for computer graphics | G06T 19/00 |
Checking-devices for individual registration on entry or exit | G07C 9/00 |
Burglar, theft or intruder alarms using image scanning and comparing means | G08B 13/194 |
Traffic control systems for road vehicles | G08G 1/00 |
Labels, tag tickets or similar identification or indication means | G09F 3/00 |
Speech recognition | G10L 15/00 |
Speaker recognition | G10L 17/00 |
Bioinformatics | G16B |
Chemoinformatics and computational material science | G16C |
Healthcare informatics | G16H |
Semiconductor devices | H01L |
Arrangements for secret or secure communications; Network security protocols | H04L 9/00 |
Scanning, transmission or reproduction of documents, e.g. facsimile transmission | H04N 1/00 |
Studio circuitry for television systems | H04N 5/222 |
Closed circuit television systems | H04N 7/18 |
Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding | H04N 19/20 |
Methods or arrangements for coding, decoding, compressing or decompressing digital video signals, region motion estimation for predictive coding | H04N 19/543 |
Pattern recognition or pattern learning techniques for image or video understanding involving feature extraction or matching, clustering or classification should be classified in groups G06V 10/40 or G06V 10/70 irrespective whether an application-related context provided by the groups G06V 20/00-G06V 40/00 exists.