Global Patent Index - EP 4052118 A4

EP 4052118 A4 20231108 - AUTOMATIC REDUCTION OF TRAINING SETS FOR MACHINE LEARNING PROGRAMS

Title (en)

AUTOMATIC REDUCTION OF TRAINING SETS FOR MACHINE LEARNING PROGRAMS

Title (de)

AUTOMATISCHE REDUZIERUNG VON TRAININGSMENGEN FÜR MASCHINELLE LERNPROGRAMME

Title (fr)

RÉDUCTION AUTOMATIQUE D'ENSEMBLES D'INSTRUCTION POUR PROGRAMMES D'APPRENTISSAGE AUTOMATIQUE

Publication

EP 4052118 A4 20231108 (EN)

Application

EP 20883285 A 20201029

Priority

  • US 201962928287 P 20191030
  • US 2020057987 W 20201029

Abstract (en)

[origin: WO2021087129A1] A computer-implemented method of creating a predictive machine learning model to predict the usefulness of digitally stored data in a second machine learning model comprises receiving an input dataset of training data, the input dataset comprising a plurality of records, the input dataset having been previously used to train the second machine learning model; measuring a usefulness value of records within the input dataset; categorizing training data into groups of usefulness; creating a data filter that is programmed to classify or rank the input dataset using the usefulness values of records in the input dataset; receiving a second dataset of prospective training data; filtering the second dataset of prospective training data using the data filter, and to output a refined training dataset comprising fewer records than the second dataset, the refined training dataset comprising only records of the second dataset having the usefulness value above a specified threshold.

IPC 8 full level

G06N 20/00 (2019.01); G06F 3/08 (2006.01); G06N 3/045 (2023.01)

CPC (source: EP US)

G06N 3/08 (2013.01 - US); G06N 20/00 (2018.12 - EP); G06N 3/045 (2023.01 - EP)

Citation (search report)

  • [XI] SMITH MICHAEL R ET AL: "The Potential Benefits of Data Set Filtering and Learning Algorithm Hyperparameter Optimization", METASEL'15, vol. 1455, 7 September 2015 (2015-09-07), XP093087022, Retrieved from the Internet <URL:https://ceur-ws.org/Vol-1455/paper-03.pdf> [retrieved on 20230928]
  • [A] PRUENGKARN RATCHAKOON ET AL: "Using misclassification data to improve classification performance", 2015 12TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), IEEE, 24 June 2015 (2015-06-24), pages 1 - 5, XP033196870, DOI: 10.1109/ECTICON.2015.7206950
  • See references of WO 2021087129A1

Designated contracting state (EPC)

AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DOCDB simple family (publication)

WO 2021087129 A1 20210506; CA 3156623 A1 20210506; EP 4052118 A1 20220907; EP 4052118 A4 20231108; US 2022138561 A1 20220505

DOCDB simple family (application)

US 2020057987 W 20201029; CA 3156623 A 20201029; EP 20883285 A 20201029; US 202117162870 A 20210129