Correction of AI systems by linear discriminants: Probabilistic foundations

A. N. Gorban; A. Golubkov; B. Grechuk; E. M. Mirkes; I. Y. Tyukin

doi:10.1016/j.ins.2018.07.040

Correction of AI systems by linear discriminants: Probabilistic foundations

A. N. Gorban^*, A. Golubkov, B. Grechuk, E. M. Mirkes, I. Y. Tyukin

^*Corresponding author for this work

Mathematics

Research output: Contribution to journal › Article › peer-review

55 Citations (Scopus)

Abstract

Artificial Intelligence (AI) systems sometimes make errors and will make errors in the future, from time to time. These errors are usually unexpected, and can lead to dramatic consequences. Intensive development of AI and its practical applications makes the problem of errors more important. Total re-engineering of the systems can create new errors and is not always possible due to the resources involved. The important challenge is to develop fast methods to correct errors without damaging existing skills. We formulated the technical requirements to the ‘ideal’ correctors. Such correctors include binary classifiers, which separate the situations with high risk of errors from the situations where the AI systems work properly. Surprisingly, for essentially high-dimensional data such methods are possible: simple linear Fisher discriminant can separate the situations with errors from correctly solved tasks even for exponentially large samples. The paper presents the probabilistic basis for fast non-destructive correction of AI systems. A series of new stochastic separation theorems is proven. These theorems provide new instruments for fast non-iterative correction of errors of legacy AI systems. The new approaches become efficient in high-dimensions, for correction of high-dimensional systems in high-dimensional world (i.e. for processing of essentially high-dimensional data by large systems). We prove that this separability property holds for a wide class of distributions including log-concave distributions and distributions with a special ‘SMeared Absolute Continuity’ (SmAC) property defined through relations between the volume and probability of sets of vanishing volume. These classes are much wider than the Gaussian distributions. The requirement of independence and identical distribution of data is significantly relaxed. The results are supported by computational analysis of empirical data sets.

Original language	English
Pages (from-to)	303-322
Number of pages	20
Journal	INFORMATION SCIENCES
Volume	466
DOIs	https://doi.org/10.1016/j.ins.2018.07.040
Publication status	Published - Oct 2018

Keywords

Big data
Blessing of dimensionality
Error correction
Linear discriminant
Measure concentration
Non-iterative learning

Access to Document

10.1016/j.ins.2018.07.040

Cite this

@article{e547edfb0185463baad03bf8d4941081,

title = "Correction of AI systems by linear discriminants: Probabilistic foundations",

abstract = "Artificial Intelligence (AI) systems sometimes make errors and will make errors in the future, from time to time. These errors are usually unexpected, and can lead to dramatic consequences. Intensive development of AI and its practical applications makes the problem of errors more important. Total re-engineering of the systems can create new errors and is not always possible due to the resources involved. The important challenge is to develop fast methods to correct errors without damaging existing skills. We formulated the technical requirements to the {\textquoteleft}ideal{\textquoteright} correctors. Such correctors include binary classifiers, which separate the situations with high risk of errors from the situations where the AI systems work properly. Surprisingly, for essentially high-dimensional data such methods are possible: simple linear Fisher discriminant can separate the situations with errors from correctly solved tasks even for exponentially large samples. The paper presents the probabilistic basis for fast non-destructive correction of AI systems. A series of new stochastic separation theorems is proven. These theorems provide new instruments for fast non-iterative correction of errors of legacy AI systems. The new approaches become efficient in high-dimensions, for correction of high-dimensional systems in high-dimensional world (i.e. for processing of essentially high-dimensional data by large systems). We prove that this separability property holds for a wide class of distributions including log-concave distributions and distributions with a special {\textquoteleft}SMeared Absolute Continuity{\textquoteright} (SmAC) property defined through relations between the volume and probability of sets of vanishing volume. These classes are much wider than the Gaussian distributions. The requirement of independence and identical distribution of data is significantly relaxed. The results are supported by computational analysis of empirical data sets.",

keywords = "Big data, Blessing of dimensionality, Error correction, Linear discriminant, Measure concentration, Non-iterative learning",

author = "Gorban, {A. N.} and A. Golubkov and B. Grechuk and Mirkes, {E. M.} and Tyukin, {I. Y.}",

note = "Funding Information: ANG and IYT were Supported by Innovate UK (KTP009890 and KTP010522) and the Ministry of Education and Science of the Russian Federation (Project 14.Y26.31.0022). BG thanks the University of Leicester for granting him academic study leave to do this research. Publisher Copyright: {\textcopyright} 2018 Elsevier Inc.",

year = "2018",

month = oct,

doi = "10.1016/j.ins.2018.07.040",

language = "English",

volume = "466",

pages = "303--322",

journal = "INFORMATION SCIENCES",

issn = "0020-0255",

publisher = "ELSEVIER SCIENCE INC",

}

TY - JOUR

T1 - Correction of AI systems by linear discriminants

T2 - Probabilistic foundations

AU - Gorban, A. N.

AU - Golubkov, A.

AU - Grechuk, B.

AU - Mirkes, E. M.

AU - Tyukin, I. Y.

N1 - Funding Information: ANG and IYT were Supported by Innovate UK (KTP009890 and KTP010522) and the Ministry of Education and Science of the Russian Federation (Project 14.Y26.31.0022). BG thanks the University of Leicester for granting him academic study leave to do this research. Publisher Copyright: © 2018 Elsevier Inc.

PY - 2018/10

Y1 - 2018/10

N2 - Artificial Intelligence (AI) systems sometimes make errors and will make errors in the future, from time to time. These errors are usually unexpected, and can lead to dramatic consequences. Intensive development of AI and its practical applications makes the problem of errors more important. Total re-engineering of the systems can create new errors and is not always possible due to the resources involved. The important challenge is to develop fast methods to correct errors without damaging existing skills. We formulated the technical requirements to the ‘ideal’ correctors. Such correctors include binary classifiers, which separate the situations with high risk of errors from the situations where the AI systems work properly. Surprisingly, for essentially high-dimensional data such methods are possible: simple linear Fisher discriminant can separate the situations with errors from correctly solved tasks even for exponentially large samples. The paper presents the probabilistic basis for fast non-destructive correction of AI systems. A series of new stochastic separation theorems is proven. These theorems provide new instruments for fast non-iterative correction of errors of legacy AI systems. The new approaches become efficient in high-dimensions, for correction of high-dimensional systems in high-dimensional world (i.e. for processing of essentially high-dimensional data by large systems). We prove that this separability property holds for a wide class of distributions including log-concave distributions and distributions with a special ‘SMeared Absolute Continuity’ (SmAC) property defined through relations between the volume and probability of sets of vanishing volume. These classes are much wider than the Gaussian distributions. The requirement of independence and identical distribution of data is significantly relaxed. The results are supported by computational analysis of empirical data sets.

AB - Artificial Intelligence (AI) systems sometimes make errors and will make errors in the future, from time to time. These errors are usually unexpected, and can lead to dramatic consequences. Intensive development of AI and its practical applications makes the problem of errors more important. Total re-engineering of the systems can create new errors and is not always possible due to the resources involved. The important challenge is to develop fast methods to correct errors without damaging existing skills. We formulated the technical requirements to the ‘ideal’ correctors. Such correctors include binary classifiers, which separate the situations with high risk of errors from the situations where the AI systems work properly. Surprisingly, for essentially high-dimensional data such methods are possible: simple linear Fisher discriminant can separate the situations with errors from correctly solved tasks even for exponentially large samples. The paper presents the probabilistic basis for fast non-destructive correction of AI systems. A series of new stochastic separation theorems is proven. These theorems provide new instruments for fast non-iterative correction of errors of legacy AI systems. The new approaches become efficient in high-dimensions, for correction of high-dimensional systems in high-dimensional world (i.e. for processing of essentially high-dimensional data by large systems). We prove that this separability property holds for a wide class of distributions including log-concave distributions and distributions with a special ‘SMeared Absolute Continuity’ (SmAC) property defined through relations between the volume and probability of sets of vanishing volume. These classes are much wider than the Gaussian distributions. The requirement of independence and identical distribution of data is significantly relaxed. The results are supported by computational analysis of empirical data sets.

KW - Big data

KW - Blessing of dimensionality

KW - Error correction

KW - Linear discriminant

KW - Measure concentration

KW - Non-iterative learning

UR - http://www.scopus.com/inward/record.url?scp=85050893476&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2018.07.040

DO - 10.1016/j.ins.2018.07.040

M3 - Article

AN - SCOPUS:85050893476

SN - 0020-0255

VL - 466

SP - 303

EP - 322

JO - INFORMATION SCIENCES

JF - INFORMATION SCIENCES

ER -

Correction of AI systems by linear discriminants: Probabilistic foundations

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this