Reliable Machine Learning for Communication Systems: Tools and Applications

Kfir Cohen

Engineering

Student thesis: Doctoral Thesis › Doctor of Philosophy

Abstract

Two of the main principles underlying the life cycle of an artiﬁcial intelligence (AI) module in communication networks are adaptation and monitoring. Adaptation refers to the need to adjust the operation of an AI module depending on the current conditions; while monitoring requires measures of the reliability of an AI module’s decisions. Furthermore, the dynamic nature of a communication system imposes short coherent resources due to rapid conditions changes. This makes the available data for module training, e.g., known pilot transmission, to be of small size, requiring the AI module to be sample-eﬃcient.

In the ﬁrst part of this thesis, we integrate both meta-learning and Bayesian learning for these challenges. Meta-learning addresses sample-eﬃciency by extracting useful shared properties across diﬀerent channel conditions by learning how to learn when facing a new condition with few pilots. Bayesian learning increases reliability by producing better-calibrated, e.g., less overconﬁdent, decisions. A well-calibrated AI model is one that can reliably quantify the uncertainty of its decisions, assigning high conﬁdence levels to decisions that are likely to be correct and low conﬁdence levels to decisions that are likely to be erroneous. As an application, we apply the integration of meta-learning and Bayesian learning to symbol demodulation and validate its improvements. The capacity to quantify uncertainty in the model parameter space is further leveraged by extending Bayesian meta-learning to an active setting. In it, the designer can select in a sequential fashion channel conditions under which to generate data for meta-learning from a channel simulator. Bayesian active meta-learning is seen in experiments to signiﬁcantly reduce the number of frames required to obtain eﬃcient adaptation procedure for new frames of an equalization problem.

While Bayesian meta-learning is better than frequentist learning, it is a best-eﬀort approach with no formal guarantees. To obtain mathematical reliability guarantees, we in-corporate in the second part of the thesis the framework of conformal prediction. Conformal prediction post-processes in an ad-hoc manner a probabilistic predictor into set predictor, producing set of labels that is guaranteed to contain the correct label with a probability chosen by the designer. This is done irrespectively of the true, unknown, distribution underlying the generation of the variables of interest, and can be deﬁned in terms of ensemble or time-averaged probabilities. We apply recent conformal prediction advances, including cross-validation-based schemes that reduce average set sizes, to communication problems such as symbol demodulation and modulation classiﬁcation. As a communication system may have its data distribution change over time, online conformal prediction is used and investigated for received signal strength prediction, as well for 5G dynamic scheduling of ultra-reliable and low-latency communication traﬃc. Experiments compare empirical coverage rate and averaged set sizes of diﬀerent schemes.

Conformal prediction is a special case of conformal risk control for which the risk is the miscoverage indicator. We develop a novel cross-validation-based conformal risk control and show its ability to meet a target risk while keeping the average predictive set sizes smaller than the validation-based counterparts.

Date of Award	1 Jun 2024
Original language	English
Awarding Institution	King's College London
Supervisor	Osvaldo Simeone (Supervisor)

Cite this

Documents

2024_Cohen_Kfir_20064738_ethesis
File: application/pdf, 3.33 MB
Type: Thesis