Hyper-parameter initialization for squared exponential kernel-based Gaussian Process regression

Nalika Ulapane, Karthick Thiyagarajan, Sarath Kodagoda

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

Abstract

![CDATA[Hyper-parameter optimization is an essential task in the use of machine learning techniques. Such optimizations are typically done starting with an initial guess provided to hyper-parameter values followed by optimization (or minimization) of some cost function via gradient-based methods. The initial values become crucial since there is every chance for reaching local minimums in the cost functions being minimized, especially since gradient-based optimizing is done. Therefore, initializing hyper-parameters several times and repeating optimization to achieve the best solutions is usually attempted. Repetition of optimization can be computationally expensive when using techniques like Gaussian Process (GP) which has an O(n 3 ) complexity, and not having a formal strategy to initialize hyper-parameter values is an additional challenge. In general, re-initialization of hyper-parameter values in the contexts of many machine learning techniques including GP has been done at random over the years; some recent developments have proposed some initialization strategies based on the optimization of some meta loss cost functions. To simplify this challenge of hyper-parameter initialization, this paper introduces a data-dependent deterministic initialization technique. The specific case of the squared exponential kernel-based GP regression problem is focused on, and the proposed technique brings novelty by being deterministic as opposed to random initialization, and fast (due to the deterministic nature) as opposed to optimizing some form of meta cost function as done in some previous works. Although global suitability of this initialization technique is not proven in this paper, as a preliminary study the technique's effectiveness is demonstrated via several synthetic as well as real data-based nonlinear regression examples, hinting that the technique may have the effectiveness for broader usage.]]
Original languageEnglish
Title of host publicationProceedings of the 15th IEEE Conference on Industrial Electronics and Applications (ICIEA 2020), 9 - 13 November 2020, Virtual
PublisherIEEE
Pages1154-1159
Number of pages6
ISBN (Print)9781728151694
DOIs
Publication statusPublished - 2020
EventIEEE Conference on Industrial Electronics and Applications -
Duration: 1 Jan 2021 → …

Conference

ConferenceIEEE Conference on Industrial Electronics and Applications
Period1/01/21 → …

Fingerprint

Dive into the research topics of 'Hyper-parameter initialization for squared exponential kernel-based Gaussian Process regression'. Together they form a unique fingerprint.

Cite this