On partial least-squares estimation in scalar-on-function regression models

Saricam, SEMANUR; Beyaztaş, Ufuk; Asikgil, Baris; Shang, Han

doi:10.1002/cem.3452

On partial least-squares estimation in scalar-on-function regression models

Saricam S., Beyaztaş U., Asikgil B., Shang H. L.

Journal of Chemometrics, cilt.36, sa.12, 2022 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 36 Sayı: 12
Basım Tarihi: 2022
Doi Numarası: 10.1002/cem.3452
Dergi Adı: Journal of Chemometrics
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Aerospace Database, Analytical Abstracts, Chemical Abstracts Core, Chimica, Communication Abstracts, Metadex, DIALNET, Civil Engineering Abstracts
Anahtar Kelimeler: Bidiag1, Bidiag2, bidiagonalization, NIPALS, SIMPLS
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
İstanbul Gelişim Üniversitesi Adresli: Hayır

Özet

Scalar-on-function regression, where the response is scalar valued and the predictor consists of random functions, is one of the most important tools for exploring the functional relationship between a scalar response and functional predictor(s). The functional partial least-squares method improves estimation accuracy for estimating the regression coefficient function compared to other existing methods, such as least squares, maximum likelihood, and maximum penalized likelihood. The functional partial least-squares method is often based on the SIMPLS or NIPALS algorithm, but these algorithms can be computationally slow for analyzing a large dataset. In this study, we propose two modified functional partial least-squares methods to efficiently estimate the regression coefficient function under the scalar-on-function regression. In the proposed methods, the infinite-dimensional functional predictors are first projected onto a finite-dimensional space using a basis expansion method. Then, two partial least-squares algorithms, based on re-orthogonalization of the score and loading vectors, are used to estimate the linear relationship between scalar response and the basis coefficients of the functional predictors. The finite-sample performance and computing speed are evaluated using a series of Monte Carlo simulation studies and a sugar process dataset.