Multimodal AutoML via Representation Evolution

Škrlj, Blaž; Bevec, Matej; Lavrač, Nada

Published in

MDPI, Machine Learning and Knowledge Extraction, 1(5), p. 1-13, 2022

DOI: 10.3390/make5010001

Tools

Export citation

Search in Google Scholar

Multimodal AutoML via Representation Evolution

Journal article published in 2022 by Blaž Škrlj

, Matej Bevec, Nada Lavrač

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

With the increasing amounts of available data, learning simultaneously from different types of inputs is becoming necessary to obtain robust and well-performing models. With the advent of representation learning in recent years, lower-dimensional vector-based representations have become available for both images and texts, while automating simultaneous learning from multiple modalities remains a challenging problem. This paper presents an AutoML (automated machine learning) approach to automated machine learning model configuration identification for data composed of two modalities: texts and images. The approach is based on the idea of representation evolution, the process of automatically amplifying heterogeneous representations across several modalities, optimized jointly with a collection of fast, well-regularized linear models. The proposed approach is benchmarked against 11 unimodal and multimodal (texts and images) approaches on four real-life benchmark datasets from different domains. It achieves competitive performance with minimal human effort and low computing requirements, enabling learning from multiple modalities in automated manner for a wider community of researchers.

Published in

Links

Tools

Multimodal AutoML via Representation Evolution

Abstract