RITA: Group Attention is All You Need for Timeseries Analytics

Liang, Jiaming; Cao, Lei; Madden, Samuel; Ives, Zack; Li, Guoliang

Published in

Proceedings of the ACM on Management of Data, 1(2), p. 1-28, 2024

DOI: 10.1145/3639317

Tools

Export citation

Search in Google Scholar

RITA: Group Attention is All You Need for Timeseries Analytics

Journal article published in 2024 by Jiaming Liang

, Lei Cao

, Samuel Madden

, Zack Ives

, Guoliang Li

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Timeseries analytics is important in many real-world applications. Recently, the Transformer model, popular in natural language processing, has been leveraged to learn high quality feature embeddings from timeseries: embeddings are key to the performance of various timeseries analytics tasks such as similarity-based timeseries queries within vector databases. However, quadratic time and space complexities limit Transformers' scalability, especially for long timeseries. To address these issues, we develop a timeseries analytics tool, RITA, which uses a novel attention mechanism, named group attention, to address this scalability issue. Group attention dynamically clusters the objects based on their similarity into a small number of groups and approximately computes the attention at the coarse group granularity. It thus significantly reduces the time and space complexity, yet provides a theoretical guarantee on the quality of the computed attention. The dynamic scheduler of RITA continuously adapts the number of groups and the batch size in the training process, ensuring group attention always uses the fewest groups needed to meet the approximation quality requirement. Extensive experiments on various timeseries datasets and analytics tasks demonstrate that RITA outperforms the state-of-the-art in accuracy and is significantly faster --- with speedups of up to 63X.

Published in

Links

Tools

RITA: Group Attention is All You Need for Timeseries Analytics

Abstract