Episodes and Topics in Multivariate Temporal Data
View/ Open
Date
2023Author
Andrienko, Natalia
Andrienko, Gennady
Shirato, Gota
Metadata
Show full item recordAbstract
The term ‘episode’ refers to a time interval in the development of a dynamic process or behaviour of an entity. Episode‐based data consist of a set of episodes that are described using time series of multiple attribute values. Our research problem involves analysing episode‐based data in order to understand the distribution of multi‐attribute dynamic characteristics across a set of episodes. To solve this problem, we applied an existing theoretical model and developed a general approach that involves incrementally increasing data abstraction. We instantiated this general approach in an analysis procedure in which the value variation of each attribute within an episode is represented by a combination of symbols treated as a ‘word’. The variation of multiple attributes is thus represented by a combination of ‘words’ treated as a ‘text’. In this way, the the set of episodes is transformed to a collection of text documents. Topic modelling techniques applied to this collection find groups of related (i.e. repeatedly co‐occurring) ‘words’, which are called ‘topics’. Given that the ‘words’ encode variation patterns of individual attributes, the ‘topics’ represent patterns of joint variation of multiple attributes. In the following steps, analysts interpret the topics and examine their distribution across all episodes using interactive visualizations. We test the effectiveness of the procedure by applying it to two types of episode‐based data with distinct properties and introduce a range of generic and data type‐specific visualization techniques that can support the interpretation and exploration of topic distribution.
BibTeX
@article {10.1111:cgf.14926,
journal = {Computer Graphics Forum},
title = {{Episodes and Topics in Multivariate Temporal Data}},
author = {Andrienko, Natalia and Andrienko, Gennady and Shirato, Gota},
year = {2023},
publisher = {© 2023 Eurographics ‐ The European Association for Computer Graphics and John Wiley & Sons Ltd.},
ISSN = {1467-8659},
DOI = {10.1111/cgf.14926}
}
journal = {Computer Graphics Forum},
title = {{Episodes and Topics in Multivariate Temporal Data}},
author = {Andrienko, Natalia and Andrienko, Gennady and Shirato, Gota},
year = {2023},
publisher = {© 2023 Eurographics ‐ The European Association for Computer Graphics and John Wiley & Sons Ltd.},
ISSN = {1467-8659},
DOI = {10.1111/cgf.14926}
}