Mapping the Unknown: A New Approach to Open-World Video Recognition

Loading...
Thumbnail Image

Identifiers

Publication date

Authors

Parga, César D.
Pardo, Xosé Manuel

Advisors

Other responsabilities

Journal Title

Bibliographic citation

Parga, C.D., Pardo, X.M., Regueiro, C.V. (2025). Mapping the Unknown: A New Approach to Open-World Video Recognition. In: Antonacopoulos, A., Chaudhuri, S., Chellappa, R., Liu, CL., Bhattacharya, S., Pal, U. (eds) Pattern Recognition. ICPR 2024. Lecture Notes in Computer Science, vol 15309. Springer, Cham. https://doi.org/10.1007/978-3-031-78189-6_10

Type of academic work

Academic degree

Abstract

[Abstract]: Intelligent agents must strive to (re)map the constantly changing environment in which they operate, in order to remain adaptive and efficient. In open-world recognition (OWR) a system has to: detect new emerging categories, recognize new instances of known categories, and continually update knowledge based on the data streams it receives, mostly unannotated. In this work, we propose a hybrid method to deal with OWR that combines deep feature embeddings with dynamic ensemble methods for a continuous reshaping of boundaries in feature space. Our approach is flexible to update to patterns in the border of what is already known (concept-drift), detect and create models for new categories, recover from mistakes, and mitigate catastrophic forgetting, even in semi-supervised contexts. As an application use case, we have considered the problem of semi-supervised video face recognition, where the spatial-temporal coherence is harnessed to augment data. Our experiments shown that the system responds adequately to the unknowns, adding models for new identities, and improving its performance.

Description

Presentada no International Conference on Pattern Recognition, 2024. Parte da serie: Lecture Notes in Computer Science (LNCS,volume 15309). This version of the article has been accepted for publication, after peer review and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/978-3-031-78189-6_10

Rights

Copyright © 2025, The Author(s), under exclusive license to Springer Nature Switzerland AG