Mapping the Unknown: A New Approach to Open-World Video Recognition

Parga, César D.; Pardo, Xosé Manuel; Regueiro, Carlos V.

Use this link to cite:

https://hdl.handle.net/2183/45520

Mapping the Unknown: A New Approach to Open-World Video Recognition

Files

VazquezRegueiro_Carlos_2025_Mapping_the_Unknown.pdf (1.06 MB)

Identifiers

URI: https://hdl.handle.net/2183/45520

DOI: 10.1007/978-3-031-78189-6_10

Publication date

2024-12

Authors

Parga, César D.

Pardo, Xosé Manuel

Regueiro, Carlos V.

Bibliographic citation

Parga, C.D., Pardo, X.M., Regueiro, C.V. (2025). Mapping the Unknown: A New Approach to Open-World Video Recognition. In: Antonacopoulos, A., Chaudhuri, S., Chellappa, R., Liu, CL., Bhattacharya, S., Pal, U. (eds) Pattern Recognition. ICPR 2024. Lecture Notes in Computer Science, vol 15309. Springer, Cham. https://doi.org/10.1007/978-3-031-78189-6_10

Abstract

[Abstract]: Intelligent agents must strive to (re)map the constantly changing environment in which they operate, in order to remain adaptive and efficient. In open-world recognition (OWR) a system has to: detect new emerging categories, recognize new instances of known categories, and continually update knowledge based on the data streams it receives, mostly unannotated. In this work, we propose a hybrid method to deal with OWR that combines deep feature embeddings with dynamic ensemble methods for a continuous reshaping of boundaries in feature space. Our approach is flexible to update to patterns in the border of what is already known (concept-drift), detect and create models for new categories, recover from mistakes, and mitigate catastrophic forgetting, even in semi-supervised contexts. As an application use case, we have considered the problem of semi-supervised video face recognition, where the spatial-temporal coherence is harnessed to augment data. Our experiments shown that the system responds adequately to the unknowns, adding models for new identities, and improving its performance.

Description

Presentada no International Conference on Pattern Recognition, 2024. Parte da serie: Lecture Notes in Computer Science (LNCS,volume 15309). This version of the article has been accepted for publication, after peer review and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/978-3-031-78189-6_10

Keywords

Ensemble learning Incremental Learning Open-World Instance Recognition

Editor version

https://doi.org/10.1007/978-3-031-78189-6_10

Rights

Collections

Investigación (FIC)

Full item page

Mapping the Unknown: A New Approach to Open-World Video Recognition

Files

Identifiers

Publication date

Authors

Advisors

Other responsabilities

Journal Title

Bibliographic citation

Type of academic work

Academic degree

Abstract

Description

Keywords

Editor version

Rights

Collections