Please use this identifier to cite or link to this item:
https://hdl.handle.net/2445/219021
Title: | Deep ensemble-based hard sample mining for food recognition |
Author: | Nagarajan, Bhalaji Bolaños Solà, Marc Aguilar Torres, Eduardo Radeva, Petia |
Keywords: | Aprenentatge automàtic Reconeixement de formes (Informàtica) Visió per ordinador Machine learning Pattern recognition systems Computer vision |
Issue Date: | Sep-2023 |
Publisher: | Elsevier |
Abstract: | Deep neural networks represent a compelling technique to tackle complex real-world problems, but are over-parameterized and often suffer from over- or under-confident estimates. Deep ensembles have shown better parameter estimations and often provide reliable uncertainty estimates that contribute to the robustness of the results. In this work, we propose a new metric to identify samples that are hard to classify. Our metric is defined as coincidence score for deep ensembles which measures the agreement of its individual models. The main hypothesis we rely on is that deep learning algorithms learn the low-loss samples better compared to large-loss samples. In order to compensate for this, we use controlled over-sampling on the identified ”hard” samples using proper data augmentation schemes to enable the models to learn those samples better. We validate the proposed metric using two public food datasets on different backbone architectures and show the improvements compared to the conventional deep neural network training using different performance metrics. |
Note: | Reproducció del document publicat a: https://doi.org/10.1016/j.jvcir.2023.103905 |
It is part of: | Journal of Visual Communication and Image Representation, 2023, vol. 95 |
URI: | https://hdl.handle.net/2445/219021 |
Related resource: | https://doi.org/10.1016/j.jvcir.2023.103905 |
ISSN: | 1047-3203 |
Appears in Collections: | Articles publicats en revistes (Matemàtiques i Informàtica) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
853422.pdf | 3.94 MB | Adobe PDF | View/Open |
This item is licensed under a
Creative Commons License