Recognizing handshapes using small datasets

Detalles Bibliográficos
Autor Principal: Cornejo Fandos, U.
Otros autores o Colaboradores: Ríos, Gastón Gustavo, Ronchetti, Franco, Quiroga, Facundo Manuel, Hasperué, Waldo, Lanzarini, Laura Cristina
Formato: Capítulo de libro
Lengua:inglés
Temas:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/90457
Consultar en el Cátalogo
Resumen:Advances in convolutional neural networks have made possible significant improvements in the state-of-the-art in image classification. However, their success on a particular field rests on the possibility of obtaining labeled data to train networks. Handshape recognition from images, an important subtask of both gesture and sign language recognition, suffers from such a lack of data. Furthermore, hands are highly deformable objects and therefore handshape classification models require larger datasets. We analyze both state of the art models for image classification, as well as data augmentation schemes and specific models to tackle problems with small datasets. In particular, we perform experiments with Wide- DenseNet, a state of the art convolutional architecture and Prototypical Networks, a state of the art few-shot learning meta model. In both cases, we also quantify the impact of data augmentation on accuracy. Our results show that on small and simple data sets such as CIARP, all models and variations of achieve perfect accuracy, and therefore the utility of the data is highly doubtful, despite its having 6000 samples. On the other hand, in small but complex datasets such as LSA16 (800 samples), specialized methods such as Prototypical Networks do have an advantage over other methods. On RWTH, another complex and small dataset with close to 4000 samples, a traditional and state-of-the-art method such as Wide-DenseNet surpasses all other models. Also, data augmentation consistently increases accuracy for Wide-DenseNet, but not fo Prototypical Networks.
Notas:Formato de archivo PDF. -- Este documento es producción intelectual de la Facultad de Informática - UNLP (Colección BIPA/Biblioteca)
Descripción Física:1 archivo (939,8 kB)

MARC

LEADER 00000naa a2200000 a 4500
003 AR-LpUFIB
005 20250311170527.0
008 230201s2019 xx o 000 0 eng d
024 8 |a DIF-M8772  |b 8999  |z DIF008040 
040 |a AR-LpUFIB  |b spa  |c AR-LpUFIB 
100 1 |a Cornejo Fandos, U. 
245 1 0 |a Recognizing handshapes using small datasets 
300 |a 1 archivo (939,8 kB) 
500 |a Formato de archivo PDF. -- Este documento es producción intelectual de la Facultad de Informática - UNLP (Colección BIPA/Biblioteca) 
520 |a Advances in convolutional neural networks have made possible significant improvements in the state-of-the-art in image classification. However, their success on a particular field rests on the possibility of obtaining labeled data to train networks. Handshape recognition from images, an important subtask of both gesture and sign language recognition, suffers from such a lack of data. Furthermore, hands are highly deformable objects and therefore handshape classification models require larger datasets. We analyze both state of the art models for image classification, as well as data augmentation schemes and specific models to tackle problems with small datasets. In particular, we perform experiments with Wide- DenseNet, a state of the art convolutional architecture and Prototypical Networks, a state of the art few-shot learning meta model. In both cases, we also quantify the impact of data augmentation on accuracy. Our results show that on small and simple data sets such as CIARP, all models and variations of achieve perfect accuracy, and therefore the utility of the data is highly doubtful, despite its having 6000 samples. On the other hand, in small but complex datasets such as LSA16 (800 samples), specialized methods such as Prototypical Networks do have an advantage over other methods. On RWTH, another complex and small dataset with close to 4000 samples, a traditional and state-of-the-art method such as Wide-DenseNet surpasses all other models. Also, data augmentation consistently increases accuracy for Wide-DenseNet, but not fo Prototypical Networks. 
534 |a Congreso Argentino de Ciencias de la Computación (25to : 2019 : Río Cuarto, Córdoba) 
650 4 |a REDES NEURONALES 
653 |a lenguaje de señas 
700 1 |a Ríos, Gastón Gustavo 
700 1 |a Ronchetti, Franco 
700 1 |a Quiroga, Facundo Manuel 
700 1 |a Hasperué, Waldo 
700 1 |a Lanzarini, Laura Cristina 
856 4 0 |u http://sedici.unlp.edu.ar/handle/10915/90457 
942 |c CP 
952 |0 0  |1 0  |4 0  |6 A1285  |7 3  |8 BD  |9 84198  |a DIF  |b DIF  |d 2025-03-11  |l 0  |o A1285   |r 2025-03-11 17:05:27  |u http://catalogo.info.unlp.edu.ar/meran/getDocument.pl?id=2667  |w 2025-03-11  |y CP 
999 |c 57812  |d 57812