Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/29139
Autoria: Dias, J. M. S.
Nande, P.
Santos, P.
Barata, N.
Correia, A.
Editor: Marcos, A., Mendonça, A., Leitão, M., Costa, A., and Jorge, J.
Data: 2021
Título próprio: Image manipulation through gestures
Título e volume do livro: Atas do 12º Encontro Português de Computação Gráfica
Paginação: 111 - 118
Título do evento: 12º Encontro Português de Computação Gráfica
Referência bibliográfica: Dias, J. M. S., Nande, P., Santos, P., Barata, N., & Correia, A. (2021). Image manipulation through gestures. In A. Marcos, A. Mendonça, M. Leitão, A. Costa, & J. Jorge (Eds.), Actas do 12º Encontro Português de Computação Gráfica (pp. 111-118). Eurographics Association. https://doi.org/10.2312/pt.20031431
ISBN: 978-3-03868-163-2
DOI (Digital Object Identifier): 10.2312/pt.20031431
Palavras-chave: Augmented virtuality
Tangible interfaces
AR toolkit
Hand and finger gesture
Digital video editing
Image browsing
Filmstrip
Resumo: In this work, we present a novel free-hand gesture user interface based on detecting the trajectory of fiducial markers attached to the user's fingers and pulse, able to interact with a sequence of images of a digital video piece. The model adopted for the video representation is based on its decomposition in a sequence of frames or filmstrip. Sensor-less and cable-less interfaces provide the means for a user to intuitively interact through gestures with the filmstrip within the framework of an Augmented Virtuality usage scenario. By simply gesturing, users can select at random, drag, release, delete or zoom image frames, browse the filmstrip at a controlled user-defined rate, and issue start, end, stop and play commands to control the digital video sequence better. A fixed video camera monitors user interaction by gesturing the fiducial markers. This scheme enables the system to simplify the more complex problem of markerless free-hand gesture tracking. Once the computer vision layer detects and recognises the markers in real-time, the system obtains the marker centres' 3D pose (position and orientation) relative to a virtual camera reference frame, whose mathematical model matches the real video camera. We are specifically interested in obtaining the pose of the left and right-hand pulses, left and right thumb, and left and right-hand index. By projecting the positions of these poses in the 2D visualization window, simple topological analysis based on the study of the kinematics evolution of distances and angles can be implemented, enabling gesture recognition and the activation of system functions and, subsequently, of specific gesture-based user interaction for a given active functionality. This interaction will affect the shape, scale factor, position and visualisation of scene objects, that is, filmstrip frames. For the computer vision layer, our system adopts AR Toolkit, a C/Open GL-based open-source library that uses accurate vision-based tracking methods to determine the virtual camera pose information through the detection in real-time of fiducial markers. The graphical output is implemented with C++/Open GL. Our proposed system is general because it can interact with any filmstrip obtained ''a priori'' from a digital video source.
Arbitragem científica: yes
Acesso: Acesso Aberto
Aparece nas coleções:ISTAR-CRN - Comunicações a conferências nacionais

Ficheiros deste registo:
Ficheiro TamanhoFormato 
conferenceobject_96743.pdf209,24 kBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.