Utilize este identificador para referenciar este registo: http://hdl.handle.net/10071/23678
Autoria: Coelho, J.
Neto, A.
Tavares, M.
Coutinho, C.
Oliveira, J.
Ribeiro, R.
Batista, F.
Editor: Cucchiara, R., Fred, A., & Filipe, J.
Data: 2021
Título próprio: Transformer-based language models for semantic search and mobile applications retrieval
Volume: 1
Paginação: 225 - 232
Título do evento: 13th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR
ISSN: 2184-3228
ISBN: 978-989-758-533-3
DOI (Digital Object Identifier): 10.5220/0010657300003064
Palavras-chave: Semantic search
Word embeddings
ElasticSearch
Mobile applications
Transformer-based models
Resumo: Search engines are being extensively used by Mobile App Stores, where millions of users world-wide use them every day. However, some stores still resort to simple lexical-based search engines, despite the recent advances in Machine Learning, Information Retrieval, and Natural Language Processing, which allow for richer semantic strategies. This work proposes an approach for semantic search of mobile applications that relies on transformer-based language models, fine-tuned with the existing textual information about known mobile applications. Our approach relies solely on the application name and on the unstructured textual information contained in its description. A dataset of about 500 thousand mobile apps was extended in the scope of this work with a test set, and all the available textual data was used to fine-tune our neural language models. We have evaluated our models using a public dataset that includes information about 43 thousand applications, and 56 manually annotated non- exact queries. The results show that our model surpasses the performance of all the other retrieval strategies reported in the literature. Tests with users have confirmed the performance of our semantic search approach, when compared with an existing deployed solution.
Arbitragem científica: yes
Acesso: Acesso Aberto
Aparece nas coleções:CTI-CRI - Comunicações a conferências internacionais
ISTAR-CRI - Comunicações a conferências internacionais
IT-CRI - Comunicações a conferências internacionais

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
conferenceobject_82724.pdfVersão Aceite274,2 kBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis Logotipo do Orcid 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.