Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/83559
DC FieldValueLanguage
dc.contributor.advisorCosta, Ernesto Jorge Fernandes-
dc.contributor.advisorSousa, Ricardo Jorge Gamelas de-
dc.contributor.authorMarcelino, José Miguel Simões-
dc.date.accessioned2019-01-04T22:26:05Z-
dc.date.available2019-01-04T22:26:05Z-
dc.date.issued2018-07-10-
dc.date.submitted2019-01-22-
dc.identifier.urihttps://hdl.handle.net/10316/83559-
dc.descriptionDissertação de Mestrado em Engenharia Informática apresentada à Faculdade de Ciências e Tecnologia-
dc.description.abstractThe world of fashion has recently been receiving a lot of at- tention. At Farfetch, the global platform for luxury, a robust search engine is required to cope with its users’ needs and ex- pectations. The fashion language, with its semantically rich expressions, context and linguistic structure give origin to chal- lenges such as the inability 1) to capture nuances of fashion epithets (for instance, semantically speaking, what is a floral dress?) and, 2) of having an algorithm able to translate the do- main into an effective identification of fashion entities. Natural Language Processing (NLP) endows our systems with capabil- ities to tackle these challenging tasks.Historically, the search engine parser has been keyword-based, involving a large number of constraint rules. This work presents a hierarchical deep learning natural language parser for fash- ion. Our proposal intends not only to recognize fashion-domain entities but also to expose syntactic and morphologic insights. We leverage the usage of an architecture of specialist models, each one for a different task (from parsing to entity recogni- tion). Such architecture renders a hierarchical model able to capture the nuances of the fashion language. The natural lan- guage parser is able to deal with textual ambiguities which are left unresolved by our currently existing solution.We conducted an extensive experimental work to assess the quality of our proposal. Having in mind the complexity of our proposed framework, we conducted an analysis of each module. Achieving 0.98, 0.8 and 0.99 in terms of F1 score in our extrinsic evaluation into the each module (Part-of-Speech (POS), Depen- dency Parsing and Named Entity Recognition (NER) respec- tively) our empirical results establish a robust baseline, which justifies the use of hierarchical architectures of deep learning models while opening new research avenues to explore.eng
dc.description.abstractO mundo da moda tem vindo a ser alvo de grande destaque. Na Farfetch, a plataforma global de luxo, ́e necess ́ario um mecanismo de pesquisa robusto de modo a cumprir as necessi- dades e expectativas dos utilizadores. A linguagem da moda, com as suas express ̃oes semanticamente ricas, com a sua es- trutura lingu ́ıstica complexa e o seu vago contexto originam grandes desafios, tais como a incapacidade: 1) de captar nu- ances de vocabul ́ario (por exemplo, semanticamente falando, o que ́e um vestido floral?) e, 2) de ter um algoritmo ca- paz de traduzir o dom ́ınio numa identifica ̧c ̃ao efetiva das enti- dades da moda. O Processamento de Linguagem Natural dota os nossos sistemas com capacidades para ultrapassar estas de- safiantes tarefas. Historicamente, o processador do motor de pesquisa tem sido baseado em palavras-chave, envolvendo um vasto nu ́mero de restri ̧c ̃oes. Este trabalho apresenta um pro- cessador de linguagem natural baseado em redes de aprendiza- gem profunda estruturadas de forma hier ́arquica no dom ́ınio da moda. A nossa abordagem pretende n ̃ao s ́o reconhecer as en- tidades do dom ́ınio da moda mas tamb ́em expor conhecimento acerca da sintaxe e morfologia. Baseamo-nos numa arquitetura de modelos especialistas, cada um com um diferente prop ́osito (desde an ́alise sint ́atica a reconhecimento de entidades). Esta arquitetura hier ́arquica permite captar as nuances da moda. O processador de linguagem natural ́e capaz de lidar com am- biguidades que n ̃ao possuem resposta adequada da solu ̧c ̃ao em vigor. Conduzimos uma vasta experimentac ̧ ̃ao de forma a val- idar os nossos resultados. Tendo em conta a complexidade da nossa solu ̧c ̃ao efetuamos uma avalia ̧c ̃ao orientada a cada m ́odulo, tendo obtido valores de ‘F1 score’ de 0.98, 0.8 e 0.99 (respetivamente, An ́alise Sint ́atica, An ́alise de Dependˆencias e Reconhecimento de Entidades). Os nossos resultados emp ́ıricos estabelecem uma base robusta, que justifica o uso de arquite- turas hier ́arquicas de modelos de aprendizagem profunda en- quanto abre novas oportunidades de investiga ̧c ̃ao.por
dc.language.isoeng-
dc.rightsclosedAccess-
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/-
dc.subjectProcessamento de Linguagem Naturalpor
dc.subjectAprendizagem Computacionalpor
dc.subjectModelos de Linguagempor
dc.subjectExtração de Informaçãopor
dc.subjectE-commerce de modapor
dc.subjectNatural Language Processing (NLP)eng
dc.subjectMachine Learning (ML)eng
dc.subjectLanguage Modelseng
dc.subjectInformation Extractioneng
dc.subjectFashion e-commerceeng
dc.titleSmartSearch – Search Engine with Natural Language Processingeng
dc.title.alternativeSmartSearch – Search Engine with Natural Language Processingpor
dc.typemasterThesis-
degois.publication.locationFarfetch-
degois.publication.titleSmartSearch – Search Engine with Natural Language Processingeng
dc.peerreviewedyes-
dc.identifier.tid202129748-
thesis.degree.disciplineInformática-
thesis.degree.grantorUniversidade de Coimbra-
thesis.degree.level1-
thesis.degree.nameMestrado em Engenharia Informática-
uc.degree.grantorUnitFaculdade de Ciências e Tecnologia - Departamento de Engenharia Informática-
uc.degree.grantorID0500-
uc.contributor.authorMarcelino, José Miguel Simões::0000-0001-7153-2205-
uc.degree.classification18-
uc.degree.presidentejuriCabral, Bruno Miguel Brás-
uc.degree.elementojuriCosta, Ernesto Jorge Fernandes-
uc.degree.elementojuriOliveira, Hugo Ricardo Gonçalo-
uc.contributor.advisorCosta, Ernesto Jorge Fernandes-
uc.contributor.advisorSousa, Ricardo Jorge Gamelas de-
uc.controloAutoridadeSim-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.openairetypemasterThesis-
item.cerifentitytypePublications-
item.grantfulltextreserved-
item.fulltextCom Texto completo-
item.languageiso639-1en-
crisitem.advisor.researchunitCISUC - Centre for Informatics and Systems of the University of Coimbra-
crisitem.advisor.parentresearchunitFaculty of Sciences and Technology-
crisitem.advisor.orcid0000-0002-8460-4033-
Appears in Collections:UC - Dissertações de Mestrado
Files in This Item:
File Description SizeFormat Login
master-thesis-final-version.pdf2.72 MBAdobe PDF    Request a copy
Show simple item record

Page view(s) 50

470
checked on Apr 16, 2024

Download(s) 50

366
checked on Apr 16, 2024

Google ScholarTM

Check


This item is licensed under a Creative Commons License Creative Commons