Show simple item record

dc.contributor.authorChaudhary, Akhil
dc.date.accessioned2022-12-15T15:16:10Z
dc.date.available2022-12-15T15:16:10Z
dc.date.issued2022-12-14
dc.identifier.urihttp://hdl.handle.net/10222/82151
dc.descriptionWe presented the first zero-shot model to generate all three types of textual labels (i.e., 1. One Word Label, 2. Sentence Label, and 3. Summary Label) for automatically generated topics. We have defined our evaluation matrix based on BERTScore, which is used to measure the similarities between the generated label and gold standard labels in the case of One Worded Label and between the original Article and generated label for Sentence and Summary labels. Our zero-shot approach is sound and produces appropriate labels.en_US
dc.description.abstractAutomatic topic labelling aims to generate sound, interpretable, and meaningful topic labels used to interpret topics. A topic is usually represented by a list of terms and documents, ranked by their probability, and we are using Top2Vec for our topic modelling. Automatic Topic labelling intends to reduce the effort to interpret while investigating the topics. In this study, we introduce a novel three-phase zero-shot topic labelling framework using the ConceptNet knowledge graph (a freely-available the semantic network of words and phrases) and language models as external sources of information. The first phase uses the knowledge graph by extending the top n words (based on semantic similarity) neighbourhood and filling missing connections and information gaps by querying ConceptNet and generating a candidate sub-graph to generate candidate labels. In the second phase, it develops a neighbourhood graph for each candidate label, scores each node based on its semantic similarity with the topic and retains the best sub-graph based on semantic similarity. In the third phase, we utilize the language model to determine the labels using the final graph as input. We use a knowledge graph and language model to extend the knowledge beyond topic documents to optimize discovered topics with better representative terms while retaining the topic information. The proposed framework decreases the computation burden by utilizing a zero-shot approach and reduces the cognitive and interpretation load of the end-user by creating three types of labels for each topic, i.e., a one-word label, sentence label and summary label. The experimental results showed that our model significantly outperforms the unsupervised baselines and classic topic labelling models and is comparable to supervised baselines topic labelling models.en_US
dc.language.isoenen_US
dc.subjectTopic Labellingen_US
dc.subjectTopic Modellingen_US
dc.subjectdocument labellingen_US
dc.subjectsummarizationen_US
dc.subjectconceptneten_US
dc.subjectKnowledge Graphen_US
dc.subjectExplainableen_US
dc.subjectZero-Shoten_US
dc.subjectConceptNeten_US
dc.titleTOP2LABEL: EXPLAINABLE ZERO SHOT TOPIC LABELLING USING KNOWLEDGE GRAPHSen_US
dc.date.defence2022-12-02
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.degreeMaster of Computer Scienceen_US
dc.contributor.external-examinerNAen_US
dc.contributor.graduate-coordinatorDr. Michael McAllisteren_US
dc.contributor.thesis-readerDr. Vlado Keseljen_US
dc.contributor.thesis-readerDr. Hassan Sajjaden_US
dc.contributor.thesis-supervisorDr. Evangelos Miliosen_US
dc.contributor.thesis-supervisorDr. Enayat Rajabien_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.copyright-releaseNot Applicableen_US
 Find Full text

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record