dc.contributor.author | Sarvmaili, Mahtab | |
dc.date.accessioned | 2024-08-26T18:54:11Z | |
dc.date.available | 2024-08-26T18:54:11Z | |
dc.date.issued | 2024-08-23 | |
dc.identifier.uri | http://hdl.handle.net/10222/84476 | |
dc.description.abstract | Over the past decade, complex black-box models have excelled in various tasks, but their lack of transparency undermines trust in their predictions. This study contributes to Explainable AI (XAI) by introducing data-centric post-hoc explainers. We present two frameworks, FEHAN and DICTA, for locally explaining text classifiers through interpretable surrogate models. Experimental evaluations on four datasets demonstrate their effectiveness, with a focus on simplifying the explanation process. Additionally, we explore the explainability of Graph Convolutional Networks (GCNs) applied to molecular structures, offering multiple perspectives on their predictions. We also introduce HD-Explain, a post-hoc, model-aware, example-based explanation method for neural classifiers. HD-Explain uses Kernelized Stein Discrepancy (KSD) to identify influential training data points and potential distribution mismatches. This research advances the understanding of data contributions to machine learning models and addresses the emerging challenge of Machine Unlearning (MU) by leveraging insights into data-model interactions. | en_US |
dc.language.iso | en | en_US |
dc.subject | Data Centric Explainable AI | en_US |
dc.subject | Data Centric Model Editing | en_US |
dc.subject | Data Centric Prediction Explanation | en_US |
dc.title | Data-centric Prediction Explanation and Model Editing for Deep Neural Networks | en_US |
dc.date.defence | 2024-08-16 | |
dc.contributor.department | Faculty of Computer Science | en_US |
dc.contributor.degree | Doctor of Philosophy | en_US |
dc.contributor.external-examiner | Randy Goebel | en_US |
dc.contributor.thesis-reader | Vlado Keselj | en_US |
dc.contributor.thesis-reader | Hassan Sajjad | en_US |
dc.contributor.thesis-supervisor | Ga Wu | en_US |
dc.contributor.ethics-approval | Not Applicable | en_US |
dc.contributor.manuscripts | Not Applicable | en_US |
dc.contributor.copyright-release | Not Applicable | en_US |