dc.contributor.author | Li, Yansong | |
dc.date.accessioned | 2023-08-21T16:47:30Z | |
dc.date.available | 2023-08-21T16:47:30Z | |
dc.date.issued | 2023-08-21 | |
dc.identifier.uri | http://hdl.handle.net/10222/82808 | |
dc.description.abstract | The Burrows-Wheeler Transform (BWT) is a widely used succinct data structure in bioinformatics. However, one of the main concerns that researchers still face when using BWT is its processing time due to the lack of access locality. The (Last to First) LF mapping is derived from the BWT to facilitate backward searches, each step of the LF mapping tends to jump to a completely different spot of the BWT, resulting in at least one cache miss per step. Our project endeavours to minimize the processing time of BWT through the optimization of BWT layout. This objective is achieved by block partitioning and rearranging the layout of the BWT. Two proxy measures were introduced to represent the cache misses, but the reduced proxies did not correspond to an improved running time. However, the blocked partitioned BWT representation still achieved a 50% speedup with limited memory and a hard disk drive (HDD). | en_US |
dc.language.iso | en | en_US |
dc.subject | Data Structures | en_US |
dc.subject | Algorithms | en_US |
dc.subject | Bioinformatics | en_US |
dc.title | A Cache-Friendly BWT Layout | en_US |
dc.date.defence | 2023-07-21 | |
dc.contributor.department | Faculty of Computer Science | en_US |
dc.contributor.degree | Master of Computer Science | en_US |
dc.contributor.external-examiner | n/a | en_US |
dc.contributor.graduate-coordinator | Michael McAllister | en_US |
dc.contributor.thesis-reader | Chris Whidden | en_US |
dc.contributor.thesis-reader | Meng He | en_US |
dc.contributor.thesis-supervisor | Travis Gagie | en_US |
dc.contributor.thesis-supervisor | Norbert Zeh | en_US |
dc.contributor.ethics-approval | Not Applicable | en_US |
dc.contributor.manuscripts | Not Applicable | en_US |
dc.contributor.copyright-release | Not Applicable | en_US |