doi: 10.53962/q5n4-a3jp
Originally published on 2022-03-12 under a CC BY 4.0
We have implemented a two-step scalable approach to detect variants: first we construct a graph pangenome from a graphical fragment assembly (GFA) file that stores the fragments, where each fragment corresponds to a vertex of the graph, then we analyze the graph to detect all variants. We have tested our approach on a SARS-CoV-2 dataset with over 7800 fragments and on a dataset that contains all alternative sequences of the highly polymorphic human leukocyte antigen (HLA) complex.