Publication:
A draft UAE-based Arab pangenome reference.

dc.contributor.authorNassir, Nasna
dc.contributor.authorAlmarri, Mohamed A
dc.contributor.authorKumail, Muhammad
dc.contributor.authorMohamed, Nesrin
dc.contributor.authorBalan, Bipin
dc.contributor.authorHanif, Shehzad
dc.contributor.authorAlObathani, Maryam
dc.contributor.authorJamalalail, Bassam
dc.contributor.authorElsokary, Hanan
dc.contributor.authorKondaramage, Dasuki
dc.contributor.authorShiyas, Suhana
dc.contributor.authorKosaji, Noor
dc.contributor.authorSatsangi, Dharana
dc.contributor.authorSuwaidi, Hanan Al
dc.contributor.authorAlbanna, Ammar
dc.contributor.authorS Du Plessis, Stefan
dc.contributor.authorKhansaheb, Hamda Hassan
dc.contributor.authorAlsheikh-Ali, Alawi
dc.contributor.authorUddin, Mohammed
dc.date.accessioned2025-09-02T04:46:47Z
dc.date.available2025-09-02T04:46:47Z
dc.date.issued2025-07-24
dc.description.abstractPangenomes provide a robust and comprehensive portrayal of genetic diversity in humans, but Arab populations remain underrepresented. We present a preliminary UAE-based Arab Pangenome Reference (UPR) utilizing 53 individuals of diverse Arab ethnicities residing in the United Arab Emirates. We assembled nuclear and mitochondrial pangenomes using 35.27X high-fidelity long reads, 54.22X ultralong reads and 65.46X Hi-C reads. This approach yielded contiguous haplotype-phased de novo assemblies of exceptional quality, with an average N50 of 124.28 Mb. We discovered 111.96 million base pairs of previously uncharacterized euchromatic sequences absent from existing human pangenomes, the T2T-CHM13 and GRCh38 reference human genomes, and other public datasets. Moreover, we identified 8.94 million population-specific small variants and 235,195 structural variants within the Arab pangenome, not present in linear and pangenome references and public datasets. We detected 883 gene duplications, including the TATA-binding protein gene TAF11L5, which was uniquely duplicated across all Arab populations and that included 15.06% of genes associated with recessive diseases. By exploring the mitochondrial pangenome, we identified 1,436 bp of previously unreported sequences. Our study provides a valuable resource for future genetic research and genomic medicine initiatives in Arab population and other population with similar genetic backgrounds.
dc.identifier.other40707445
dc.identifier.urihttps://repository.mbru.ac.ae/handle/1/1727
dc.language.isoen
dc.subjectArab pangenome reference
dc.subjectPangenomes
dc.subjectUAE
dc.subjectUnited Arab Emirates
dc.titleA draft UAE-based Arab pangenome reference.
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
A draft UAE-based Arab pangenome reference.pdf
Size:
2.23 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: