[PDF][PDF] NyuWa Genome resource: a deep whole-genome sequencing-based variation profile and reference panel for the Chinese population

P Zhang, H Luo, Y Li, Y Wang, J Wang, Y Zheng, Y Niu… - Cell reports, 2021 - cell.com
P Zhang, H Luo, Y Li, Y Wang, J Wang, Y Zheng, Y Niu, Y Shi, H Zhou, T Song, Q Kang, T Xu…
Cell reports, 2021cell.com
The lack of haplotype reference panels and whole-genome sequencing resources specific
to the Chinese population has greatly hindered genetic studies in the world's largest
population. Here, we present the NyuWa genome resource, based on deep (26.2×)
sequencing of 2,999 Chinese individuals, and construct a NyuWa reference panel of 5,804
haplotypes and 19.3 million variants, which is a high-quality publicly available Chinese
population-specific reference panel with thousands of samples. Compared with other …
Summary
The lack of haplotype reference panels and whole-genome sequencing resources specific to the Chinese population has greatly hindered genetic studies in the world's largest population. Here, we present the NyuWa genome resource, based on deep (26.2×) sequencing of 2,999 Chinese individuals, and construct a NyuWa reference panel of 5,804 haplotypes and 19.3 million variants, which is a high-quality publicly available Chinese population-specific reference panel with thousands of samples. Compared with other panels, the NyuWa reference panel reduces the Han Chinese imputation error rate by a margin ranging from 30% to 51%. Population structure and imputation simulation tests support the applicability of one integrated reference panel for northern and southern Chinese. In addition, a total of 22,504 loss-of-function variants in coding and noncoding genes are identified, including 11,493 novel variants. These results highlight the value of the NyuWa genome resource in facilitating genetic research in Chinese and Asian populations.
cell.com