Abstract: | MOTIVATION: DNA methylation plays an important role in regulating gene expression. DNA methylation is commonly analyzed using bisulfite sequencing (BS-seq)-based designs, such as whole-genome bisulfite sequencing (WGBS), reduced representation bisulfite sequencing (RRBS), and oxidative bisulfite sequencing (oxBS-seq). Furthermore, there has been growing interest in investigating the roles that genetic variants play in changing the methylation levels (i.e., methylation quantitative trait loci or meQTLs), how methylation regulates the imprinting of gene expression (i.e., allele-specific methylation or ASM), and the differentially methylated regions (DMRs) among different cell types. However, none of the current simulation tools can generate different BS-seq data types (e.g., WGBS, RRBS, and oxBS-seq) while modeling meQTLs, ASM, and DMRs. RESULTS: We developed pWGBSSimla, a profile-based bisulfite sequencing data simulator, which simulates WGBS, RRBS, and oxBS-seq data for different cell types based on real data. meQTLs and ASM are modeled based on the block structures of the methylation status at CpGs, whereas the simulation of DMRs is based on observations of methylation rates in real data. We demonstrated that pWGBSSimla adequately simulates data and allows performance comparisons among different methylation analysis methods. AVAILABILITY: pWGBSSimla is available at https://omicssimla.sourceforge.io. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |