A systematic evaluation of Hi-C data enhancement methods for enhancing PLAC-seq and HiChIP data

Huang, Le; Yang, Yuchen; Li, Gang; Jiang, Minzhi; Wen, Jia; Abnousi, Armen; Rosen, Jonathan D.; Hu, Ming; Li, Yun

Published in

Oxford University Press, Briefings in Bioinformatics, 3(23), 2022

DOI: 10.1093/bib/bbac145

Tools

Export citation

Search in Google Scholar

A systematic evaluation of Hi-C data enhancement methods for enhancing PLAC-seq and HiChIP data

Journal article published in 2022 by Le Huang, Yuchen Yang, Gang Li

, Minzhi Jiang, Jia Wen, Armen Abnousi, Jonathan D. Rosen, Ming Hu

, Yun Li

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Abstract The three-dimensional organization of chromatin plays a critical role in gene regulation. Recently developed technologies, such as HiChIP and proximity ligation-assisted ChIP-Seq (PLAC-seq) (hereafter referred to as HP for brevity), can measure chromosome spatial organization by interrogating chromatin interactions mediated by a protein of interest. While offering cost-efficiency over genome-wide unbiased high-throughput chromosome conformation capture (Hi-C) data, HP data remain sparse at kilobase (Kb) resolution with the current sequencing depth in the order of 108 reads per sample. Deep learning models, including HiCPlus, HiCNN, HiCNN2, DeepHiC and Variationally Encoded Hi-C Loss Enhancer (VEHiCLE), have been developed to enhance the sequencing depth of Hi-C data, but their performance on HP data has not been benchmarked. Here, we performed a comprehensive evaluation of HP data sequencing depth enhancement using models developed for Hi-C data. Specifically, we analyzed various HP data, including Smc1a HiChIP data of the human lymphoblastoid cell line GM12878, H3K4me3 PLAC-seq data of four human neural cell types as well as of mouse embryonic stem cells (mESC), and mESC CCCTC-binding factor (CTCF) PLAC-seq data. Our evaluations lead to the following three findings: (i) most models developed for Hi-C data achieve reasonable performance when applied to HP data (e.g. with Pearson correlation ranging 0.76–0.95 for pairs of loci within 300 Kb), and the enhanced datasets lead to improved statistical power for detecting long-range chromatin interactions, (ii) models trained on HP data outperform those trained on Hi-C data and (iii) most models are transferable across cell types. Our results provide a general guideline for HP data enhancement using existing methods designed for Hi-C data.

Published in

Links

Tools

A systematic evaluation of Hi-C data enhancement methods for enhancing PLAC-seq and HiChIP data

Abstract