A binary segmentation method for detecting topological domains in Hi-C data

N. Raveendran, G. Sofronov

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

Abstract

The three-dimensional (3D) architecture of chromosomes in nuclear space plays an important role in studying gene expression and regulation in cell biology. In particular, chromosome conformation capture (3C) techniques are used to study the spatial structure of chromosomes. Many such methods have been developed in the last two decades. Among them, Hi-C is a technology that uses a deep sequencing approach to detect the 3D spatial organization of a genome. That is, Hi-C allows us to evaluate spatial proximity between any pair of loci along the genome. This results in an interaction matrix with the frequency of interactions between genomic loci that physically interact in the nucleus. Highly self-interacting regions appear in the contact map of such interaction matrices. These regions are called topological domains and they play an important role in regulating gene expression and other genomic functions. Thus detecting such topological domains will provide new insights on chromosomal conformation in better understanding of cell functioning and various diseases. The topological domains centered along a diagonal region in contact maps are more likely to exist and prominent in data. In this study, we focus on detecting such domains, and we approach this problem as a two-dimensional segmentation problem. To solve this segmentation problem, we propose an algorithm based on the binary segmentation method, a well-known recursive partitioning technique used in change point detection problems. Our numerical experiments illustrate the usefulness of this approach. We obtain estimates for the number of diagonal blocks and their boundaries in an artificially generated data matrix and compare the results of these estimates to those obtained with the HiCSeg R package. We conclude that binary segmentation method works well in identifying such domains with easy implementation and a low computational cost.
Original languageEnglish
Title of host publicationProceedings of the 25th International Congress on Modelling and Simulation (MODSIM 2023), Darwin Convention Centre, Darwin, Northern Territory, 9 - 14 July 2023
EditorsJai Vaze, Chris Chilcott, Lindsay Hutley, Susan M. Cuddy
PublisherModelling and Simulation Society of Australia and New Zealand
Pages737-743
Number of pages7
ISBN (Print)9780987214300
DOIs
Publication statusPublished - 2023
EventMSSANZ/IMACS Biennial Conference on Modelling and Simulation -
Duration: 9 Jul 2023 → …

Publication series

NameProceedings of the International Congress on Modelling and Simulation, MODSIM
ISSN (Electronic)2981-8001

Conference

ConferenceMSSANZ/IMACS Biennial Conference on Modelling and Simulation
Period9/07/23 → …

Bibliographical note

Publisher Copyright:
© 2023 Proceedings of the International Congress on Modelling and Simulation, MODSIM. All rights reserved.

Open Access - Access Right Statement

These proceedings are licensed under the terms of the Creative Commons Attribution 4.0 International CC BY License (http://creativecommons.org/ licenses/by/4.0)_which permits use_sharing_adaptation_distribution and reproduction in any medium or format_as long as you attribute MSSANZ and the original author(s) and source_provide a link to the Creative Commons licence and indicate if changes were made. Images or other third party material are included in this licence_unless otherwise indicated in a credit line to the material. Individual MODSIM papers are copyright of the Authors and Modelling and Simulation Society of Australia and New Zealand Inc. (MSSANZ). MSSANZ is the publisher of the MODSIM Proceedings.

Keywords

  • binary segmentation method
  • Hi-C data
  • two-dimensional segmentation

Fingerprint

Dive into the research topics of 'A binary segmentation method for detecting topological domains in Hi-C data'. Together they form a unique fingerprint.

Cite this