About
Apps
Protocols
Data
Workshops/Meetings

Proximity Ligation Resource

A centralized resource for researchers to self-report basic methodological details

For more information, please check out the pre-print:

Synthetic community Hi-C benchmarking provides a baseline for virus-host inferences

https://www.biorxiv.org/content/10.1101/2025.02.12.637985v2

Recommendations

As Hi-C proximity ligation becomes more widely adopted, it is important to assess whether a Z-score threshold of ≥ 0.5 is broadly generalizable. While our SynCom data show that this threshold effectively filters spurious linkages, these results are based on controlled marine virus-host systems. The underlying principle of Hi-C, cross-linking physically proximal DNA fragments, should theoretically allow its application across diverse environments. However, natural systems may introduce variability that impacts the reliability of this threshold.

To support the responsible use of Hi-C for virus-host linkage inference, we offer the following recommendations:

  1. Benchmarking and Cross-validation: Hi-C predictions should be cross-validated with in silico tools, and any discrepancies carefully examined.

  2. Z-score Filtering: Z-score thresholds should, where possible, be guided by SynCom-based benchmarking. In new environments or with poorly characterized taxa, this means conducting controlled benchmarking experiments. Where such benchmarking is not feasible, limitations should be transparently acknowledged, and associated databases annotated to avoid propagating uncertain results.

  3. Threshold Selection: The trade-off between specificity and sensitivity should be calibrated to the research question. Stricter thresholds are warranted for bold claims (e.g., cross-phyla or cross-domain linkages), while broader ecological assessments may allow more lenient filtering.

To facilitate consistency in future work, we provide a centralized database for researchers (here) to self-report basic methodological details, promoting transparency and reproducibility across studies. We will be updating this resource with new literature every 6-12 months as funding allows.

Please contact sullivan.948 at osu in order for us to add your details here!

Name
DOI
HiC Linkage Identification
Normalization Tool
Phage Identification
Publication Date
Sample Used
Virus-Host Interactions
Virus Contigs Identified
Virus Availability
Hi-C Reads Availability
Linkage Data Availably
MAGs Availability
Normalization of Linkages
Filtering for Spurious Linkages
Improving gut virome comparisons using predicted phage host information
https://doi.org/10.1128/msystems.01364-24
Proximeta bioinformatic pipeline
NA
Vibrant, Proximeta
04/2025
Fecal
1.587 phage-host pairings
3,870 vOTUs
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Host-specific viral predation network on coral reefs
https://doi.org/10.1093/ismejo/wrae240
Read Recruitment
normalization by abundance
Vibrant
01/2024
Coral
1959 Hi-C links
4,019 representative viral genomes
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Hi-C metagenome sequencing reveals soil phage–host interactions
https://doi.org/10.1038/s41467-023-42967-z
Proximeta bioinformatic pipeline
NA
VirSorter and Vibrant
11/2023
Soil
19
479 vOTU
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Virus impacted community adaptation in oligotrophic groundwater environment revealed by Hi-C coupled metagenomic and viromic study
https://doi.org/10.1016/j.jhazmat.2023.131944
Read Recruitment
Virsorter
09/2000
Groundwater
Combined with CRISPR. 469 phages linked to their host
1920 potential viruses, finding that nearly half grouped into 454 clusters
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Hi-C Metagenomics in the ICU: Exploring Clinically Relevant Features of Gut Microbiome in Chronically Critically Ill Patients
https://doi.org/10.3389/fmicb.2021.770323
Read Recruitment
HiCZin
Phaster
09/2023
Human Stool
107-159 to one MAG, 12-167 to many
1156 to 1612
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data***
https://doi.org/10.1038/s41467-023-35945-y
Binning
HiCZin
VirSorter
01/2000
Human gut, cow fecal, wastewater
not reported
791, 1338, and 2757 viral contigs from the human gut, cow fecal, and wastewater samples
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
MetaHiC phage-bacteria infection network reveals active cycling phages of the healthy human gut
https://elifesciences.org/articles/60608
Binning
Normalized by read coverage
VirSorter and Vibrant
02/2017
Human Stool
7458
9488
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Viruses interact with hosts that span distantly related microbial domains in dense hydrothermal mats
https://doi.org/10.1038/s41564-023-01347-5
Read Recruitment
HiCZin
VirSorter and Vibrant
04/2000
Hydrothermal Mat
5292 consolidated to 859
46vMAG
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Assignment of virus and antimicrobial resistance genes to microbial hosts in a complex microbial community by combined long-read assembly and proximity ligation
https://doi.org/10.1186/s13059-019-1760-x
Read Recruitment
NA
Blobtools/DIAMOND
04/2023
Cow Rumen
297
8702
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay
https://doi.org/10.1126/sciadv.1602105
Read Recruitment
Normalized by read coverage
Blast
02/2000
Mice Feces
40 phages linked to 47 hosts
43
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No