Skip to content
Convergence
GitHub

Usage

build_bedpe

build_bedpe builds pairs between elements in two bed files. Pairs can be constrained by a third bed file (usually TADs) or by user-defined minimum and maximum distances. Pairs are printed in bedpe format to standard out. Pairs can be use to query .hic files with query_bedpe. View the tutorial here.

Usage and Option Summary

build_bedpe -A path/to/bed1.bed -B path/to/bed2.bed -T path/to/TADfile.bed

(or):

build_bedpe -A path/to/bed1.bed -B path/to/bed2.bed -d 10000 -D 100000

Required

Short OptionLong OptionDescription
-A--bed_APath to the first bed file
-B--bed_BPath to the second bed file

Optional

Short OptionLong OptionDescription
-T--TADPath of the TAD file to restrict pairings outside of the TAD
-d--min_distMinimum distance between pairs used to drop results. Default 0 bp
-D--max_distMaximum distance between pairs used to drop results. Default 5 Mb
-m--preserve_metaIf bed file meta data columns should be preserved. Default FALSE
-i--get_transIf pairs between different chromosomes should be made. If TRUE, will print trans pairs only. Default FALSE
-f--fractionControl the number of possible trans pairs to be printed. Between 0-1. Default = 1. Only applicable if --get_trans is TRUE.
-h--helpHelp message

get_loops

get_loops obtains loops with scores > 1. Scores are calculated using inherent normalization and printed in bedpe format to standard out.

Usage and Option Summary

get_loops \
  -A H3K27ac \
  -G hg38 \
  -R chr1:1000:5000 \
  -r 5000

Required

Short OptionLong OptionDescription
-A--sample1Name of the sample you want to use as it appears on the Tinker box
-G--genomeThe genome build the sample(s) has been processed using. Strictly hg19 or hg38
-R--rangeRange to obtain loops from, in chr:start
format.
-r--resolutionResolution of sample in base pairs. Only 5000 and 1000 supported.

Optional

Short OptionLong OptionDescription
-T--TADFull path to the TAD file, the boundaries of which will be used to obtain loops
-S--scoreMinimum inherent score to get loops. Default = 1
-d--min_distMinimum distance to filter obtained loops. Default 0 bp
-h--helpHelp message

get_multisample_viewpoints

get_multisample_viewpoints is used to extract contact values from specific genomic viewpoints for multiple samples simultaneously.

Usage and Option Summary

get_multisample_viewpoints \
  -G hg38 \
  -L LAMP_DMSO,LAMP_dCBP1 \
  -R chr1:40280000:40530000:MCC7_MYCL \
  -V chr1:40400000:anchor1

Required

Short OptionLong OptionDescription
-L--listComma separated list of sample names. For ex., LAMP_DMSO,LAMP_dCBP1
-G--genomeThe genome build the sample(s) has been processed using. Strictly hg19 or hg38
-R--rangeThe genomic range to extract the contact values of, in chr:start:end
format. For example: -R chr1:40280000:40530000:MCC7_MYCL
-V--viewpointViewpoint in chr:start
format. For example: -V chr1:40400000:anchor1

Optional

Short OptionLong OptionDescription
-T--tablePath to 1-col .txt file containing list of sample names, if --list option is not used
-Q--normWhich normalization to use. Strictly none, cpm or aqua in lower case. Non-spike-in samples default to cpm. Spike-in samples default to aqua.
-r--resolutionResolution of sample in base pairs, using which the contact values should be calculated. Default 5000. Accepted resolutions- 1000,5000,10000,25000,50000,100000,250000,500000,1000000,2500000
-O--output_nameIf saving to a file is desired, provide a name for the output
-h--helpHelp message

intersect_bedpe

Given a bedpe file, intersect_bedpe prints those rows of the bedpe in standard out that intersect with rows of given bed file(s) on either foot of the pair. intersect_bedpe is useful for extracting biological subsets from the bedpe.

Usage and Option Summary

intersect_bedpe -A H3K27ac -P /path/to/bedpe

(or):

intersect_bedpe -A H3K27ac -B H3K27me3 -P /path/to/bedpe

Required

Short OptionLong OptionDescription
-A--bed_APath to the first bed file
-P--bedpePath to the bedpe file

Optional

Short OptionLong OptionDescription
-F--flankGenome distance in bp that the bed should be in vicinity of either foot. Default is 0
-V--absenceIf specified, reports those rows of the bedpe that do not intersect with rows of given bed file. Default FALSE
-B--bed_BPath to the second bed file
--print_bedIf specified, reports rows of bed instead of bedpe
-h--helpHelp message

plot_APA

plot_APA generates APA (Aggregate Peak Analysis) plots using AQuA normalized contact values from genomic pair data.

Usage and Option Summary

plot_APA \
   -P /path/to/example_pairs.bedpe \
   -A H3K27ac \
   -G hg38 \
   -O /path/to/output_directory \
   -B SampleB \
   --bin_size 10000 \
   --hard_cap_cpm 50

(or):

plot_APA \
   -P /path/to/example_pairs.bedpe \
   -A H3K27ac \
   -G hg38 \
   -O /path/to/output_directory \
   -B H3K27me3

Required

Short OptionLong OptionDescription
-P--pairPath to the bedpe (pairs) file you want to use, without headers.
-A--sample1Name of the sample you want to use to create the plot, name it as it appears on the Tinker box
-G--genomeThe genome build the sample(s) has been processed using. Strictly hg19 or hg38.
-O--out-dirFull path of the directory you want to store the output plots in.

Optional

Short OptionLong OptionDescription
-B--sample2The name of the second sample. If triggered, plots the delta AQuA normalized values from both samples for that pair. Useful in case vs control.
--cpmlNo input required. If —cpml is specified, CPM and AQuA APA values get normalized by the number of loops in the bedpe.
--bin_sizeBin size you want to use for the APA plots. Default = 5000.
--hard_cap_cpmIf saving to a file is desired, provide a name for the output.
--hard_cap_cpm_deltaUpper limit of the CPM delta plot range. Only for two sample analysis. If not specified, upper limit will be calculated using max delta value.
--hard_cap_aquaUpper limit of the AQuA plot range. If not specified, upper limit will be calculated using max bin value.
--hard_cap_aqua_deltaUpper limit of the AQuA delta plot range. Only for two sample analysis. If not specified, upper limit will be calculated using max delta value.
-h--helpHelp message

plot_contacts

plot_contacts creates contact plots with CPM/AQuA normalized contact values. View the tutorial here.

Usage and Option Summary

plot_contacts -A H3K27ac -R chr1:40280000:40530000:MCC7_MYCL -G hg38

(or):

plot_contacts -A H3K27ac -B H3K27me3 -g MCC7_MYCL -G hg38

Required

Short OptionLong OptionDescription
-A--sample_1Name of sample you want to use to create the contact plot, name it as it appears on the Tinkerbox
-R--rangeThe genomic range that is to be plotted, in chr:start:end
format. For example: -R chr1:40280000:40530000
-G--genomeThe genome build the sample(s) has been processed using. Strictly hg19 or hg38

Optional

Short OptionLong OptionDescription
-O--output_nameProvide a name for the output pdf
-Q--normWhich normalization to use. Strictly ‘none’, ‘cpm’ or ‘aqua’ in lower case. Non-spike-in samples default to cpm. Spike-in samples default to aqua.
-B--sample_2For two sample delta plots, name of the second sample.
-r--resolutionResolution of sample in base pairs, using which the contact values should be calculated. Default 5000. Accepted resolutions- 1000,5000,10000,25000,50000,100000,250000,500000,1000000,2500000
-p--profilesIf contact profiles should be drawn along the diagonal, x axis and y axis. Default = FALSE
-o--color_one_sampleColor for contacts for single sample plots in RGB hexadecimal, ex: red = FF0000 (RRGGBB). Default = FF0000
-t--color_two_sampleColor for contacts for two sample plots (delta) in RGB hexadecimal separated by ’-’, ex: 1E90FF-C71585
--annotations_defaultDraw bed annotations; TSSs, ENCODE 3 enhancers, CpG islands. Default = TRUE
--annotations_customPath to bed file to draw custom annotations. Only one custom bed supported
--quant_cutBetween 0.00-1.00. Rather than using the max value of the matrix as the highest color, cap the values at a given percentile. Default 1.00
--max_capSet a hard cap, all values greater contact values than this will be brought down to cap value supplied
--use_dumpTRUE or FALSE. Obtain raw contact matrices along with contact plot. Default FALSE
--bedpeSupply path to a bedpe file to highlight tiles of interacting bedpe feet
--bedpe_colorColor for supplied bedpe in RGB hexadecimal. ex: C71585
-i--inherentTRUE or FALSE. If TRUE, normalize the contact plot using inherent normalization
-w--widthManually set width of printed bin between 0 and 1. Default width calculated automatically.
-g--geneProvide a gene name to automatically select TAD coordinates for interval range (-R). -g can be used in place of -R.
-h--helpHelp message. Primer can be found at https://rb.gy/fjkwkr

plot_virtual_4C

plot_virtual_4C is used for visualizing chromatin interactions, similar to what the 4C (Circular Chromosome Conformation Capture) technique does. However, instead of performing a wet-lab 4C experiment, the tool uses processed data to virtually generate a 4C profile focused on interactions of a specific genomic region (viewpoint) with the rest of the genome.

Usage and Option Summary

plot_virtual_4C
   -A H2K27ac \
   -G hg38 \
   -R chr1:40280000:40530000:MCC7_MYCL \
   -V chr1:40400000:anchor1

(or):

plot_virtual_4C
   -A H3K27ac \
   -B H3K27me3 \
   -G hg38 \
   -R chr1:40280000:40530000:MCC7_MYCL \
   -V chr1:40400000:anchor1

Required

Short OptionLong OptionDescription
-A--sample1Name of the first sample you want to use as it appears on the tinker box
-G--genomeThe genome build the sample(s) has been processed using. Strictly hg19 or hg38
-R--rangeThe genomic range that is to be plotted, in chr:start:end
format. For example: -R chr1:40280000:40530000:MCC7_MYCL
-V--viewpointThe viewpoint to be considered in chr:start
format. For example: -R chr1:40400000:anchor1

Optional

Short OptionLong OptionDescription
-B--sample2Name of the second sample you want to use as it appears on the tinker box
-Q--normWhich normalization to use. Strictly none, cpm or aqua in lower case. Non-spike-in samples default to cpm. Spike-in samples default to aqua.
-r--resolutionResolution of sample in base pairs. Default 5000. Accepted resolutions: 1000,5000,10000,25000,50000,100000,250000,500000,1000000,2500000
-O--output_nameOptional: provide a name for the plot
--quant_cutHelp message
--max_capSet a hard cap, all values greater than this will be brought down to cap value supplied
--widthNumber of bins up and downstream of viewpoint locus to be considered for drawing profiles. Default 0
--heightNumeric factor to control the height of the Virtual 4C profile in the plot. Default 0.3
-h--helpHelp message

query_bedpe

query_bedpe uses a bedpe file to calculate AQuA normalized or counts-per-million (CPM) contact values for given ranges in a sample and prints to standard out. View the tutorial here.

Usage and Option Summary

query_bedpe -A H3K27ac -P path/to/pairs.bedpe -G hg38

(or):

query_bedpe -A H3K27ac -B H3K27me3 -P path/to/pairs.bedpe -G hg38

Required

Short OptionLong OptionDescription
-P--pairFull path to the bedpe (pairs) file you want to query, without headers!
-A--sample_1Name of the sample you want to use as it appears on the Tinker box
-G--genomeThe genome build the sample(s) has been processed using. Strictly hg19 or hg38
-Q--normWhich normalization to use. Strictly ‘none’, ‘cpm’ or ‘aqua’ in lower case. Non-spike-in samples default to cpm. Spike-in samples default to aqua.

Optional

Short OptionLong OptionDescription
-B--sample_2The name of the second sample. If triggered, calculates the delta contact values for that pair. Useful in case vs control
-R--resolutionResolution of sample in base pairs. Default 5000. Accepted resolutions: 1000,5000,10000,25000,50000,100000,250000,500000,1000000,2500000
-f--formulaArithmetic to use to report contact values. Options: center, max, average, sum. Default = center
-F--fixIf FALSE, reports new coordinates based on arithmetic center or max. Default = TRUE
--shrink_wrapSqueezes a 2D bedpe interval until supplied value is reached. Default = FALSE
--splitSplits a 2D bedpe interval into multiple sub-intervals greater than supplied value. Default = FALSE
--paddingJoins sub-intervals in 2D space reported by —split, based on supplied value in bin units. Default = 2
--expandExpands 1D bedpe feet in both directions based on supplied value (in bin units). Default = 0
-I--inherentIf TRUE, hic values transformed to inherent units. For one-sample tests only. Default = FALSE
-h--helpHelp message. Primer can be found at https://rb.gy/zyfjxc

summarize_interval

summarize_interval counts both short-range and long-range 3D contacts within specified genomic intervals.

Usage and Option Summary

summarize_interval \
  -G hg38 \
  -I /path/to/input_bed_file.bed \
  -A H3K27ac

(or):

summarize_interval \
  -G hg38 \
  -I /path/to/input_bed_file.bed \
  -A H3K27ac \
  -B H3K27me3

Required

Short OptionLong OptionDescription
-I--inputFull path to the bed (intervals) file without headers
-A--sample1Name of the sample you want to use to calculate the contact values, as it appears on the Tinker box
-G--genomeThe genome build the sample(s) has been processed using. Strictly hg19 or hg38

Optional

Short OptionLong OptionDescription
-B--sample2Name of the second sample you want to use as it appears on the Tinker box. Useful in case vs control.
-Q--AQuAUse AQuA factors: TRUE or FALSE. Non-spike-in samples default to FALSE (CPM). Spike-in samples default to TRUE (AQUA).
-r--resolutionResolution of sample in base pairs. Default 5000. Accepted resolutions: 1000,5000,10000,25000,50000,100000,250000,500000,1000000,2500000
-D--distanceDistance in base pairs to classify short-range and long-range contact values. Default 15000.
-h--helpHelp message