Provided by: metabat_2.15-4_amd64 

NAME
metabat1 - MetaBAT: Metagenome Binning based on Abundance and Tetranucleotide frequency (version 1)
DESCRIPTION
MetaBAT: Metagenome Binning based on Abundance and Tetranucleotide frequency (version 1) by Don Kang
(ddkang@lbl.gov), Jeff Froula, Rob Egan, and Zhong Wang (zhongwang@lbl.gov)
OPTIONS
-h [ --help ]
produce help message
-i [ --inFile ] arg
Contigs in (gzipped) fasta file format [Mandatory]
-o [ --outFile ] arg
Base file name for each bin. The default output is fasta format. Use -l option to output only
contig names [Mandatory]
-a [ --abdFile ] arg
A file having mean and variance of base coverage depth (tab delimited; the first column should be
contig names, and the first row will be considered as the header and be skipped) [Optional]
--cvExt
When a coverage file without variance (from third party tools) is used instead of abdFile from
jgi_summarize_bam_contig_depths
-p [ --pairFile ] arg
A file having paired reads mapping information. Use it to increase sensitivity. (tab delimited;
should have 3 columns of contig index (ordered by), its mate contig index, and supporting mean
read coverage. The first row will be considered as the header and be skipped) [Optional]
--p1 arg (=0)
Probability cutoff for bin seeding. It mainly controls the number of potential bins and their
specificity. The higher, the more (specific) bins would be. (Percentage; Should be between 0 and
100)
--p2 arg (=0)
Probability cutoff for secondary neighbors. It supports p1 and better be close to p1. (Percentage;
Should be between 0 and 100)
--minProb arg (=0)
Minimum probability for binning consideration. It controls sensitivity. Usually it should be >=
75. (Percentage; Should be between 0 and 100)
--minBinned arg (=0)
Minimum proportion of already binned neighbors for one's membership inference. It contorls
specificity. Usually it would be <= 50 (Percentage; Should be between 0 and 100)
--verysensitive
For greater sensitivity, especially in a simple community. It is the shortcut for --p1 90 --p2 85
--pB 20 --minProb 75 --minBinned 20 --minCorr 90
--sensitive
For better sensitivity [default]. It is the shortcut for --p1 90 --p2 90 --pB 20 --minProb 80
--minBinned 40 --minCorr 92
--specific
For better specificity. Different from --sensitive when using correlation binning or ensemble
binning. It is the shortcut for --p1 90 --p2 90 --pB 30 --minProb 80 --minBinned 40 --minCorr 96
--veryspecific
For greater specificity. No correlation binning for short contig recruiting. It is the shortcut
for --p1 90 --p2 90 --pB 40 --minProb 80 --minBinned 40
--superspecific
For the best specificity. It is the shortcut for --p1 95 --p2 90 --pB 50 --minProb 80 --minBinned
20
--minCorr arg (=0)
Minimum pearson correlation coefficient for binning missed contigs to increase sensitivity
(Helpful when there are many samples). Should be very high (>=90) to reduce contamination.
(Percentage; Should be between 0 and 100; 0 disables)
--minSamples arg (=10)
Minimum number of sample sizes for considering correlation based recruiting
-x [ --minCV ] arg (=1)
Minimum mean coverage of a contig to consider for abundance distance calculation in each library
--minCVSum arg (=2)
Minimum total mean coverage of a contig (sum of all libraries) to consider for abundance distance
calculation
-s [ --minClsSize ] arg (=200000) Minimum size of a bin to be considered as the output
-m [ --minContig ] arg (=2500)
Minimum size of a contig to be considered for binning (should be >=1500; ideally >=2500). If # of
samples >= minSamples, small contigs (>=1000) will be given a chance to be recruited to existing
bins by default.
--minContigByCorr arg (=1000)
Minimum size of a contig to be considered for recruiting by pearson correlation coefficients
(activated only if # of samples >= minSamples; disabled when minContigByCorr > minContig)
-t [ --numThreads ] arg (=0)
Number of threads to use (0: use all cores)
--minShared arg (=50)
Percentage cutoff for merging fuzzy contigs
--fuzzy
Binning with fuzziness which assigns multiple memberships of a contig to bins (activated only with
--pairFile at the moment)
-l [ --onlyLabel ]
Output only sequence labels as a list in a column without sequences
-S [ --sumLowCV ]
If set, then every sample that falls below the minCV will be used in an aggregate sample
-V [ --maxVarRatio ] arg (=0)
Ignore any contigs where variance / mean exceeds this ratio (0 disables)
--saveTNF arg
File to save (or load if exists) TNF matrix for each contig in input
--saveDistance arg
File to save (or load if exists) distance graph at lowest probability cutoff
--saveCls
Save cluster memberships as a matrix format
--unbinned
Generate [outFile].unbinned.fa file for unbinned contigs
--noBinOut
No bin output. Usually combined with --saveCls to check only contig memberships
-B [ --B ] arg (=20)
Number of bootstrapping for ensemble binning (Recommended to be >=20)
--pB arg (=50)
Proportion of shared membership in bootstrapping. Major control for sensitivity/specificity. The
higher, the specific. (Percentage; Should be between 0 and 100)
--seed arg (=0)
For reproducibility in ensemble binning, though it might produce slightly different results. (0:
use random seed)
--keep Keep the intermediate files for later usage
-d [ --debug ]
Debug output
-v [ --verbose ]
Verbose output
AUTHOR
This manpage was written by Andreas Tille for the Debian distribution and
can be used for any other usage of the program.
metabat1 2.15 May 2020 METABAT1(1)