find_redundant_contigs
find_redundant_contigs |
R Documentation |
Find redundant contigs within a bin
Description
Find contigs with overlapping marker genes in a given bin, and suggest contigs to be removed in order to reduce contamination without affecting completeness. Note that this can give a quick idea of the contigs that are sources of contamination within a bin, but is not a replacement for proper bin refininement with other tools such as anvi\’o.
Usage
find_redundant_contigs(SQM, bin, minimum_overlap_for_removal = 1)
Arguments
|
A SQM object. |
|
character. Name of the bin to be created. |
|
numeric. Fraction of marker genes
in the contigs present in another
contig needed to suggest it for
removal. If set to |
Value
A character vector with the contigs deemed to be redundant. A heatmap showing how marker genes overlap over different contigs will also be produced.
See Also
create_bin, remove_contigs_from_bin
Examples
data(Hadza)
bin_name = "Hadza2merged.concoct.28.fa.contigs"
# Get redundant contigs that could be removed from our bin
candidates_for_removal = find_redundant_contigs(Hadza, bin_name)
# We can now remove them from the bin
Hadza.new.1 = remove_contigs_from_bin(Hadza, bin_name, candidates_for_removal)
# Or we can create a new bin out of them
# which will also remove them from the original bin
Hadza.new.2 = create_bin(Hadza, "new_bin_name", candidates_for_removal)