Skip to main content

dRep

Rapid and accurate comparison and de-replication of microbial genomes


File naming

The dRep format is a special case that requires two files:

  • Cdb.csv
  • Wdb.csv

Their names have to be exactly as above.

File format

tip

For more information on the dRep output files, visit the dRep documentation.

Cdb.tsv

This file informs the cluster of every MAG.

The file must follow the Tab Separated Values (TSV). It must have columns representing the following data, in that order and with a header:

Column nameColumn obligatorinessData typeData nullability
genomeMandatoryStringNot nullable
secondary_clusterMandatoryStringNullable
thresholdOptional (ignored)N/AN/A
cluster_methodOptional (ignored)N/AN/A
comparison_algorithmOptional (ignored)N/AN/A
primary_clusterOptional (ignored)N/AN/A

Wdb.tsv

This file informs the "winners" (i.e. best representatives) of each cluster.

The file must follow the Tab Separated Values (TSV). It must have columns representing the following data, in that order and with a header:

Column nameColumn obligatorinessData typeData nullability
genomeMandatoryStringNot nullable
scoreOptional (ignored)N/AN/A
clusterOptional (ignored)N/AN/A

Mapping to database

DrepDirectory

Original dataDrepDirectory field
dRep directory pathpath 1

DrepEntry

Original dataDrepEntry field
genome column of Wdb.csvwinner 2
genome column of Cdb.csvgenome_name
secondary_cluster column of Cdb.csvgenome_cluster_name

Footnotes

  1. This is the path to the directory that contains both Cdb.csv and Wdb.csv.

  2. MAGs whose names are in Wdb.csv are the winners of theirs clusters.