Database Darkly provides insight into a deep-sea survey of microbial eukaryote sequences. Because these sequences belong to protistan species that we do not necessarily have microscopic images of or representatives in culture… we worked to compile what we know about each reference here.
As a community, we hope that this information can be used to link across other studies so we can expand what we know about these environmental strains.
First, download qiime2-output-files_Hu-et-al.tar from Zenodo link, extract files.
mkdir qiime2-output
mv qiime2-output-files_Hu-et-al.tar qiime2-output
cd qiime2-output
tar -xf qiime2-output-files_Hu-et-al.tar
Since DADA2 determination of ASVs is most appropriate by sequence library run, there are three separate ASV datasets. These were merged to create the microeuk-merged data.
1.2.2 Extract fasta files for reference database
You do not need QIIME2 installed to obtain the reference sequences. This can be extracted like a normal zip file. Move the reference sequences to the qiime2 output directory.
Rows: 17934 Columns: 98
── Column specification ────────────────────────────────────────────────────────
Delimiter: "\t"
chr (1): #OTU ID
dbl (97): 101_GR_substrate_MC3_Riftia_6_0_Jun2021, 102_GR_substrate_MC3_Shel...
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
# head(asv_table)
Import taxonomy information
tax_table <-read_delim("input-data/taxonomy.tsv")
Rows: 17934 Columns: 3
── Column specification ────────────────────────────────────────────────────────
Delimiter: "\t"
chr (2): Feature ID, Taxon
dbl (1): Consensus
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 100 Columns: 20
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (17): SAMPLE, VENT, COORDINATES, SITE, Sample_or_Control, SAMPLEID, SAMP...
dbl (3): ref_num, YEAR, Perc
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.