<ddi:DDIInstance xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="ddi:instance:3_3 http://ddialliance.org/Specification/DDI-Lifecycle/3.3/XMLSchema/instance.xsd" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:ddi="ddi:instance:3_3" xmlns:r="ddi:reusable:3_3" xmlns:s="ddi:studyunit:3_3" xmlns:d="ddi:datacollection:3_3" xmlns:a="ddi:archive:3_3" xmlns:c="ddi:conceptualcomponent:3_3" xmlns:cm="ddi:comparative:3_3" xmlns:g="ddi:group:3_3" xmlns:l="ddi:logicalproduct:3_3" xmlns:p="ddi:physicaldataproduct:3_3" xmlns:pi="ddi:physicalinstance:3_3" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xhtml="http://www.w3.org/1999/xhtml" xmlns:xml="http://www.w3.org/XML/1998/namespace" isMaintainable="true" scopeOfUniqueness="Agency">
  <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964:0</r:URN>
  <r:Agency>SND</r:Agency>
  <r:ID>doi-10-17044-scilifelab-28327964</r:ID>
  <r:Version>0</r:Version>
  <g:ResourcePackage>
    <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.ResourcePackage:2.0</r:URN>
    <r:OtherMaterialScheme>
      <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.OtherMaterialScheme:2.0</r:URN>
    </r:OtherMaterialScheme>
    <a:OrganizationScheme>
      <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.OrganizationScheme-0:2.0</r:URN>
      <a:Individual>
        <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.Individual-0:2.0</r:URN>
        <r:UserAttributePair>
          <r:AttributeKey>affiliation</r:AttributeKey>
          <r:AttributeValue>Science for Life Laboratory</r:AttributeValue>
        </r:UserAttributePair>
        <a:IndividualIdentification>
          <a:IndividualName>
            <a:FirstGiven>Sarahi</a:FirstGiven>
            <a:LastFamily>Garcia</a:LastFamily>
            <a:FullName>
              <r:String>Sarahi Garcia</r:String>
            </a:FullName>
          </a:IndividualName>
          <a:ResearcherID>
            <a:TypeOfID>ORCID</a:TypeOfID>
            <a:ResearcherIdentification>0000-0002-8622-0308</a:ResearcherIdentification>
          </a:ResearcherID>
        </a:IndividualIdentification>
      </a:Individual>
      <a:Individual>
        <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.Individual-0:2.0</r:URN>
        <a:IndividualIdentification>
          <a:IndividualName>
            <a:FirstGiven>Alejandro</a:FirstGiven>
            <a:LastFamily>Rodríguez Gijón</a:LastFamily>
            <a:FullName>
              <r:String>Alejandro Rodríguez Gijón</r:String>
            </a:FullName>
          </a:IndividualName>
          <a:ResearcherID>
            <a:TypeOfID>ORCID</a:TypeOfID>
            <a:ResearcherIdentification>0000-0002-1649-6894</a:ResearcherIdentification>
          </a:ResearcherID>
        </a:IndividualIdentification>
      </a:Individual>
    </a:OrganizationScheme>
  </g:ResourcePackage>
  <s:StudyUnit>
    <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.StudyUnit:2.0</r:URN>
    <r:UserID typeOfUserID="datasetIdentifier">doi-10-17044-scilifelab-28327964</r:UserID>
    <r:Citation>
      <r:Title>
        <r:String xml:lang="en">FRESH-MAP dataset: study on the ecological success of streamlined aquatic microorganisms</r:String>
      </r:Title>
      <r:Creator>
        <r:CreatorReference>
          <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.Individual-0:2.0</r:URN>
          <r:TypeOfObject>Individual</r:TypeOfObject>
        </r:CreatorReference>
      </r:Creator>
      <r:Publisher>
        <r:PublisherName>
          <r:String xml:lang="sv">Stockholms universitet</r:String>
          <r:String xml:lang="en">Stockholm University</r:String>
        </r:PublisherName>
      </r:Publisher>
      <r:Publisher>
        <r:PublisherName>
          <r:String xml:lang="sv">Stockholms universitet</r:String>
          <r:String xml:lang="en">Stockholm University</r:String>
        </r:PublisherName>
      </r:Publisher>
      <r:PublicationDate>
        <r:SimpleDate>2025-02-04</r:SimpleDate>
      </r:PublicationDate>
      <r:InternationalIdentifier>
        <r:IdentifierContent>10.17044/SCILIFELAB.28327964</r:IdentifierContent>
        <r:ManagingAgency controlledVocabularyAgencyName="DOI">DOI</r:ManagingAgency>
      </r:InternationalIdentifier>
    </r:Citation>
    <r:Abstract>
      <r:Content xml:lang="en">We release the FRESH-MAP dataset, a compilation of 9028 prokaryotic species-clusters (ANI &gt;95%) detected on a set of 636 globally-distributed freshwater metagenomes after competitive mapping. The main goal of our study was to provide the first systematic evaluation of the ‘Black Queen Hypothesis’ on a global scale based on aquatic metagenomic datasets. In this repository, we provide the supplementary tables and the full set of genomes. Moreover, we include all 9028 representative genomes (FRESH-MAP dataset) as a zip file (FreshMap_dataset.zip). You can also find 12 supplementary tables that include the following information. 

- Table S1: Genomic statistics of all 80561 medium-to-high quality genomes (&gt;50% completeness and 95%), including the best representative genome. We include origin of the species-clusters (freshwater, non-freshwater or mixed) and the type of genome (omics, isolate or both). Genome stats here included refer to the best representative genome of each species-cluster. 

- Table S3: List of all 636 freshwater genomes used for competitive mapping. We include accession numbers, reference of publication and metadata. 

- Table S4: Mapping statistics after trimming. We include the total number of reads, number of mapped reads, average length of the reads (bp), and GC content (%). 

- Table S5: Relative abundance results for each of the 24050 species-clusters (ANI &gt;95%) across the freshwater metagenomes. 

- Table S6: List of 1202 representative genomes part of the co-occurrence network, including information on the degree of connectedness and cohort. 

- Table S7: Recruitment of species-clusters per phyla across the different cohorts. Yellow indicates phyla linked to a specific cohort. 

- Table S8: Completeness (ranging from 0 to 1) of all KEGG modules involved in biosynthesis of amino acids, nucleotides and vitamins for each of the 9028 species-clusters (ANI &gt; 95%) detected on ≥1 freshwater metagenomes. 

- Table S9: Number of copies of each KEGG KO involved in flagellar, sigma factors, two-component systems, carbon fixation, nitrogen cycle, and sulfur cycle for each of the 9028 species-clusters (ANI &gt; 95%) detected on ≥1 freshwater metagenomes. 

- Table S10: Metadata of the two newly sequenced metagenomic samples from Stadsträdgården, Uppsala (Sweden), including accession numbers. 

- Table S11: Metagenomic samples used for re-binning from the StratfreshDB (Buck et al., 2021), including ENA accession numbers. 

- Table S12: Genomic statistics, including completeness and contamination of the re-binned (n = 11146) and original (n = 7837) MAGs.</r:Content>
    </r:Abstract>
    <r:Coverage>
      <r:TopicalCoverage>
        <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.TopicalCoverage:2.0</r:URN>
        <r:Subject xml:lang="en" controlledVocabularyID="3" controlledVocabularyName="Standard för svensk indelning av forskningsämnen 2025">Medical and Health Sciences</r:Subject>
        <r:Subject xml:lang="sv" controlledVocabularyID="3" controlledVocabularyName="Standard för svensk indelning av forskningsämnen 2025">Medicin och hälsovetenskap</r:Subject>
        <r:Subject xml:lang="en" controlledVocabularyID="10611" controlledVocabularyName="Standard för svensk indelning av forskningsämnen 2025">Ecology</r:Subject>
        <r:Subject xml:lang="sv" controlledVocabularyID="10611" controlledVocabularyName="Standard för svensk indelning av forskningsämnen 2025">Ekologi</r:Subject>
      </r:TopicalCoverage>
      <r:SpatialCoverage />
    </r:Coverage>
    <a:Archive>
      <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.Archive:2.0</r:URN>
      <a:ArchiveSpecific>
        <a:Item>
          <a:Access>
            <r:URN>urn:ddi:se.researchdata:doi-10-17044-scilifelab-28327964.Archive-ArchiveSpecificType-AccessType:2.0</r:URN>
            <a:TypeOfAccess controlledVocabularyName="info:eu-repo-Access-Terms vocabulary"></a:TypeOfAccess>
          </a:Access>
          <a:DataFileQuantity>0</a:DataFileQuantity>
        </a:Item>
      </a:ArchiveSpecific>
    </a:Archive>
  </s:StudyUnit>
</ddi:DDIInstance>