<codeBook xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xsi:schemaLocation="ddi:codebook:2_5 http://www.ddialliance.org/Specification/DDI-Codebook/2.5/XMLSchema/codebook.xsd" xmlns="ddi:codebook:2_5">
  <docDscr>
    <citation>
      <titlStmt>
        <titl xml:lang="sv">10X singel-cell RNA sekvensering av benmärgsceller från MDS-RS patienter och friska donatorer</titl>
        <parTitl xml:lang="en">10X single-cell RNA sequencing of bone marrow cells from MDS-RS patients and healthy donors</parTitl>
        <IDNo agency="SND">2023-121-1-1</IDNo>
        <IDNo agency="DOI">https://doi.org/10.48723/nq2a-1e03</IDNo>
      </titlStmt>
      <prodStmt>
        <producer xml:lang="en" abbr="SND">Swedish National Data Service</producer>
        <producer xml:lang="sv" abbr="SND">Svensk nationell datatjänst</producer>
      </prodStmt>
      <holdings URI="https://doi.org/10.48723/nq2a-1e03">Landing page</holdings>
    </citation>
  </docDscr>
  <stdyDscr>
    <citation>
      <titlStmt>
        <titl xml:lang="sv">10X singel-cell RNA sekvensering av benmärgsceller från MDS-RS patienter och friska donatorer</titl>
        <parTitl xml:lang="en">10X single-cell RNA sequencing of bone marrow cells from MDS-RS patients and healthy donors</parTitl>
        <IDNo agency="SND">2023-121-1-1</IDNo>
        <IDNo agency="DOI">https://doi.org/10.48723/nq2a-1e03</IDNo>
        <IDNo agency="DOI">10.1158/0008-5472.CAN-23-3038</IDNo>
      </titlStmt>
      <rspStmt>
        <AuthEnty xml:lang="en" affiliation="Department of Medicine, Huddinge / Center for Hematology and Regenerative Medicine (HERM), Karolinska Institutet">Moura, Pedro Luis</AuthEnty>
        <AuthEnty xml:lang="sv" affiliation="Institutionen för medicin, Huddinge / Centrum för hematologi och regenerativ medicin (HERM), Karolinska Institutet">Moura, Pedro Luis</AuthEnty>
        <AuthEnty xml:lang="en" affiliation="Department of Medicine, Huddinge / Center for Hematology and Regenerative Medicine (HERM), Karolinska Institutet">Hellström-Lindberg, Eva</AuthEnty>
        <AuthEnty xml:lang="sv" affiliation="Institutionen för medicin, Huddinge / Centrum för hematologi och regenerativ medicin (HERM), Karolinska Institutet">Hellström-Lindberg, Eva</AuthEnty>
      </rspStmt>
      <prodStmt>
        <grantNo xml:lang="en" agency="Knut and Alice Wallenberg Foundation">2017.0359</grantNo>
        <grantNo xml:lang="sv" agency="Knut och Alice Wallenbergs stiftelse">2017.0359</grantNo>
        <grantNo xml:lang="en" agency="Swedish Cancer Society">21 0340</grantNo>
        <grantNo xml:lang="sv" agency="Cancerfonden">21 0340</grantNo>
        <grantNo xml:lang="en" agency="Swedish Cancer Society">19 0200</grantNo>
        <grantNo xml:lang="sv" agency="Cancerfonden">19 0200</grantNo>
        <grantNo xml:lang="en" agency="Swedish Research Council">2021-01404_VR</grantNo>
        <grantNo xml:lang="sv" agency="Vetenskapsrådet">2021-01404_VR</grantNo>
      </prodStmt>
      <distStmt>
        <distrbtr xml:lang="en" abbr="SND" URI="https://snd.se">Swedish National Data Service</distrbtr>
        <distrbtr xml:lang="sv" abbr="SND" URI="https://snd.se">Svensk nationell datatjänst</distrbtr>
        <distDate xml:lang="en" date="2023-10-19" />
      </distStmt>
      <verStmt>
        <version elementVersion="1" elementVersionDate="2023-10-19" />
      </verStmt>
      <holdings URI="https://doi.org/10.48723/nq2a-1e03">Landing page</holdings>
    </citation>
    <stdyInfo>
      <subject />
      <abstract xml:lang="en" contentType="abstract">This dataset consists of single-cell RNA sequencing data of bone marrow cells (CD34+ stem cells, GPA+ erythroblasts, ring sideroblasts and mononuclear cells) obtained from multiple healthy bone marrow donors and MDS-RS patients. The objective of this data collection was to assess several parameters on how the bone marrow of MDS-RS patients differs from that of healthy donors.

This dataset includes raw sequencing data in .fastq format, processed count matrices and associated pseudonymized metadata.
 
Processing: All samples were loaded onto Chromium Single Cell Chips (10x Genomics, CA, USA) at a target capture rate of 10,000 cells per sample. Single cell libraries were prepared using Chromium Next GEM Single Cell 3ʹ Kits v3.1 (10x Genomics) as per the manufacturer’s instructions, except 1µl additive ADT primers were added to the initial cDNA PCR amplification buffer and ADT libraries prepared as described in the Total-Seq B protocol (BioLegend) from the initial cDNA SPRI clean up. Libraries were pooled and sequenced on an Illumina NovaSeq 6000 (Illumina). Read pseudoalignment was performed against the GRCh38.p13 human genome assembly through kallisto v0.46.1 and bustools v0.40.0 was used for barcode and UMI counting.

The dataset consists of 2 folders: 
- Processed_Count_Matrices
- Raw_FASTQ

And one xlsx file:
- Sample_key.xlsx

The folder Processed_Count_Matrices contains 1 rds file, 1 tsv file, 9 mtx files, and 18 txt files.
The folder Raw_FASTQ contains 27 GNU zipped fastq files, and 5 txt files.

The documentation file File_list_10x.txt contains a full list of the files in the dataset.

The total size of the dataset is approximately 21 GB.</abstract>
      <abstract xml:lang="sv" contentType="abstract">Syftet med denna datainsamling var att bedöma flera parametrar om hur benmärgen hos MDS-RS-patienter skiljer sig från den hos friska donatorer. Denna datauppsättning består av singel-cell RNA sekvensering av benmärgsceller från MDS-RS patienter och friska donatorer. Se den engelska beskrivningen för mer information.

Datasetet består av två mappar:
- Processed_Count_Matrices
- Raw_FASTQ

och en xlsx-fil:
- Sample_key.xlsx

Mappen Processed_Count_Matrices innehåller 1 rds-fil, 1 tsv-fil, 9 mtx-filer och 18 txt-filer.
Mappen Raw_FASTQ innehåller 27 GNU-zippade fastq-filer och 5 txt-filer.

Dokumentationsfilen File_list_10x.txt innehåller en lista över datasetets alla filer.

Datasetets totala storlek är ungefär 21 GB.</abstract>
      <sumDscr>
        <anlyUnit xml:lang="en" unit="Cells">Cells<concept vocab="DDI Analysis Unit" vocabURI="https://vocabularies.cessda.eu/v2/vocabularies/AnalysisUnit/2.1.3?languageVersion=en-2.1.3">Cells</concept></anlyUnit>
        <anlyUnit xml:lang="sv" unit="Celler">Celler<concept vocab="DDI Analysis Unit" vocabURI="https://vocabularies.cessda.eu/v2/vocabularies/AnalysisUnit/2.1.3?languageVersion=sv-2.1.3">Celler</concept></anlyUnit>
        <universe xml:lang="en">Patients with Myelodysplastic neoplasms with ring sideroblasts (MDS-RS) and healthy donors</universe>
        <universe xml:lang="sv">Patienter med myelodysplastisk syndrom med ringsideroblaster (MDS-RS) samt friska donatorer</universe>
        <dataKind xml:lang="en">Numeric</dataKind>
        <dataKind xml:lang="en">Other</dataKind>
      </sumDscr>
    </stdyInfo>
    <method>
      <dataColl>
        <sampProc xml:lang="en">Bone marrow (BM) and/or peripheral blood (PB) samples were collected from 36 MDS-RS and 3 MDS non-RS patients evaluated at Karolinska University Hospital, Huddinge, Sweden. Diagnostic procedures were performed according to the European LeukemiaNet recommendation and WHO classification for myeloid neoplasms. As the specific purpose was to dissect the pathobiology of SF3B1-mutant MDS-RS, all MDS-RS patients belonged to the SF3B1α category in the IPSS-M risk classification. RS presence was quantified according to standard clinical practice. Additional samples were collected from a total of 40 healthy normal bone marrow (NBM) donors for control purposes. Please note that a deidentified donor and experiment index is provided in the companion publication for this dataset, including clinical and mutational status. All source material was provided with written informed consent for research use, given in accordance with the Declaration of Helsinki.<concept vocab="DDI Sampling Procedure" vocabURI="https://vocabularies.cessda.eu/v2/vocabularies/SamplingProcedure/2.0.1?languageVersion=en-2.0.1">Bone marrow (BM) and/or peripheral blood (PB) samples were collected from 36 MDS-RS and 3 MDS non-RS patients evaluated at Karolinska University Hospital, Huddinge, Sweden. Diagnostic procedures were performed according to the European LeukemiaNet recommendation and WHO classification for myeloid neoplasms. As the specific purpose was to dissect the pathobiology of SF3B1-mutant MDS-RS, all MDS-RS patients belonged to the SF3B1α category in the IPSS-M risk classification. RS presence was quantified according to standard clinical practice. Additional samples were collected from a total of 40 healthy normal bone marrow (NBM) donors for control purposes. Please note that a deidentified donor and experiment index is provided in the companion publication for this dataset, including clinical and mutational status. All source material was provided with written informed consent for research use, given in accordance with the Declaration of Helsinki.</concept></sampProc>
        <sampProc xml:lang="en">Probability: Simple random<concept vocab="DDI Sampling Procedure" vocabURI="https://vocabularies.cessda.eu/v2/vocabularies/SamplingProcedure/2.0.1?languageVersion=en-2.0.1">Probability: Simple random</concept></sampProc>
        <sampProc xml:lang="sv">Sannolikhetsurval: obundet slumpmässigt urval<concept vocab="DDI Sampling Procedure" vocabURI="https://vocabularies.cessda.eu/v2/vocabularies/SamplingProcedure/2.0.1?languageVersion=sv-2.0.1">Sannolikhetsurval: obundet slumpmässigt urval</concept></sampProc>
        <sampProc xml:lang="en">Non-probability: Availability<concept vocab="DDI Sampling Procedure" vocabURI="https://vocabularies.cessda.eu/v2/vocabularies/SamplingProcedure/2.0.1?languageVersion=en-2.0.1">Non-probability: Availability</concept></sampProc>
        <sampProc xml:lang="sv">Icke-sannolikhetsurval: tillgänglighetsurval<concept vocab="DDI Sampling Procedure" vocabURI="https://vocabularies.cessda.eu/v2/vocabularies/SamplingProcedure/2.0.1?languageVersion=sv-2.0.1">Icke-sannolikhetsurval: tillgänglighetsurval</concept></sampProc>
        <sampProc xml:lang="en">Mixed probability and non-probability<concept vocab="DDI Sampling Procedure" vocabURI="https://vocabularies.cessda.eu/v2/vocabularies/SamplingProcedure/2.0.1?languageVersion=en-2.0.1">Mixed probability and non-probability</concept></sampProc>
        <sampProc xml:lang="sv">Blandat sannolikhets- och icke-sannolikhetsurval<concept vocab="DDI Sampling Procedure" vocabURI="https://vocabularies.cessda.eu/v2/vocabularies/SamplingProcedure/2.0.1?languageVersion=sv-2.0.1">Blandat sannolikhets- och icke-sannolikhetsurval</concept></sampProc>
      </dataColl>
    </method>
    <dataAccs>
      <useStmt>
        <restrctn xml:lang="en">Access to data through SND. Access to data is restricted.</restrctn>
        <restrctn xml:lang="sv">Åtkomst till data via SND. Tillgång till data är begränsad.</restrctn>
        <conditions elementVersion="info:eu-repo-Access-Terms vocabulary">restrictedAccess</conditions>
      </useStmt>
    </dataAccs>
    <othrStdyMat>
      <relPubl>
        <citation>
          <titlStmt>
            <titl xml:lang="sv">Moura PL, Mortera Blanco T, Hofman IJ, Todisco G, Kretzschmar WW, Björklund AC, Creignou M, Hagemann-Jensen M, Ziegenhain C, Cabrerizo Granados D, Barbosa I, Walldin G, Jansson M, Ashley N, Mead AJ, Lundin V, Dimitriou M, Yoshizato T, Woll PS, Ogawa S, Sandberg R, Jacobsen SW, Hellström-Lindberg E. Erythroid differentiation enhances RNA mis-splicing in SF3B1-mutant myelodysplastic syndromes with ring sideroblasts. Cancer Res. 2023 Nov 3. doi: 10.1158/0008-5472.CAN-23-3038.</titl>
            <parTitl xml:lang="en">Moura PL, Mortera Blanco T, Hofman IJ, Todisco G, Kretzschmar WW, Björklund AC, Creignou M, Hagemann-Jensen M, Ziegenhain C, Cabrerizo Granados D, Barbosa I, Walldin G, Jansson M, Ashley N, Mead AJ, Lundin V, Dimitriou M, Yoshizato T, Woll PS, Ogawa S, Sandberg R, Jacobsen SW, Hellström-Lindberg E. Erythroid differentiation enhances RNA mis-splicing in SF3B1-mutant myelodysplastic syndromes with ring sideroblasts. Cancer Res. 2023 Nov 3. doi: 10.1158/0008-5472.CAN-23-3038.</parTitl>
            <IDNo agency="DOI">10.1158/0008-5472.CAN-23-3038</IDNo>
          </titlStmt>
          <distStmt>
            <distDate date="2023">2023</distDate>
          </distStmt>
        </citation>
      </relPubl>
    </othrStdyMat>
  </stdyDscr>
</codeBook>