Bulk RNA sequencing of erythroblasts from a pair of SF3B1-mutated and SF3B1-wildtype induced pluripotent stem cell (iPSC) lines
https://doi.org/10.48723/3hs1-0v44
This dataset consists of bulk RNA sequencing data of MACS-separated GPA+ erythroblasts obtained from a pair of induced pluripotent stem cell (iPSC) lines with and without SF3B1-mutation, generated from an MDS patient (Asimomitis G et al. 2022, Blood Advances). The objective of this data collection was to assess how SF3B1 mutation changes the molecular profile of RNA splicing in erythropoiesis.
This dataset includes minimally processed, visualisation-ready .bam format sequencing data for both of the lines.
Processing:
MDS patient iPSC line-derived hematopoietic stem and progenitor cells (HSPC) were cultured for 14 days in erythroid specification media (StemPro-34 SFM [Gibco] + 1% Pen/Strep [Cytiva], 2 mM L-glutamine [Sigma-Aldrich], 3.5 µM 1-Thioglycerol [Sigma-Aldrich], 1% Bovine Albumin Fraction V [Gibco], 150 µg/mL holo-transferrin [Sigma-Aldrich], 2 U/mL erythropoietin [Pfizer], 50 ng/mL Stem Cell Factor [PeproTech] and 50 ng/mL interleukin-3 [PeproTech]).
At Day 14 of culture, mixed glycophorin A-positive (GPA+) erythroblast samples were isolated through MACS. Cells were lysed in RLT (Qiagen) + 40 mM dithiothreitol (Sigma-Aldrich) and RNA extraction was performed with RNeasy Micro Kit (Qiagen) with RNase-free DNase treatment according to the manufacturer’s protocol. RNA integrity numbers (RIN) were estimated using Agilent RNA 6000 Pico Kits (Agilent Technologies, CA, USA). A minimum RIN value of 6.5 was considered adequate. RNA sequencing (RNAseq) libraries were prepared from total RNA using SMARTer Stranded Total RNA-Seq Kits v2 - Pico Input Mammalian (Takara Bio, Japan), including enzymatic ribosomal depletion steps. Libraries were sequenced using an Illumina Novaseq 6000 S4 (Illumina, CA, USA) with paired-end 150bp configuration. Reads were pre-processed with TrimGalore v. 0.6.7 using CutAdapt v. 3.5 and BAM files were generated through via two-pass alignment with STAR v. 2.7.9a against the GRCh38.p13 human genome assembly.
The dataset consists of 13 files:
- 2 .bam files, one for the SF3B1-mutant sample and one for the wildtype sample;
- 2. bai bam index files, one for each sample to facilitate analysis of the .bam files.
- 8 .fastq raw data files, corresponding to a paired-end run of the two samples in two different lanes (2 x 2 x 2).
- 1 gene-collapsed read count matrix (.txt) summarising read counts for both samples.
The documentation file iPSCEB_FileList.txt contains a full list of the files in the dataset.
The total size of the dataset is approximately 80 GB.
Citation and access
Citation and access
Data access level:
Creator/Principal investigator(s):
Research principal:
Data contains personal data:
Yes
Type of personal data:
Pseudonymized genetic data from an MDS patient
Code key exists:
Yes
Sensitive personal data:
Yes
Citation:
Language:
Method and outcome
Method and outcome
Unit of analysis:
Population:
Cell lines from a patient with SF3B1-mutant myelodysplastic neoplasm with ring sideroblasts (MDS-RS)
Study design:
- Preclinical study
Sampling procedure:
Data format/data structure:
Samples/material - Existing from scientific collection/biobank
Samples/material - Existing from scientific collection/biobank
Name:
Karolinska Institutet MDS biobank
Type(s) of sample:
Bone marrow cells
Administrative information
Administrative information
Responsible department/unit:
Department of Medicine, Huddinge [H7]
Ethics Review:
Stockholm - 2017/1090-31/4 (Stockholm)
Topic and keywords
Topic and keywords
Standard för svensk indelning av forskningsämnen 2025:
Publications
Publications
Citation:
Thier, J., Hofmann, S., Kirchhof, K.M. et al. SF3B1-mutant models of RNA mis-splicing uncover UBA1 as a therapeutic target in myelodysplastic neoplasms. Leukemia (2025). https://doi.org/10.1038/s41375-025-02740-1Opens in a new tab
