SCANIA Component X Dataset: A Real-World Multivariate Time Series Dataset for Predictive Maintenance
https://doi.org/10.58141/1w9m-yz81
This data is a real-world, multivariate time series dataset collected from an anonymized engine
component (called Component X) of a fleet of trucks from SCANIA, Sweden. This dataset includes diverse variables capturing detailed operational data, repair records, and specifications of trucks while maintaining confidentiality by anonymization. It is well-suited for a range of machine learning applications, such as classification, regression, survival analysis, and anomaly detection, particularly when applied to predictive maintenance scenarios. The large population size and variety of features in the format of histograms and numerical counters, along with the inclusion of temporal information, make this real-world dataset unique in the field. The objective of releasing this dataset is to give a broad range of researchers the possibility of working with real-world data from a well-known international company and introduce a standard benchmark to the predictive maintenance field, fostering reproducible research.
Data files
Data files
Citation and access
Citation and access
Data access level:
Creator/Principal investigator(s):
- Olof Steinert - Scania CV AB - Strategic Product Planning and Advanced Analytics
- Oskar Andersson Reyna - Scania CV AB - Connected Intelligence
Research principal:
Data contains personal data:
No
Citation:
Language:
Method and outcome
Method and outcome
Data format/data structure:
Data collection - Physical measurements and tests
Data collection - Physical measurements and tests
Mode of collection:
Physical measurements and tests
Data collector:
- Scania CV AB
Opens a new window at ror.org.
ROROpens in a new tab
Source of the data:
- Processes
Administrative information
Administrative information
Other research principals:
Topic and keywords
Topic and keywords
Standard för svensk indelning av forskningsämnen 2025:
Publications
Publications
Citation:
Zahra Kharazian, Tony Lindgren, Sindri Magnússon, Olof Steinert, Oskar Andersson Reyna. (2024). SCANIA Component X Dataset: A Real-World Multivariate Time Series Dataset for Predictive Maintenance. arXiv:2401.15199.
Versions
Versions
Version:
3
Metadata added:
Replaced the manuscript PDF under documentation files with the final article version.
Published:
Version:
2
Data added:
The test labels are now added to the dataset.Metadata added:
The data descriptor paper is now updated with the journal paper.
Published:
Metadata
Metadata
Versions
Versions
Version:
3
Metadata added:
Replaced the manuscript PDF under documentation files with the final article version.
Published:
Version:
2
Data added:
The test labels are now added to the dataset.Metadata added:
The data descriptor paper is now updated with the journal paper.
Published:

Scania CV AB