Data for: Coverage of Web Accessibility Guidelines Provided by Automated Checking Tools
https://doi.org/10.5878/qe0c-kb63
This data set contains three parts:
1. A collection of the raw data, which includes (a) the retrieved landing page of each analyzed PSO (to be precise, the DOM presentation from a browser showing this page) both in HTML and text (text without HTML tags), (b) for each of the six automated checker/engine combination one log file, (c) other metadata such as text file containing tools' and libraries' version information.
Data of case 1(a) may contain personal data (details see below) and is thus kept in a separate archive file and is only available upon request. Data of case 1(b) has been stripped of personal data and thus may get shared freely.
This data allows investigating how the webpages looked at the time of the study and to which assessments the then-current automated checkers came. Future studies can reproduce the same setup and, for example, compare changes over time in PSOs' webpages' accessibility.
2. A "coverage" file that is essentially a big database on WCAG-2 success criteria, their metadata, and links to automated checkers' documentation and source code. The "coverage" file combines information from various sources, such as information scrapped from W3C web page, accessibility tools' Git repositories, or AXE's documentation. Other researchers can load this "coverage" file to get a database of WCAG-2 success criteria and associated metadata in their data analysis without performing those error-prone and tedious steps themselves.
3. A collection of Python files. This not only allows reproducing how raw data was process and filtered (up to the output of LaTeX code), but allows other researchers to get inspiration how to solve problems addressed in this code base as well as to re-use code in their own projects.
The data covered by case 1(a) above includes textual data collected from publicly available web pages of Swedish public sector organizations (PSOs), which may include names, contact details, or other personal or biographical information. Due to the directory structure, for every file the origin of the data is determined, so any further questions about the handling of personal data shall be directed to the respective PSO.
Documentation files
Documentation files
Citation and access
Citation and access
Data access level:
Creator/Principal investigator(s):
Research principal:
Principal's reference number:
- 20200013
Data contains personal data:
Yes
Type of personal data:
Textual data collected from publicly available web pages of Swedish public sector organizations (PSOs), which may include names, contact details, or other personal or biographical information. Due to the directory structure, for every file the origin of the data is determined, so any further questions about the handling of personal data shall be directed to the respective PSO.
Citation:
Language:
Method and outcome
Method and outcome
Data collection - Compilation/Synthesis
Data collection - Compilation/Synthesis
Geographic coverage
Geographic coverage
Administrative information
Administrative information
Topic and keywords
Topic and keywords
Metadata
Metadata
Version 1

University of Skövde