Parallel texts from the Swedish Work Environment Authority
Parallel texts downloaded from the websites of the Swedish Work Environment Authority. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.
Go to data source
Opens in a new tabhttps://sprakresurser.isof.se/myndighetsdata/texter/Arbetsmiljoeverket/
Citation and access
Citation and access
Data access level:
Creator/Principal investigator(s):
- Simon Dahlberg - Institute for Language and Folklore - Language Council of Sweden
Research principal:
Citation:
Corpus
Corpus
Foreseen use:
NLP application
Text part
Text part
Linguality:
Multilingual
Language:
Swedish (swe)
Texts: 21
English (eng)
Texts: 19
Bulgarian (bul)
Texts: 2
Czech (ces)
Texts: 2
German (deu)
Texts: 3
Estonian (est)
Texts: 3
Finnish (fin)
Texts: 1
Hungarian (hun)
Texts: 1
Latvian (lav)
Texts: 3
Lithuanian (lit)
Texts: 3
Polish (pol)
Texts: 4
Romanian (ron)
Texts: 3
Spanish (spa)
Texts: 2
Chinese (zho)
Texts: 2
Russian (rus)
Texts: 3
Arabic (ara)
Texts: 1
Turkish (tur)
Texts: 2
Thai (tha)
Texts: 1
Hindi (hin)
Texts: 1
Modality:
Written Language
Size:
Words: 166367 (swe)
Texts: 21 (swe)
Words: 432133 (tot)
Texts: 78
Annotation:
Original source:
Link to other media:
Method and outcome
Method and outcome
Data format/data structure:
Geographic coverage
Geographic coverage
Geographic location:
Administrative information
Administrative information
Responsible department/unit:
Language Council of Sweden
Contributor(s):
Topic and keywords
Topic and keywords
Standard för svensk indelning av forskningsämnen 2025:
Keywords:
Relations
Relations
Is part of:
Related research data:
Metadata
Metadata
Version 1

Institute for Language and Folklore