Parallel texts from the Swedish Competition Agency
Texts collected from the Swedish Competition Authority's website around March 2018. The texts are yearly reviews and other public information from this authority.
Parallel texts downloaded from the agency's website.
What was downloaded were pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.
Go to data source
Opens in a new tabhttps://sprakresurser.isof.se/myndighetsdata/texter/Konkurrensverket/
Citation and access
Citation and access
Data access level:
Creator/Principal investigator(s):
- Simon Dahlberg - Institute for Language and Folklore
Research principal:
Data contains personal data:
No
Citation:
Copyright:
Public Domain Mark (https://creativecommons.org/publicdomain/mark/1.0/deed.sv)
Corpus
Corpus
Foreseen use:
NLP application
Text part
Text part
Linguality:
Bilingual
Language:
English (eng)
Texts: 15
Swedish (swe)
Texts: 15
Modality:
Written Language
Size:
Words: 479760 (tot)
Texts: 30 (tot)
Words: 217870 (swe)
Texts: 15 (swe)
Annotation:
Original source:
Link to other media:
Method and outcome
Method and outcome
Data format/data structure:
Administrative information
Administrative information
Responsible department/unit:
Language Council of Sweden
Contributor(s):
Topic and keywords
Topic and keywords
Standard för svensk indelning av forskningsämnen 2025:
Keywords:
Relations
Relations
Is part of:
Related research data:
Metadata
Metadata
Version 1

Institute for Language and Folklore