Texts from the Swedish Consumer Agency
Parallel texts downloaded from the website, hallåkonsument.se, run by of the Swedish Consumer Agency.
The texts have been downloaded using the command 'w3m -dump' from an ubuntu shell, whereafter the resulting text files were stripped to contain only the interesting text (no menus and such).
Go to data source
Opens in a new tabhttps://sprakresurser.isof.se/myndighetsdata/texter/Konsumentverket/
Citation and access
Citation and access
Data access level:
Creator/Principal investigator(s):
- Simon Dahlberg - Institute for Language and Folklore - Language Council of Sweden
Research principal:
Data contains personal data:
No
Citation:
Corpus
Corpus
Foreseen use:
NLP application
Text part
Text part
Linguality:
Multilingual
Language:
Swedish (swe)
Texts: 42
English (eng)
Texts: 42
French (fra)
Texts: 31
Spanish (spa)
Texts: 31
German (deu)
Texts: 31
Polish (pol)
Texts: 31
Finnish (fin)
Texts: 31
Arabic (ara)
Texts: 42
Persian (fas)
()
Texts: 42
Somali (som)
Texts: 6
Albanian (sqi)
Texts: 31
Tigrinya (tir)
Texts: 6
Central Kurdish (ckb)
Texts: 37
Croatian (hrv)
Texts: 31
Modality:
Written Language
Size:
Words: 190126 (tot)
Texts: 434 (tot)
Words: 21535 (swe)
Texts: 42 (swe)
Annotation:
Original source:
Link to other media:
Method and outcome
Method and outcome
Data format/data structure:
Geographic coverage
Geographic coverage
Geographic location:
Administrative information
Administrative information
Responsible department/unit:
Language Council of Sweden
Contributor(s):
Topic and keywords
Topic and keywords
Standard för svensk indelning av forskningsämnen 2025:
Keywords:
Relations
Relations
Is part of:
Related research data:
Metadata
Metadata
Version 1

Institute for Language and Folklore