Parallel texts from the Swedish Migration Agency
Parallel texts downloaded with "w3m -dump" from an ubuntu shell, from the website of the Swedish Migration Agency.
The texts have been downloaded using the command 'w3m -dump' from an ubuntu shell, whereafter the resulting text files were stripped to contain only the interesting text (no menus and such).
Go to data source
Opens in a new tabhttps://sprakresurser.isof.se/myndighetsdata/texter/Migrationsverket/
Citation and access
Citation and access
Data access level:
Creator/Principal investigator(s):
- Simon Dahlberg - Institute for Language and Folklore - Language Council of Sweden
Research principal:
Data contains personal data:
No
Citation:
Corpus
Corpus
Foreseen use:
NLP application
Text part
Text part
Linguality:
Multilingual
Language:
Swedish (swe)
Texts: 33
Amharic (amh)
Texts: 23
Arabic (ara)
Texts: 33
Azerbaijani (aze)
Texts: 27
Central Kurdish (ckb)
Texts: 29
English (eng)
Texts: 33
Persian (fas)
Texts: 32
Croatian (hrv)
Texts: 23
Armenian (hye)
Texts: 24
Georgian (kat)
Texts: 1
Northern Kurdish (kmr)
Texts: 28
Mongolian (mon)
Texts: 25
Dari (prs)
Texts: 28
Pushto (pus)
Texts: 28
Romany (rom)
Arli (Dialect)
Texts: 24
Russian (rus)
Texts: 33
Somali (som)
Texts: 29
Spanish (spa)
Texts: 31
Albanian (sqi)
Texts: 27
Thai (tha)
Texts: 4
Tigrinya (tir)
Texts: 29
Turkish (tur)
Texts: 2
Uzbek (uzb)
Texts: 25
Chinese (zho)
Texts: 3
French (fra)
Texts: 31
Modality:
Written Language
Size:
Words: 29008 (swe)
Texts: 33 (swe)
Words: 438614 (tot)
Texts: 580 (tot)
Annotation:
Original source:
Link to other media:
Method and outcome
Method and outcome
Data format/data structure:
Geographic coverage
Geographic coverage
Geographic location:
Administrative information
Administrative information
Responsible department/unit:
Language Council of Sweden
Contributor(s):
Topic and keywords
Topic and keywords
Standard för svensk indelning av forskningsämnen 2025:
Keywords:
Relations
Relations
Is part of:
Related research data:
Metadata
Metadata
Version 1

Institute for Language and Folklore