Gold standard for English-Swedish Europarl data (GES)
Reference corpus for word linking, divided into training data and test data. The sentences come from the English and Swedish parts of Europarl.
Data are created from the English-Swedish part of the Europarl corpus. For each sentence pair in the selected subset, token correspondences are stated as pairs of integral token identifiers
Go to data source
Opens in a new tabhttps://www.ida.liu.se/divisions/hcs/nlplab/resources/ges/
Citation and access
Citation and access
Creator/Principal investigator(s):
- Lars Ahrenberg - Linköping University - Department of Computer and Information Science
- Maria Holmqvist - Linköping University - Department of Computer and Information Science
