TimeBankPT

TimeBankPT is a corpus of Portuguese text with annotations about time. The annotation scheme used is similar to TimeML. TimeBankPT is the result of adapting the English corpus used in the first TempEval challenge to the Portuguese language.

Contents

Citation

The preferred citation is Costa and Branco (2012). Further details about the corpus can be found in the following publications:

Features

Some of the features of TimeBankPT:

Size of TimeBankPT
Train SetTest Set
Sentences 2,281 351
Word Tokens
According to white space 60,7828,920
Splitting contractions and detaching punctuation 68,3519,829
Events 6,790 1,097
Temporal Expressions 1,244 165
Temporal Relations 5,781 758

License

Coming soon.

Example

This short text from TimeBankPT is an example of what can be found in TimeBankPT.

Download

Version 1 of TimeBankPT is available for download.

Last update: December 13, 2012