PIE Corpus

  • Hessel Haagsma (Creator)
  • Johan Bos (Contributor)
  • Barbara Plank (Contributor)



    An evaluation corpus for the automatic detection of potentially idiomatic expressions (PIEs), based on the British National Corpus (BNC). This repository contains six json-files containing the annotations.
    Date made available17-Oct-2017
    PublisherUniversity of Groningen
    Date of data production1-Sep-2017 - 17-Oct-2017

    Keywords on Datasets

    • idiomatic expressions
    • British National Corpus

    Cite this