Textual features and metadata for DBNL novels 1800-2000

Dataset

Description

This dataset contains a corpus of 1346 novels from DBNL. Included are metadata, word counts, and syntactic features for the novels. The metadata includes variables related to canonicity: library information, secondary references, Wikipedia mentions, etc.

The titles have been selected using the following criteria:

- Novels and novellas
- Originally written in Dutch
- First published 1800-2000
- TEI from titles available on https://www.DBNL.org

Acknowledgements: Information from libraries was contributed by Trudie Stoutjesdijk and Eddie de Kok from Data Warehouse.
Datum van beschikbaarheid31-jan.-2022
UitgeverKoninklijke Bibliotheek
Tijdelijke dekking1-jan.-1800 - 1-jan.-2000
Datum van data-aanmaak31-jan.-2022

Citeer dit