Textual features and metadata for DBNL novels 1800-2000

Dataset

Description

This dataset contains a corpus of 1346 novels from DBNL. Included are metadata, word counts, and syntactic features for the novels. The metadata includes variables related to canonicity: library information, secondary references, Wikipedia mentions, etc.

The titles have been selected using the following criteria:

- Novels and novellas
- Originally written in Dutch
- First published 1800-2000
- TEI from titles available on https://www.DBNL.org

Acknowledgements: Information from libraries was contributed by Trudie Stoutjesdijk and Eddie de Kok from Data Warehouse.
Date made available31-Jan-2022
PublisherKoninklijke Bibliotheek
Temporal coverage1-Jan-1800 - 1-Jan-2000
Date of data production31-Jan-2022

Cite this