The Classical Model of Type-Token Systems Compared with Items from the Standardized Project Gutenberg Corpus

We compare the “classical” equations of type-token systems, namely Zipf’s laws, Heaps’ law and the relationships between their indices, with data selected from the Standardized Project Gutenberg Corpus (SPGC). Selected items all exceed 100,000 word-tokens and are trimmed to 100,000 word-tokens each....

Full description

Saved in:
Bibliographic Details
Main Authors: Martin Tunnicliffe, Gordon Hunter
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Analytics
Subjects:
Online Access:https://www.mdpi.com/2813-2203/4/2/16
Tags: Add Tag
No Tags, Be the first to tag this record!