Creating a typology of parishes in England and Wales: Mining 1881 census data

The paper presents the application of principal component analysis and cluster analysis to historical individual level census data in order to explore social and economic variations and patterns in household structure across mid-Victorian England and Wales. Principal component analysis is used in or...

Full description

Saved in:
Bibliographic Details
Main Authors: Kevin Schürer, Tatiana Penkova
Format: Article
Language:English
Published: International Institute of Social History 2015-09-01
Series:Historical Life Course Studies
Subjects:
Online Access:http://hdl.handle.net/10622/23526343-2015-0004?locatt=view:master
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832570913376174080
author Kevin Schürer
Tatiana Penkova
author_facet Kevin Schürer
Tatiana Penkova
author_sort Kevin Schürer
collection DOAJ
description The paper presents the application of principal component analysis and cluster analysis to historical individual level census data in order to explore social and economic variations and patterns in household structure across mid-Victorian England and Wales. Principal component analysis is used in order to identify and eliminate unimportant attributes within the data and the aggregation of the remaining attributes. By combining Kaiser’s rule and the Broken-stick model, four principal components are selected for subsequent data modelling. Cluster analysis is used in order to identify associations and structure within the data. A hierarchy of cluster structures is constructed with two, three, four and five clusters in 21-dimensional data space. The main differences between clusters are described in this paper.
format Article
id doaj-art-a8fec3371b654d55afcc7a2e7c35e2fe
institution Kabale University
issn 2352-6343
2352-6343
language English
publishDate 2015-09-01
publisher International Institute of Social History
record_format Article
series Historical Life Course Studies
spelling doaj-art-a8fec3371b654d55afcc7a2e7c35e2fe2025-02-02T13:34:54ZengInternational Institute of Social HistoryHistorical Life Course Studies2352-63432352-63432015-09-0123857Creating a typology of parishes in England and Wales: Mining 1881 census dataKevin Schürer0Tatiana Penkova1University of LeicesterInstitute of Computational Modelling SB RASThe paper presents the application of principal component analysis and cluster analysis to historical individual level census data in order to explore social and economic variations and patterns in household structure across mid-Victorian England and Wales. Principal component analysis is used in order to identify and eliminate unimportant attributes within the data and the aggregation of the remaining attributes. By combining Kaiser’s rule and the Broken-stick model, four principal components are selected for subsequent data modelling. Cluster analysis is used in order to identify associations and structure within the data. A hierarchy of cluster structures is constructed with two, three, four and five clusters in 21-dimensional data space. The main differences between clusters are described in this paper.http://hdl.handle.net/10622/23526343-2015-0004?locatt=view:masterPrincipal Component AnalysisCluster AnalysisCensus DataHousehold Structures
spellingShingle Kevin Schürer
Tatiana Penkova
Creating a typology of parishes in England and Wales: Mining 1881 census data
Historical Life Course Studies
Principal Component Analysis
Cluster Analysis
Census Data
Household Structures
title Creating a typology of parishes in England and Wales: Mining 1881 census data
title_full Creating a typology of parishes in England and Wales: Mining 1881 census data
title_fullStr Creating a typology of parishes in England and Wales: Mining 1881 census data
title_full_unstemmed Creating a typology of parishes in England and Wales: Mining 1881 census data
title_short Creating a typology of parishes in England and Wales: Mining 1881 census data
title_sort creating a typology of parishes in england and wales mining 1881 census data
topic Principal Component Analysis
Cluster Analysis
Census Data
Household Structures
url http://hdl.handle.net/10622/23526343-2015-0004?locatt=view:master
work_keys_str_mv AT kevinschurer creatingatypologyofparishesinenglandandwalesmining1881censusdata
AT tatianapenkova creatingatypologyofparishesinenglandandwalesmining1881censusdata