Factor Analysis of Utterances in Japanese Fiction-Writing Based on BCCWJ Speaker Information Corpus
To analyse the characteristics of utterances in Japanese novels, several attributes (e.g., the speaker, listener, relationship between the speaker and listener, and gender of the speaker) were added to a randomly extracted Japanese novel corpus. A total of 887 data sets, with 5632 annotated utteranc...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2018-01-01
|
Series: | Advances in Human-Computer Interaction |
Online Access: | http://dx.doi.org/10.1155/2018/5056268 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832560233851912192 |
---|---|
author | Hajime Murai |
author_facet | Hajime Murai |
author_sort | Hajime Murai |
collection | DOAJ |
description | To analyse the characteristics of utterances in Japanese novels, several attributes (e.g., the speaker, listener, relationship between the speaker and listener, and gender of the speaker) were added to a randomly extracted Japanese novel corpus. A total of 887 data sets, with 5632 annotated utterances, were prepared. Based on the attribute annotated utterance corpus, the characteristics of utterance styles were extracted quantitatively. A chi-square test was used for particles and auxiliary verbs to extract utterance characteristics which reflected the genders of and relationships between the speakers and listeners. Results revealed that the use of imperative words was higher among male characters than their female counterparts, who used more particle verbs, and that auxiliaries of politeness were used more frequently for ‘coworkers’ and ‘superior authorities’. In addition, utterances varied between close and intimate relationships between the speaker and listener. Moreover, repeated factor analyses for 7576 data sets in BCCWJ speaker information corpus revealed ten typical utterance styles (neutral, frank, dialect, polite, feminine, crude, aged, interrogative, approval, and dandy). The factor scores indicated relationships between various utterance styles and fundamental attributes of speakers. Thus, results of this study would be utilisable in speaker identification tasks, automatic speech generation tasks, and scientific interpretation of stories and characters. |
format | Article |
id | doaj-art-29956a7c993e45d380f2ee999c374c79 |
institution | Kabale University |
issn | 1687-5893 1687-5907 |
language | English |
publishDate | 2018-01-01 |
publisher | Wiley |
record_format | Article |
series | Advances in Human-Computer Interaction |
spelling | doaj-art-29956a7c993e45d380f2ee999c374c792025-02-03T01:28:04ZengWileyAdvances in Human-Computer Interaction1687-58931687-59072018-01-01201810.1155/2018/50562685056268Factor Analysis of Utterances in Japanese Fiction-Writing Based on BCCWJ Speaker Information CorpusHajime Murai0Department of Complex and Intelligent Systems, Future University Hakodate, Hakodate 041-8655, JapanTo analyse the characteristics of utterances in Japanese novels, several attributes (e.g., the speaker, listener, relationship between the speaker and listener, and gender of the speaker) were added to a randomly extracted Japanese novel corpus. A total of 887 data sets, with 5632 annotated utterances, were prepared. Based on the attribute annotated utterance corpus, the characteristics of utterance styles were extracted quantitatively. A chi-square test was used for particles and auxiliary verbs to extract utterance characteristics which reflected the genders of and relationships between the speakers and listeners. Results revealed that the use of imperative words was higher among male characters than their female counterparts, who used more particle verbs, and that auxiliaries of politeness were used more frequently for ‘coworkers’ and ‘superior authorities’. In addition, utterances varied between close and intimate relationships between the speaker and listener. Moreover, repeated factor analyses for 7576 data sets in BCCWJ speaker information corpus revealed ten typical utterance styles (neutral, frank, dialect, polite, feminine, crude, aged, interrogative, approval, and dandy). The factor scores indicated relationships between various utterance styles and fundamental attributes of speakers. Thus, results of this study would be utilisable in speaker identification tasks, automatic speech generation tasks, and scientific interpretation of stories and characters.http://dx.doi.org/10.1155/2018/5056268 |
spellingShingle | Hajime Murai Factor Analysis of Utterances in Japanese Fiction-Writing Based on BCCWJ Speaker Information Corpus Advances in Human-Computer Interaction |
title | Factor Analysis of Utterances in Japanese Fiction-Writing Based on BCCWJ Speaker Information Corpus |
title_full | Factor Analysis of Utterances in Japanese Fiction-Writing Based on BCCWJ Speaker Information Corpus |
title_fullStr | Factor Analysis of Utterances in Japanese Fiction-Writing Based on BCCWJ Speaker Information Corpus |
title_full_unstemmed | Factor Analysis of Utterances in Japanese Fiction-Writing Based on BCCWJ Speaker Information Corpus |
title_short | Factor Analysis of Utterances in Japanese Fiction-Writing Based on BCCWJ Speaker Information Corpus |
title_sort | factor analysis of utterances in japanese fiction writing based on bccwj speaker information corpus |
url | http://dx.doi.org/10.1155/2018/5056268 |
work_keys_str_mv | AT hajimemurai factoranalysisofutterancesinjapanesefictionwritingbasedonbccwjspeakerinformationcorpus |