Staff
YI Yeong-il

YI Yeong-il

YI Yeong-il
Research Keywords
Korean language; Corpus linguistics; Digital humanities

I study the Korean language by developing a corpus from closed captions

Korean language study has advanced by incorporating abundant linguistic resources and diverse research methods in addition to analysis based on the introspection of native speakers. In recent years, the development of large-scale corpora has been active, and there has been an accumulation of a substantial amount of usage-based research. Notably, written ones are well-established, both in terms of size and annotation, and I have also utilized them in my linguistic study. In contrast, spoken language data remains relatively limited. To address this gap, I work on creating a unique, large-scale spoken corpus based on closed captions extracted from multimedia data broadcast in South Korea. This corpus aims to cover a wide range of language usage across various genres, with the advantage of easy scalability compared to existing corpora. I am advancing the study of Korean grammar with a particular focus on negation expressions in Korean through quantitative analysis of my original corpus.