Speech recognition technology can help searching and understanding the contents of audio-visual records that archives hold. But old video records sometimes do not guarantee good recognition results due to low signal quality or lack of vocabulary used at that time. This paper shows actual experimental results and trials to enhance the accuracy using speech recognition toolkit based on deep learning, by training with relevant corpus data for video records in the 1950s and 1970s. This paper also proposes a strategy for records management applications, considering of accuracies and service purposes for the future.
Jae-Pyeong Kim, Yong-Min Shin, Sang-Kook Kim, "Research on Applying Speech Recognition for Audio-Visual Records at the National Archives of Korea" in Proc. IS&T Archiving 2018, 2018, pp 88 - 92, https://doi.org/10.2352/issn.2168-3204.2018.1.0.20