Back to articles
Proceedings Paper
Volume: 22 | Article ID: 20
Image
Simple and Effective ASR for Archives
  DOI :  10.2352/issn.2168-3204.2025.22.1.20  Published OnlineJune 2025
Abstract
Abstract

Archives are traditionally identified as holders of text-based information. However, they also possess audio and video materials, which are the focus of this paper. In archival institutions, the absence of transcriptions for audio and video materials presents significant challenges. These materials often hold historical, cultural, and research value, but without transcriptions, their accessibility and usability are limited. The lack of transcriptions makes it difficult to index and search the content, hindering effective utilization. While existing ASR (Automatic Speech Recognition) technologies can assist, these may suffer from mediocre accuracy, especially with older or poor-quality materials. This work addresses the challenge by utilizing state of the art multilingual LLM (Large Language Model), simple to use UI (User Interface) and GPU (Graphics Processing Unit) ready containers to create a simple and effective multilingual transportable ASR module.

Subject Areas :
Views 0
Downloads 0
 articleview.views 0
 articleview.downloads 0
  Cite this article 

Anssi Jääskeläinen, "Simple and Effective ASR for Archivesin Archiving Conference,  2025,  pp 107 - 111,  https://doi.org/10.2352/issn.2168-3204.2025.22.1.20

 Copy citation
  Copyright statement 
Copyright ©2025 Society for Imaging Science and Technology 2025
archiving
Archiving Conference
2161-8798
2161-8798
Society for Imaging Science and Technology
IS&T 7003 Kilworth Lane, Springfield, VA 22151 USA