Korpusi za učenje srpskog jezika kao stranog u eri veštačke inteligencije

Perisic, Olja

doi:10.18485/judig.2025.1

This study explores the challenges of corpus-based teaching of Serbian as a Foreign Language in the context of artificial intelligence technologies. While there are ongoing global discussions about the future of DDL in the AI era, the Serbian context demonstrates a disconnect between technological advancements, particularly those developed by IT experts from JeRTeh, and their application in foreign language teaching. The research focuses on specific linguistic areas that have been examined in previous corpus studies: polysemy (kuća–dom), lexical gaps (babine) and positive or negative word connotations (žena). The findings show that, in tasks such as distinguishing between closely related synonyms, corpus searches offer more precise and reliable results than AI, due to their ability to provide frequency-based data. Generative models are useful for lexical gaps, particularly for beginner learners, though corpusbased methods yield strong results when sufficient technical knowledge is available. However, when analysing word connotations, generative models demonstrate significant limitations, frequently avoiding negative content due to embedded safety filters, rendering them unsuitable for discourse analysis. The main issue is that corpora are not sufficiently integrated into foreign language teaching in Serbia, which limits the ability to make comprehensive comparisons with AI tools. The study recommends training more teachers in corpus-based methods and promoting integrative approaches that combine corpora and generative models. It also advocates cultivating critical thinking, which is central to DDL and involves learner engagement with search engines and AI tools, as well as corpora. Ultimately, this research offers a model for the thoughtful incorporation of AI into language learning, particularly for less commonly taught languages such as Serbian, while preserving the strengths of corpusbased methodology.