An Evolutionary Overview of Large Language Models: From Statistical Methods to the Transformer Era

Boris Damnjanović; Dragan Korać; Dejan Simić; Negovan Stamenković

doi:10.7251/JIT2502145D

Authors

Boris Damnjanović Pan-European University Apeiron , Banja Luka https://orcid.org/0000-0003-4774-5774
Dragan Korać Faculty of Natural Sciences and Mathematics, University of Banja Luka, Banja Luka https://orcid.org/0000-0001-7798-5950
Dejan Simić Faculty of organizational sciences, University of Belgrade, Belgrade https://orcid.org/0000-0002-0744-5411
Negovan Stamenković Pan-European University Apeiron, Banja Luka https://orcid.org/0000-0003-4025-5342

DOI:

https://doi.org/10.7251/JIT2502145D

Keywords:

Artificial intelligence, large language models, Transformer architecture, self-attention

Abstract

While the early evolution of large language models (LLMs), including shift from statistical approaches to the Transformer architecture, illustrates their historical impact on the processing of natural language; however, the latest research in neural networks has enabled the faster and more powerful rise of language models grounded in solid theoretical foundations. These advantages, driven by advances in computing systems (e.g., ultra-powerful processing and memory capabilities), enable the development of numerous new models based on new emerging technologies such as artificial intelligence (AI). Thus, we provide an evolutionary overview of LLMs involved in the shift from the statistical to deep learning approach, highlighting their key stages of development, with a particular focused on concepts such as self-attention, the Transformer architecture, BERT, GPT, DeepSeek, and Claude. Finally, our conclusions present a reference point for future research associated with the emergence of new AI-supported models that are irreversibly transforming the way an increasing number of human activities are performed.

An Evolutionary Overview of Large Language Models: From Statistical Methods to the Transformer Era

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section