Журнал «Современная Наука»

Russian (CIS)English (United Kingdom)
MOSCOW +7(495)-142-86-81

ANALYSIS OF MODERN FUNDAMENTAL ARCHITECTURAL APPROACHES AND METHODS FOR TRAINING LARGE LANGUAGE MODELS: FROM THE TRANSFORMER REVOLUTION TO A NEW PARADIGM OF EFFECTIVE ARTIFICIAL INTELLIGENCE

Belov Vyacheslav Viktorovich  (postgraduate student, Russian Presidential Academy of National Economy and Public Administration)

Nikishov Sergey Ivanovich  (Doctor of Economics, Associate Professor, Russian Presidential Academy of National Economy and Public Administration. Moscow, Russia )

This article presents a comprehensive analysis of the current state of research in the field of Large Language Models (LLMs). It examines the evolution of fundamental architectural solutions from the classical transformer to modern specialized approaches: decoder models with effective attention mechanisms, Mixture of Experts, and multimodal architectures. This book provides a detailed analysis of modern learning methods, including optimal scaling according to Chincha's law, advanced approaches to training data curation, methods for aligning with human preferences (RLHF, DPO), and effective fine-tuning strategies (PEFT, LoRA). Key trends are identified: the shift from extensive parameter augmentation to intelligent architecture design, the democratization of access through open models, and the shift toward next-generation multimodal systems. Particular attention is given to promising research areas, including state-space models for infinite contexts and hybrid architectures.

Keywords:Large language models, transformer architecture, mixed experts, multimodal learning, AI alignment, efficient fine-tuning, long context, open models

 

Read the full article …



Citation link:
Belov V. V., Nikishov S. I. ANALYSIS OF MODERN FUNDAMENTAL ARCHITECTURAL APPROACHES AND METHODS FOR TRAINING LARGE LANGUAGE MODELS: FROM THE TRANSFORMER REVOLUTION TO A NEW PARADIGM OF EFFECTIVE ARTIFICIAL INTELLIGENCE // Современная наука: актуальные проблемы теории и практики. Серия: Естественные и Технические Науки. -2026. -№02. -С. 17-22 DOI 10.37882/2223-2966.2026.02.02
LEGAL INFORMATION:
Reproduction of materials is permitted only for non-commercial purposes with reference to the original publication. Protected by the laws of the Russian Federation. Any violations of the law are prosecuted.
© ООО "Научные технологии"