Pre-training Large Language Models (LLMs) on high-quality, meticulously curated datasets is widely recognized as critical for enhancing their performance and generalization capabilities. This study ...
Abstract: Nowadays there are several classes of constrained codes intended for different applications. The following two large classes can be distinguished. The first class contains codes with local ...
Abstract: This paper provides a comprehensive review of recent advances in machine learning (ML) technologies applied to code generation, with a focus on literature published between 2022 and 2024. We ...