From deep neural language models to LLMs

Kucharavy, Andrei

doi:10.1007/978-3-031-54827-7_1

From deep neural language models to LLMs

Kucharavy, Andrei

2024

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Cite

Résumé

Large Language Models (LLMs) are scaled-up instances of Deep Neural Language Models—a type of Natural Language Processing (NLP) tools trained with Machine Learning (ML). To best understand how LLMs work, we must dive into what technologies they build on top of and what makes them different. To achieve this, an overview of the history of LLMs development, starting from the 1990s, is provided before covering the counterintuitive purely probabilistic nature of the Deep Neural Language Models, continuous token embedding spaces, recurrent neural networks-based models, what self-attention brought to the table, and finally, why scaling Deep Neural Language Models led to a qualitative change, warranting a new name for the technology.

Détails

Titre

From deep neural language models to LLMs

Auteur(s)/ trice(s)

Kucharavy, Andrei (School of Management, HES-SO University of Applied Sciences and Arts Western Switzerland)

Editeur(s) scientifique(s)

Kucharavy, Andrei ; School of Management, HES-SO University of Applied Sciences and Arts Western Switzerland
Plancherel, Octave ; Cyber-Defence Campus, armasuisse Science and Technology, Thun, Switzerland
Mulder, Valentin ; Cyber-Defence Campus, armasuisse Science and Technology, Thun, Switzerland
Mermoud, Alain ; Cyber-Defence Campus, armasuisse Science and Technology, Thun, Switzerland
Lenders, Vincent ; Cyber-Defence Campus, armasuisse Science and Technology, Thun, Switzerland

Date

2024-04

Publié dans

Large language models in cybersecurity

Publié par

Cham, Springer

Pagination & équivalents

pp. 3–17

ISBN

978-3-031-54826-0

DOI

https://doi.org/10.1007/978-3-031-54827-7_1

Domaine

Economie et Services

Ecole

HEG-VS

Institut

Institut Entrepreneuriat & Management

Lien vers catalogue collection papier

Accès au catalogue des bibliothèques

Le document apparaît dans

Chapitres de livres
Global

From deep neural language models to LLMs

Résumé

Détails

Actions

PDF