Part 1: Introduction to LLMs

LLMs
Exploring LLMs
Notebook

Exploring what makes up Large Language Models (LLMs), from training neural networks to downloading and deploying them.

Author

Victor Roussekov

Published

September 19, 2023

Modified

September 26, 2023


What are Large Language Models?

Large Language Models (LLMs) are neural networks that can generate output such as text, images, etc. Each model has its own applied methodology, reflected in how the neural network is configured.

To use neural networks, they must first be “trained”. The training process involves passing “training” data through the network and evaluating the output against a “validation” or “testing” set.

This process adjusts the model “weights” along the way, and once the training process is finished, the resulting weights can be distributed for use.

The resulting weights are generally referred to as “pre-trained” models that are distributed in a format that can be “loaded” into a model.

There are many different types of models, from OpenAI’s proprietary GPT models (GPT3, GTP3.5, GTP4), to open source models like Meta’s Llama2.

There are many open sources models that can be downloaded from a model artefact repository called HuggingFace. You can think of HuggingFace like the DockerHub for LLMs.

There are also platforms that offer LLM hosting services, such as Replicate, which host these LLM models on the cloud for you, meaning you don’t have to run them on your own hardware.

It is still possible to run LLMs on consumer hardware, and a MacBook should be able to at least run small-to-medium sized models within reasonable timeframes.

Back to top

Citation

BibTeX citation:
@online{roussekov2023,
  author = {Roussekov, Victor},
  title = {Exploring {LLMs,} {Part} 1: {Introduction} to {LLMs}},
  pages = {undefined},
  date = {2023-09-19},
  url = {https://victorouse.zip/blog/exploring-llms/1-introduction-to-llms},
  langid = {en}
}
For attribution, please cite this work as:
Roussekov, Victor. 2023. “Exploring LLMs, Part 1: Introduction to LLMs.” September 19, 2023. https://victorouse.zip/blog/exploring-llms/1-introduction-to-llms.