LLMs

Two types of LLMs:

  • Base LLM: predicts the next word or token based on its training data.
  • Instruction-tuned LLM: a base LLM fine-tuned on instructions.

Instruction-based LLMs generally imply an RLHF layer: reinforcement learning with human feedback.