LLMs
Two types of LLMs:
- Base LLM: predicts the next word or token based on its training data.
- Instruction-tuned LLM: a base LLM fine-tuned on instructions.
Instruction-based LLMs generally imply an RLHF layer: reinforcement learning with human feedback.