Models and SDG¶

Creating fine tuned models with various techniques and Synthetic Data Generation

Models¶

To create our own specialist LLMs as Judge we can either use In Context Learning - LLMs with many examples in the prompt - or fine tune models.

We can create smaller and more accurate models using techniques listed below. I am currently creating notebooks and content about this:

Supervised Fine Tuning where we give ground truth input/output pairs.
Direct Preference Optimisation where give grount truth input-chosen-rejected pairs.
Distillation where a larger model teaches a smaller model that can even outperform the teacher.

As a result, we have powerful tools for our evaluation pipeline.

Synthetic Data Generation enables us to take some seed examples and generate synthetic data. These will need human evaluation.