Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU | Textpad