BayJarvis: Blogs on alignment

llm Fine-tuning Zephyr 7B GPTQ with 4-Bit Quantization for Custom Data and Inference - 2023-11-08

Model fine-tuning and quantization play pivotal roles in creating efficient and robust machine learning solutions. This blog post explores the fine-tuning process of the Zephyr 7B GPT-Q model using 4-bit quantization to boost its performance for custom data inference tasks. …

paper Constitutional AI - Training AI Systems to Be Helpful and Harmless Using AI Feedback - 2023-11-04

The paper proposes a new technique called "Constitutional AI" (CAI) to train AI systems like chatbots to be helpful, honest, and harmless without needing human feedback labels identifying harmful behaviors. Instead, the training relies entirely on AI-generated feedback guided by simple principles. This makes it possible to control AI behavior more precisely with far less human input. …