Raj.

01

LLM Optimization Using QLoRA and AWQ

Performed QLoRA and AWQ quantization on the Flan-T5 (Base, Large, XL) LLMs and evaluated their practicality for real world scenarios.

  • Python,
  • HuggingFace,
  • Google Colab,
  • Kaggle