pytorch causal-lm

Introduction

70M GPT-NeoX model for Tabby triton backend.

This model is mainly used for integration testing / development purpose. Don't rely on it in your production setup.

Acknowlegement

This repository is derived from EleutherAI/pythia-70m-deduped