gpt2 exbert

crumbly/gpt2-linear-xl sharded to 1GiB chunks, in bf16 precision.