gpt2 exbert

crumbly/gpt2-linear-xl sharded to 1GiB chunks, in fp32 precision.