A miniature llama model for testing the llama GQA variant in the BetterTransformer framework.