Total ~75k data, batch size 256

Eval results: https://huggingface.co/spaces/pe-nlp/mt-bench-zh