Testig reward model for RLHF on 1000 examples from Anthropic/hh-rlhf.