AI Model Zoo
Home
AI Tools
BimAnt
Testig reward model for RLHF on 1000 examples from
Anthropic/hh-rlhf
.