SciNCL based on training data w/o SciDocs leakage.

See malteos/scincl for more details.