GORANI 100k

<br>

The project is currently in progress. Please refrain from using weights and datasets.

KORANI is derived from GORANI, a project within llama2 that experiments with the distribution of appropriate datasets to transfer or distill knowledge based on English datasets. Officially, it's called Grid Of Ranvier Node In llama2 (GORANI), based on the biological term Ranvier Node, and aims to explore the optimal dataset for transferring knowledge in various languages and specific domains. Due to strict licensing issues with English datasets, gorani is primarily for research purposes. Therefore, we are refining and training a commercially usable Korean dataset on top of llama2, based on the experimental results of the GORANI project, and this project is named KORANI (Korean GORANI).

<br>

Template

I use llama2-13b with LFM, but I have used it without a default system message. If a system message is specified in some datasets, I use that content.

### System:
{System}

### User:
{New_User_Input}

### Input:
{New User Input}

### Assistant:
{New_Assistant_Answer}

Update

Update Schedule Task Description Status
23-10-05 Completed training - 19.7k 13b weight (specific data) Done
23-10-06 Submitted hf model weights (REV 01) Done
23-10-20 Q.C On Process
23-10- Completed training - 50k 13b weight
23-10- Q.C
23-10- Submitted hf model weights
23-10- Completed training - 100k 13b weight
23-10- Q.C
23-10- Q.A
23-11- Official weight release

Caution

The model weights and dataset have not been properly curated yet and are strictly prohibited for use under any license. In relation to this, the developers do not assume any responsibility, either implicitly or explicitly.

Revisions

Revision Commit Hash Updated Train Process Status
Revision 01 6d30494fa8da84128499d55075eef57094336d03 23.10.04 19,740/100,000 On Training