bart-large
This model is a fine-tuned version of bart-large on a manually created dataset. It achieves the following results on the evaluation set:
- Loss: 0.40
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 64
- eval_batch_size: 64
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- num_epochs: 3
Training results
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
- | 1.0 | 47 | 4.5156 |
... | |||
- | 10 | 490 | 0.4086 |
How to use
def generate_text(input_text):
# Tokenize the input text
input_tokens = tokenizer(input_text, return_tensors='pt')
# Move the input tokens to the same device as the model
input_tokens = input_tokens.to(model.device)
# Generate text using the fine-tuned model
output_tokens = model.generate(**input_tokens)
# Decode the generated tokens to text
output_text = tokenizer.decode(output_tokens[0], skip_special_tokens=True)
return output_text
from transformers import BartForConditionalGeneration
# Load the pre-trained BART model from the Hugging Face model hub
model = BartForConditionalGeneration.from_pretrained('rasta/BART-FHIR-question')
input_text = "List all procedures with reason reference to resource with ID 24680135."
output_text = generate_text(input_text)
print(output_text)
Framework versions
- Transformers 4.18.0
- Pytorch 1.11.0+cu113
- Datasets 2.1.0
- Tokenizers 0.12.1