llama toolformer text-generation-inference

Model converted and quantized to ggml-4bit by ItsBradarr