Arabic-Whisper Small

Description

Whisper-small-ar is an Automatic Speech Recognition (ASR) model fine-tuned specifically for the Arabic language using the Whisper model architecture. ASR models are designed to convert spoken language into written text. This model has been fine-tuned on the Mozilla Common Voice dataset (version 11.0) to transcribe spoken Arabic speech into textual form.

Key Features

Use Cases

Whisper-small-ar can be employed in a variety of ASR use cases, including:

Usage

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("automatic-speech-recognition", model="ayoubkirouane/whisper-small-ar")

def transcribe(audio):
    text = pipe(audio)["text"]
    return text