developers.home-assistant/docs/core/entity/stt.md

1.6 KiB

title sidebar_label
Speech-to-text entity Speech-to-text

A speech-to-text (STT) entity allows other integrations or applications to stream speech data to the STT API and get text back.

A speech-to-text entity is derived from the homeassistant.components.stt.SpeechToTextEntity.

Properties

:::tip Properties should always only return information from memory and not do I/O (like network requests). :::

Name Type Default Description
supported_languages list[str] Required The supported languages of the STT service.
supported_formats list[AudioFormats] Required The supported audio formats of the STT service, wav or ogg.
supported_codecs list[AudioCodecs] Required The supported audio codecs of the STT service, pcm or opus.
supported_bit_rates list[AudioBitRates] Required The supported audio bit rates of the STT service, 8, 16, 24 or 32.
supported_sample_rates list[AudioSampleRates] Required The supported audio sample rates of the STT service.
supported_channels list[AudioChannels] Required The supported audio channels of the STT service, 1 or 2.

Methods

Process audio stream

The process audio stream method is used to send audio to an STT service and get text back.

class MySpeechToTextEntity(SpeechToTextEntity):
    """Represent a Speech To Text entity."""

    async def async_process_audio_stream(self) -> None:
        """Process an audio stream to STT service.

        Only streaming content is allowed!
        """