Speech-to-Text Adapter - Likepersonsapp | Generated by Eircodex v1.0.0

Description

Wraps Flutter's speech_to_text package and exposes a simplified interface for per-field transcription. Handles microphone permission requests, locale detection, recognition lifecycle, and error recovery. Shared across any feature requiring spoken text input.

Feature: Structured Post-Session Report

speech-to-text-adapter

Summaries

The Speech-to-Text Adapter enables hands-free data entry across the mobile application, directly reducing the time and friction required for field workers to log activities and complete forms. By allowing users to dictate rather than type, the organization benefits from faster data capture, fewer input errors, and improved adoption rates among staff who work in physically demanding or time-pressured environments. This capability is a meaningful differentiator in markets where competitors rely on manual keyboard entry, and it directly supports accessibility compliance by offering an alternative input modality. Because the component is shared across all features requiring spoken input, the investment in building and maintaining it yields compounding returns every time a new form or workflow is introduced — no additional voice-input development cost is incurred per feature.

The Speech-to-Text Adapter carries medium complexity and sits on the critical path for any feature that requires voice input. Because it is a shared component, it must be delivered and stabilized before dependent features can be fully tested end-to-end. Key delivery risks include platform-specific permission handling differences across Android and iOS versions, variability in speech recognition accuracy across device models and locales, and the need for physical device testing that cannot be replicated in emulators. The QA plan must include real-device test coverage for at least two locales and scenarios where the microphone permission is denied or revoked mid-session.

Estimate an additional buffer for error recovery UX, as graceful fallback messaging is required to maintain user trust when recognition fails.

Speech-to-Text Adapter wraps the Flutter `speech_to_text` package behind a clean, application-level interface, decoupling all consuming widgets from the underlying plugin API. The public interface exposes a full recognition lifecycle: `initialize()` sets up the plugin instance, `requestPermission()` handles OS-level microphone authorization, `startListening(locale, onResult)` opens a recognition session and streams partial and final transcript callbacks, and `stopListening()`/`cancelListening()` cleanly terminate the session with or without committing the result. `isAvailable()` and `isListening()` provide state guards for UI logic. Locale is resolved via `getLocale()` and passed explicitly per session, supporting multilingual deployments.

The adapter has no external dependencies beyond the Flutter plugin itself and should be registered as a singleton in the DI container to avoid redundant initialization. Error recovery emits typed error states rather than throwing, keeping consumer widgets stateless and predictable. Extend carefully — the underlying plugin's platform channel is stateful, so concurrent session attempts must be guarded against at the adapter level.

Responsibilities

Request and verify microphone permissions
Start, stop, and cancel speech recognition sessions
Stream partial and final transcription results
Handle recognition errors with user-facing fallback messages

Interfaces

initialize()

requestPermission()

startListening(locale, onResult)

stopListening()

cancelListening()

isAvailable()

isListening()

getLocale()

Relationships

Dependents (1)

Components that depend on this component

ui Speech-to-Text Field Overlay

Used Integrations (1)

External integrations and APIs this component relies on

speech_to_text Flutter Package (iOS SFSpeechRecognizer / Android SpeechRecognizer) API

API Contract

View full contract →

REST /api/v1/speech 7 endpoints

GET /api/v1/speech List speech recognition sessions (history)

GET /api/v1/speech/:id Get a speech recognition session by ID

POST /api/v1/speech/start Start a speech recognition session for a form field

PUT /api/v1/speech/:id Update or correct a speech transcript

DELETE /api/v1/speech/:id Delete a speech recognition session

POST /api/v1/speech/:id/stop Stop an active speech recognition session and return transcript

+1 more