Skip to content

[API Proposal]: Speech to Text Abstrations for Microsoft.Extensions.AI #5534

Open

Description

Background and motivation

It's great that dotnet/extensions is providing abstractions for AI paradigms and a great addition (besides chat and embeddings) will probably be Speech to Text.

This is more of a conversation starter as we need to look what would be the best pattern to accommodate different approaches. I would be happy to contribute with an implementation for https://github.com/sandrohanea/whisper.net (local inference using whisper.cpp)

It should most likely also accommodate: Whisper API (from OpenAI / Azure)
but also https://www.nuget.org/packages/Microsoft.CognitiveServices.Speech

API Proposal

TBD

API Usage

TBD

Alternative Designs

No response

Risks

No response

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

Labels

api-suggestionEarly API idea and discussion, it is NOT ready for implementationarea-AIuntriaged

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions