Speech-to-Text API

High-accuracy transcription API with speaker diarization and affordable prices. Designed for call centers and voice analytics.

Speaker Diarization

Two options available:

  • stereo Left/right channel separation. Use when your PBX records agent and customer on separate channels.
  • ai Neural network-based detection. For single-channel recordings.

Response Example

[
    {
      "start": 0.00,
      "end": 2.34,
      "speaker": 0,
      "text": "Good afternoon!"
    },
    {
      "start": 2.80,
      "end": 5.12,
      "speaker": 1,
      "text": "Hi, how are you?"
    },
]

Try it

Register, upload a file via web interface, check the result. No code required.

Register

For any questions please contact us.