2,636 questions
1
vote
1
answer
55
views
How to reset sequence playback index in Godot AudioStreamRandomizer
I am using an AudioStreamRandomizer with playback mode = Sequential to play a few dozen sound files. When I play the audio stream, I get the next sound in the sequence, but the starting index is ...
0
votes
0
answers
43
views
How to stitch together Amazon Connect, Kinesis Video Stream, and In-house ai agent pipeline (TTS-STT-LLM)
I built an ai voice agent with TTS->LLM->STT pipeline, it should make outbound calls and interact with customers.
How do I utilize amazon contact center with kinesis video streams to manage this ...
1
vote
0
answers
53
views
Sounddevice plays loud noice instead of speech
I'm trying to make a script that would take some text and turn it into a speech using edge-tts and then stream it using sounddevice, so that playing audio begins right after edge-tts finished it's job....
0
votes
0
answers
36
views
Is it possible to switch in between multiple audios present in the media-file while streaming with RTSP?
I have been trying to figure out if it is possible to stream media-file containing more than one audios and single video in it!
As per the steps of connection establishment in RTSP, when the two-way ...
1
vote
1
answer
85
views
Handling audio streaming over WebSocket in FastAPI for live transcription
I am trying to achieve live transcription using openai whisper model in my app but having some issues with processing the audio to get the waveform.
@router.websocket("/stt/predict/live")
...
0
votes
0
answers
44
views
how to get streaming audio output from gTTS python
I am making a voice assistant using gTTS but when the full speech output is delivered and played , it take a lot time to respond
when searching in the Internet about this I found a GitHub issue:
https:...
1
vote
1
answer
59
views
Is there a way that browsers understand to interleave an mjpeg stream served through HTTP with an MP3 and play them without a javascript player?
I'm serving an MJPEG stream and an MP3 stream from an esp32. The individual jpg frames are compressed by the camera module itself, but the microcontroller is not powerful enough to decode and re-...
0
votes
0
answers
79
views
Syncing Voice to Text in Openai Realtime voice in ReactJS
I am using OpenAI’s real-time API (gpt-4o-realtime-preview-2024-12-17) in a React-based application for live transcription and response generation. However, I am facing an issue where the transcribed ...
0
votes
1
answer
127
views
How can I convert 'AudioDataStream' to 'Stream' or 'byte[]'?
I'm using Microsoft.CongnitiveServices.Speech SDK to convert text to speech by creating a custom API in C#. The return datatype is 'AudioDataStream'. However, I need to convert it to Stream or bytes ...
0
votes
0
answers
31
views
Distorted Audio When Capturing FreeTTS Output as a Byte Array
I am currently facing an issue while trying to capture and store synthesized speech from FreeTTS as a byte array. The goal is to take the transcribed text, generate an audio output using FreeTTS, and ...
0
votes
1
answer
224
views
Playing base64 audio chunks in Swift
I'm receiving audio chunks over WebSocket and storing them in an AVAudioPCMBuffer for playback using AVAudioEngine. However, when I start playing the audio, I hear noise instead of the expected sound. ...
-1
votes
1
answer
73
views
HTML audio element not working with Google Drive or Dropbox
I'm having problem streaming audio in my site with the audio element.
<audio controls="controls">
<source src="?" type="audio/mpeg" />
Your browser does not ...
0
votes
0
answers
291
views
Real-time Audio Streaming chunk is not playing sequentially using javascript + elevenlabs
I am trying to play the audio stream chunks I receive from ElevenLabs in real time. When the next chunk is received, the audio player should preload the audio so that once the first chunk finishes, ...
0
votes
1
answer
446
views
How to implement audio streaming in React + Node using the OpenAI TTS model
I am going to implement Audio Streaming using OpenAI TTS model.
It will get audio data from OpenAI TTS model as streaming and send it to frontend via WebSocket. It plays on the Frontend. Frontend is ...
0
votes
1
answer
278
views
Real-time Audio Streaming Not Playing on iOS with WebSocket and Audio APIs
I'm building a real-time audio streaming app using WebSocket and audio APIs, but I'm facing issues specifically on iOS. The app streams audio chunks received via WebSocket from the ElevenLabs API and ...
1
vote
1
answer
165
views
How can I stream audio directly from the response of an API request?
I am using an API where, when I make a POST request and send an audio file through form data, it returns a streamable audio. I don't know any method to play the audio immediately without having to ...
0
votes
0
answers
26
views
Stitching React Native streaming data back together
I'm using the react-native-live-audio-stream package to record.
I followed the instructions for in the npm
// yarn add buffer
import { Buffer } from 'buffer';
...
LiveAudioStream.on('data', data =&...
0
votes
2
answers
71
views
How play audio before download is complete?
I have an Angular client and one SpringBoot server. On the server, I have an endpoint that generates an audio file from text. The audio file generation depends on another API that take too long (5 ...
0
votes
0
answers
49
views
Why my music using Web Audio API sounds weaker?
Only on mobile chrome, the speaker is set to phone call during playing a music. I figured out this is because of gainNode. But gainNode is important for looping without gap and etc. I don't want to ...
2
votes
1
answer
60
views
Failed to play m3u8 stream continously with ffmpeg APIs
I am working on an embedded Linux system (5.10.24), where I want to play m3u8 audio stream with FFMPEG APIs.
Here is my code.
#include <stdio.h>
#include <stdbool.h>
#include <alsa/...
0
votes
3
answers
144
views
How to send ICY format message in audio stream from server in Python?
I implemented a StreamingResponse in FastAPI with audio bytes from async generator sources. But besides need to insert some messages for client side audio player (currently, React Native) just in the ...
2
votes
1
answer
445
views
How to make microphone suppress the audio that sounds from speakers at the moment on iPhone?
I was implemented ChatGPT Real-time API, that allows to chat with bot in real time with a voice among other methods. But I've faced a problem that ChatGPT always starts talking with itself when using ...
-3
votes
1
answer
68
views
HTTP audio streaming
Is it possible to provide audio file from HTTP server that is consumed by clients via streaming (loading a file partially)?
HTTP server shouldn't care about streaming, it thinks consumers just ...
1
vote
1
answer
52
views
Audio intermitency js
The page is exibit in a TV on a reception. When it's open we got a dialog with a button to allow the webpage to play audio. And the page refresh itself after 40 seconds.
For the most time of the day, ...
8
votes
0
answers
319
views
Combining threading and asyncio to process audio stream through a WebSocket connection
Overview
I have a server which has an an open WebSocket connection with one client application. This client application (android app) can send live microphone audio data. What the server has to do in ...