35,925 questions
1
vote
1
answer
117
views
Web Audio API preprocessing not improving Azure Speech SDK recognition accuracy for real-time meeting transcription
I'm working on a real-time speech-to-text application where microphone input is processed through Web Audio API before being sent to Azure Speech SDK. The main issue is that some audio content is ...
0
votes
0
answers
43
views
One-way audio only on iOS softphone, need more complete solution
I have developed an application for both iOS and Android using Flutter/DART and using the sip_ua package. I have everything working great on both platforms, with one small exception. On iOS, the audio ...
1
vote
0
answers
56
views
When i resume FMOD Studio instance from pause, song position shift a little bit
I'm using FMOD Studio with banks in my project, and I've encountered an issue when pausing and resuming audio. I load my audio as a bank event (i.e., an FMOD Studio Event Instance) and use the ...
1
vote
1
answer
449
views
No Output When Using Faster-Whisper for Transcription in Python
I'm trying to use faster-whisper for speech-to-text transcription in a Jupyter Notebook. I followed a YouTube tutorial and refined my code with ChatGPT, but I'm not getting any transcription output. ...
3
votes
1
answer
123
views
Lowpass filter is slower on GPU than CPU in PyTorch
I have been trying out some of the Torchaudio functionalities and I can't seem to figure out why lowpass_biquad is running slower on the GPU than on the CPU. And this is true for other effects like, ...
0
votes
0
answers
44
views
How to work with the librosa.amplitude_to_db function
I am working with liberosa and I need to get the estimate dB out of a few audio files. I thought I could easly use the librosa.amplitude_to_db function but I have a problem.
I don't knowe what I did ...
1
vote
0
answers
24
views
Realtime generation of audio to write to `SourceDataLine`
I'm wondering if it's possible to constantly output audio generated on the fly to a DataLine, e.g. so the sound changes in response to some kind of state like user input. I'm at a point where I can ...
2
votes
1
answer
76
views
How can I visualize audio data amplitude graphically using UIGraphics iOS
I want to show an interactive audio waveform like this.
I've extracted the sample data using AVAssetReader. Using this data, I'm drawing a UIBezierPath in a Scrollview's contentView. Currently, when I ...
0
votes
2
answers
113
views
Can't play sound file in Compose Desktop
I'm trying to play a WAV file using Java Clip
val uri = Res.getUri("files/frog.wav")
val audioInputStream = AudioSystem.getAudioInputStream(File(uri))
val clip: Clip = AudioSystem.getClip()
...
0
votes
0
answers
86
views
Why does android Noise Suppressor capture more background noise, and how to handle this?
I'm working on a project where I need to record speech audio and perform processing on it. I'm using the Noise Suppressor API to reduce background noise, but I’ve noticed that the behavior of this API ...
0
votes
0
answers
39
views
Send filename to LV2 plugin from host through Atom Sequence
Objective
I want to load an LV2 plugin, and then send filename to said plugin through LV2 Atom Sequences, but I am unable to get my head around LV2 atoms.
Background
I am the author of Amp Rack. It's ...
0
votes
0
answers
42
views
HTMLAudioElement on iOS is paused when video plays again
Here's the description:
On iOS (iPhone or iPad) the following behaviour happens:
when both an HTML video element and an HTML audio element are playing on a web page,
and the video source ends,
and ...
0
votes
0
answers
36
views
CORS Errors on random basis when creating a download functionality
I have created a download functionality and here is the code for the same:
$.ajax({
url: filename, // URL of the file to be downloaded
method: 'GET',
xhrFields: {
...
2
votes
0
answers
180
views
How to implement spatial audio with Swift on IOS 18?
I’m facing a problem in implementing spatial audio effects in my iOS 18 app. I have tried several approaches to achieve 3D audio effect, but the result never felt good enough or it didn’t work at all.
...
2
votes
0
answers
44
views
How to properly map SF2 preset headers to their associated instruments and samples?
I'm working on parsing an SF2 file in C and extracting information like sample data and preset headers. I've successfully identified the main chunks (INFO, sdta, and pdta) and read several sub-chunks ...
0
votes
0
answers
43
views
Embedding image to mp3 using Mutagen
I'm currently writing code to mass update my music library with Python (data is gathered through a csv file).
I'm currently using Mutagen, because of it's simpleness and it does the trick, except ...
2
votes
0
answers
147
views
How to stream live audio in Flutter and visualize waveform?
I'm trying to build a Flutter app that does the following:
Extract data from the audio stream and use it to create a real-time waveform visualization of the audio.
Stream live audio from a URL.
...
1
vote
0
answers
454
views
iOS Audio is very quiet (React-Native, Expo-Go)
EDIT: Fixed by setting "allowsRecordingIOS" to False, as per Gordon Childs' comment:
"""
You need to set allowsRecordingIOS to false when you want to playback. It sounds like ...
0
votes
1
answer
56
views
Audio fadeout using exponentialRampToValueAtTime in Chrome or Firefox is not reliable
The following code respects the MDN documentation but results in an abrupt mute instead of a 2-second-long fadeout:
const audioContext = new window.AudioContext();
let oscillator;
let gainNode;
...
1
vote
0
answers
25
views
Bluetooth Communication Issue: Receiving Garbled Audio Data Instead of Properly Parsed Files in Android App
I am developing an Android app that handles Bluetooth communication to send and receive text messages and audio files. The sending process works fine, and the headers seem to be transmitted correctly. ...
1
vote
0
answers
169
views
How to use a compiled Audio Processing Object (APO) .DLL?
I have compiled some test APO. Yet, I am not sure if it will even work.
When you compile an APO, you'll end up with a .DLL which usually has these methods exposed.
EXPORTS
DllCanUnloadNow ...
2
votes
1
answer
110
views
Change video onclick function from play/pause to mute/unmute
I have a header video and bydefault it needs to be muted so it can be autoplayed on all browsers, I want to hide all video controles at the same time give options for users to mute and unmute the ...
0
votes
2
answers
68
views
How can I improve my sound module in Java? [closed]
I am developing a 2D game in Java and sometimes I have problems with sound. At the moment, I am not using sound in many parts except in music, when walking, when attacking and when enemies take damage....
0
votes
1
answer
28
views
Convert a uploaded audio chunk to be playable in server
I have a hour long audio streamed to backend with websockets. I need to start transcribing the audio and give back response in near realtime. atleast prevent users from waiting an hr long before ...
2
votes
0
answers
97
views
How to sync multiple Android media MusicPlayers with low latency
I’m developing an Android app that needs to play five audio tracks of a song simultaneously. I'm currently using the Android MediaPlayer package for playback. However, precise synchronization is ...