Sound effects should use the same audio format as TTS #262

aaronchantrill · 2020-04-26T22:17:54Z

Detailed Description

Flite produces audio output that is formatted as mono 16 bit little endian at 16MHz, and this appears to be the format generally preferred for both generating and recording audio. The beep_hi.wav and beep_lo.wav files which are used for testing the audio system and providing signals to the user are recorded in 16 bit little endian format, but at 44.1MHz stereo rather than 16MHz mono.

Context

The beep_lo.wav file is played to test whether the user's audio system appears to be set up correctly. The fact that this is playing at a different rate than virtually any other audio file Naomi plays means that it is possible for this test to succeed, but then for Naomi's output to fail.

Possible Implementation

We could just load the current files into audacity, reduce the channels to 1 and change the frequency to 16MHz. We may also want to generate some custom tones. I don't know where these sound effects came from, but they sound uncomfortably close to those used by some other assistants to me.

Your Environment

Version used: naomi-dev
Environment name and version (e.g. PHP 5.4 on nginx 1.9.1): Python 3.7.3
Server type and version: Raspberry Pi 4B
Operating System and version: Raspbian Buster

sank8-2 · 2024-10-16T12:01:09Z

Hi I have changed the audio files to mono and frequency to 16MHz, but I still didn't get the second part. Shall I raise a PR for the audio files?

aaronchantrill · 2024-10-19T22:22:45Z

@sank8-2 please do issue a pull request. I'm not sure what second part you are referring to, but getting a pull request with those files in the correct format would be a big help. Sorry it's taken me so long to get back to you.

sank8-2 · 2024-10-20T01:08:58Z

@aaronchantrill Second part I was referring to generation of custom tones.
Alright I'll raise a PR

aaronchantrill · 2024-10-20T16:46:06Z

Yes, generating custom tones isn't anything I would expect someone new to the project to do, but standardizing the audio out on 16000Hz 1channel 16bit will help.

aaronchantrill · 2024-10-26T14:38:35Z

Merged #428 - Closed

aaronchantrill added Good First Issue! Priority: Low Status: Available Type: Maintenance labels Apr 26, 2020

aaronchantrill added the Hacktoberfest Small or non-core issues that could be worked on by Hacktoberfest participants label Aug 30, 2020

sank8-2 mentioned this issue Oct 20, 2024

Changed audio to mono and 16MHz #428

Merged

8 tasks

aaronchantrill closed this as completed Oct 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sound effects should use the same audio format as TTS #262

Sound effects should use the same audio format as TTS #262

aaronchantrill commented Apr 26, 2020

sank8-2 commented Oct 16, 2024

aaronchantrill commented Oct 19, 2024

sank8-2 commented Oct 20, 2024

aaronchantrill commented Oct 20, 2024

aaronchantrill commented Oct 26, 2024

Sound effects should use the same audio format as TTS #262

Sound effects should use the same audio format as TTS #262

Comments

aaronchantrill commented Apr 26, 2020

Detailed Description

Context

Possible Implementation

Your Environment

sank8-2 commented Oct 16, 2024

aaronchantrill commented Oct 19, 2024

sank8-2 commented Oct 20, 2024

aaronchantrill commented Oct 20, 2024

aaronchantrill commented Oct 26, 2024