AudioTranscription Fails #194

abegehr · 2024-04-07T03:19:02Z

Describe the bug
On the latest version 0.2.7, AudioTranscriptionQueries fail.

To Reproduce
Run an audio transcription with the following code:

let query = AudioTranscriptionQuery(file: data, fileType: .m4a, model: .whisper_1)
let result =  try await openAI.audioTranscriptions(query: query)

Expected behavior
I'd expect the transcription to run successfully.

Desktop (please complete the following information):

OS: macOS 14

Additional context
The error: APIErrorResponse(error: OpenAI.APIError(message: "Invalid file format. Supported formats: [\'flac\', \'m4a\', \'mp3\', \'mp4\', \'mpeg\', \'mpga\', \'oga\', \'ogg\', \'wav\', \'webm\']", type: "invalid_request_error", param: nil, code: nil))

I tried different values for fileType, however all fail with the same error. The failure occurs quite fast, so I'd assume it is a metadata check by OpenAI's API and not an issue with data.

Transcription worked on version 0.2.6 with the following code:

let query = AudioTranscriptionQuery(file: data, fileName: "record.m4a", model: .whisper_1)
let result = try await openAI.audioTranscriptions(query: query)

The text was updated successfully, but these errors were encountered:

abegehr · 2024-04-07T04:56:55Z

I'm working around the issue by making the HTTP call using Vapor's req.client directly:

struct AudioTranscriptionRequestBody: Content {
    var file: File
    var model: String
    var prompt: String?
    var temperature: String?
}
let file = File(data: buffer, filename: "speech.m4a")
let body = AudioTranscriptionRequestBody(file: file, model: "whisper-1")
let res = try await req.client.post(.init(string: "https://api.openai.com/v1/audio/transcriptions")) { request in
    request.headers.bearerAuthorization = .init(
        token: openAI.configuration.token)
    request.headers.contentType = .formData
    try request.content.encode(body, as: .formData)
}
let result = try res.content.decode(AudioTranscriptionResult.self)

pradeepb28 · 2024-04-07T09:43:57Z

I honestly don't know why they are mapping m4a file to mp4 type that could be the issue why are facing the problem

abegehr · 2024-04-08T09:30:43Z

Relevant commit here: 905e317
It was committed by James J Kalafus, however he is not linked to a GitHub profile.

Demircivi · 2024-04-10T09:58:37Z

I created a PR that addresses this issue.

AT5HK · 2024-04-19T23:12:55Z

Same issue please fix this, I had to remove it from SPM and add OpenAI locally to change the code and fix it.

pradeepb28 · 2024-04-21T05:55:15Z

Same issue please fix this, I had to remove it from SPM and add OpenAI locally to change the code and fix it.

Please reply your concern in the above PR to escalate the MacPaw team to merge it. (I did it)

Demircivi mentioned this issue Apr 10, 2024

Fixes the m4a content type sent as mp4 instead #197

Merged

Krivoblotsky closed this as completed in #197 Apr 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AudioTranscription Fails #194

AudioTranscription Fails #194

abegehr commented Apr 7, 2024 •

edited

Loading

abegehr commented Apr 7, 2024 •

edited

Loading

pradeepb28 commented Apr 7, 2024 •

edited

Loading

abegehr commented Apr 8, 2024

Demircivi commented Apr 10, 2024

AT5HK commented Apr 19, 2024

pradeepb28 commented Apr 21, 2024

AudioTranscription Fails #194

AudioTranscription Fails #194

Comments

abegehr commented Apr 7, 2024 • edited Loading

abegehr commented Apr 7, 2024 • edited Loading

pradeepb28 commented Apr 7, 2024 • edited Loading

abegehr commented Apr 8, 2024

Demircivi commented Apr 10, 2024

AT5HK commented Apr 19, 2024

pradeepb28 commented Apr 21, 2024

abegehr commented Apr 7, 2024 •

edited

Loading

abegehr commented Apr 7, 2024 •

edited

Loading

pradeepb28 commented Apr 7, 2024 •

edited

Loading