-
Notifications
You must be signed in to change notification settings - Fork 645
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Replace runners prefix amz2023. #3818
base: master
Are you sure you want to change the base?
Commits on May 23, 2023
-
[BugFix] Fix extract_features method for WavLM models (#3350)
Summary: resolve #3347 `position_bias` is ignored in `extract_features` method, this doesn't affect Wav2Vec2 or HuBERT models, but it changes the output of transformer layers (except the first layer) in WavLM model. This PR fixes it by adding `position_bias` to the method. Pull Request resolved: #3350 Reviewed By: mthrok Differential Revision: D46112148 Pulled By: nateanl fbshipit-source-id: 3d21aa4b32b22da437b440097fd9b00238152596
Configuration menu - View commit details
-
Copy full SHA for 7d0f336 - Browse repository at this point
Copy the full SHA 7d0f336View commit details
Commits on May 24, 2023
-
Resolve lint issue on LaTeX (#3366)
Summary: Pull Request resolved: #3366 Reviewed By: nateanl Differential Revision: D46136238 Pulled By: mthrok fbshipit-source-id: 3432f5d007293831bab21460a79ae26b1bbc81a8
Configuration menu - View commit details
-
Copy full SHA for 8690e6e - Browse repository at this point
Copy the full SHA 8690e6eView commit details -
Remove CUDA 11.7 builds; replace with 11.8 (#3360)
Summary: CC atalman malfet Pull Request resolved: #3360 Reviewed By: mthrok Differential Revision: D46150898 Pulled By: atalman fbshipit-source-id: 985a0ef69406f48fb15f239d6b16616c0a5379f5
Configuration menu - View commit details
-
Copy full SHA for 5a6f4eb - Browse repository at this point
Copy the full SHA 5a6f4ebView commit details -
Amend commit to gh-pages branch (#3345)
Summary: This commit changes the way doc is pushed. It ammends instead of adding a new commit. Currently each commit in gh-pages contain like 100MB of data. gh-pages branch is fetched by default when `git clone`. So the size of torchaudio repo grows significantly. Pull Request resolved: #3345 Reviewed By: nateanl Differential Revision: D46136612 Pulled By: mthrok fbshipit-source-id: 39479ee5d1a6888254ef50f0db252453d976d183
Configuration menu - View commit details
-
Copy full SHA for a79cf3b - Browse repository at this point
Copy the full SHA a79cf3bView commit details -
Summary: * Delay the import of torchaudio until the CLI options are parsed. * Add option to set log level to DEBUG so that it's easy to see the issue with external libraries. Pull Request resolved: #3346 Reviewed By: nateanl Differential Revision: D46022546 Pulled By: mthrok fbshipit-source-id: 9f988bbd770c2fd2bb260c3cfe02b238a9da2808
Configuration menu - View commit details
-
Copy full SHA for 71b2634 - Browse repository at this point
Copy the full SHA 71b2634View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8b85ca5 - Browse repository at this point
Copy the full SHA 8b85ca5View commit details -
Add StreamReader/Writer custom IO to doc (#3367)
Summary: Pull Request resolved: #3367 Reviewed By: nateanl Differential Revision: D46148139 Pulled By: mthrok fbshipit-source-id: 50f297ac69bb95562976eb452e4e382b8c064c3c
Configuration menu - View commit details
-
Copy full SHA for f41ba26 - Browse repository at this point
Copy the full SHA f41ba26View commit details
Commits on May 25, 2023
-
Add LRS3 AV-ASR recipe (#3278)
Summary: This PR adds AV-ASR recipe which contains sample implementations of training and evaluation pipelines for RNNT based automatic, visual, and audio-visual (ASR, VSR, AV-ASR) models on LRS3. This repository includes both streaming/non-streaming modes. CC stavros99 xiaohui-zhang YumengTao mthrok nateanl hwangjeff Pull Request resolved: #3278 Reviewed By: nateanl Differential Revision: D46121550 Pulled By: mpc001 fbshipit-source-id: bb44b97ae25e87df2a73a707008be46af4ad0fc6
Configuration menu - View commit details
-
Copy full SHA for c6624fa - Browse repository at this point
Copy the full SHA c6624faView commit details
Commits on May 26, 2023
-
Improve RNN-T streaming decoding (#3295)
Summary: This commit fixes the following issues affecting streaming decoding quality 1. The `init_b` hypothesis is only regenerated from blank token if no initial hypotheses are provided. 2. Allows the decoder to receive top-K hypothesis to continue decoding from, instead of using just the top hypothesis at each decoding step. This dramatically affects decoding quality especially for speech with long pauses and disfluencies. 3. Some minor errors regarding shape checking for length. This also means that the resulting output is the entire transcript up until that time step, instead of just the incremental change in transcript. Pull Request resolved: #3295 Reviewed By: nateanl Differential Revision: D46216113 Pulled By: hwangjeff fbshipit-source-id: 8f7efae28dcca4a052f434ca55a2795c9e5ec0b0
Configuration menu - View commit details
-
Copy full SHA for 9fc0dca - Browse repository at this point
Copy the full SHA 9fc0dcaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 37779ef - Browse repository at this point
Copy the full SHA 37779efView commit details -
Temporarily remove test for extract_features (#3378)
Summary: The tests failed for several bundles. Remove them and will re-add once the root cause is figured out. Pull Request resolved: #3378 Reviewed By: atalman Differential Revision: D46230884 Pulled By: nateanl fbshipit-source-id: 42056a29b2ec2335268b273d3e37fb517035be92
Configuration menu - View commit details
-
Copy full SHA for 05649ca - Browse repository at this point
Copy the full SHA 05649caView commit details -
Use cuda 11.8 for circleci tests (#3381)
Summary: Use cuda 11.8 for circleci tests. 11.7 was deprecated Pull Request resolved: #3381 Reviewed By: osalpekar Differential Revision: D46236223 Pulled By: atalman fbshipit-source-id: 6d6a8e09603807a07241f31c1bd1e6d3a2b67d9d
Configuration menu - View commit details
-
Copy full SHA for 5c0249b - Browse repository at this point
Copy the full SHA 5c0249bView commit details -
Use the same CUDNN version on Windows as PyTorch (#3380)
Summary: 11.7 uses 8.5.0; 11.8 uses 8.7.0; 12.1 uses 8.8.1. Otherwise, Windows vision job (8.5.0) would overwrite the CUDNN version setup by PyTorch (8.7.0) leading to this flaky failures https://github.com/pytorch/pytorch/actions/runs/5088860652/jobs/9146641450 ``` RuntimeError: cuDNN version incompatibility: PyTorch was compiled against (8, 7, 0) but found runtime version (8, 5, 0). PyTorch already comes bundled with cuDNN. One option to resolving this error is to ensure PyTorch can find the bundled cuDNN. ``` Pull Request resolved: #3380 Reviewed By: atalman Differential Revision: D46236286 Pulled By: huydhn fbshipit-source-id: 9ca12d5068c3029688347d52c5c284488f33728d
Configuration menu - View commit details
-
Copy full SHA for c120f31 - Browse repository at this point
Copy the full SHA c120f31View commit details -
Fix encoding g722 format (#3373)
Summary: g722 format only supports 16k Hz, but AVCodec does not list this. The implementation does not insert resampling and the resulting audio can be slowed down or sped up. Pull Request resolved: #3373 Reviewed By: hwangjeff Differential Revision: D46233181 Pulled By: mthrok fbshipit-source-id: 902b3f862a8f7269dc35bc871e868b0e78326c6c
Configuration menu - View commit details
-
Copy full SHA for 1b05ca7 - Browse repository at this point
Copy the full SHA 1b05ca7View commit details
Commits on May 27, 2023
-
Fix AudioEffector for mulaw (#3372)
Summary: When encoding audio with mulaw, the resulting data does not have header, and the StreamReader defaults to 16k Hz, which can strech/shrink the resulting waveform. Pull Request resolved: #3372 Reviewed By: hwangjeff Differential Revision: D46234772 Pulled By: mthrok fbshipit-source-id: 942c89a8cfe29b0b6f57b3e5b6c9dfd3524ca552
Configuration menu - View commit details
-
Copy full SHA for af932cc - Browse repository at this point
Copy the full SHA af932ccView commit details
Commits on May 29, 2023
-
[Nova] Windows CPU Unittests on Nova (#3329)
Summary: Continuing with the job migrations from CCI to Nova, this PR introduces the Windows CPU Unittest job as a Nova workflow. The job is passing: https://github.com/pytorch/audio/actions/runs/5094569687/jobs/9159020192?pr=3329. Pull Request resolved: #3329 Reviewed By: huydhn Differential Revision: D46265649 Pulled By: atalman fbshipit-source-id: 7659dfbcc8ad400f2e109ff64530e1f768e82ef9
Configuration menu - View commit details
-
Copy full SHA for 6425d46 - Browse repository at this point
Copy the full SHA 6425d46View commit details
Commits on May 30, 2023
-
Simplify sox namespace (#3383)
Summary: Pull Request resolved: #3383 This commit reduces `torchaudio::sox_*` namespace into `torchaudio::sox`. Also put Pybind11 registration and TorchBind registration into anonymous namescope. Differential Revision: D46257367 fbshipit-source-id: 0f0f181eaa72036916e223263daf4b7c298fca0d
Configuration menu - View commit details
-
Copy full SHA for a81b0ed - Browse repository at this point
Copy the full SHA a81b0edView commit details -
Summary: Pull Request resolved: #3389 Adopt more of const reference in sox source code. Differential Revision: D46264068 fbshipit-source-id: 809d34a6e16f621c856d4278ef7ce45a5868a717
Configuration menu - View commit details
-
Copy full SHA for 9cdf26f - Browse repository at this point
Copy the full SHA 9cdf26fView commit details -
Configuration menu - View commit details
-
Copy full SHA for caf3ac0 - Browse repository at this point
Copy the full SHA caf3ac0View commit details
Commits on May 31, 2023
-
Fixes to #3295 Improve RNN-T streaming decoding (#3379)
Summary: Pull Request resolved: #3379 Fixes `RNNTBeamSearch.infer`'s docstring and removes unused import from tutorial. Reviewed By: mthrok Differential Revision: D46227174 fbshipit-source-id: 7c1c3f05a6476cb0437622dea6f3ae6cb3ea9468
Configuration menu - View commit details
-
Copy full SHA for b8016e4 - Browse repository at this point
Copy the full SHA b8016e4View commit details -
Summary: Windows GPU workflows Pull Request resolved: #3364 Reviewed By: mthrok Differential Revision: D46292403 Pulled By: atalman fbshipit-source-id: ee3c6f8082ca77bdc1ffdb930c59fa5a9cb25a4a
Configuration menu - View commit details
-
Copy full SHA for 92d0fb5 - Browse repository at this point
Copy the full SHA 92d0fb5View commit details -
[Nova] Deprecate windows circleci unit tests (#3393)
Summary: Nova - Deprecate windows circleci unit tests Pull Request resolved: #3393 Reviewed By: malfet Differential Revision: D46315608 Pulled By: atalman fbshipit-source-id: 3d7b5d0618b9d2e12e5f97e21d7becdc61d85c69
Configuration menu - View commit details
-
Copy full SHA for c5d3706 - Browse repository at this point
Copy the full SHA c5d3706View commit details -
Surface test failures on CI (#3394)
Summary: Set the directory of JUnitText XML file to the one where test-infra picks up and put them in summary. Example: https://github.com/pytorch/audio/actions/runs/5136305988 Pull Request resolved: #3394 Differential Revision: D46328832 Pulled By: mthrok fbshipit-source-id: f0b5020a911ca4ec09345a965bdec769300859f0
Configuration menu - View commit details
-
Copy full SHA for 2283df8 - Browse repository at this point
Copy the full SHA 2283df8View commit details -
Summary: See title. If all is well, we can deprecate the CCI job in a few days. Pull Request resolved: #3341 Reviewed By: mthrok Differential Revision: D46324265 Pulled By: osalpekar fbshipit-source-id: bc706c6ae4285d4085dc5f0223ea41d8fc290f1c
Configuration menu - View commit details
-
Copy full SHA for 5d0697b - Browse repository at this point
Copy the full SHA 5d0697bView commit details -
[Nova] Stylechecks on Nova (#3390)
Summary: Introducing the stylecheck job on Nova. It seems like it is failing on trunk, but the functionality of this job itself is working and it fails with the same error as it does on trunk with CCI. Pull Request resolved: #3390 Reviewed By: mthrok Differential Revision: D46324223 Pulled By: osalpekar fbshipit-source-id: 1324202e53569d610559ef6f1b90cb5c364e6909
Configuration menu - View commit details
-
Copy full SHA for f7cb6c6 - Browse repository at this point
Copy the full SHA f7cb6c6View commit details
Commits on Jun 1, 2023
-
Delete CCI Linux and MacOS Unittest Jobs (#3391)
Summary: Deprecates the Linux and MacOS Unittest jobs now that they've been running on Nova for over a week. Aside: There was also a stylecheck job that was dependent on the Linux Unittest job. I also put up #3390 to move that stylecheck job to Nova. I'm happy to reintroduce the CCI stylecheck job standalone in CCI if we want the Nova version to run on main for a week. Pull Request resolved: #3391 Reviewed By: mthrok Differential Revision: D46324198 Pulled By: osalpekar fbshipit-source-id: 2115748e153c5dee1a38db2b6230acebc4f56927
Configuration menu - View commit details
-
Copy full SHA for d5d94b7 - Browse repository at this point
Copy the full SHA d5d94b7View commit details -
Update and deprecate apply_codec function (#3386)
Summary: To prepare for the upcoming removal of file-like object support from sox_io backend, this commit changes apply_codec function to use tempfile. `apply_codec` function is now deprecated and users are encourated to use `torchaudio.io.AudioEffector`. We will not remove the function itself, but will remove the entry from the doc. Pull Request resolved: #3386 Reviewed By: hwangjeff Differential Revision: D46330610 Pulled By: mthrok fbshipit-source-id: 3071bdefa05b4cbb9f00629bef50f0981eae89b4
Configuration menu - View commit details
-
Copy full SHA for d6dd497 - Browse repository at this point
Copy the full SHA d6dd497View commit details -
Refactor arg mapping in ffmpeg save function (#3387)
Summary: The arguments of TorchAudio's save function ("format", "bits_per_sample" and "encoding") are not one-to-one mapping to the arguments of FFmpeg encoding. For example, to use vorbis codec, FFmpeg expects "ogg" container/extension with "vorbis" encoder. It does not recognize "vorbis" extension like TorchAudio (libsox) does. This commit refactors the logic to parse/map the arguments. As a result it now properly works with vorbis and mp3 extension. Pull Request resolved: #3387 Reviewed By: hwangjeff Differential Revision: D46328787 Pulled By: mthrok fbshipit-source-id: 36f993952a062bfec58a8b51be6aa86297571f90
Configuration menu - View commit details
-
Copy full SHA for b99e5f4 - Browse repository at this point
Copy the full SHA b99e5f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1dfac46 - Browse repository at this point
Copy the full SHA 1dfac46View commit details -
Summary: Pull Request resolved: #3398 Reviewed By: nateanl Differential Revision: D46354862 Pulled By: mthrok fbshipit-source-id: b86dcdfeff8ed9db87b0b78eca20f6f18117e97e
Configuration menu - View commit details
-
Copy full SHA for c7ac1af - Browse repository at this point
Copy the full SHA c7ac1afView commit details -
[Nova] Deleting Remaining CircleCI jobs (#3399)
Summary: This job completely deletes the CircleCI `config.yml`. Here is what was remaining in the config at the point of deletion: Used Jobs: * **Lint** - Now running on Nova - see https://github.com/pytorch/audio/actions/runs/5144082942 for an example run on the latest PR in trunk * **CircleCI Consistency** - Not needed anymore now if there is no CCI config. Unused Jobs: * **build-ffmpeg-$OS** - For the build jobs, we are already building FFMPEG from source as part of the Nova workflows. * **download-third-parties** - This is caching. We currently do not have caching in Nova jobs, but atalman is working on adding support for this as a future optimization. Pull Request resolved: #3399 Reviewed By: mthrok Differential Revision: D46363921 Pulled By: osalpekar fbshipit-source-id: 8abf5b0c1612c3492908fb2f5797e6b0a3c70766
Configuration menu - View commit details
-
Copy full SHA for cc89f74 - Browse repository at this point
Copy the full SHA cc89f74View commit details -
[BC-breaking] Remove file-like object support from sox_io backend (#3035
) Summary: This commit removes file-like obejct support so that we can remove custom patch The motivation and plan is outlined in #2950. Pull Request resolved: #3035 Reviewed By: hwangjeff Differential Revision: D44695647 Pulled By: mthrok fbshipit-source-id: 13af0234e288c041bc7b490e1f967f85ce7eb8ec
Configuration menu - View commit details
-
Copy full SHA for bc54ac8 - Browse repository at this point
Copy the full SHA bc54ac8View commit details -
Summary: This commit changes the way FFmpeg extension is built and used. Instead of linking (LGPL) FFmpeg libraries to torchaudio at build time, It uses dlopen to search and link them at run time. For dlopen-ing, we use PyTorch's `at::DynamicLibrary` class, which provides portable wrapper. Pull Request resolved: #3353 Differential Revision: D46059199 Pulled By: mthrok fbshipit-source-id: 4493a5fd8a4c802178d20276522f5334d637307d
Configuration menu - View commit details
-
Copy full SHA for b14ced1 - Browse repository at this point
Copy the full SHA b14ced1View commit details
Commits on Jun 2, 2023
-
Revert D46059199: [audio][PR] Use dlopen for FFmpeg
Differential Revision: D46059199 Original commit changeset: 4493a5fd8a4c Original Phabricator Diff: D46059199 fbshipit-source-id: 71cde3f8cd870d1ad9114e3e87cdd1ba564441c0
Configuration menu - View commit details
-
Copy full SHA for ab7a39f - Browse repository at this point
Copy the full SHA ab7a39fView commit details -
Update data augmentation tutorial (#3375)
Summary: Replace sox_effects with `torchaudio.io.AudioEffector` 1. To show case the new and better feature 2. To prepare for the upcoming removal of file-like support object Pull Request resolved: #3375 Reviewed By: nateanl Differential Revision: D46379016 Pulled By: mthrok fbshipit-source-id: 70f24b62494204949f327f6ac6c49f315c9ee315
Configuration menu - View commit details
-
Copy full SHA for 2ba36b4 - Browse repository at this point
Copy the full SHA 2ba36b4View commit details -
[BC-Breaking] Remove compute_kaldi_pitch (#3368)
Summary: This commit removes compute_kaldi_pitch function and the underlying Kaldi integration from torchaudio. Kaldi pitch function was added in a short period of time by integrating the original Kaldi implementation, instead of reimplementing it in PyTorch. The Kaldi integration employed a hack which replaces the base vector/matrix implementation of Kaldi with PyTorch Tensor so that there is only one blas library within torchaudio. Recently, we are making torchaudio more lean, and we don't see a wide adoption of kaldi_pitch feature, so we decided to remove them. See some of the discussion #1269 Pull Request resolved: #3368 Differential Revision: D46406176 Pulled By: mthrok fbshipit-source-id: ee5e24d825188f379979ddccd680c7323b119b1e
Configuration menu - View commit details
-
Copy full SHA for 5bbbb1d - Browse repository at this point
Copy the full SHA 5bbbb1dView commit details
Commits on Jun 3, 2023
-
[audio][PR] Add option to dlopen FFmpeg libraries (#3402)
Summary: Pull Request resolved: #3402 This is a second attempt of #3353. The basic logic to enable dlopen for FFmpeg libraries are same. It uses `at::DynamicLibrary`, which allows to compile torchaudio without linking FFmpeg libraries. This time, the option to enable this feature DLOPEN_FFMPEG has been added, so that users have a way to disable this feature and keep using build-time linking. Please refer to stub.h for more technical detail. Differential Revision: D46403783 fbshipit-source-id: ca3db57ff6bdc50c8c225d22f12f3e76c6dc3f16
Configuration menu - View commit details
-
Copy full SHA for b7d3e89 - Browse repository at this point
Copy the full SHA b7d3e89View commit details
Commits on Jun 4, 2023
-
Update HuBERT/SSL training recipes to support Lightning 2.x (#3396)
Summary: There are some BC-Breaking changes from pytorch_lightning to lightning library. The PR adjust those changes to support latest lightning library. Pull Request resolved: #3396 Reviewed By: mthrok Differential Revision: D46345206 Pulled By: nateanl fbshipit-source-id: 59469c15dc5fe5466a99a5b5380eb4f98c2c633f
Configuration menu - View commit details
-
Copy full SHA for e908357 - Browse repository at this point
Copy the full SHA e908357View commit details
Commits on Jun 5, 2023
-
Configuration menu - View commit details
-
Copy full SHA for c076d1a - Browse repository at this point
Copy the full SHA c076d1aView commit details
Commits on Jun 6, 2023
-
Update forced_align method to only support batch Tensors (#3365)
Summary: Pull Request resolved: #3365 Current design of forced_align accept 2D Tensor for `log_probs` and 1D Tensor for `targets`. To make the API simple, the PR make changes to only support batch Tensors (3D Tensor for `log_probs` and 2D Tensor for `targets`). Reviewed By: vineelpratap Differential Revision: D46126226 fbshipit-source-id: 42cb52b19d91bbff7dc040ccf60350545d75b3a2
Configuration menu - View commit details
-
Copy full SHA for 5f17d81 - Browse repository at this point
Copy the full SHA 5f17d81View commit details -
Revert D46126226: Update forced_align method to only support batch Te…
…nsors Differential Revision: D46126226 Original commit changeset: 42cb52b19d91 Original Phabricator Diff: D46126226 fbshipit-source-id: 372b2526d9e196e37e014f1556bf117d29bb1ac6
Configuration menu - View commit details
-
Copy full SHA for bbc13b9 - Browse repository at this point
Copy the full SHA bbc13b9View commit details -
[Nova] Remove unused files (#3409)
Summary: We are using Project Nova workflows now. These are not required. Same as: pytorch/vision#7656 Pull Request resolved: #3409 Reviewed By: mthrok Differential Revision: D46494331 Pulled By: atalman fbshipit-source-id: a642ae55b75482918e0afb7c55dc876bc8356e70
Configuration menu - View commit details
-
Copy full SHA for 23e756a - Browse repository at this point
Copy the full SHA 23e756aView commit details -
Summary: Pull Request resolved: #3410 Differential Revision: D46496786 Pulled By: mthrok fbshipit-source-id: e517b273c40b340f39ce7db7ab1be1c3eb5f2059
Configuration menu - View commit details
-
Copy full SHA for 27aa52f - Browse repository at this point
Copy the full SHA 27aa52fView commit details
Commits on Jun 7, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 91db978 - Browse repository at this point
Copy the full SHA 91db978View commit details -
Fix style to prep #3414 (#3415)
Summary: Pull Request resolved: #3415 Differential Revision: D46526437 Pulled By: mthrok fbshipit-source-id: f78d19c19d7e68f67712412de35d9ed50f47263b
Configuration menu - View commit details
-
Copy full SHA for 4771677 - Browse repository at this point
Copy the full SHA 4771677View commit details
Commits on Jun 8, 2023
-
[Nova] Add cache ffmpeg before building (#3417)
Summary: [Nova] Add cache ffmpeg before building Pull Request resolved: #3417 Reviewed By: mthrok Differential Revision: D46537892 Pulled By: atalman fbshipit-source-id: 9f8dc0ecfc305c3b378557d46f89a5d7de67a165
Configuration menu - View commit details
-
Copy full SHA for 5ca03f4 - Browse repository at this point
Copy the full SHA 5ca03f4View commit details -
Merge all the lint/style checks to pre-commit hook (#3414)
Summary: Pull Request resolved: #3414 Differential Revision: D46536717 Pulled By: mthrok fbshipit-source-id: 505bdcdd1b59ca9fe5afc2c8516a0a821e2b8d7e
Configuration menu - View commit details
-
Copy full SHA for c3ca256 - Browse repository at this point
Copy the full SHA c3ca256View commit details -
Optimize Torchaudio Vad (#3382)
Summary: Pull Request resolved: #3382 The voice activity detector function was unoptimized, confusingly written, and buggy. The optimizations created here allow for the function to run roughly 17x faster. The main optimizations were to loop over windows of audio rather than individual audio samples. Reducing the number of copies also helped. There was an off by one error where the array slice referenced was [1: 16001] (for the default settings) instead of [0: 16000] Reviewed By: hwangjeff Differential Revision: D44749359 fbshipit-source-id: c76c9412e70cdc6fcd527d113603c88f78480558
Configuration menu - View commit details
-
Copy full SHA for 1e117f5 - Browse repository at this point
Copy the full SHA 1e117f5View commit details -
Summary: - Moving the unit test scripts from .circleci to .github - Remove docker file for unit test base - Use the Conda from Docker image in Linux jobs. Remaining follow-up items - Reuse the unittest script in Linux GPU job like done in Linux CPU job. The unit test script needs to be fixed to be used for Linux GPU job in new GHA workflow. Keeping it as a separate follow-up work item. Pull Request resolved: #3407 Differential Revision: D46498263 Pulled By: mthrok fbshipit-source-id: d8256717a55bb4257151d819d3b2ebd453601eac
Configuration menu - View commit details
-
Copy full SHA for f080315 - Browse repository at this point
Copy the full SHA f080315View commit details -
Remove CCI badge from README (#3420)
Summary: CI jobs are migrated from CCI to GHA Pull Request resolved: #3420 Differential Revision: D46548562 Pulled By: mthrok fbshipit-source-id: d7e17201e8b256efaa54543e445a0f139aa549b2
Configuration menu - View commit details
-
Copy full SHA for a7fea8a - Browse repository at this point
Copy the full SHA a7fea8aView commit details -
Delay the initialization of CUDA tensor converter (#3419)
Summary: StreamReader decoding process is composed of the three steps; 1. Decode the incoming AVPacket into AVFrame 2. Pass AVFrame through AVFilter to perform post process 3. Convert the resulgint AVFrame The internal of StreamReader was refactored in #3188 so that the above pipeline is initialized at the time output stream is defined and output stream shape can be retrieved. For CPU decoder, this works fine because resizing happens in step 2, and the resulting shape can be retrievable. However, this is problematic for GPU decoder, as resizing is currently done using GPU decoder option (step 1) and there seems to be no interface to retrieve the output shape. This refactor introduced regression, which is described in #3405 AVFilter internally is adoptive to the change of input frame size. This commit changes the conversion process to be similar, so that it will wait until the first frame comes in to finalize the frame shape. Fix #3405 Pull Request resolved: #3419 Differential Revision: D46557505 Pulled By: mthrok fbshipit-source-id: 46ad2d82c8c30f368ebfbaf6947718a5036c7dc6
Configuration menu - View commit details
-
Copy full SHA for 7dff24c - Browse repository at this point
Copy the full SHA 7dff24cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 25e96f4 - Browse repository at this point
Copy the full SHA 25e96f4View commit details -
Introduce chroma filter bank function (#3395)
Summary: Pull Request resolved: #3395 Adds chroma filter bank function `chroma_filterbank` to `torchaudio.prototype.functional`. Reviewed By: mthrok Differential Revision: D46307672 fbshipit-source-id: c5d8104a8bb03da70d0629b5cc224e0d897148d5
Configuration menu - View commit details
-
Copy full SHA for dfd0c5f - Browse repository at this point
Copy the full SHA dfd0c5fView commit details
Commits on Jun 9, 2023
-
Fix the input pixel format when using GPU video encoder (#3426)
Summary: StreamWriter's encoding pipeline looks like the following 1. convert tensor to AVFrame 2. pass AVFrame to AVFilter 3. pass the resulting AVFrame to AVCodecContext (encoder) and AVFormatContext (muxer) When dealing with CUDA tensor, the AVFilter becomes no-op, as we have not added support for CUDA-compatible filters. When CUDA frame is passed, the existing solution passes the software pixel format to AVFilter, which issues warning later as what AVFilter sees is AV_PIX_FMT_CUDA. Since the filter itself is no-op, it functions as expected. But this commit fixes it. See #3317 Pull Request resolved: #3426 Differential Revision: D46562370 Pulled By: mthrok fbshipit-source-id: ce0131f1e50bcc826ee036fc0f35db2a5162b660
Configuration menu - View commit details
-
Copy full SHA for 30afaa9 - Browse repository at this point
Copy the full SHA 30afaa9View commit details -
Disable HF integration test (#3431)
Summary: The new version of transformers changed the format of pre-trained weight. Fixing it is low-priority for the maintanance team so we disable the test. See #3430 Pull Request resolved: #3431 Differential Revision: D46592883 Pulled By: mthrok fbshipit-source-id: d8f54a281a92cac60c469c48f95345bcf0e959d6
Configuration menu - View commit details
-
Copy full SHA for f5d7635 - Browse repository at this point
Copy the full SHA f5d7635View commit details -
Use torch/types.h where possible (#3422)
Summary: Pull Request resolved: #3422 Differential Revision: D46558184 Pulled By: mthrok fbshipit-source-id: a775c4fb193496d9b2bf9db7bee186ee23512b99
Configuration menu - View commit details
-
Copy full SHA for c587715 - Browse repository at this point
Copy the full SHA c587715View commit details
Commits on Jun 12, 2023
-
feat: add guard in
lfilter
for a non-default cuda device (#3432)Configuration menu - View commit details
-
Copy full SHA for c76d952 - Browse repository at this point
Copy the full SHA c76d952View commit details
Commits on Jun 13, 2023
-
Summary: Pull Request resolved: #3435 Reviewed By: nateanl Differential Revision: D46659362 Pulled By: mthrok fbshipit-source-id: ffa033ad6759de6fd958b63ac51a4a1153ffb45d
Configuration menu - View commit details
-
Copy full SHA for 0f682c7 - Browse repository at this point
Copy the full SHA 0f682c7View commit details -
[SoX/Flac] disable xmms_plugin dependency (#3436)
Summary: This plugin pulls glib and gtk which breaks the build on some headless systems Since the plugin is not actually used, it seems right to disable it This change fixed the build on my system Pull Request resolved: #3436 Differential Revision: D46683297 Pulled By: mthrok fbshipit-source-id: 5b1c1eee1929f4a69a1cc6c7d7bb3ed998ec5872
Configuration menu - View commit details
-
Copy full SHA for 58a51b5 - Browse repository at this point
Copy the full SHA 58a51b5View commit details
Commits on Jun 14, 2023
-
Add resample option to AudioEffector (#3374)
Summary: Currently, AudioEffector always resample to the original sample rate. It is more flexible to allow overriding this to any sample rate. Pull Request resolved: #3374 Differential Revision: D46235358 Pulled By: mthrok fbshipit-source-id: 39a5d4e38d9b90380da31d0ce9ee8090668b54e4
Configuration menu - View commit details
-
Copy full SHA for 406e9c8 - Browse repository at this point
Copy the full SHA 406e9c8View commit details
Commits on Jun 15, 2023
-
Update forced alignment tutorial (#3440)
Summary: * Fix backtrack visualization (the cooridnate was off-by-one.) * Add note about the simplification and the new align API * Explicitly handle SOS and EOS Pull Request resolved: #3440 Reviewed By: xiaohui-zhang Differential Revision: D46761282 Pulled By: mthrok fbshipit-source-id: b0b6c9754674e8e23543e9f002e29b55102c92f8
Configuration menu - View commit details
-
Copy full SHA for 1860169 - Browse repository at this point
Copy the full SHA 1860169View commit details
Commits on Jun 16, 2023
-
Add LRS3 data preparation (#3421)
Summary: This PR adds a data preparation recipe that uses the ultra face detector to extract full-face video. The resulting video output is then used as input for training and evaluating RNNT-based models for automatic speech recognition (ASR), visual speech recognition (VSR), and audio-visual ASR (AV-ASR) on the LRS3 dataset. This PR also updates the word error rate (WER) for AV-ASR LRS3 models and improves the code readability. Pull Request resolved: #3421 Reviewed By: mpc001 Differential Revision: D46799748 Pulled By: mthrok fbshipit-source-id: 97af3feac0592b240617faaffa4c0ac8cef614a9
Configuration menu - View commit details
-
Copy full SHA for 77cdd16 - Browse repository at this point
Copy the full SHA 77cdd16View commit details
Commits on Jun 21, 2023
-
Split the CTC forced aligment API tutorial into two tutorials (#3443)
Summary: Splitting the multilingual example part into another tutorial. Pull Request resolved: #3443 Reviewed By: mthrok Differential Revision: D46802844 Pulled By: xiaohui-zhang fbshipit-source-id: a7093053cac8b79d650d4f665db7fde2d8254998
Configuration menu - View commit details
-
Copy full SHA for 627c37a - Browse repository at this point
Copy the full SHA 627c37aView commit details -
Introduce chroma spectrogram transform (#3427)
Summary: Pull Request resolved: #3427 Adds transform `ChromaSpectrogram` for generating chromagrams from waveforms as well as transform `ChromaScale` for generating chromagrams from linear-frequency spectrograms. Reviewed By: mthrok Differential Revision: D46547418 fbshipit-source-id: 250f298b8e11d8cf82f05536c29d51cf8d77a960
Configuration menu - View commit details
-
Copy full SHA for 7096829 - Browse repository at this point
Copy the full SHA 7096829View commit details
Commits on Jun 26, 2023
-
Add more explanation about
n_fft
(#3442)Summary: Pull Request resolved: #3442 Differential Revision: D46797481 Pulled By: mthrok fbshipit-source-id: 3513037cbb8f2edb70fdab0fec5c7c554a697abe
Configuration menu - View commit details
-
Copy full SHA for 105b77f - Browse repository at this point
Copy the full SHA 105b77fView commit details
Commits on Jun 28, 2023
-
Follow up on tutorial update (#3449)
Summary: Pull Request resolved: #3449 Differential Revision: D47094402 Pulled By: mthrok fbshipit-source-id: 43e6994604f0e6c06a5f19c5e8599e2ce12ae622
Configuration menu - View commit details
-
Copy full SHA for 4a121aa - Browse repository at this point
Copy the full SHA 4a121aaView commit details -
include a link to index.rst (#3441)
Summary: Include Conformer/Emformer RNN-T ASR/VSR/AV-ASR link to index.rst Pull Request resolved: #3441 Differential Revision: D47094158 Pulled By: mthrok fbshipit-source-id: 9ab42ac2bf52a5ce488003897ffba2f10a6ca941
Configuration menu - View commit details
-
Copy full SHA for a8ce4a8 - Browse repository at this point
Copy the full SHA a8ce4a8View commit details
Commits on Jul 3, 2023
-
Summary: Pull Request resolved: #3434 Add one bullet point for `torchaudio.functional` and forced alignment as one example. Reviewed By: mthrok Differential Revision: D46658058 fbshipit-source-id: 6e037b7bb6ed2fc2e27ad1e55c5728c17ce69ce8
Configuration menu - View commit details
-
Copy full SHA for 163157d - Browse repository at this point
Copy the full SHA 163157dView commit details
Commits on Jul 5, 2023
-
Update forced_align method to only support batch Tensors (#3433)
Summary: Pull Request resolved: #3433 Current design of forced_align accept 2D Tensor for `log_probs` and 1D Tensor for `targets`. To make the API simple, the PR make changes to only support batch Tensors (3D Tensor for `log_probs` and 2D Tensor for `targets`). Reviewed By: mthrok Differential Revision: D46657526 fbshipit-source-id: af17ec3f92f1a2c46dba91c6db2488a11de36f89
Configuration menu - View commit details
-
Copy full SHA for cc16447 - Browse repository at this point
Copy the full SHA cc16447View commit details -
Untangle third party inclusion in CMake (#3457)
Summary: Pull Request resolved: #3457 Differential Revision: D47241343 Pulled By: mthrok fbshipit-source-id: fd1bfd1531397cb59e9cf11de9dede6949f8517e
Configuration menu - View commit details
-
Copy full SHA for c34a1d6 - Browse repository at this point
Copy the full SHA c34a1d6View commit details -
Add stand alone job to build FFmpeg binaries (#3455)
Summary: Pull Request resolved: #3455 Differential Revision: D47242316 Pulled By: mthrok fbshipit-source-id: 0eb4bdb0a45fccfe9ff97eaed79db63cd7bfc7d8
Configuration menu - View commit details
-
Copy full SHA for 662f067 - Browse repository at this point
Copy the full SHA 662f067View commit details -
Revert "[audio][PR] Add option to dlopen FFmpeg libraries (#3402)" (#…
Configuration menu - View commit details
-
Copy full SHA for ca66a1d - Browse repository at this point
Copy the full SHA ca66a1dView commit details
Commits on Jul 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 2fa39db - Browse repository at this point
Copy the full SHA 2fa39dbView commit details -
Add ARM linux ffmpeg build (#3462)
Summary: Pull Request resolved: #3462 Differential Revision: D47270241 Pulled By: mthrok fbshipit-source-id: 6a3b02380dfb381ffb47c1f46b46f4833c765246
Configuration menu - View commit details
-
Copy full SHA for d9f51ce - Browse repository at this point
Copy the full SHA d9f51ceView commit details
Commits on Jul 7, 2023
-
Use pre-built binaries for ffmpeg extension (#3460)
Summary: This commit changes the way FFmpeg extension is built. Originally, the build process expected the FFmpeg binaries to be somehow available in build env. This makes the build process unpredictable and prevents default enabling FFmpeg extension. The proposed change uses pre-built FFmpeg binaries as build-time only scaffold, which are built in our CI job https://github.com/pytorch/audio/actions/workflows/ffmpeg.yml. This makes the build process more predictable and removes the necessity to build FFmpeg in our CI. Currently, it supports macOS (arm64, x86_64), unix (x86_64, aarch64) and windows (amd64). The downside is that it no longer works with the architecture not listed above. We can potentially workaround by searching the FFmpeg binaries available in system (the old way) for these system, but since they are not supported by PyTorch, the priority is low. Pull Request resolved: #3460 Differential Revision: D47261885 Pulled By: mthrok fbshipit-source-id: 223a15e95c9140c95688af968beb35ff40354476
Configuration menu - View commit details
-
Copy full SHA for f77c3e5 - Browse repository at this point
Copy the full SHA f77c3e5View commit details -
Fix StreamWriter regression around RGB0/BGR0 (#3428)
Summary: - Add RGB0/BGR0 support to CPU encoder - Allow to pass RGB/BGR when expectged format is RGB0/BGR0 Pull Request resolved: #3428 Differential Revision: D47274370 Pulled By: mthrok fbshipit-source-id: d34d940e04b07673bb86f518fe895c0735912444
Configuration menu - View commit details
-
Copy full SHA for 9210cba - Browse repository at this point
Copy the full SHA 9210cbaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9c7bf1b - Browse repository at this point
Copy the full SHA 9c7bf1bView commit details
Commits on Jul 10, 2023
-
Update package smoke test (#3465)
Summary: 1. Update smoke test script to change directory so that there is no `torchaudio` directory in CWD when smoke test is being executed. 2. Disable the part of smoke test which requires FFmpeg for wheel. The preparation for pytorch/test-infra#4358 Pull Request resolved: #3465 Reviewed By: nateanl Differential Revision: D47345117 Pulled By: mthrok fbshipit-source-id: 95aad0a22922d44ee9a24a05d9ece85166b8c17e
Configuration menu - View commit details
-
Copy full SHA for 589de10 - Browse repository at this point
Copy the full SHA 589de10View commit details
Commits on Jul 11, 2023
-
Clean up FFMPEG env var and remove pre/post build script (#3466)
Summary: Now that we do not build FFmpeg as part of CI build process, we can remove the pre/post build scripts. Needs to land after pytorch/test-infra#4358 Pull Request resolved: #3466 Reviewed By: atalman Differential Revision: D47367022 Pulled By: mthrok fbshipit-source-id: 17aafff74ee7d269236cffb8a88c803a8d4c44b7
Configuration menu - View commit details
-
Copy full SHA for c825c01 - Browse repository at this point
Copy the full SHA c825c01View commit details -
Summary: Pull Request resolved: #3469 Differential Revision: D47368140 Pulled By: mthrok fbshipit-source-id: d82ddb91ae1f6612298486fb8401f95c48db5620
Configuration menu - View commit details
-
Copy full SHA for 216146a - Browse repository at this point
Copy the full SHA 216146aView commit details -
Summary: Pull Request resolved: #3468 Differential Revision: D47368070 Pulled By: mthrok fbshipit-source-id: 9b5d57b0cb861a2556a1903121f526f8011a0e2d
Configuration menu - View commit details
-
Copy full SHA for 18b20f7 - Browse repository at this point
Copy the full SHA 18b20f7View commit details -
Clean up FFmpeg build scripts (#3470)
Summary: Pull Request resolved: #3470 Differential Revision: D47374347 Pulled By: mthrok fbshipit-source-id: 003b83e50a70f6e1d06eb196f0be5dbba1640226
Configuration menu - View commit details
-
Copy full SHA for cc41178 - Browse repository at this point
Copy the full SHA cc41178View commit details
Commits on Jul 12, 2023
-
Support multiple FFmpeg versions (#3464)
Summary: This commit introduces support for multiple FFmpeg versions for OSS binary distributions. Currently torchaudio only works with FFmpeg 4. This is inconvenient from installing to runtime linking. This commit allows to pick FFmpeg 4, 5 or 6 at runtime, instead of just looking for v4. The way it works is that we compile the FFmpeg extension three times with different FFmpeg and ship them. At runtime, we look for libavutil of specific version and when one is found, load the corresponding FFmpeg extension. The order of preference is 6, 5, then 4. To make the build process simple and reproducible, we use pre-built binaries of FFmpeg during the build. They are LGPL and downloaded from S3 at build time, instead of building every time. The use of pre-built binaries as scaffolding limits the system that can build torchaudio, so it also introduces single FFmpeg version support mode. setting FFMPEG_ROOT during the build will change the way binaries are built so that it will only support one specific version of FFmpeg. Pull Request resolved: #3464 Differential Revision: D47300223 Pulled By: mthrok fbshipit-source-id: 560c7968315e4c8922afa11a4693f648c0356d04
Configuration menu - View commit details
-
Copy full SHA for 786066b - Browse repository at this point
Copy the full SHA 786066bView commit details -
Fix FFmpeg initialization logic (#3474)
Summary: Pull Request resolved: #3474 Differential Revision: D47398447 fbshipit-source-id: f77b685d54ddfc222b806475707d4a10239872f5
Configuration menu - View commit details
-
Copy full SHA for 49e269a - Browse repository at this point
Copy the full SHA 49e269aView commit details -
Use FFmpeg6 in build doc (#3475)
Summary: Pull Request resolved: #3475 Differential Revision: D47403772 Pulled By: mthrok fbshipit-source-id: 5cdde521dbbbbf33856470a9dc79419b4a3a1683
Configuration menu - View commit details
-
Copy full SHA for 989702b - Browse repository at this point
Copy the full SHA 989702bView commit details -
Fix resampling to support dynamic input lengths for onnx exports. (#3473
) Summary: This is a port of adefossez/julius#17 for torchaudio. Not sure if it's possible/desirable to add tests to test the functionality of ONNX exports, but I did a quick test on my machine to ensure this works. The logic is a bit simpler compared to the other PR because the torchaudio version does not support the additional flags available in julius. Pull Request resolved: #3473 Differential Revision: D47401988 Pulled By: mthrok fbshipit-source-id: 62fa1e4388923f6a62cef2c0f902a79ea179cec4
Configuration menu - View commit details
-
Copy full SHA for a3b6bfb - Browse repository at this point
Copy the full SHA a3b6bfbView commit details -
Resolve some compilation warnings (#3471)
Summary: - FFmpeg 6 deprecated attributes - Guard CUDA specific functions not used in CPU builds Pull Request resolved: #3471 Differential Revision: D47402174 Pulled By: mthrok fbshipit-source-id: 00c0719ab1849b50c0b56b03d8fb38bc7aa74538
Configuration menu - View commit details
-
Copy full SHA for a6d1fec - Browse repository at this point
Copy the full SHA a6d1fecView commit details
Commits on Jul 13, 2023
-
Revert D47402174: [audio][PR] Resolve some compilation warnings
Differential Revision: D47402174 Original commit changeset: 00c0719ab184 Original Phabricator Diff: D47402174 fbshipit-source-id: b1f6ea4cc3ecef3f72a87bf2f67bf9644c847546
Configuration menu - View commit details
-
Copy full SHA for 155d1ba - Browse repository at this point
Copy the full SHA 155d1baView commit details -
Linux CPU job should respect set Python version (#3477)
Summary: Reintroduce a conda environment within which we will do all deps installation, audio builds, and tests runs. This conda environment will use the python version set by the GHA job - previously this just defaulted to using the system 3.10 python which was default inside the container. Pull Request resolved: #3477 Reviewed By: mthrok Differential Revision: D47414572 Pulled By: osalpekar fbshipit-source-id: 80760f82c7726205b29812d576e498db2a7a80a0
Configuration menu - View commit details
-
Copy full SHA for 86cb1e0 - Browse repository at this point
Copy the full SHA 86cb1e0View commit details
Commits on Jul 14, 2023
-
Update the logic to fetch pixel format from filter graph (#3479)
Summary: When using GPU decoder in some environments, attempting to read the output formats from filter graph caused an issue in which the software pixel format cannot be determined. We do not know the exact cause but when it happens, the input link of buffer sink does not have HW frames context. Since currently no filter can convert the pixel format of CUDA frame, we resort to the HW frames context of the output link of buffer source. Environments this was observed. Env1 - OS: Fedora 36 (x86_64) - GCC 12.2.1 - Python 3.10.12 - GPU: GeForce RTX 3070 Ti Laptop GPU - FFmpeg: 5.1.3 - nv-codec-header: n11.1.5.2 - CUDA: 12.1 Env2 - Ubuntu 20.04.4 LTS (x86_64) - GCC 9.4.0 - Python 3.11.3 - GPU: Quadro GV100 - FFmpeg: 5.1.3 - nv-codec-header: n11.1.5.2 - CUDA: 11.4 Pull Request resolved: #3479 Differential Revision: D47482407 Pulled By: mthrok fbshipit-source-id: 1c53096b27824453b260138ab64e1948afeeefc7
Configuration menu - View commit details
-
Copy full SHA for cf53a48 - Browse repository at this point
Copy the full SHA cf53a48View commit details
Commits on Jul 15, 2023
-
Update notes on FFmpeg version (#3480)
Summary: The nightly builds support FFmpeg version 4, 5 and 6. Pull Request resolved: #3480 Differential Revision: D47482841 Pulled By: mthrok fbshipit-source-id: 88267f5e83ddc7b1e866b35e57a87b985e2c78c9
Configuration menu - View commit details
-
Copy full SHA for 5a809aa - Browse repository at this point
Copy the full SHA 5a809aaView commit details -
Use more recent FFmpeg in unit tests (#3476)
Summary: Pull Request resolved: #3476 Differential Revision: D47494211 Pulled By: mthrok fbshipit-source-id: 230bbf0a271b070d1dea34146d0d466e666cccdc
Configuration menu - View commit details
-
Copy full SHA for ea7a96d - Browse repository at this point
Copy the full SHA ea7a96dView commit details
Commits on Jul 17, 2023
-
Ensure StreamReader returns tensors with requires_grad is False (#3467)
Summary: Pull Request resolved: #3467 Differential Revision: D47482388 Pulled By: mthrok fbshipit-source-id: abff36491dc28b83270673860d6457a084b1327d
Configuration menu - View commit details
-
Copy full SHA for 44b9206 - Browse repository at this point
Copy the full SHA 44b9206View commit details
Commits on Jul 18, 2023
-
Extract NVDEC tutorial from the current notebook (#3478)
Summary: Now that GPU video decoders are available in doc CI, we run the tutorials with GPU decoders. Pull Request resolved: #3478 Differential Revision: D47519672 Pulled By: mthrok fbshipit-source-id: 2f95243100e9c75e17c2b4d306da164f0e31f8f2
Configuration menu - View commit details
-
Copy full SHA for 6324462 - Browse repository at this point
Copy the full SHA 6324462View commit details
Commits on Jul 24, 2023
-
Move examples/asr/avsr_rnnt to examples/avsr folder (#3489)
Summary: Pull Request resolved: #3489 Reviewed By: mthrok Differential Revision: D47726448 Pulled By: mpc001 fbshipit-source-id: 3d5aa7646c6bb816dcbbf70c61e98404bb148841
Configuration menu - View commit details
-
Copy full SHA for 66f661d - Browse repository at this point
Copy the full SHA 66f661dView commit details
Commits on Jul 25, 2023
-
Update AV-ASR recipe link to index.rst. (#3492)
Summary: Pull Request resolved: #3492 Reviewed By: mthrok Differential Revision: D47755638 Pulled By: mpc001 fbshipit-source-id: 729efdb2a69b5656dbc0b70dd623c1509123d3aa
Configuration menu - View commit details
-
Copy full SHA for ae8c131 - Browse repository at this point
Copy the full SHA ae8c131View commit details -
Configuration menu - View commit details
-
Copy full SHA for 135cb7b - Browse repository at this point
Copy the full SHA 135cb7bView commit details -
Run GPU video decoder/encoder tests in CI (#3490)
Summary: Pull Request resolved: #3490 Differential Revision: D47757316 Pulled By: mthrok fbshipit-source-id: cfb376be29980f9e452f291c4fa25780e9f85a97
Configuration menu - View commit details
-
Copy full SHA for df65560 - Browse repository at this point
Copy the full SHA df65560View commit details -
Update nvdec/nvenc tutorials (#3483)
Summary: Pull Request resolved: #3483 Differential Revision: D47725664 Pulled By: mthrok fbshipit-source-id: e4249e1488fa7af8670be4a5077957912ff3420b
Configuration menu - View commit details
-
Copy full SHA for 56e2266 - Browse repository at this point
Copy the full SHA 56e2266View commit details -
Summary: This PR is to include few changes in the AV-ASR recipe. The changes include better results, a faster face detector (Mediapipe), renamed variable names, a streamlined dataloader, and a few illustrated examples. These changes were made to improve the usability of the recipe. Pull Request resolved: #3493 Reviewed By: mthrok Differential Revision: D47758072 Pulled By: mpc001 fbshipit-source-id: 4533587776f3a7a74f3f11b0ece773a0934bacdc
Configuration menu - View commit details
-
Copy full SHA for d464479 - Browse repository at this point
Copy the full SHA d464479View commit details -
Fix and update doc deployment (#3495)
Summary: - Fix condition to add new commit to gh-pages - Allow to deploy docs from workflow dispatch Pull Request resolved: #3495 Differential Revision: D47767443 Pulled By: mthrok fbshipit-source-id: 9ca858868f3e822e532c21cde9d7499af9891a51
Configuration menu - View commit details
-
Copy full SHA for e483a67 - Browse repository at this point
Copy the full SHA e483a67View commit details -
Disable some tests that need libsox (#3494)
Summary: In preparation for #3082 Disable those FFmpeg tests that depend on sox CLI. These tests need to be updated or removed so as not to use sox CLI. Auto-skip some sox tests if decoder/encoder are not available Pull Request resolved: #3494 Differential Revision: D47761948 Pulled By: mthrok fbshipit-source-id: 3a48d7f280f8376a48d223947dd41a7cdc8cbc30
Configuration menu - View commit details
-
Copy full SHA for 49e9ed9 - Browse repository at this point
Copy the full SHA 49e9ed9View commit details
Commits on Jul 26, 2023
-
Add nightly doc update (#3496)
Summary: Add scheduled doc update job so that docs are updated at least once a day. Pull Request resolved: #3496 Differential Revision: D47795577 Pulled By: mthrok fbshipit-source-id: aba5376ec51f07560014d250a16fef8b8a11b43e
Configuration menu - View commit details
-
Copy full SHA for f082e6c - Browse repository at this point
Copy the full SHA f082e6cView commit details -
Summary: Pull Request resolved: #3499 Differential Revision: D47803654 Pulled By: mthrok fbshipit-source-id: 2b916fa66d84c91c01b4dfe6dd5ee3501159f451
Configuration menu - View commit details
-
Copy full SHA for da21202 - Browse repository at this point
Copy the full SHA da21202View commit details -
av-asr: move video loading outside detector (#3498)
Summary: This PR moves video loading outside detector during pre-processing. Pull Request resolved: #3498 Reviewed By: mthrok Differential Revision: D47811044 Pulled By: mpc001 fbshipit-source-id: f17839b695b13d3cf2d9db343d7e9a0202eea7d5
Configuration menu - View commit details
-
Copy full SHA for c977afe - Browse repository at this point
Copy the full SHA c977afeView commit details
Commits on Jul 27, 2023
-
Add switch to disable sox integration and ffmpeg integration at runti…
…me (#3500) Summary: Since libsox and ffmpeg extensions now depend on external libraries, their initialization processes might cause unrecoverable issue, such as segfault. This commit adds environment variable to disable them so that importing torchaudio won't attempt to load these libraries. Pull Request resolved: #3500 Differential Revision: D47808178 Pulled By: mthrok fbshipit-source-id: 80c1c6b5f4bc608d4e209473702680db093c95ee
Configuration menu - View commit details
-
Copy full SHA for 29903c5 - Browse repository at this point
Copy the full SHA 29903c5View commit details -
Replace libsox with stub library (#3497)
Summary: This commit updates the way libsox is integrated to torchaudio 1. We stop statically linking libsox, so torchaudio will not ship libsox. 2. We link libsox dynamically. Users are expected to install libsox by themselves. 3. We use stab library to build torchaudio. Pull Request resolved: #3497 Differential Revision: D47803706 Pulled By: mthrok fbshipit-source-id: 31b05495d81069186fa52d67beea360cc7e817a8
Configuration menu - View commit details
-
Copy full SHA for 8588fba - Browse repository at this point
Copy the full SHA 8588fbaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7368e33 - Browse repository at this point
Copy the full SHA 7368e33View commit details
Commits on Jul 28, 2023
-
Add real-time av-asr tutorial (#3511)
Summary: Pull Request resolved: #3511 Reviewed By: mthrok Differential Revision: D47852108 Pulled By: mpc001 fbshipit-source-id: c0ecb4b5bcc8670013dcbe1164e3929f5793c8aa
Configuration menu - View commit details
-
Copy full SHA for d6aeaa7 - Browse repository at this point
Copy the full SHA d6aeaa7View commit details -
Move TorchAudio-Squim models to Beta (#3512)
Summary: The PR move `SquimObjective` and `SquimSubjective` models and corresponding factory functions and pre-trained pipelines out of prototype and to the core directory. They will be included in the next official release. Pull Request resolved: #3512 Reviewed By: mthrok Differential Revision: D47837434 Pulled By: nateanl fbshipit-source-id: d0639f29079f7e1afc30f236849e530c8cadffd8
Configuration menu - View commit details
-
Copy full SHA for b7d2d92 - Browse repository at this point
Copy the full SHA b7d2d92View commit details -
Update documentation about dependencies (#3517)
Summary: Pull Request resolved: #3517 Reviewed By: huangruizhe Differential Revision: D47858452 Pulled By: mthrok fbshipit-source-id: 62ee6c8bb2669dd70f8ca25703a04dc8a9d19aec
Configuration menu - View commit details
-
Copy full SHA for a051985 - Browse repository at this point
Copy the full SHA a051985View commit details -
Remove ffmpeg fallback from sox_io backend (#3516)
Summary: In #2419, we added ffmpeg as fallback for sox_io backend. The was a warkaround for solving the issue with libmad removal. Now that we introduced `backend` argument to I/O functions, and libsox integration is moved to dynamic binding where users can use libsox with libmad integration, we do not need the workaround. This commit is based on reverting #2416 (fd7ace1). Pull Request resolved: #3516 Reviewed By: huangruizhe Differential Revision: D47855272 Pulled By: mthrok fbshipit-source-id: 5af73af7865f6e545ccb052d478e86588ff2a014
Configuration menu - View commit details
-
Copy full SHA for 2c8665d - Browse repository at this point
Copy the full SHA 2c8665dView commit details -
Amend amp_to_db docstring (#3519)
Summary: Context: #3448 The documentation of amplitude_to_DB is ambigious on how cut-off values are computed when the input tensor is 3D. This commit clarifies that. Closes: #3448 Pull Request resolved: #3519 Reviewed By: huangruizhe Differential Revision: D47875505 Pulled By: mthrok fbshipit-source-id: e06bb997e7a27e2abe35c8e2ac91ddfbded4e641
Configuration menu - View commit details
-
Copy full SHA for 61cbf79 - Browse repository at this point
Copy the full SHA 61cbf79View commit details
Commits on Jul 29, 2023
-
Summary: The I/O functions in _compat module was introduced there so that everything related to FFmpeg is in torchaudio.io and FFmpeg library initialization can be carried out in `torchaudio.io.__init__`. Now that this constraint is removed, (all the initialization happens at `torchaudio._extension.__init__`) and `_compat` is only used by FFmpeg dispatcher backend, we move the module to `torchaudio._backend` for better locality. Pull Request resolved: #3518 Reviewed By: huangruizhe Differential Revision: D47877412 Pulled By: mthrok fbshipit-source-id: aa18c8cb6e5d5360950df5158c33c653e37c565f
Configuration menu - View commit details
-
Copy full SHA for 8497ee9 - Browse repository at this point
Copy the full SHA 8497ee9View commit details
Commits on Jul 31, 2023
-
Set and tweak global matplotlib configuration in tutorials (#3515)
Summary: - Set global matplotlib rc params - Fix style check - Fix and updates FA tutorial plots - Add av-asr index cars Pull Request resolved: #3515 Reviewed By: huangruizhe Differential Revision: D47894156 Pulled By: mthrok fbshipit-source-id: b40d8d31f12ffc2b337e35e632afc216e9d59a6e
Configuration menu - View commit details
-
Copy full SHA for 84b1230 - Browse repository at this point
Copy the full SHA 84b1230View commit details -
Migrate torch.norm to torch.linalg.vector_norm (#3522)
Summary: torch.norm is now deprecated. The usages in torchaudio seems to be vector norm, so replacing them with torch.linalg.vector_norm Resolves #3484 Pull Request resolved: #3522 Reviewed By: huangruizhe Differential Revision: D47926659 Pulled By: mthrok fbshipit-source-id: f7428cf0168109a3d340b8784adc99bb5f781084
Configuration menu - View commit details
-
Copy full SHA for 8a2e12d - Browse repository at this point
Copy the full SHA 8a2e12dView commit details
Commits on Aug 1, 2023
-
Add pretrained VGGish inference pipeline (#3491)
Summary: Adds pre-trained VGGish inference pipeline ported from https://github.com/harritaylor/torchvggish and https://github.com/tensorflow/models/tree/master/research/audioset. Pull Request resolved: #3491 Reviewed By: mthrok Differential Revision: D47738130 Pulled By: hwangjeff fbshipit-source-id: 859c1ff1ec1b09dae4e26586169544571657cc67
Configuration menu - View commit details
-
Copy full SHA for cbfde17 - Browse repository at this point
Copy the full SHA cbfde17View commit details -
Summary: torch.nn.utils.weight_norm is deprecated. Replacing this with new API Pull Request resolved: #3523 Reviewed By: huangruizhe Differential Revision: D47932384 Pulled By: mthrok fbshipit-source-id: 344abfa12bd11da779f7fd13b74a1e009a582b52
Configuration menu - View commit details
-
Copy full SHA for 144cfcf - Browse repository at this point
Copy the full SHA 144cfcfView commit details -
Add cuctc tutorial, change blank skip threshold into prob (#3297)
Configuration menu - View commit details
-
Copy full SHA for 732c94a - Browse repository at this point
Copy the full SHA 732c94aView commit details
Commits on Aug 2, 2023
-
Fix save INT16 sox backend (#3524)
Summary: When passing int16 type tensor to `save(backend="sox")`, the resulting file should be 16-bit signed PCM, but instead is 32-bit signed PCM. Resolves #3304 Pull Request resolved: #3524 Reviewed By: huangruizhe Differential Revision: D47941090 Pulled By: mthrok fbshipit-source-id: 2622b31eb1cbf03969f67ab2b2adec6e2ba677c4
Configuration menu - View commit details
-
Copy full SHA for 3f9b517 - Browse repository at this point
Copy the full SHA 3f9b517View commit details
Commits on Aug 3, 2023
-
Relax Conformer RNN-T numerical parity tests (#3525)
Summary: Increases numerical tolerance on Conformer RNN-T TorchScript consistency tests to resolve CI test failures. Pull Request resolved: #3525 Reviewed By: mthrok Differential Revision: D48000613 Pulled By: hwangjeff fbshipit-source-id: 1d35ba58055a8346dc40e2b67f37ccfd2e015894
Configuration menu - View commit details
-
Copy full SHA for 72b0917 - Browse repository at this point
Copy the full SHA 72b0917View commit details -
Refactor wav2vec2 pipeline misc helper functions (#3527)
Summary: Pull Request resolved: #3527 Reviewed By: huangruizhe Differential Revision: D48008822 Pulled By: mthrok fbshipit-source-id: 4beae2956dfd1f00534832b70a1bf0897cba7812
Configuration menu - View commit details
-
Copy full SHA for 09aabcc - Browse repository at this point
Copy the full SHA 09aabccView commit details
Commits on Aug 4, 2023
-
Update ctc forced alignment tutorial (#3529)
Summary: - Simplify the step to generate token-level alignment Pull Request resolved: #3529 Reviewed By: huangruizhe Differential Revision: D48066787 Pulled By: mthrok fbshipit-source-id: 452c243d278e508926a59894928e280fea76dcc6
Configuration menu - View commit details
-
Copy full SHA for b645c07 - Browse repository at this point
Copy the full SHA b645c07View commit details -
Revise VGGish pipeline to accept arbitrary state dict function (#3531)
Summary: Pull Request resolved: #3531 Revises VGGish pipeline to accept arbitrary state dict function to accommodate loading weights from any source. Reviewed By: mthrok Differential Revision: D48056390 fbshipit-source-id: 2767699b58442ad132b518b4a6435f2772a637c3
Configuration menu - View commit details
-
Copy full SHA for b976c8f - Browse repository at this point
Copy the full SHA b976c8fView commit details
Commits on Aug 7, 2023
-
Make target_lengths/input_lengths in forced_align optional (#3533)
Summary: Currently `torchaudio.functional.forced_align` function requires full information on input/target lengths. When performing non-batched alignment, these can be inferred from the size of Tensor. Pull Request resolved: #3533 Reviewed By: nateanl Differential Revision: D48111041 Pulled By: mthrok fbshipit-source-id: fbf07124d3959c5cc5533dcd86296851587082fb
Configuration menu - View commit details
-
Copy full SHA for cd80976 - Browse repository at this point
Copy the full SHA cd80976View commit details -
Add merge_tokens / TokenSpan (#3535)
Summary: This commit adds `merge_tokens` function which removes repeated tokens from CTC token sequences returned from `forced_align`. Resolving repeated tokens is a necessary step and almost universal, thus it makes sense to have such helper function in torchaudio. Pull Request resolved: #3535 Reviewed By: huangruizhe Differential Revision: D48111202 Pulled By: mthrok fbshipit-source-id: 25354bfa210aa5c03f8c1d3e201f253ca3761b24
Configuration menu - View commit details
-
Copy full SHA for 30668af - Browse repository at this point
Copy the full SHA 30668afView commit details -
Summary: Port the MMS FA model from tutorial to the library with post-processing module. Pull Request resolved: #3521 Reviewed By: huangruizhe Differential Revision: D48038285 Pulled By: mthrok fbshipit-source-id: 571cf0fceaaab4790983be2719f1a85805b814f5
Configuration menu - View commit details
-
Copy full SHA for 5e211d6 - Browse repository at this point
Copy the full SHA 5e211d6View commit details -
Move alignment code to separate submodule (#3536)
Summary: Pull Request resolved: #3536 Reviewed By: huangruizhe Differential Revision: D48120170 Pulled By: mthrok fbshipit-source-id: dec7575db07734490099b35a8bfc854252952c6e
Configuration menu - View commit details
-
Copy full SHA for 90143e9 - Browse repository at this point
Copy the full SHA 90143e9View commit details
Commits on Aug 8, 2023
-
Add abstraction for download util (#1959)
Summary: Pull Request resolved: #1959 Reviewed By: hwangjeff Differential Revision: D32078361 fbshipit-source-id: 50b56bac9593c36197998e89db19cd6d65b793cc
Configuration menu - View commit details
-
Copy full SHA for 3f98fb9 - Browse repository at this point
Copy the full SHA 3f98fb9View commit details -
Librispeech RNNT recipe updates for pytorch lightening 2.0 (#3336)
Summary: Pull Request resolved: #3336 Reviewed By: mthrok Differential Revision: D47846814 Pulled By: huangruizhe fbshipit-source-id: dc12362bf243c52222dccadec3176e25e43dd652
Configuration menu - View commit details
-
Copy full SHA for e6c8973 - Browse repository at this point
Copy the full SHA e6c8973View commit details -
Summary: Pull Request resolved: #3538 Reviewed By: huangruizhe Differential Revision: D48154056 Pulled By: mthrok fbshipit-source-id: 72f58c501c5302d40f1d059f95bd6fe40d4a52aa
Configuration menu - View commit details
-
Copy full SHA for 7e85f62 - Browse repository at this point
Copy the full SHA 7e85f62View commit details -
Adopt MMS_FA bundle in multilingual FA tutorials (#3534)
Summary: Pull Request resolved: #3534 Reviewed By: huangruizhe Differential Revision: D48155817 Pulled By: mthrok fbshipit-source-id: a3d45fdfd360f9668063a3ecb3b00364290134c9
Configuration menu - View commit details
-
Copy full SHA for 19e9046 - Browse repository at this point
Copy the full SHA 19e9046View commit details -
Add tutorial link to AVSR recipe (#3532)
Summary: Pull Request resolved: #3532 Reviewed By: mthrok Differential Revision: D48165499 Pulled By: mpc001 fbshipit-source-id: c87b3361f0e6282684f218b32888df883d56682b
Configuration menu - View commit details
-
Copy full SHA for f7ab406 - Browse repository at this point
Copy the full SHA f7ab406View commit details -
Updating CTC FA tutorial (#3542)
Summary: Pull Request resolved: #3542 Reviewed By: huangruizhe Differential Revision: D48166025 Pulled By: mthrok fbshipit-source-id: 29fee7dbf08394993972ec2967f94ce9fcb1c853
Configuration menu - View commit details
-
Copy full SHA for eab8aa7 - Browse repository at this point
Copy the full SHA eab8aa7View commit details
Commits on Aug 9, 2023
-
Revise VGGish inference pipeline test (#3544)
Summary: Pull Request resolved: #3544 Revises VGGish inference pipeline test to support internal testing. Reviewed By: mthrok Differential Revision: D48058409 fbshipit-source-id: 045140a0e9d50128d32ef6510bdb2f642a365c83
Configuration menu - View commit details
-
Copy full SHA for 9f5fa84 - Browse repository at this point
Copy the full SHA 9f5fa84View commit details
Commits on Aug 10, 2023
-
Summary: Pull Request resolved: #3546 Reviewed By: huangruizhe Differential Revision: D48219274 Pulled By: mthrok fbshipit-source-id: 6881f039bf70cf7240fbcfeb48443471ef457bd4
Configuration menu - View commit details
-
Copy full SHA for bc26425 - Browse repository at this point
Copy the full SHA bc26425View commit details -
Fix SoundfileBackend method decorators (#3550)
Summary: Pull Request resolved: #3550 Reviewed By: hwangjeff Differential Revision: D48219176 Pulled By: mthrok fbshipit-source-id: 4b11111dd3853cbef4ffe1859ec428ca05394824
Configuration menu - View commit details
-
Copy full SHA for 2d1138c - Browse repository at this point
Copy the full SHA 2d1138cView commit details -
Move backend initialization to toplevel (#3548)
Summary: The backend dispatcher is implemented in `torchaudio._backend`, while the legacy backend is implemented in `torchaudio.backend`. The initialization happen in `torchaudio._backend`. This commit moves it to `torchaudio.__init__`, so that `backend` and `_backend` is more independent. Pull Request resolved: #3548 Reviewed By: huangruizhe Differential Revision: D48219244 Pulled By: mthrok fbshipit-source-id: e694cb232794f90902a60ee51c7bf11b7f0548a0
Configuration menu - View commit details
-
Copy full SHA for 6fb21ab - Browse repository at this point
Copy the full SHA 6fb21abView commit details -
[aarch64] Add aarch64 workflow (#3553)
Summary: # Changes * Adding workflow for building aarch64 wheels. Pull Request resolved: #3553 Reviewed By: hwangjeff, osalpekar Differential Revision: D48239384 Pulled By: atalman fbshipit-source-id: dfa00edb3fee0acaf2b83fb420eaf12bddc6980e
Configuration menu - View commit details
-
Copy full SHA for 8d858c3 - Browse repository at this point
Copy the full SHA 8d858c3View commit details -
Add Frechet distance function (#3545)
Summary: Pull Request resolved: #3545 Adds function for computing the Fréchet distance between two multivariate normal distributions. Reviewed By: mthrok Differential Revision: D48126102 fbshipit-source-id: e4e122b831e1e752037c03f5baa9451e81ef1697
Configuration menu - View commit details
-
Copy full SHA for 06301c0 - Browse repository at this point
Copy the full SHA 06301c0View commit details -
Refactor _backend module (#3547)
Summary: * Move Backend implementations to separate files Pull Request resolved: #3547 Reviewed By: hwangjeff Differential Revision: D48233538 Pulled By: mthrok fbshipit-source-id: bcc63fc07a5dfcd48929f0a2fb64bfcb3282eb92
Configuration menu - View commit details
-
Copy full SHA for 1e6a8f9 - Browse repository at this point
Copy the full SHA 1e6a8f9View commit details
Commits on Aug 11, 2023
-
Support writing opus and mp3 with soundfile (#3554)
Summary: Pull Request resolved: #3554 Reviewed By: huangruizhe Differential Revision: D48240906 Pulled By: mthrok fbshipit-source-id: 1936757646f8ebba74e8b65e2ffe2a8b74fdfeeb
Configuration menu - View commit details
-
Copy full SHA for 9bd7ca5 - Browse repository at this point
Copy the full SHA 9bd7ca5View commit details -
Revise VGGish pipeline test again (#3551)
Summary: Pull Request resolved: #3551 Restores VGGish pipeline test to be a function rather than class. Reviewed By: mthrok Differential Revision: D48236197 fbshipit-source-id: 25ac19d87a7a0964a9c3f7552037cd6c21dc38a9
Configuration menu - View commit details
-
Copy full SHA for f2b2f05 - Browse repository at this point
Copy the full SHA f2b2f05View commit details -
Summary: `torchaudio.info` returns `AudioMetaData`. It should be exposed as public API, without referring `backend` submodule. Pull Request resolved: #3556 Reviewed By: huangruizhe Differential Revision: D48267349 Pulled By: mthrok fbshipit-source-id: 6ccc0c32bf62fbdcb71495fc7d8d4cc29891538a
Configuration menu - View commit details
-
Copy full SHA for 9467fc4 - Browse repository at this point
Copy the full SHA 9467fc4View commit details
Commits on Aug 14, 2023
-
Add default use_tmp_hub_dir value for integration tests (#3558)
Summary: Pull Request resolved: #3558 In the event that `use_tmp_hub_dir` isn't specified as an option, pytest shouldn't fail. To resolve such failures, this PR modifies function `temp_hub_dir` to fall back on a default value of `False` for `use_tmp_hub_dir`. Reviewed By: mthrok Differential Revision: D48318947 fbshipit-source-id: 5dd692f9202ef37ec3e2c9ea39896156f928d693
Configuration menu - View commit details
-
Copy full SHA for d1d41fd - Browse repository at this point
Copy the full SHA d1d41fdView commit details -
Update ffmpeg pre-built binary to 4.4.4 (#3557)
Summary: Pull Request resolved: #3557 Reviewed By: huangruizhe Differential Revision: D48326462 Pulled By: mthrok fbshipit-source-id: c37ae38e28e4514ea284613636604a725829346d
Configuration menu - View commit details
-
Copy full SHA for a9e38e7 - Browse repository at this point
Copy the full SHA a9e38e7View commit details -
Update integration test CI config (#3502)
Summary: Update the ubuntu image so that CI is triggered. There is some issue with FFmpeg 4, so that CI does not succeed. This will be is handled separately. Pull Request resolved: #3502 Reviewed By: huangruizhe Differential Revision: D48327431 Pulled By: mthrok fbshipit-source-id: 5ea639f3e20c3aaf460e6030f6cb1ad2daa00172
Configuration menu - View commit details
-
Copy full SHA for 9d8f76d - Browse repository at this point
Copy the full SHA 9d8f76dView commit details -
Update I/O and backend docs (#3555)
Summary: * Merge backend doc into torchaudio toplevel doc * Update backend, dispatcher, installation doc Pull Request resolved: #3555 Reviewed By: huangruizhe Differential Revision: D48326812 Pulled By: mthrok fbshipit-source-id: cc0d7326eacfebd341323b5d613ca1777255748b
Configuration menu - View commit details
-
Copy full SHA for c0f25f2 - Browse repository at this point
Copy the full SHA c0f25f2View commit details -
Move essential backend implementations to _backend (#3549)
Summary: Move the actual I/O implementation to `_backend` submodule so that the existing `backend` submodule contains only what's related to legacy backend utilities. Pull Request resolved: #3549 Reviewed By: huangruizhe Differential Revision: D48253550 Pulled By: mthrok fbshipit-source-id: c23f1664458c723f63e134c7974b3f7cf17a1e98
Configuration menu - View commit details
-
Copy full SHA for 2e0dfaf - Browse repository at this point
Copy the full SHA 2e0dfafView commit details
Commits on Aug 15, 2023
-
[BC-breaking] Update pre-built ffmpeg4 to 4.4.4 (#3561)
Summary: In #3460, we switched the build process for FFmpeg extension. Since it is complicated to install FFmpeg in some environments, at build time, pre-built binaries and its headers are downloaded and used as a scaffolding for torchaudio build. Now even though we did not change any code or FFmpeg version, it turned out that this causes segmentation fault on Ubuntu when using system Python and FFmpeg 4.4 installed via aptitude. While investigating the issue, I swapped the said pre-built FFmpeg scaffolding with FFmpeg 4.4 from aptitude, and the segmentation fault did not happen. This indicates that it is binary compatibility issue. Before #3460, each binary build job was building FFmpeg 4.1.8 using the same compiler used to build torchaudio, but after #3460 the environments to build FFmpeg 4.1.8 and torchaudio are different. My hypothesis is that this difference is causing some ABI incompatibility when linking against FFmpeg 4.4. (Also, I don't remember well, but I read somewhere that 4.4 has a different ABI) Through experiments, it turned out upgrading the pre-built FFmpeg scaffolding to 4.4 resolves this. So this commit upgrade the pre-built FFmpeg 4 to 4.4. The potential (yet unconfirmed) downside is that torchaudio will no longer work with 4.1, 4.2, and 4.3. Since FFmpeg 4.4 is what Ubuntu 20.04 and 22.04 support by default, and Google Colab is also on 20.04, I think it is more important to support 4.4. Therefore we drop the support for 4.1-4.3 from normal build (and official distributions). Those who wish to use 4.1-4.3 can build torchaudio from source by linking to specific FFmpeg. Pull Request resolved: #3561 Reviewed By: hwangjeff Differential Revision: D48340201 Pulled By: mthrok fbshipit-source-id: 7ece82910f290c7cf83f58311c4cf6a384e8795e
Configuration menu - View commit details
-
Copy full SHA for bf07ea6 - Browse repository at this point
Copy the full SHA bf07ea6View commit details -
Use pytorch/manylinuxaarch64-builder:cpu-aarch64 docker image (#3560)
Summary: Use pytorch/manylinuxaarch64-builder:cpu-aarch64 Introduced in pytorch/builder#1472 Pull Request resolved: #3560 Reviewed By: mthrok Differential Revision: D48366572 Pulled By: atalman fbshipit-source-id: 6de15f81abb09c737e6a1271226259483141e8f4
Configuration menu - View commit details
-
Copy full SHA for 126f9f6 - Browse repository at this point
Copy the full SHA 126f9f6View commit details
Commits on Aug 18, 2023
-
Summary: Remove mention of backend and quick usage. Those are explained in the documentation in detail. Pull Request resolved: #3567 Reviewed By: huangruizhe Differential Revision: D48471832 Pulled By: mthrok fbshipit-source-id: 467efc1f11f66534c33cf4751de27b08176c31bf
Configuration menu - View commit details
-
Copy full SHA for 1638efe - Browse repository at this point
Copy the full SHA 1638efeView commit details
Commits on Aug 19, 2023
-
Enable ROCm RNN-T Loss (#2485)
Summary: Added HIPIFY code and small changes for ROCm. Targeting RNN-T loss. Pull Request resolved: #2485 Reviewed By: huangruizhe Differential Revision: D43537864 Pulled By: mthrok fbshipit-source-id: 4bdb1f291dc51a12232ccd072b97ae94ae20cc0c
Configuration menu - View commit details
-
Copy full SHA for c593961 - Browse repository at this point
Copy the full SHA c593961View commit details
Commits on Aug 20, 2023
-
Add detail about CTC peaky behavior (#3566)
Summary: Pull Request resolved: #3566 Reviewed By: huangruizhe Differential Revision: D48499338 Pulled By: mthrok fbshipit-source-id: 7f837e1a1f8116d7d82411607c91628b729077d8
Configuration menu - View commit details
-
Copy full SHA for a25bcb6 - Browse repository at this point
Copy the full SHA a25bcb6View commit details -
Fix style check CI job (#3564)
Summary: It seems that the default Python version was updated to 3.11. libcst does not have binary release for 3.11, so the CI attempts to build from source but it fails because building libcst requires Rust compiler. This commit fix the Python version of style check job to 3.10 so that the issue with Rust compiler is avoided. Pull Request resolved: #3564 Reviewed By: huangruizhe Differential Revision: D48499560 Pulled By: mthrok fbshipit-source-id: 53ab77268d8143f4946d92e8cd1f96aea55e7b72
Configuration menu - View commit details
-
Copy full SHA for a5da0a2 - Browse repository at this point
Copy the full SHA a5da0a2View commit details -
Summary: Turned out FFmpeg 5 installed via conda reports video frame rate -1. FFmpeg 4 and 6 are fine. This is either a regression in FFmpeg or in the underlying decoding library. Make the reference value adoptive. Pull Request resolved: #3568 Reviewed By: huangruizhe Differential Revision: D48499621 Pulled By: mthrok fbshipit-source-id: fb64187bcf0dc57b753cb6c05f04d436238f5c51
Configuration menu - View commit details
-
Copy full SHA for 0688863 - Browse repository at this point
Copy the full SHA 0688863View commit details
Commits on Aug 21, 2023
-
Summary: Pull Request resolved: #3569 Reviewed By: huangruizhe Differential Revision: D48508244 Pulled By: mthrok fbshipit-source-id: 6e14267e2dbdf08ea3c25a1dab480cb0e908e0c3
Configuration menu - View commit details
-
Copy full SHA for 3318bce - Browse repository at this point
Copy the full SHA 3318bceView commit details -
Use FFmpeg6 in unittest (#3570)
Summary: Pull Request resolved: #3570 Reviewed By: huangruizhe Differential Revision: D48518568 Pulled By: mthrok fbshipit-source-id: 0fdfb8b3988789c7ded0fb336824034bedf6a394
Configuration menu - View commit details
-
Copy full SHA for 9d11563 - Browse repository at this point
Copy the full SHA 9d11563View commit details
Commits on Aug 23, 2023
-
update CUDA to 12.1 U1 (#3563)
Summary: Follow-up of: pytorch/builder#1485 CC atalman Pull Request resolved: #3563 Reviewed By: kit1980 Differential Revision: D48610200 Pulled By: atalman fbshipit-source-id: 61c9981da5a343a3cbce97b0a77ab91f37560087
Configuration menu - View commit details
-
Copy full SHA for 47eaab4 - Browse repository at this point
Copy the full SHA 47eaab4View commit details
Commits on Aug 29, 2023
-
Remove random print statement (#3577)
Summary: Pull Request resolved: #3577 Reviewed By: atalman Differential Revision: D48763580 Pulled By: mthrok fbshipit-source-id: 6ab155a5dd4cf11b2a58f26ced369107f0a2f08f
Configuration menu - View commit details
-
Copy full SHA for 5ee254e - Browse repository at this point
Copy the full SHA 5ee254eView commit details -
Separate Test Token for Conda Uploads (#3582)
Summary: We will use a separate token for uploading test binaries (instead of reusing the nightly token). This PR adds that token to the caller workflow. Pull Request resolved: #3582 Reviewed By: atalman Differential Revision: D48803009 Pulled By: osalpekar fbshipit-source-id: c2af57f6946da51a7b56c975614e60f243e3f6fb
Configuration menu - View commit details
-
Copy full SHA for 6fb6854 - Browse repository at this point
Copy the full SHA 6fb6854View commit details
Commits on Aug 30, 2023
-
Bump Version 2.1 -> 2.2 (#3585)
Summary: Pull Request resolved: #3585 Reviewed By: atalman Differential Revision: D48832146 Pulled By: osalpekar fbshipit-source-id: c36dd79a0c0f407d490a488902207c8066c01383
Configuration menu - View commit details
-
Copy full SHA for bf77b2a - Browse repository at this point
Copy the full SHA bf77b2aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5cf7d2d - Browse repository at this point
Copy the full SHA 5cf7d2dView commit details
Commits on Sep 1, 2023
-
Automate Release-Only Changes (#3590)
Summary: First Pass for automating release-only changes. Tested this in audio to ensure it works. Pull Request resolved: #3590 Reviewed By: huydhn Differential Revision: D48921925 Pulled By: osalpekar fbshipit-source-id: 2920e4d984ffb0c1957dae6fb6bab10344e65ff7
Configuration menu - View commit details
-
Copy full SHA for af79867 - Browse repository at this point
Copy the full SHA af79867View commit details
Commits on Sep 2, 2023
-
Moving to More Efficient Windows GPU Runner (#3587)
Summary: Pull Request resolved: #3587 Reviewed By: huydhn Differential Revision: D48847224 Pulled By: osalpekar fbshipit-source-id: 15ccae56f0e5e8e366b5feb66fdb42bb5cba97e1
Configuration menu - View commit details
-
Copy full SHA for e057d7d - Browse repository at this point
Copy the full SHA e057d7dView commit details
Commits on Sep 4, 2023
-
Summary: Pull Request resolved: #3593 Reviewed By: nateanl Differential Revision: D48933041 Pulled By: mthrok fbshipit-source-id: cd05d3cf5006206ba441fdc05548bcd922ce0598
Configuration menu - View commit details
-
Copy full SHA for 3e7e696 - Browse repository at this point
Copy the full SHA 3e7e696View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6fbc1e6 - Browse repository at this point
Copy the full SHA 6fbc1e6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7d37f69 - Browse repository at this point
Copy the full SHA 7d37f69View commit details -
[BC-Breaking] Remove legacy global backend switch (#3559)
Summary: This PR removes the legacy backend switch mechanism. The implementation itself is still available. Merge after v2.1 release Pull Request resolved: #3559 Reviewed By: nateanl Differential Revision: D48353764 Pulled By: mthrok fbshipit-source-id: 4d3924dbe6f334ecebe2b12fcd4591c61c4aa656
Configuration menu - View commit details
-
Copy full SHA for 454418d - Browse repository at this point
Copy the full SHA 454418dView commit details
Commits on Sep 5, 2023
-
Fix backward compatibility layer in backend module (#3595)
Summary: The PR #3549 re-organized the backend implementations and deprecated the direct access to torchaudio.backend. The change was supposed to be BC-compatible while issuing a warning to users, but the implementation of module-level `__getattr__` was not quite right. See an issue pyannote/pyannote-audio#1456. This commit fixes it so that the following imports work; ```python from torchaudio.backend.common import AudioMetaData from torchaudio.backend import sox_io_backend from torchaudio.backend.sox_io_backend import save, load, info from torchaudio.backend import no_backend from torchaudio.backend.no_backend import save, load, info from torchaudio.backend import soundfile_backend from torchaudio.backend.soundfile_backend import save, load, info ``` Pull Request resolved: #3595 Reviewed By: nateanl Differential Revision: D48957446 Pulled By: mthrok fbshipit-source-id: ebb256461dd3032025fd27d0455ce980888f7778
Configuration menu - View commit details
-
Copy full SHA for 931598c - Browse repository at this point
Copy the full SHA 931598cView commit details -
Update FFmpeg arm64 CI (#3578)
Summary: Pull Request resolved: #3578 Reviewed By: nateanl Differential Revision: D48971591 Pulled By: mthrok fbshipit-source-id: a76cf8f1287c076d7b6919d901eda0ce832bdb7d
Configuration menu - View commit details
-
Copy full SHA for ede4309 - Browse repository at this point
Copy the full SHA ede4309View commit details
Commits on Sep 7, 2023
-
Fix type casting issue in mask length calculation (#3599)
Pingchuan Ma authoredSep 7, 2023 Configuration menu - View commit details
-
Copy full SHA for e756b23 - Browse repository at this point
Copy the full SHA e756b23View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e1d8f3 - Browse repository at this point
Copy the full SHA 3e1d8f3View commit details
Commits on Sep 8, 2023
-
Simplify trainining step in av-asr recipe (#3598)
* Simplify trainining step in av-asr recipe * Run pre-commit
Pingchuan Ma authoredSep 8, 2023 Configuration menu - View commit details
-
Copy full SHA for 5e893d6 - Browse repository at this point
Copy the full SHA 5e893d6View commit details
Commits on Sep 13, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b4c9aa5 - Browse repository at this point
Copy the full SHA b4c9aa5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4bbf65e - Browse repository at this point
Copy the full SHA 4bbf65eView commit details
Commits on Sep 19, 2023
-
Fix doc nightly doc CI (#3611)
Some changes at matplotlib 3.8.0 rejects torch.Tensor passed to `plot` function.
Configuration menu - View commit details
-
Copy full SHA for ac63c45 - Browse repository at this point
Copy the full SHA ac63c45View commit details -
Configuration menu - View commit details
-
Copy full SHA for 402939e - Browse repository at this point
Copy the full SHA 402939eView commit details -
Add wall implementation for RIR ray tracing (#3612)
Extracted from #3604 Add Wall helper class and C++ unit test
Configuration menu - View commit details
-
Copy full SHA for 94aafd8 - Browse repository at this point
Copy the full SHA 94aafd8View commit details
Commits on Sep 20, 2023
-
replace avsr model used in the tutorial (#3602)
* replace model used in the tutorial * Upload a torchscript model;Remove model components * Update download url * Switch from download_url_to_file to download_asset
Pingchuan Ma authoredSep 20, 2023 Configuration menu - View commit details
-
Copy full SHA for 0ff6d26 - Browse repository at this point
Copy the full SHA 0ff6d26View commit details
Commits on Sep 21, 2023
-
Optimize merge_tokens method (#3615)
Optimizes merge_tokens method as discussed in #3614 Co-authored-by: Eyal Cohen <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0a27bb0 - Browse repository at this point
Copy the full SHA 0a27bb0View commit details
Commits on Sep 23, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f1a9f79 - Browse repository at this point
Copy the full SHA f1a9f79View commit details
Commits on Sep 26, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 71104e8 - Browse repository at this point
Copy the full SHA 71104e8View commit details -
Disable Windows CPP test (#3613)
The test executable is not found. Need to investigate more later.
Configuration menu - View commit details
-
Copy full SHA for 4dc06ce - Browse repository at this point
Copy the full SHA 4dc06ceView commit details
Commits on Sep 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for acf1c7b - Browse repository at this point
Copy the full SHA acf1c7bView commit details
Commits on Sep 29, 2023
-
Configuration menu - View commit details
-
Copy full SHA for d07aebf - Browse repository at this point
Copy the full SHA d07aebfView commit details -
Remove Python2 style object inheritance (#3597)
Summary: Python3 makes the use of `(object)` in class inheritance unnecessary. Let's modernize our code by eliminating this. Reviewed By: aaronenyeshi Differential Revision: D48957872
Configuration menu - View commit details
-
Copy full SHA for b7791ea - Browse repository at this point
Copy the full SHA b7791eaView commit details
Commits on Oct 2, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ff266b1 - Browse repository at this point
Copy the full SHA ff266b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 92ded61 - Browse repository at this point
Copy the full SHA 92ded61View commit details
Commits on Oct 3, 2023
-
change modality naming inconsistency (visual & video) (#3631)
* Update transforms.py * Update train.py
Configuration menu - View commit details
-
Copy full SHA for 47f502a - Browse repository at this point
Copy the full SHA 47f502aView commit details -
Remove 2d impl from ray tracing (#3633)
* Fix some issues * Remove 2D implementation from ray tracing We only add 3D RIR.
Configuration menu - View commit details
-
Copy full SHA for d9942ba - Browse repository at this point
Copy the full SHA d9942baView commit details
Commits on Oct 5, 2023
-
Add automated doc flow for release (#3634)
Currently, nightly documentation is pushed automatically for each commit and on daily basis. For release, automating it is tricky because we need to ensure that the version number is properly updated. (removing alpha suffix at branch cut time and updating minor version at minor release) For this reason, in release/2.1 we used manual-trigger to build and deploy doc for release. This commit back port it to main. For each release, once the version number is updated, running build_doc job from GHA will deploy the documentation automatically.
Configuration menu - View commit details
-
Copy full SHA for df44d30 - Browse repository at this point
Copy the full SHA df44d30View commit details -
Configuration menu - View commit details
-
Copy full SHA for e3b11a8 - Browse repository at this point
Copy the full SHA e3b11a8View commit details
Commits on Oct 9, 2023
-
Differential Revision: D49965263 Pull Request resolved: #3639
Configuration menu - View commit details
-
Copy full SHA for ec13a81 - Browse repository at this point
Copy the full SHA ec13a81View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2994ce2 - Browse repository at this point
Copy the full SHA 2994ce2View commit details -
Differential Revision: D50036850 Pull Request resolved: #3637
Configuration menu - View commit details
-
Copy full SHA for a8bb397 - Browse repository at this point
Copy the full SHA a8bb397View commit details
Commits on Oct 11, 2023
-
Move libtorchaudio_ffmpeg to dedicated directory
Differential Revision: D50082877 Pull Request resolved: #3646
Configuration menu - View commit details
-
Copy full SHA for 2836a23 - Browse repository at this point
Copy the full SHA 2836a23View commit details -
Configuration menu - View commit details
-
Copy full SHA for 03d1270 - Browse repository at this point
Copy the full SHA 03d1270View commit details
Commits on Oct 12, 2023
-
Move libtorchaudio to dedicated directory
Differential Revision: D50086556 Pull Request resolved: #3648
Configuration menu - View commit details
-
Copy full SHA for e65e472 - Browse repository at this point
Copy the full SHA e65e472View commit details -
Remove FFmpeg compat load/info function
Differential Revision: D50229857 Pull Request resolved: #3652
Configuration menu - View commit details
-
Copy full SHA for 57f7f52 - Browse repository at this point
Copy the full SHA 57f7f52View commit details -
Differential Revision: D50205775 Pull Request resolved: #3651
Configuration menu - View commit details
-
Copy full SHA for d947dee - Browse repository at this point
Copy the full SHA d947deeView commit details -
Simplify the logic to initialize FFmpeg
Differential Revision: D50193749 Pull Request resolved: #3650
Configuration menu - View commit details
-
Copy full SHA for f62367a - Browse repository at this point
Copy the full SHA f62367aView commit details
Commits on Oct 13, 2023
-
Simplify the logic to initialize sox
Differential Revision: D50197331 Pull Request resolved: #3654
Configuration menu - View commit details
-
Copy full SHA for dde08ba - Browse repository at this point
Copy the full SHA dde08baView commit details -
Configuration menu - View commit details
-
Copy full SHA for fa78fb6 - Browse repository at this point
Copy the full SHA fa78fb6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0b6ca28 - Browse repository at this point
Copy the full SHA 0b6ca28View commit details
Commits on Oct 17, 2023
-
The `encoder_layer_norm_first` should be set to True for the Large model of WavLM. Address #3347
Configuration menu - View commit details
-
Copy full SHA for 671261c - Browse repository at this point
Copy the full SHA 671261cView commit details
Commits on Oct 19, 2023
-
Support compression level in i/o dispatcher backend
Differential Revision: D50367721 Pull Request resolved: #3662
Configuration menu - View commit details
-
Copy full SHA for 7ea108f - Browse repository at this point
Copy the full SHA 7ea108fView commit details -
Do not autoremove prototype in release branch (#3663)
So that release engineering team does not have to manually remove prototype-related things in each release
Configuration menu - View commit details
-
Copy full SHA for d00df95 - Browse repository at this point
Copy the full SHA d00df95View commit details -
Configuration menu - View commit details
-
Copy full SHA for b7b61c2 - Browse repository at this point
Copy the full SHA b7b61c2View commit details
Commits on Oct 24, 2023
-
Differential Revision: D50506299 Pull Request resolved: #3669
Configuration menu - View commit details
-
Copy full SHA for a78ba38 - Browse repository at this point
Copy the full SHA a78ba38View commit details -
Differential Revision: D50602496 Pull Request resolved: #3670
Configuration menu - View commit details
-
Copy full SHA for 8342795 - Browse repository at this point
Copy the full SHA 8342795View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1caa3fc - Browse repository at this point
Copy the full SHA 1caa3fcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7d6e486 - Browse repository at this point
Copy the full SHA 7d6e486View commit details
Commits on Oct 25, 2023
-
Add torio top-level directory. It's not part of the package yet.
Configuration menu - View commit details
-
Copy full SHA for 7c988b4 - Browse repository at this point
Copy the full SHA 7c988b4View commit details -
Differential Revision: D50633306 Pull Request resolved: #3675
Configuration menu - View commit details
-
Copy full SHA for 478a852 - Browse repository at this point
Copy the full SHA 478a852View commit details
Commits on Oct 26, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 3ff5e8c - Browse repository at this point
Copy the full SHA 3ff5e8cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2a0f4c0 - Browse repository at this point
Copy the full SHA 2a0f4c0View commit details -
Swap decoder/encoder implementation
Differential Revision: D50677606 Pull Request resolved: #3681
Configuration menu - View commit details
-
Copy full SHA for 36f5010 - Browse repository at this point
Copy the full SHA 36f5010View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0988724 - Browse repository at this point
Copy the full SHA 0988724View commit details -
Update StreamReader/Writer name
Differential Revision: D50696105 Pull Request resolved: #3682
Configuration menu - View commit details
-
Copy full SHA for fcf3894 - Browse repository at this point
Copy the full SHA fcf3894View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e26515 - Browse repository at this point
Copy the full SHA 6e26515View commit details
Commits on Oct 30, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 90186bc - Browse repository at this point
Copy the full SHA 90186bcView commit details
Commits on Oct 31, 2023
-
Configuration menu - View commit details
-
Copy full SHA for c5b6933 - Browse repository at this point
Copy the full SHA c5b6933View commit details
Commits on Nov 9, 2023
-
Configuration menu - View commit details
-
Copy full SHA for d4cf8d5 - Browse repository at this point
Copy the full SHA d4cf8d5View commit details -
Fix vad to return zero output for zero input (#3685)
When the input is zero Tensor, the result should be empty.
Configuration menu - View commit details
-
Copy full SHA for df2a6bc - Browse repository at this point
Copy the full SHA df2a6bcView commit details -
Make pytest run ignoring the ones with collection error (#3692)
PyTorch lightening is having issue with the nightly PyTorch. Let the other tests still run.
Configuration menu - View commit details
-
Copy full SHA for 4c85f81 - Browse repository at this point
Copy the full SHA 4c85f81View commit details -
Remove global audio backend from test (#3691)
global audio backend is removed thus this is no-op.
Configuration menu - View commit details
-
Copy full SHA for a59e9ab - Browse repository at this point
Copy the full SHA a59e9abView commit details -
Fix release doc deployment (#3693)
Back port from release/2.1 branch. Need to git-fetch source code to get the version number dynamically
Configuration menu - View commit details
-
Copy full SHA for 65df10b - Browse repository at this point
Copy the full SHA 65df10bView commit details
Commits on Nov 10, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 172260f - Browse repository at this point
Copy the full SHA 172260fView commit details -
Configuration menu - View commit details
-
Copy full SHA for ccd78ff - Browse repository at this point
Copy the full SHA ccd78ffView commit details
Commits on Dec 1, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 6518fa9 - Browse repository at this point
Copy the full SHA 6518fa9View commit details
Commits on Jan 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 258169e - Browse repository at this point
Copy the full SHA 258169eView commit details
Commits on Jan 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 639a4e9 - Browse repository at this point
Copy the full SHA 639a4e9View commit details
Commits on Jan 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e3efbc2 - Browse repository at this point
Copy the full SHA e3efbc2View commit details
Commits on Jan 12, 2024
-
Remove AWS credentials on workflows (#3723) (#3723)
Summary: This is not needed anymore after pytorch/test-infra#4865. Reviewed By: malfet, jeanschmidt, clee2000, NicolasHug Differential Revision: D52735187 Pulled By: huydhn
Configuration menu - View commit details
-
Copy full SHA for b2d9c3e - Browse repository at this point
Copy the full SHA b2d9c3eView commit details
Commits on Jan 25, 2024
-
Doc: references for
lfilter
implementation (#3734)* add golf and dynonet paper * doc: add references * add EOF * fix: line too long * remove line end space * remove indentation Co-authored-by: moto <[email protected]> --------- Co-authored-by: moto <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d5a9433 - Browse repository at this point
Copy the full SHA d5a9433View commit details
Commits on Jan 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 02586da - Browse repository at this point
Copy the full SHA 02586daView commit details
Commits on Feb 9, 2024
-
Migrate the macOS runners label from macos-m1-12 to macos-m1-stable
Differential Revision: D53606067 Pull Request resolved: #3740
Configuration menu - View commit details
-
Copy full SHA for 5286f9f - Browse repository at this point
Copy the full SHA 5286f9fView commit details
Commits on Feb 16, 2024
-
Fix document for forced_align method (#3748)
The lengths of targets and log_probs should be reversed.
Configuration menu - View commit details
-
Copy full SHA for b7b7b5d - Browse repository at this point
Copy the full SHA b7b7b5dView commit details
Commits on Feb 27, 2024
-
Remove macos x86 nightly builds
Differential Revision: D54263224 Pull Request resolved: #3751
Configuration menu - View commit details
-
Copy full SHA for 87aeb55 - Browse repository at this point
Copy the full SHA 87aeb55View commit details
Commits on Mar 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1bc1479 - Browse repository at this point
Copy the full SHA 1bc1479View commit details -
Update tacotron2_pipeline_tutorial.py (#3759)
* Update tacotron2_pipeline_tutorial.py - Fixed typo - Clarified what was being done in different sections
Configuration menu - View commit details
-
Copy full SHA for 17a7081 - Browse repository at this point
Copy the full SHA 17a7081View commit details
Commits on Mar 24, 2024
-
Fix windows CI CPU and GPU tests by unpinning mkl version and install…
Configuration menu - View commit details
-
Copy full SHA for ea437b3 - Browse repository at this point
Copy the full SHA ea437b3View commit details
Commits on May 14, 2024
-
[codemod] c10::optional -> std::optional in pytorch/audio/src/libtori…
…o/ffmpeg/stream_reader/stream_processor.h +20 Differential Revision: D57294285 Pull Request resolved: #3792
Configuration menu - View commit details
-
Copy full SHA for b4407e0 - Browse repository at this point
Copy the full SHA b4407e0View commit details -
[codemod] c10::optional -> std::optional in pyspeech/experimental/csr…
…c/decoders/TransducerDecoder.h +20 Differential Revision: D57294284 Pull Request resolved: #3793
Configuration menu - View commit details
-
Copy full SHA for 9f10306 - Browse repository at this point
Copy the full SHA 9f10306View commit details -
[codemod] c10::optional -> std::optional in pytorch/audio/src/libtorc…
…haudio/sox/effects.cpp +20 Differential Revision: D57294298 Pull Request resolved: #3791
Configuration menu - View commit details
-
Copy full SHA for 1980f8a - Browse repository at this point
Copy the full SHA 1980f8aView commit details
Commits on Jun 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b829e93 - Browse repository at this point
Copy the full SHA b829e93View commit details
Commits on Jun 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 7f6209b - Browse repository at this point
Copy the full SHA 7f6209bView commit details
Commits on Jul 3, 2024
-
Fix model downloading in bento (#3803)
Summary: Pull Request resolved: #3803 The model checkpoint path can not be created for Squim models. Use the latest download_asset method to fix it. Reviewed By: moto-meta Differential Revision: D59061348
Configuration menu - View commit details
-
Copy full SHA for 69b2a0a - Browse repository at this point
Copy the full SHA 69b2a0aView commit details
Commits on Jul 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b62dc24 - Browse repository at this point
Copy the full SHA b62dc24View commit details