Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace runners prefix amz2023. #3818

Open
wants to merge 1,144 commits into
base: master
Choose a base branch
from
This pull request is big! We’re only showing the most recent 250 commits.

Commits on May 23, 2023

  1. [BugFix] Fix extract_features method for WavLM models (#3350)

    Summary:
    resolve #3347
    
    `position_bias` is ignored in `extract_features` method, this doesn't affect Wav2Vec2 or HuBERT models, but it changes the output of transformer layers (except the first layer) in WavLM model. This PR fixes it by adding `position_bias` to the method.
    
    Pull Request resolved: #3350
    
    Reviewed By: mthrok
    
    Differential Revision: D46112148
    
    Pulled By: nateanl
    
    fbshipit-source-id: 3d21aa4b32b22da437b440097fd9b00238152596
    nateanl authored and facebook-github-bot committed May 23, 2023
    Configuration menu
    Copy the full SHA
    7d0f336 View commit details
    Browse the repository at this point in the history

Commits on May 24, 2023

  1. Resolve lint issue on LaTeX (#3366)

    Summary: Pull Request resolved: #3366
    
    Reviewed By: nateanl
    
    Differential Revision: D46136238
    
    Pulled By: mthrok
    
    fbshipit-source-id: 3432f5d007293831bab21460a79ae26b1bbc81a8
    mthrok authored and facebook-github-bot committed May 24, 2023
    Configuration menu
    Copy the full SHA
    8690e6e View commit details
    Browse the repository at this point in the history
  2. Remove CUDA 11.7 builds; replace with 11.8 (#3360)

    Summary:
    CC atalman malfet
    
    Pull Request resolved: #3360
    
    Reviewed By: mthrok
    
    Differential Revision: D46150898
    
    Pulled By: atalman
    
    fbshipit-source-id: 985a0ef69406f48fb15f239d6b16616c0a5379f5
    ptrblck authored and facebook-github-bot committed May 24, 2023
    Configuration menu
    Copy the full SHA
    5a6f4eb View commit details
    Browse the repository at this point in the history
  3. Amend commit to gh-pages branch (#3345)

    Summary:
    This commit changes the way doc is pushed.
    It ammends instead of adding a new commit.
    
    Currently each commit in gh-pages contain like 100MB of data. gh-pages branch is fetched by default when `git clone`. So the size of torchaudio repo grows significantly.
    
    Pull Request resolved: #3345
    
    Reviewed By: nateanl
    
    Differential Revision: D46136612
    
    Pulled By: mthrok
    
    fbshipit-source-id: 39479ee5d1a6888254ef50f0db252453d976d183
    mthrok authored and facebook-github-bot committed May 24, 2023
    Configuration menu
    Copy the full SHA
    a79cf3b View commit details
    Browse the repository at this point in the history
  4. Update smoke test (#3346)

    Summary:
    * Delay the import of torchaudio until the CLI options are parsed.
    * Add option to set log level to DEBUG so that it's easy to see the issue with external libraries.
    
    Pull Request resolved: #3346
    
    Reviewed By: nateanl
    
    Differential Revision: D46022546
    
    Pulled By: mthrok
    
    fbshipit-source-id: 9f988bbd770c2fd2bb260c3cfe02b238a9da2808
    mthrok authored and facebook-github-bot committed May 24, 2023
    Configuration menu
    Copy the full SHA
    71b2634 View commit details
    Browse the repository at this point in the history
  5. Fix build doc (#3349)

    Summary:
    Follow-up #3045
    - Revert the removal of HW acceleration doc
    - comment out FFmpeg CLI test run
    
    Pull Request resolved: #3349
    
    Reviewed By: nateanl
    
    Differential Revision: D46121899
    
    Pulled By: mthrok
    
    fbshipit-source-id: dfc030a69f05addec73637cfb6a720c184e37323
    mthrok authored and facebook-github-bot committed May 24, 2023
    Configuration menu
    Copy the full SHA
    8b85ca5 View commit details
    Browse the repository at this point in the history
  6. Add StreamReader/Writer custom IO to doc (#3367)

    Summary: Pull Request resolved: #3367
    
    Reviewed By: nateanl
    
    Differential Revision: D46148139
    
    Pulled By: mthrok
    
    fbshipit-source-id: 50f297ac69bb95562976eb452e4e382b8c064c3c
    mthrok authored and facebook-github-bot committed May 24, 2023
    Configuration menu
    Copy the full SHA
    f41ba26 View commit details
    Browse the repository at this point in the history

Commits on May 25, 2023

  1. Add LRS3 AV-ASR recipe (#3278)

    Summary:
    This PR adds AV-ASR recipe which contains sample implementations of training and evaluation pipelines for RNNT based automatic, visual, and audio-visual (ASR, VSR, AV-ASR) models on LRS3. This repository includes both streaming/non-streaming modes.
    
    CC stavros99 xiaohui-zhang YumengTao mthrok nateanl hwangjeff
    
    Pull Request resolved: #3278
    
    Reviewed By: nateanl
    
    Differential Revision: D46121550
    
    Pulled By: mpc001
    
    fbshipit-source-id: bb44b97ae25e87df2a73a707008be46af4ad0fc6
    Pingchuan Ma authored and facebook-github-bot committed May 25, 2023
    Configuration menu
    Copy the full SHA
    c6624fa View commit details
    Browse the repository at this point in the history

Commits on May 26, 2023

  1. Improve RNN-T streaming decoding (#3295)

    Summary:
    This commit fixes the following issues affecting streaming decoding quality
    1. The `init_b` hypothesis is only regenerated from blank token if no initial hypotheses are provided.
    2. Allows the decoder to receive top-K hypothesis to continue decoding from, instead of using just the top hypothesis at each decoding step.  This dramatically affects decoding quality especially for speech with long pauses and disfluencies.
    3. Some minor errors regarding shape checking for length.
    
    This also means that the resulting output is the entire transcript up until that time step, instead of just the incremental change in transcript.
    
    Pull Request resolved: #3295
    
    Reviewed By: nateanl
    
    Differential Revision: D46216113
    
    Pulled By: hwangjeff
    
    fbshipit-source-id: 8f7efae28dcca4a052f434ca55a2795c9e5ec0b0
    lakshmi-speak authored and facebook-github-bot committed May 26, 2023
    Configuration menu
    Copy the full SHA
    9fc0dca View commit details
    Browse the repository at this point in the history
  2. Revert "Upgrade to FFmpeg5 (#3298)" (#3377)

    Summary:
    This reverts commit d38a785.
    
    This is temporary revert to unblock unit test migration from circleci to github
    
    Pull Request resolved: #3377
    
    Reviewed By: mthrok
    
    Differential Revision: D46230498
    
    Pulled By: atalman
    
    fbshipit-source-id: 000d8a9ca00750fc1ca61f4c2cdd6e930a5ce46d
    atalman authored and facebook-github-bot committed May 26, 2023
    Configuration menu
    Copy the full SHA
    37779ef View commit details
    Browse the repository at this point in the history
  3. Temporarily remove test for extract_features (#3378)

    Summary:
    The tests failed for several bundles. Remove them and will re-add once the root cause is figured out.
    
    Pull Request resolved: #3378
    
    Reviewed By: atalman
    
    Differential Revision: D46230884
    
    Pulled By: nateanl
    
    fbshipit-source-id: 42056a29b2ec2335268b273d3e37fb517035be92
    nateanl authored and facebook-github-bot committed May 26, 2023
    Configuration menu
    Copy the full SHA
    05649ca View commit details
    Browse the repository at this point in the history
  4. Use cuda 11.8 for circleci tests (#3381)

    Summary:
    Use cuda 11.8 for circleci tests.
    11.7 was deprecated
    
    Pull Request resolved: #3381
    
    Reviewed By: osalpekar
    
    Differential Revision: D46236223
    
    Pulled By: atalman
    
    fbshipit-source-id: 6d6a8e09603807a07241f31c1bd1e6d3a2b67d9d
    atalman authored and facebook-github-bot committed May 26, 2023
    Configuration menu
    Copy the full SHA
    5c0249b View commit details
    Browse the repository at this point in the history
  5. Use the same CUDNN version on Windows as PyTorch (#3380)

    Summary:
    11.7 uses 8.5.0; 11.8 uses 8.7.0; 12.1 uses 8.8.1.  Otherwise, Windows vision job (8.5.0) would overwrite the CUDNN version setup by PyTorch (8.7.0) leading to this flaky failures https://github.com/pytorch/pytorch/actions/runs/5088860652/jobs/9146641450
    
    ```
    RuntimeError: cuDNN version incompatibility: PyTorch was compiled  against (8, 7, 0) but found runtime version (8, 5, 0). PyTorch already comes bundled with cuDNN. One option to resolving this error is to ensure PyTorch can find the bundled cuDNN.
    ```
    
    Pull Request resolved: #3380
    
    Reviewed By: atalman
    
    Differential Revision: D46236286
    
    Pulled By: huydhn
    
    fbshipit-source-id: 9ca12d5068c3029688347d52c5c284488f33728d
    huydhn authored and facebook-github-bot committed May 26, 2023
    Configuration menu
    Copy the full SHA
    c120f31 View commit details
    Browse the repository at this point in the history
  6. Fix encoding g722 format (#3373)

    Summary:
    g722 format only supports 16k Hz, but AVCodec does not list this. The implementation does not insert resampling and the resulting audio can be slowed down or sped up.
    
    Pull Request resolved: #3373
    
    Reviewed By: hwangjeff
    
    Differential Revision: D46233181
    
    Pulled By: mthrok
    
    fbshipit-source-id: 902b3f862a8f7269dc35bc871e868b0e78326c6c
    mthrok authored and facebook-github-bot committed May 26, 2023
    Configuration menu
    Copy the full SHA
    1b05ca7 View commit details
    Browse the repository at this point in the history

Commits on May 27, 2023

  1. Fix AudioEffector for mulaw (#3372)

    Summary:
    When encoding audio with mulaw, the resulting data does not have header, and the StreamReader defaults to 16k Hz, which can strech/shrink the resulting waveform.
    
    Pull Request resolved: #3372
    
    Reviewed By: hwangjeff
    
    Differential Revision: D46234772
    
    Pulled By: mthrok
    
    fbshipit-source-id: 942c89a8cfe29b0b6f57b3e5b6c9dfd3524ca552
    mthrok authored and facebook-github-bot committed May 27, 2023
    Configuration menu
    Copy the full SHA
    af932cc View commit details
    Browse the repository at this point in the history

Commits on May 29, 2023

  1. [Nova] Windows CPU Unittests on Nova (#3329)

    Summary:
    Continuing with the job migrations from CCI to Nova, this PR introduces the Windows CPU Unittest job as a Nova workflow.
    
    The job is passing: https://github.com/pytorch/audio/actions/runs/5094569687/jobs/9159020192?pr=3329.
    
    Pull Request resolved: #3329
    
    Reviewed By: huydhn
    
    Differential Revision: D46265649
    
    Pulled By: atalman
    
    fbshipit-source-id: 7659dfbcc8ad400f2e109ff64530e1f768e82ef9
    osalpekar authored and facebook-github-bot committed May 29, 2023
    Configuration menu
    Copy the full SHA
    6425d46 View commit details
    Browse the repository at this point in the history

Commits on May 30, 2023

  1. Simplify sox namespace (#3383)

    Summary:
    Pull Request resolved: #3383
    
    This commit reduces `torchaudio::sox_*` namespace into `torchaudio::sox`.
    Also put Pybind11 registration and TorchBind registration into anonymous namescope.
    
    Differential Revision: D46257367
    
    fbshipit-source-id: 0f0f181eaa72036916e223263daf4b7c298fca0d
    mthrok authored and facebook-github-bot committed May 30, 2023
    Configuration menu
    Copy the full SHA
    a81b0ed View commit details
    Browse the repository at this point in the history
  2. Use const reference (#3389)

    Summary:
    Pull Request resolved: #3389
    
    Adopt more of const reference in sox source code.
    
    Differential Revision: D46264068
    
    fbshipit-source-id: 809d34a6e16f621c856d4278ef7ce45a5868a717
    mthrok authored and facebook-github-bot committed May 30, 2023
    Configuration menu
    Copy the full SHA
    9cdf26f View commit details
    Browse the repository at this point in the history
  3. Disable failing GPU unit test (#3384)

    Summary:
    Disable failing GPU unit test.
    See associated issue: #3376
    
    Pull Request resolved: #3384
    
    Reviewed By: mthrok
    
    Differential Revision: D46279324
    
    Pulled By: atalman
    
    fbshipit-source-id: 3a606bb992e0261451f48d1fb458e054f7fd5583
    atalman authored and facebook-github-bot committed May 30, 2023
    Configuration menu
    Copy the full SHA
    caf3ac0 View commit details
    Browse the repository at this point in the history

Commits on May 31, 2023

  1. Fixes to #3295 Improve RNN-T streaming decoding (#3379)

    Summary:
    Pull Request resolved: #3379
    
    Fixes `RNNTBeamSearch.infer`'s docstring and removes unused import from tutorial.
    
    Reviewed By: mthrok
    
    Differential Revision: D46227174
    
    fbshipit-source-id: 7c1c3f05a6476cb0437622dea6f3ae6cb3ea9468
    hwangjeff authored and facebook-github-bot committed May 31, 2023
    Configuration menu
    Copy the full SHA
    b8016e4 View commit details
    Browse the repository at this point in the history
  2. Windows GPU workflows (#3364)

    Summary:
    Windows GPU workflows
    
    Pull Request resolved: #3364
    
    Reviewed By: mthrok
    
    Differential Revision: D46292403
    
    Pulled By: atalman
    
    fbshipit-source-id: ee3c6f8082ca77bdc1ffdb930c59fa5a9cb25a4a
    atalman authored and facebook-github-bot committed May 31, 2023
    Configuration menu
    Copy the full SHA
    92d0fb5 View commit details
    Browse the repository at this point in the history
  3. [Nova] Deprecate windows circleci unit tests (#3393)

    Summary:
    Nova - Deprecate windows circleci unit tests
    
    Pull Request resolved: #3393
    
    Reviewed By: malfet
    
    Differential Revision: D46315608
    
    Pulled By: atalman
    
    fbshipit-source-id: 3d7b5d0618b9d2e12e5f97e21d7becdc61d85c69
    atalman authored and facebook-github-bot committed May 31, 2023
    Configuration menu
    Copy the full SHA
    c5d3706 View commit details
    Browse the repository at this point in the history
  4. Surface test failures on CI (#3394)

    Summary:
    Set the directory of JUnitText XML file to the one where test-infra picks up and put them in summary.
    
    Example: https://github.com/pytorch/audio/actions/runs/5136305988
    
    Pull Request resolved: #3394
    
    Differential Revision: D46328832
    
    Pulled By: mthrok
    
    fbshipit-source-id: f0b5020a911ca4ec09345a965bdec769300859f0
    mthrok authored and facebook-github-bot committed May 31, 2023
    Configuration menu
    Copy the full SHA
    2283df8 View commit details
    Browse the repository at this point in the history
  5. [Nova] Lint on GHA (#3341)

    Summary:
    See title. If all is well, we can deprecate the CCI job in a few days.
    
    Pull Request resolved: #3341
    
    Reviewed By: mthrok
    
    Differential Revision: D46324265
    
    Pulled By: osalpekar
    
    fbshipit-source-id: bc706c6ae4285d4085dc5f0223ea41d8fc290f1c
    osalpekar authored and facebook-github-bot committed May 31, 2023
    Configuration menu
    Copy the full SHA
    5d0697b View commit details
    Browse the repository at this point in the history
  6. [Nova] Stylechecks on Nova (#3390)

    Summary:
    Introducing the stylecheck job on Nova. It seems like it is failing on trunk, but the functionality of this job itself is working and it fails with the same error as it does on trunk with CCI.
    
    Pull Request resolved: #3390
    
    Reviewed By: mthrok
    
    Differential Revision: D46324223
    
    Pulled By: osalpekar
    
    fbshipit-source-id: 1324202e53569d610559ef6f1b90cb5c364e6909
    osalpekar authored and facebook-github-bot committed May 31, 2023
    Configuration menu
    Copy the full SHA
    f7cb6c6 View commit details
    Browse the repository at this point in the history

Commits on Jun 1, 2023

  1. Delete CCI Linux and MacOS Unittest Jobs (#3391)

    Summary:
    Deprecates the Linux and MacOS Unittest jobs now that they've been running on Nova for over a week.
    
    Aside: There was also a stylecheck job that was dependent on the Linux Unittest job. I also put up #3390 to move that stylecheck job to Nova. I'm happy to reintroduce the CCI stylecheck job standalone in CCI if we want the Nova version to run on main for a week.
    
    Pull Request resolved: #3391
    
    Reviewed By: mthrok
    
    Differential Revision: D46324198
    
    Pulled By: osalpekar
    
    fbshipit-source-id: 2115748e153c5dee1a38db2b6230acebc4f56927
    osalpekar authored and facebook-github-bot committed Jun 1, 2023
    Configuration menu
    Copy the full SHA
    d5d94b7 View commit details
    Browse the repository at this point in the history
  2. Update and deprecate apply_codec function (#3386)

    Summary:
    To prepare for the upcoming removal of file-like object support from sox_io backend,
    this commit changes apply_codec function to use tempfile.
    
    `apply_codec` function is now deprecated and users are encourated to use `torchaudio.io.AudioEffector`.
    We will not remove the function itself, but will remove the entry from the doc.
    
    Pull Request resolved: #3386
    
    Reviewed By: hwangjeff
    
    Differential Revision: D46330610
    
    Pulled By: mthrok
    
    fbshipit-source-id: 3071bdefa05b4cbb9f00629bef50f0981eae89b4
    mthrok authored and facebook-github-bot committed Jun 1, 2023
    Configuration menu
    Copy the full SHA
    d6dd497 View commit details
    Browse the repository at this point in the history
  3. Refactor arg mapping in ffmpeg save function (#3387)

    Summary:
    The arguments of TorchAudio's save function ("format", "bits_per_sample" and "encoding")
    are not one-to-one mapping to the arguments of FFmpeg encoding.
    
    For example, to use vorbis codec, FFmpeg expects "ogg" container/extension with "vorbis"
    encoder. It does not recognize "vorbis" extension like TorchAudio (libsox) does.
    
    This commit refactors the logic to parse/map the arguments.
    
    As a result it now properly works with vorbis and mp3 extension.
    
    Pull Request resolved: #3387
    
    Reviewed By: hwangjeff
    
    Differential Revision: D46328787
    
    Pulled By: mthrok
    
    fbshipit-source-id: 36f993952a062bfec58a8b51be6aa86297571f90
    mthrok authored and facebook-github-bot committed Jun 1, 2023
    Configuration menu
    Copy the full SHA
    b99e5f4 View commit details
    Browse the repository at this point in the history
  4. Fix apply_codec to use named file (#3397)

    Summary:
    Follow-up #3386 The intended change was to use path of temporary file, instead of file-like object
    
    Pull Request resolved: #3397
    
    Reviewed By: hwangjeff
    
    Differential Revision: D46346189
    
    Pulled By: mthrok
    
    fbshipit-source-id: 44da799c6587bcb63a118a6313b7299bad742a40
    mthrok authored and facebook-github-bot committed Jun 1, 2023
    Configuration menu
    Copy the full SHA
    1dfac46 View commit details
    Browse the repository at this point in the history
  5. Fix style issue (#3398)

    Summary: Pull Request resolved: #3398
    
    Reviewed By: nateanl
    
    Differential Revision: D46354862
    
    Pulled By: mthrok
    
    fbshipit-source-id: b86dcdfeff8ed9db87b0b78eca20f6f18117e97e
    mthrok authored and facebook-github-bot committed Jun 1, 2023
    Configuration menu
    Copy the full SHA
    c7ac1af View commit details
    Browse the repository at this point in the history
  6. [Nova] Deleting Remaining CircleCI jobs (#3399)

    Summary:
    This job completely deletes the CircleCI `config.yml`. Here is what was remaining in the config at the point of deletion:
    
    Used Jobs:
    * **Lint** - Now running on Nova - see https://github.com/pytorch/audio/actions/runs/5144082942 for an example run on the latest PR in trunk
    * **CircleCI Consistency** - Not needed anymore now if there is no CCI config.
    
    Unused Jobs:
    * **build-ffmpeg-$OS** - For the build jobs, we are already building FFMPEG from source as part of the Nova workflows.
    * **download-third-parties** - This is caching. We currently do not have caching in Nova jobs, but atalman is working on adding support for this as a future optimization.
    
    Pull Request resolved: #3399
    
    Reviewed By: mthrok
    
    Differential Revision: D46363921
    
    Pulled By: osalpekar
    
    fbshipit-source-id: 8abf5b0c1612c3492908fb2f5797e6b0a3c70766
    osalpekar authored and facebook-github-bot committed Jun 1, 2023
    Configuration menu
    Copy the full SHA
    cc89f74 View commit details
    Browse the repository at this point in the history
  7. [BC-breaking] Remove file-like object support from sox_io backend (#3035

    )
    
    Summary:
    This commit removes file-like obejct support so that we can remove custom patch
    
    The motivation and plan is outlined in #2950.
    
    Pull Request resolved: #3035
    
    Reviewed By: hwangjeff
    
    Differential Revision: D44695647
    
    Pulled By: mthrok
    
    fbshipit-source-id: 13af0234e288c041bc7b490e1f967f85ce7eb8ec
    mthrok authored and facebook-github-bot committed Jun 1, 2023
    Configuration menu
    Copy the full SHA
    bc54ac8 View commit details
    Browse the repository at this point in the history
  8. Use dlopen for FFmpeg (#3353)

    Summary:
    This commit changes the way FFmpeg extension is built and used.
    Instead of linking (LGPL) FFmpeg libraries to torchaudio at build time,
    It uses dlopen to search and link them at run time.
    
    For dlopen-ing, we use PyTorch's `at::DynamicLibrary` class, which provides
    portable wrapper.
    
    Pull Request resolved: #3353
    
    Differential Revision: D46059199
    
    Pulled By: mthrok
    
    fbshipit-source-id: 4493a5fd8a4c802178d20276522f5334d637307d
    mthrok authored and facebook-github-bot committed Jun 1, 2023
    Configuration menu
    Copy the full SHA
    b14ced1 View commit details
    Browse the repository at this point in the history

Commits on Jun 2, 2023

  1. Revert D46059199: [audio][PR] Use dlopen for FFmpeg

    Differential Revision:
    D46059199
    
    Original commit changeset: 4493a5fd8a4c
    
    Original Phabricator Diff: D46059199
    
    fbshipit-source-id: 71cde3f8cd870d1ad9114e3e87cdd1ba564441c0
    mthrok authored and facebook-github-bot committed Jun 2, 2023
    Configuration menu
    Copy the full SHA
    ab7a39f View commit details
    Browse the repository at this point in the history
  2. Update data augmentation tutorial (#3375)

    Summary:
    Replace sox_effects with `torchaudio.io.AudioEffector`
    
    1. To show case the new and better feature
    2. To prepare for the upcoming removal of file-like support object
    
    Pull Request resolved: #3375
    
    Reviewed By: nateanl
    
    Differential Revision: D46379016
    
    Pulled By: mthrok
    
    fbshipit-source-id: 70f24b62494204949f327f6ac6c49f315c9ee315
    mthrok authored and facebook-github-bot committed Jun 2, 2023
    Configuration menu
    Copy the full SHA
    2ba36b4 View commit details
    Browse the repository at this point in the history
  3. [BC-Breaking] Remove compute_kaldi_pitch (#3368)

    Summary:
    This commit removes compute_kaldi_pitch function and the underlying Kaldi integration from torchaudio.
    
    Kaldi pitch function was added in a short period of time by integrating the original Kaldi implementation, instead of reimplementing it in PyTorch.
    
    The Kaldi integration employed a hack which replaces the base vector/matrix implementation of Kaldi with PyTorch Tensor so that there is only one blas library within torchaudio.
    
    Recently, we are making torchaudio more lean, and we don't see a wide adoption of kaldi_pitch feature, so we decided to remove them.
    
    See some of the discussion #1269
    
    Pull Request resolved: #3368
    
    Differential Revision: D46406176
    
    Pulled By: mthrok
    
    fbshipit-source-id: ee5e24d825188f379979ddccd680c7323b119b1e
    mthrok authored and facebook-github-bot committed Jun 2, 2023
    Configuration menu
    Copy the full SHA
    5bbbb1d View commit details
    Browse the repository at this point in the history

Commits on Jun 3, 2023

  1. [audio][PR] Add option to dlopen FFmpeg libraries (#3402)

    Summary:
    Pull Request resolved: #3402
    
    This is a second attempt of #3353.
    
    The basic logic to enable dlopen for FFmpeg libraries are same.
    It uses `at::DynamicLibrary`, which allows to compile torchaudio without
    linking FFmpeg libraries.
    
    This time, the option to enable this feature DLOPEN_FFMPEG has been added,
    so that users have a way to disable this feature and keep using build-time
    linking.
    
    Please refer to stub.h for more technical detail.
    
    Differential Revision: D46403783
    
    fbshipit-source-id: ca3db57ff6bdc50c8c225d22f12f3e76c6dc3f16
    mthrok authored and facebook-github-bot committed Jun 3, 2023
    Configuration menu
    Copy the full SHA
    b7d3e89 View commit details
    Browse the repository at this point in the history

Commits on Jun 4, 2023

  1. Update HuBERT/SSL training recipes to support Lightning 2.x (#3396)

    Summary:
    There are some BC-Breaking changes from pytorch_lightning to lightning library. The PR adjust those changes to support latest lightning library.
    
    Pull Request resolved: #3396
    
    Reviewed By: mthrok
    
    Differential Revision: D46345206
    
    Pulled By: nateanl
    
    fbshipit-source-id: 59469c15dc5fe5466a99a5b5380eb4f98c2c633f
    nateanl authored and facebook-github-bot committed Jun 4, 2023
    Configuration menu
    Copy the full SHA
    e908357 View commit details
    Browse the repository at this point in the history

Commits on Jun 5, 2023

  1. Clean-up ComputeKaldiPitch residue (#3403)

    Summary:
    Follow up of: #3368
    
    Remove files and lines no longer used.
    
    Pull Request resolved: #3403
    
    Differential Revision: D46441462
    
    Pulled By: mthrok
    
    fbshipit-source-id: 11b881ec4b24fa0d625c6aee9f4bd91f637f9923
    mthrok authored and facebook-github-bot committed Jun 5, 2023
    Configuration menu
    Copy the full SHA
    c076d1a View commit details
    Browse the repository at this point in the history

Commits on Jun 6, 2023

  1. Update forced_align method to only support batch Tensors (#3365)

    Summary:
    Pull Request resolved: #3365
    
    Current design of forced_align accept 2D Tensor for `log_probs` and 1D Tensor for `targets`. To make the API simple, the PR make changes to only support batch Tensors (3D Tensor for `log_probs` and 2D Tensor for `targets`).
    
    Reviewed By: vineelpratap
    
    Differential Revision: D46126226
    
    fbshipit-source-id: 42cb52b19d91bbff7dc040ccf60350545d75b3a2
    nateanl authored and facebook-github-bot committed Jun 6, 2023
    Configuration menu
    Copy the full SHA
    5f17d81 View commit details
    Browse the repository at this point in the history
  2. Revert D46126226: Update forced_align method to only support batch Te…

    …nsors
    
    Differential Revision:
    D46126226
    
    Original commit changeset: 42cb52b19d91
    
    Original Phabricator Diff: D46126226
    
    fbshipit-source-id: 372b2526d9e196e37e014f1556bf117d29bb1ac6
    mthrok authored and facebook-github-bot committed Jun 6, 2023
    Configuration menu
    Copy the full SHA
    bbc13b9 View commit details
    Browse the repository at this point in the history
  3. [Nova] Remove unused files (#3409)

    Summary:
    We are using Project Nova workflows now. These are not required.
    
    Same as: pytorch/vision#7656
    
    Pull Request resolved: #3409
    
    Reviewed By: mthrok
    
    Differential Revision: D46494331
    
    Pulled By: atalman
    
    fbshipit-source-id: a642ae55b75482918e0afb7c55dc876bc8356e70
    atalman authored and facebook-github-bot committed Jun 6, 2023
    Configuration menu
    Copy the full SHA
    23e756a View commit details
    Browse the repository at this point in the history
  4. Fix style issue (#3410)

    Summary: Pull Request resolved: #3410
    
    Differential Revision: D46496786
    
    Pulled By: mthrok
    
    fbshipit-source-id: e517b273c40b340f39ce7db7ab1be1c3eb5f2059
    mthrok authored and facebook-github-bot committed Jun 6, 2023
    Configuration menu
    Copy the full SHA
    27aa52f View commit details
    Browse the repository at this point in the history

Commits on Jun 7, 2023

  1. Make dlopen ffmpeg default off (#3418)

    Summary:
    To investigate #3411
    
    Pull Request resolved: #3418
    
    Differential Revision: D46535891
    
    Pulled By: mthrok
    
    fbshipit-source-id: b90bba399eb54f9f0ae073bd590cd8a46054ed7e
    mthrok authored and facebook-github-bot committed Jun 7, 2023
    Configuration menu
    Copy the full SHA
    91db978 View commit details
    Browse the repository at this point in the history
  2. Fix style to prep #3414 (#3415)

    Summary: Pull Request resolved: #3415
    
    Differential Revision: D46526437
    
    Pulled By: mthrok
    
    fbshipit-source-id: f78d19c19d7e68f67712412de35d9ed50f47263b
    mthrok authored and facebook-github-bot committed Jun 7, 2023
    Configuration menu
    Copy the full SHA
    4771677 View commit details
    Browse the repository at this point in the history

Commits on Jun 8, 2023

  1. [Nova] Add cache ffmpeg before building (#3417)

    Summary:
    [Nova] Add cache ffmpeg before building
    
    Pull Request resolved: #3417
    
    Reviewed By: mthrok
    
    Differential Revision: D46537892
    
    Pulled By: atalman
    
    fbshipit-source-id: 9f8dc0ecfc305c3b378557d46f89a5d7de67a165
    atalman authored and facebook-github-bot committed Jun 8, 2023
    Configuration menu
    Copy the full SHA
    5ca03f4 View commit details
    Browse the repository at this point in the history
  2. Merge all the lint/style checks to pre-commit hook (#3414)

    Summary: Pull Request resolved: #3414
    
    Differential Revision: D46536717
    
    Pulled By: mthrok
    
    fbshipit-source-id: 505bdcdd1b59ca9fe5afc2c8516a0a821e2b8d7e
    mthrok authored and facebook-github-bot committed Jun 8, 2023
    Configuration menu
    Copy the full SHA
    c3ca256 View commit details
    Browse the repository at this point in the history
  3. Optimize Torchaudio Vad (#3382)

    Summary:
    Pull Request resolved: #3382
    
    The voice activity detector function was unoptimized, confusingly written, and buggy.
    
    The optimizations created here allow for the function to run roughly 17x faster.
    The main optimizations were to loop over windows of audio rather than individual audio samples. Reducing the number of copies also helped.
    
    There was an off by one error where the array slice referenced was [1: 16001] (for the default settings) instead of [0: 16000]
    
    Reviewed By: hwangjeff
    
    Differential Revision: D44749359
    
    fbshipit-source-id: c76c9412e70cdc6fcd527d113603c88f78480558
    KubaRad2 authored and facebook-github-bot committed Jun 8, 2023
    Configuration menu
    Copy the full SHA
    1e117f5 View commit details
    Browse the repository at this point in the history
  4. Clean up CI scripts (#3407)

    Summary:
    - Moving the unit test scripts from .circleci to .github
    - Remove docker file for unit test base
    - Use the Conda from Docker image in Linux jobs.
    
    Remaining follow-up items
    
    - Reuse the unittest script in Linux GPU job like done in Linux CPU job.
    
    The unit test script needs to be fixed to be used for Linux GPU job
    in new GHA workflow. Keeping it as a separate follow-up work item.
    
    Pull Request resolved: #3407
    
    Differential Revision: D46498263
    
    Pulled By: mthrok
    
    fbshipit-source-id: d8256717a55bb4257151d819d3b2ebd453601eac
    mthrok authored and facebook-github-bot committed Jun 8, 2023
    Configuration menu
    Copy the full SHA
    f080315 View commit details
    Browse the repository at this point in the history
  5. Remove CCI badge from README (#3420)

    Summary:
    CI jobs are migrated from CCI to GHA
    
    Pull Request resolved: #3420
    
    Differential Revision: D46548562
    
    Pulled By: mthrok
    
    fbshipit-source-id: d7e17201e8b256efaa54543e445a0f139aa549b2
    mthrok authored and facebook-github-bot committed Jun 8, 2023
    Configuration menu
    Copy the full SHA
    a7fea8a View commit details
    Browse the repository at this point in the history
  6. Delay the initialization of CUDA tensor converter (#3419)

    Summary:
    StreamReader decoding process is composed of the three steps;
    
    1. Decode the incoming AVPacket into AVFrame
    2. Pass AVFrame through AVFilter to perform post process
    3. Convert the resulgint AVFrame
    
    The internal of StreamReader was refactored in #3188 so that the above pipeline is initialized at the time output stream is defined and output stream shape can be retrieved.
    
    For CPU decoder, this works fine because resizing happens in step 2, and the resulting shape can be retrievable.
    However, this is problematic for GPU decoder, as resizing is currently done using GPU decoder option (step 1) and there seems to be no interface to retrieve the output shape. This refactor introduced regression, which is described in #3405
    
    AVFilter internally is adoptive to the change of input frame size. This commit changes the conversion process to be similar, so that it will wait until the first frame comes in to finalize the frame shape.
    
    Fix #3405
    
    Pull Request resolved: #3419
    
    Differential Revision: D46557505
    
    Pulled By: mthrok
    
    fbshipit-source-id: 46ad2d82c8c30f368ebfbaf6947718a5036c7dc6
    mthrok authored and facebook-github-bot committed Jun 8, 2023
    Configuration menu
    Copy the full SHA
    7dff24c View commit details
    Browse the repository at this point in the history
  7. [Nova] Add cache ffmpeg before building #2 (#3423)

    Summary:
    [Nova] Add cache ffmpeg before building - 2
    Follow up after #3417, need to pass new arguments to test-infra workflows
    
    Pull Request resolved: #3423
    
    Reviewed By: mthrok
    
    Differential Revision: D46559344
    
    Pulled By: atalman
    
    fbshipit-source-id: fa5cccc3bfb052688de4a05cc3b4f37fcbe3a6f5
    atalman authored and facebook-github-bot committed Jun 8, 2023
    Configuration menu
    Copy the full SHA
    25e96f4 View commit details
    Browse the repository at this point in the history
  8. Introduce chroma filter bank function (#3395)

    Summary:
    Pull Request resolved: #3395
    
    Adds chroma filter bank function `chroma_filterbank` to `torchaudio.prototype.functional`.
    
    Reviewed By: mthrok
    
    Differential Revision: D46307672
    
    fbshipit-source-id: c5d8104a8bb03da70d0629b5cc224e0d897148d5
    hwangjeff authored and facebook-github-bot committed Jun 8, 2023
    Configuration menu
    Copy the full SHA
    dfd0c5f View commit details
    Browse the repository at this point in the history

Commits on Jun 9, 2023

  1. Fix the input pixel format when using GPU video encoder (#3426)

    Summary:
    StreamWriter's encoding pipeline looks like the following
    
    1. convert tensor to AVFrame
    2. pass AVFrame to AVFilter
    3. pass the resulting AVFrame to AVCodecContext (encoder) and AVFormatContext (muxer)
    
    When dealing with CUDA tensor, the AVFilter becomes no-op, as we have not added support for CUDA-compatible filters.
    
    When CUDA frame is passed, the existing solution passes the software pixel format to AVFilter, which issues warning later as what AVFilter sees is AV_PIX_FMT_CUDA.
    
    Since the filter itself is no-op, it functions as expected. But this commit fixes it.
    
    See #3317
    
    Pull Request resolved: #3426
    
    Differential Revision: D46562370
    
    Pulled By: mthrok
    
    fbshipit-source-id: ce0131f1e50bcc826ee036fc0f35db2a5162b660
    mthrok authored and facebook-github-bot committed Jun 9, 2023
    Configuration menu
    Copy the full SHA
    30afaa9 View commit details
    Browse the repository at this point in the history
  2. Disable HF integration test (#3431)

    Summary:
    The new version of transformers changed the format of pre-trained weight. Fixing it is low-priority for the maintanance team so we disable the test.
    
    See #3430
    
    Pull Request resolved: #3431
    
    Differential Revision: D46592883
    
    Pulled By: mthrok
    
    fbshipit-source-id: d8f54a281a92cac60c469c48f95345bcf0e959d6
    mthrok authored and facebook-github-bot committed Jun 9, 2023
    Configuration menu
    Copy the full SHA
    f5d7635 View commit details
    Browse the repository at this point in the history
  3. Use torch/types.h where possible (#3422)

    Summary: Pull Request resolved: #3422
    
    Differential Revision: D46558184
    
    Pulled By: mthrok
    
    fbshipit-source-id: a775c4fb193496d9b2bf9db7bee186ee23512b99
    mthrok authored and facebook-github-bot committed Jun 9, 2023
    Configuration menu
    Copy the full SHA
    c587715 View commit details
    Browse the repository at this point in the history

Commits on Jun 12, 2023

  1. feat: add guard in lfilter for a non-default cuda device (#3432)

    Summary:
    Should resolve #3425
    
    cc mthrok
    
    Pull Request resolved: #3432
    
    Differential Revision: D46656180
    
    Pulled By: mthrok
    
    fbshipit-source-id: 5c534bee2f143ef5cb5e50ec74828012dbcab7e9
    yoyololicon authored and facebook-github-bot committed Jun 12, 2023
    Configuration menu
    Copy the full SHA
    c76d952 View commit details
    Browse the repository at this point in the history

Commits on Jun 13, 2023

  1. Fix build doc (#3435)

    Summary: Pull Request resolved: #3435
    
    Reviewed By: nateanl
    
    Differential Revision: D46659362
    
    Pulled By: mthrok
    
    fbshipit-source-id: ffa033ad6759de6fd958b63ac51a4a1153ffb45d
    mthrok authored and facebook-github-bot committed Jun 13, 2023
    Configuration menu
    Copy the full SHA
    0f682c7 View commit details
    Browse the repository at this point in the history
  2. [SoX/Flac] disable xmms_plugin dependency (#3436)

    Summary:
    This plugin pulls glib and gtk which breaks the build on some headless systems
    
    Since the plugin is not actually used, it seems right to disable it
    
    This change fixed the build on my system
    
    Pull Request resolved: #3436
    
    Differential Revision: D46683297
    
    Pulled By: mthrok
    
    fbshipit-source-id: 5b1c1eee1929f4a69a1cc6c7d7bb3ed998ec5872
    Kyle Finn authored and facebook-github-bot committed Jun 13, 2023
    Configuration menu
    Copy the full SHA
    58a51b5 View commit details
    Browse the repository at this point in the history

Commits on Jun 14, 2023

  1. Add resample option to AudioEffector (#3374)

    Summary:
    Currently, AudioEffector always resample to the original sample rate. It is more flexible to allow overriding this to any sample rate.
    
    Pull Request resolved: #3374
    
    Differential Revision: D46235358
    
    Pulled By: mthrok
    
    fbshipit-source-id: 39a5d4e38d9b90380da31d0ce9ee8090668b54e4
    mthrok authored and facebook-github-bot committed Jun 14, 2023
    Configuration menu
    Copy the full SHA
    406e9c8 View commit details
    Browse the repository at this point in the history

Commits on Jun 15, 2023

  1. Update forced alignment tutorial (#3440)

    Summary:
    * Fix backtrack visualization (the cooridnate was off-by-one.)
    * Add note about the simplification and the new align API
    * Explicitly handle SOS and EOS
    
    Pull Request resolved: #3440
    
    Reviewed By: xiaohui-zhang
    
    Differential Revision: D46761282
    
    Pulled By: mthrok
    
    fbshipit-source-id: b0b6c9754674e8e23543e9f002e29b55102c92f8
    mthrok authored and facebook-github-bot committed Jun 15, 2023
    Configuration menu
    Copy the full SHA
    1860169 View commit details
    Browse the repository at this point in the history

Commits on Jun 16, 2023

  1. Add LRS3 data preparation (#3421)

    Summary:
    This PR adds a data preparation recipe that uses the ultra face detector to extract full-face video. The resulting video output is then used as input for training and evaluating RNNT-based models for automatic speech recognition (ASR), visual speech recognition (VSR), and audio-visual ASR (AV-ASR) on the LRS3 dataset.
    
    This PR also updates the word error rate (WER) for AV-ASR LRS3 models and improves the code readability.
    
    Pull Request resolved: #3421
    
    Reviewed By: mpc001
    
    Differential Revision: D46799748
    
    Pulled By: mthrok
    
    fbshipit-source-id: 97af3feac0592b240617faaffa4c0ac8cef614a9
    Pingchuan Ma authored and facebook-github-bot committed Jun 16, 2023
    Configuration menu
    Copy the full SHA
    77cdd16 View commit details
    Browse the repository at this point in the history

Commits on Jun 21, 2023

  1. Split the CTC forced aligment API tutorial into two tutorials (#3443)

    Summary:
    Splitting the multilingual example part into another tutorial.
    
    Pull Request resolved: #3443
    
    Reviewed By: mthrok
    
    Differential Revision: D46802844
    
    Pulled By: xiaohui-zhang
    
    fbshipit-source-id: a7093053cac8b79d650d4f665db7fde2d8254998
    xiaohui-zhang authored and facebook-github-bot committed Jun 21, 2023
    Configuration menu
    Copy the full SHA
    627c37a View commit details
    Browse the repository at this point in the history
  2. Introduce chroma spectrogram transform (#3427)

    Summary:
    Pull Request resolved: #3427
    
    Adds transform `ChromaSpectrogram` for generating chromagrams from waveforms as well as transform `ChromaScale` for generating chromagrams from linear-frequency spectrograms.
    
    Reviewed By: mthrok
    
    Differential Revision: D46547418
    
    fbshipit-source-id: 250f298b8e11d8cf82f05536c29d51cf8d77a960
    hwangjeff authored and facebook-github-bot committed Jun 21, 2023
    Configuration menu
    Copy the full SHA
    7096829 View commit details
    Browse the repository at this point in the history

Commits on Jun 26, 2023

  1. Add more explanation about n_fft (#3442)

    Summary: Pull Request resolved: #3442
    
    Differential Revision: D46797481
    
    Pulled By: mthrok
    
    fbshipit-source-id: 3513037cbb8f2edb70fdab0fec5c7c554a697abe
    mthrok authored and facebook-github-bot committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    105b77f View commit details
    Browse the repository at this point in the history

Commits on Jun 28, 2023

  1. Follow up on tutorial update (#3449)

    Summary: Pull Request resolved: #3449
    
    Differential Revision: D47094402
    
    Pulled By: mthrok
    
    fbshipit-source-id: 43e6994604f0e6c06a5f19c5e8599e2ce12ae622
    mthrok authored and facebook-github-bot committed Jun 28, 2023
    Configuration menu
    Copy the full SHA
    4a121aa View commit details
    Browse the repository at this point in the history
  2. include a link to index.rst (#3441)

    Summary:
    Include Conformer/Emformer RNN-T ASR/VSR/AV-ASR link to index.rst
    
    Pull Request resolved: #3441
    
    Differential Revision: D47094158
    
    Pulled By: mthrok
    
    fbshipit-source-id: 9ab42ac2bf52a5ce488003897ffba2f10a6ca941
    Pingchuan Ma authored and facebook-github-bot committed Jun 28, 2023
    Configuration menu
    Copy the full SHA
    a8ce4a8 View commit details
    Browse the repository at this point in the history

Commits on Jul 3, 2023

  1. Update README (#3434)

    Summary:
    Pull Request resolved: #3434
    
    Add one bullet point for `torchaudio.functional` and forced alignment as one example.
    
    Reviewed By: mthrok
    
    Differential Revision: D46658058
    
    fbshipit-source-id: 6e037b7bb6ed2fc2e27ad1e55c5728c17ce69ce8
    nateanl authored and facebook-github-bot committed Jul 3, 2023
    Configuration menu
    Copy the full SHA
    163157d View commit details
    Browse the repository at this point in the history

Commits on Jul 5, 2023

  1. Update forced_align method to only support batch Tensors (#3433)

    Summary:
    Pull Request resolved: #3433
    
    Current design of forced_align accept 2D Tensor for `log_probs` and 1D Tensor for `targets`. To make the API simple, the PR make changes to only support batch Tensors (3D Tensor for `log_probs` and 2D Tensor for `targets`).
    
    Reviewed By: mthrok
    
    Differential Revision: D46657526
    
    fbshipit-source-id: af17ec3f92f1a2c46dba91c6db2488a11de36f89
    nateanl authored and facebook-github-bot committed Jul 5, 2023
    Configuration menu
    Copy the full SHA
    cc16447 View commit details
    Browse the repository at this point in the history
  2. Untangle third party inclusion in CMake (#3457)

    Summary: Pull Request resolved: #3457
    
    Differential Revision: D47241343
    
    Pulled By: mthrok
    
    fbshipit-source-id: fd1bfd1531397cb59e9cf11de9dede6949f8517e
    mthrok authored and facebook-github-bot committed Jul 5, 2023
    Configuration menu
    Copy the full SHA
    c34a1d6 View commit details
    Browse the repository at this point in the history
  3. Add stand alone job to build FFmpeg binaries (#3455)

    Summary: Pull Request resolved: #3455
    
    Differential Revision: D47242316
    
    Pulled By: mthrok
    
    fbshipit-source-id: 0eb4bdb0a45fccfe9ff97eaed79db63cd7bfc7d8
    mthrok authored and facebook-github-bot committed Jul 5, 2023
    Configuration menu
    Copy the full SHA
    662f067 View commit details
    Browse the repository at this point in the history
  4. Revert "[audio][PR] Add option to dlopen FFmpeg libraries (#3402)" (#…

    …3456)
    
    Summary:
    This reverts commit b7d3e89.
    
    We will use pre-built binaries instead of dlopen.
    
    Pull Request resolved: #3456
    
    Differential Revision: D47239681
    
    Pulled By: mthrok
    
    fbshipit-source-id: 0446a62410d914081184fc20c386afa00b1e41b6
    mthrok authored and facebook-github-bot committed Jul 5, 2023
    Configuration menu
    Copy the full SHA
    ca66a1d View commit details
    Browse the repository at this point in the history

Commits on Jul 6, 2023

  1. Fix mac ffmpeg build (#3459)

    Summary:
    Follow up of  #3455
    
    FFMPEG_VERSION env ver is not defined in existing CI jobs.
    
    Pull Request resolved: #3459
    
    Reviewed By: atalman
    
    Differential Revision: D47249074
    
    Pulled By: mthrok
    
    fbshipit-source-id: 20f82d749adef5f45a984ab8125592ef36279e94
    mthrok authored and facebook-github-bot committed Jul 6, 2023
    Configuration menu
    Copy the full SHA
    2fa39db View commit details
    Browse the repository at this point in the history
  2. Add ARM linux ffmpeg build (#3462)

    Summary: Pull Request resolved: #3462
    
    Differential Revision: D47270241
    
    Pulled By: mthrok
    
    fbshipit-source-id: 6a3b02380dfb381ffb47c1f46b46f4833c765246
    mthrok authored and facebook-github-bot committed Jul 6, 2023
    Configuration menu
    Copy the full SHA
    d9f51ce View commit details
    Browse the repository at this point in the history

Commits on Jul 7, 2023

  1. Use pre-built binaries for ffmpeg extension (#3460)

    Summary:
    This commit changes the way FFmpeg extension is built.
    
    Originally, the build process expected the FFmpeg binaries to be somehow available in build env.
    This makes the build process unpredictable and prevents default enabling FFmpeg extension.
    
    The proposed change uses pre-built FFmpeg binaries as build-time only scaffold, which are built in our CI job https://github.com/pytorch/audio/actions/workflows/ffmpeg.yml.
    
    This makes the build process more predictable and removes the necessity to build FFmpeg in our CI.
    Currently, it supports macOS (arm64, x86_64), unix (x86_64, aarch64) and windows (amd64).
    The downside is that it no longer works with the architecture not listed above.
    We can potentially workaround by searching the FFmpeg binaries available in system (the old way) for
    these system, but since they are not supported by PyTorch, the priority is low.
    
    Pull Request resolved: #3460
    
    Differential Revision: D47261885
    
    Pulled By: mthrok
    
    fbshipit-source-id: 223a15e95c9140c95688af968beb35ff40354476
    mthrok authored and facebook-github-bot committed Jul 7, 2023
    Configuration menu
    Copy the full SHA
    f77c3e5 View commit details
    Browse the repository at this point in the history
  2. Fix StreamWriter regression around RGB0/BGR0 (#3428)

    Summary:
    - Add RGB0/BGR0 support to CPU encoder
    - Allow to pass RGB/BGR when expectged format is RGB0/BGR0
    
    Pull Request resolved: #3428
    
    Differential Revision: D47274370
    
    Pulled By: mthrok
    
    fbshipit-source-id: d34d940e04b07673bb86f518fe895c0735912444
    mthrok authored and facebook-github-bot committed Jul 7, 2023
    Configuration menu
    Copy the full SHA
    9210cba View commit details
    Browse the repository at this point in the history
  3. Set the default #threads to 1 in StreamWriter (#3370)

    Summary:
    Similrt to #2949
    
    Pull Request resolved: #3370
    
    Differential Revision: D47298746
    
    Pulled By: mthrok
    
    fbshipit-source-id: 0cc0f395772b33f8b2f5f55253d659e451f506c4
    mthrok authored and facebook-github-bot committed Jul 7, 2023
    Configuration menu
    Copy the full SHA
    9c7bf1b View commit details
    Browse the repository at this point in the history

Commits on Jul 10, 2023

  1. Update package smoke test (#3465)

    Summary:
    1. Update smoke test script to change directory so that there is no `torchaudio` directory in CWD when smoke test is being executed.
    2. Disable the part of smoke test which requires FFmpeg for wheel. The preparation for pytorch/test-infra#4358
    
    Pull Request resolved: #3465
    
    Reviewed By: nateanl
    
    Differential Revision: D47345117
    
    Pulled By: mthrok
    
    fbshipit-source-id: 95aad0a22922d44ee9a24a05d9ece85166b8c17e
    mthrok authored and facebook-github-bot committed Jul 10, 2023
    Configuration menu
    Copy the full SHA
    589de10 View commit details
    Browse the repository at this point in the history

Commits on Jul 11, 2023

  1. Clean up FFMPEG env var and remove pre/post build script (#3466)

    Summary:
    Now that we do not build FFmpeg as part of CI build process, we can remove the pre/post build scripts.
    
    Needs to land after pytorch/test-infra#4358
    
    Pull Request resolved: #3466
    
    Reviewed By: atalman
    
    Differential Revision: D47367022
    
    Pulled By: mthrok
    
    fbshipit-source-id: 17aafff74ee7d269236cffb8a88c803a8d4c44b7
    mthrok authored and facebook-github-bot committed Jul 11, 2023
    Configuration menu
    Copy the full SHA
    c825c01 View commit details
    Browse the repository at this point in the history
  2. Update doc analytics (#3469)

    Summary: Pull Request resolved: #3469
    
    Differential Revision: D47368140
    
    Pulled By: mthrok
    
    fbshipit-source-id: d82ddb91ae1f6612298486fb8401f95c48db5620
    mthrok authored and facebook-github-bot committed Jul 11, 2023
    Configuration menu
    Copy the full SHA
    216146a View commit details
    Browse the repository at this point in the history
  3. Fix doc style (#3468)

    Summary: Pull Request resolved: #3468
    
    Differential Revision: D47368070
    
    Pulled By: mthrok
    
    fbshipit-source-id: 9b5d57b0cb861a2556a1903121f526f8011a0e2d
    mthrok authored and facebook-github-bot committed Jul 11, 2023
    Configuration menu
    Copy the full SHA
    18b20f7 View commit details
    Browse the repository at this point in the history
  4. Clean up FFmpeg build scripts (#3470)

    Summary: Pull Request resolved: #3470
    
    Differential Revision: D47374347
    
    Pulled By: mthrok
    
    fbshipit-source-id: 003b83e50a70f6e1d06eb196f0be5dbba1640226
    mthrok authored and facebook-github-bot committed Jul 11, 2023
    Configuration menu
    Copy the full SHA
    cc41178 View commit details
    Browse the repository at this point in the history

Commits on Jul 12, 2023

  1. Support multiple FFmpeg versions (#3464)

    Summary:
    This commit introduces support for multiple FFmpeg versions for OSS binary distributions.
    
    Currently torchaudio only works with FFmpeg 4. This is inconvenient from installing to runtime linking.
    This commit allows to pick FFmpeg 4, 5 or 6 at runtime, instead of just looking for v4.
    
    The way it works is that we compile the FFmpeg extension three times with different FFmpeg and ship them.
    At runtime, we look for libavutil of specific version and when one is found, load the corresponding FFmpeg extension.
    The order of preference is 6, 5, then 4.
    
    To make the build process simple and reproducible, we use pre-built binaries of FFmpeg during the build.
    They are LGPL and downloaded from S3 at build time, instead of building every time.
    
    The use of pre-built binaries as scaffolding limits the system that can build torchaudio, so it also introduces
    single FFmpeg version support mode. setting FFMPEG_ROOT during the build will change the way binaries are built
    so that it will only support one specific version of FFmpeg.
    
    Pull Request resolved: #3464
    
    Differential Revision: D47300223
    
    Pulled By: mthrok
    
    fbshipit-source-id: 560c7968315e4c8922afa11a4693f648c0356d04
    mthrok authored and facebook-github-bot committed Jul 12, 2023
    Configuration menu
    Copy the full SHA
    786066b View commit details
    Browse the repository at this point in the history
  2. Fix FFmpeg initialization logic (#3474)

    Summary: Pull Request resolved: #3474
    
    Differential Revision: D47398447
    
    fbshipit-source-id: f77b685d54ddfc222b806475707d4a10239872f5
    mthrok authored and facebook-github-bot committed Jul 12, 2023
    Configuration menu
    Copy the full SHA
    49e269a View commit details
    Browse the repository at this point in the history
  3. Use FFmpeg6 in build doc (#3475)

    Summary: Pull Request resolved: #3475
    
    Differential Revision: D47403772
    
    Pulled By: mthrok
    
    fbshipit-source-id: 5cdde521dbbbbf33856470a9dc79419b4a3a1683
    mthrok authored and facebook-github-bot committed Jul 12, 2023
    Configuration menu
    Copy the full SHA
    989702b View commit details
    Browse the repository at this point in the history
  4. Fix resampling to support dynamic input lengths for onnx exports. (#3473

    )
    
    Summary:
    This is a port of adefossez/julius#17 for torchaudio.
    
    Not sure if it's possible/desirable to add tests to test the functionality of ONNX exports, but I did a quick test on my machine to ensure this works. The logic is a bit simpler compared to the other PR because the torchaudio version does not support the additional flags available in julius.
    
    Pull Request resolved: #3473
    
    Differential Revision: D47401988
    
    Pulled By: mthrok
    
    fbshipit-source-id: 62fa1e4388923f6a62cef2c0f902a79ea179cec4
    bogdanteleaga authored and facebook-github-bot committed Jul 12, 2023
    Configuration menu
    Copy the full SHA
    a3b6bfb View commit details
    Browse the repository at this point in the history
  5. Resolve some compilation warnings (#3471)

    Summary:
    - FFmpeg 6 deprecated attributes
    - Guard CUDA specific functions not used in CPU builds
    
    Pull Request resolved: #3471
    
    Differential Revision: D47402174
    
    Pulled By: mthrok
    
    fbshipit-source-id: 00c0719ab1849b50c0b56b03d8fb38bc7aa74538
    mthrok authored and facebook-github-bot committed Jul 12, 2023
    Configuration menu
    Copy the full SHA
    a6d1fec View commit details
    Browse the repository at this point in the history

Commits on Jul 13, 2023

  1. Revert D47402174: [audio][PR] Resolve some compilation warnings

    Differential Revision:
    D47402174
    
    Original commit changeset: 00c0719ab184
    
    Original Phabricator Diff: D47402174
    
    fbshipit-source-id: b1f6ea4cc3ecef3f72a87bf2f67bf9644c847546
    mthrok authored and facebook-github-bot committed Jul 13, 2023
    Configuration menu
    Copy the full SHA
    155d1ba View commit details
    Browse the repository at this point in the history
  2. Linux CPU job should respect set Python version (#3477)

    Summary:
    Reintroduce a conda environment within which we will do all deps installation, audio builds, and tests runs. This conda environment will use the python version set by the GHA job - previously this just defaulted to using the system 3.10 python which was default inside the container.
    
    Pull Request resolved: #3477
    
    Reviewed By: mthrok
    
    Differential Revision: D47414572
    
    Pulled By: osalpekar
    
    fbshipit-source-id: 80760f82c7726205b29812d576e498db2a7a80a0
    osalpekar authored and facebook-github-bot committed Jul 13, 2023
    Configuration menu
    Copy the full SHA
    86cb1e0 View commit details
    Browse the repository at this point in the history

Commits on Jul 14, 2023

  1. Update the logic to fetch pixel format from filter graph (#3479)

    Summary:
    When using GPU decoder in some environments, attempting to read the output formats from filter graph caused an issue in which the software pixel format cannot be determined.
    
    We do not know the exact cause but when it happens, the input link of buffer sink does not have HW frames context.
    
    Since currently no filter can convert the pixel format of CUDA frame, we resort to the HW frames context of the output link of buffer source.
    
    Environments this was observed.
    
    Env1
    - OS: Fedora 36 (x86_64)
    - GCC 12.2.1
    - Python 3.10.12
    - GPU: GeForce RTX 3070 Ti Laptop GPU
    - FFmpeg: 5.1.3
    - nv-codec-header: n11.1.5.2
    - CUDA: 12.1
    
    Env2
    - Ubuntu 20.04.4 LTS (x86_64)
    - GCC 9.4.0
    - Python 3.11.3
    - GPU: Quadro GV100
    - FFmpeg: 5.1.3
    - nv-codec-header: n11.1.5.2
    - CUDA: 11.4
    
    Pull Request resolved: #3479
    
    Differential Revision: D47482407
    
    Pulled By: mthrok
    
    fbshipit-source-id: 1c53096b27824453b260138ab64e1948afeeefc7
    mthrok authored and facebook-github-bot committed Jul 14, 2023
    Configuration menu
    Copy the full SHA
    cf53a48 View commit details
    Browse the repository at this point in the history

Commits on Jul 15, 2023

  1. Update notes on FFmpeg version (#3480)

    Summary:
    The nightly builds support FFmpeg version 4, 5 and 6.
    
    Pull Request resolved: #3480
    
    Differential Revision: D47482841
    
    Pulled By: mthrok
    
    fbshipit-source-id: 88267f5e83ddc7b1e866b35e57a87b985e2c78c9
    mthrok authored and facebook-github-bot committed Jul 15, 2023
    Configuration menu
    Copy the full SHA
    5a809aa View commit details
    Browse the repository at this point in the history
  2. Use more recent FFmpeg in unit tests (#3476)

    Summary: Pull Request resolved: #3476
    
    Differential Revision: D47494211
    
    Pulled By: mthrok
    
    fbshipit-source-id: 230bbf0a271b070d1dea34146d0d466e666cccdc
    mthrok authored and facebook-github-bot committed Jul 15, 2023
    Configuration menu
    Copy the full SHA
    ea7a96d View commit details
    Browse the repository at this point in the history

Commits on Jul 17, 2023

  1. Ensure StreamReader returns tensors with requires_grad is False (#3467)

    Summary: Pull Request resolved: #3467
    
    Differential Revision: D47482388
    
    Pulled By: mthrok
    
    fbshipit-source-id: abff36491dc28b83270673860d6457a084b1327d
    mthrok authored and facebook-github-bot committed Jul 17, 2023
    Configuration menu
    Copy the full SHA
    44b9206 View commit details
    Browse the repository at this point in the history

Commits on Jul 18, 2023

  1. Extract NVDEC tutorial from the current notebook (#3478)

    Summary:
    Now that GPU video decoders are available in doc CI, we run the tutorials with GPU decoders.
    
    Pull Request resolved: #3478
    
    Differential Revision: D47519672
    
    Pulled By: mthrok
    
    fbshipit-source-id: 2f95243100e9c75e17c2b4d306da164f0e31f8f2
    mthrok authored and facebook-github-bot committed Jul 18, 2023
    Configuration menu
    Copy the full SHA
    6324462 View commit details
    Browse the repository at this point in the history

Commits on Jul 24, 2023

  1. Move examples/asr/avsr_rnnt to examples/avsr folder (#3489)

    Summary: Pull Request resolved: #3489
    
    Reviewed By: mthrok
    
    Differential Revision: D47726448
    
    Pulled By: mpc001
    
    fbshipit-source-id: 3d5aa7646c6bb816dcbbf70c61e98404bb148841
    Pingchuan Ma authored and facebook-github-bot committed Jul 24, 2023
    Configuration menu
    Copy the full SHA
    66f661d View commit details
    Browse the repository at this point in the history

Commits on Jul 25, 2023

  1. Update AV-ASR recipe link to index.rst. (#3492)

    Summary: Pull Request resolved: #3492
    
    Reviewed By: mthrok
    
    Differential Revision: D47755638
    
    Pulled By: mpc001
    
    fbshipit-source-id: 729efdb2a69b5656dbc0b70dd623c1509123d3aa
    Pingchuan Ma authored and facebook-github-bot committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    ae8c131 View commit details
    Browse the repository at this point in the history
  2. Fix typo in melscale_fbank (#3487)

    Summary:
    Resolves #3486
    
    Pull Request resolved: #3487
    
    Differential Revision: D47724733
    
    Pulled By: mthrok
    
    fbshipit-source-id: 26f5641a8271a7e50c4a33861d09b0c8274b29e4
    mthrok authored and facebook-github-bot committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    135cb7b View commit details
    Browse the repository at this point in the history
  3. Run GPU video decoder/encoder tests in CI (#3490)

    Summary: Pull Request resolved: #3490
    
    Differential Revision: D47757316
    
    Pulled By: mthrok
    
    fbshipit-source-id: cfb376be29980f9e452f291c4fa25780e9f85a97
    mthrok authored and facebook-github-bot committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    df65560 View commit details
    Browse the repository at this point in the history
  4. Update nvdec/nvenc tutorials (#3483)

    Summary: Pull Request resolved: #3483
    
    Differential Revision: D47725664
    
    Pulled By: mthrok
    
    fbshipit-source-id: e4249e1488fa7af8670be4a5077957912ff3420b
    mthrok authored and facebook-github-bot committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    56e2266 View commit details
    Browse the repository at this point in the history
  5. Update avsr recipe (#3493)

    Summary:
    This PR is to include few changes in the AV-ASR recipe. The changes include better results, a faster face detector (Mediapipe), renamed variable names, a streamlined dataloader, and a few illustrated examples. These changes were made to improve the usability of the recipe.
    
    Pull Request resolved: #3493
    
    Reviewed By: mthrok
    
    Differential Revision: D47758072
    
    Pulled By: mpc001
    
    fbshipit-source-id: 4533587776f3a7a74f3f11b0ece773a0934bacdc
    Pingchuan Ma authored and facebook-github-bot committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    d464479 View commit details
    Browse the repository at this point in the history
  6. Fix and update doc deployment (#3495)

    Summary:
    - Fix condition to add new commit to gh-pages
    - Allow to deploy docs from workflow dispatch
    
    Pull Request resolved: #3495
    
    Differential Revision: D47767443
    
    Pulled By: mthrok
    
    fbshipit-source-id: 9ca858868f3e822e532c21cde9d7499af9891a51
    mthrok authored and facebook-github-bot committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    e483a67 View commit details
    Browse the repository at this point in the history
  7. Disable some tests that need libsox (#3494)

    Summary:
    In preparation for #3082
    
    Disable those FFmpeg tests that depend on sox CLI. These tests need to be updated or removed so as not to use sox CLI.
    
    Auto-skip some sox tests if decoder/encoder are not available
    
    Pull Request resolved: #3494
    
    Differential Revision: D47761948
    
    Pulled By: mthrok
    
    fbshipit-source-id: 3a48d7f280f8376a48d223947dd41a7cdc8cbc30
    mthrok authored and facebook-github-bot committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    49e9ed9 View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2023

  1. Add nightly doc update (#3496)

    Summary:
    Add scheduled doc update job so that docs are updated at least once a day.
    
    Pull Request resolved: #3496
    
    Differential Revision: D47795577
    
    Pulled By: mthrok
    
    fbshipit-source-id: aba5376ec51f07560014d250a16fef8b8a11b43e
    mthrok authored and facebook-github-bot committed Jul 26, 2023
    Configuration menu
    Copy the full SHA
    f082e6c View commit details
    Browse the repository at this point in the history
  2. Move env util (#3499)

    Summary: Pull Request resolved: #3499
    
    Differential Revision: D47803654
    
    Pulled By: mthrok
    
    fbshipit-source-id: 2b916fa66d84c91c01b4dfe6dd5ee3501159f451
    mthrok authored and facebook-github-bot committed Jul 26, 2023
    Configuration menu
    Copy the full SHA
    da21202 View commit details
    Browse the repository at this point in the history
  3. av-asr: move video loading outside detector (#3498)

    Summary:
    This PR moves video loading outside detector during pre-processing.
    
    Pull Request resolved: #3498
    
    Reviewed By: mthrok
    
    Differential Revision: D47811044
    
    Pulled By: mpc001
    
    fbshipit-source-id: f17839b695b13d3cf2d9db343d7e9a0202eea7d5
    Pingchuan Ma authored and facebook-github-bot committed Jul 26, 2023
    Configuration menu
    Copy the full SHA
    c977afe View commit details
    Browse the repository at this point in the history

Commits on Jul 27, 2023

  1. Add switch to disable sox integration and ffmpeg integration at runti…

    …me (#3500)
    
    Summary:
    Since libsox and ffmpeg extensions now depend on external libraries, their initialization processes might cause unrecoverable issue, such as segfault.
    
    This commit adds environment variable to disable them so that importing torchaudio won't attempt to load these libraries.
    
    Pull Request resolved: #3500
    
    Differential Revision: D47808178
    
    Pulled By: mthrok
    
    fbshipit-source-id: 80c1c6b5f4bc608d4e209473702680db093c95ee
    mthrok authored and facebook-github-bot committed Jul 27, 2023
    Configuration menu
    Copy the full SHA
    29903c5 View commit details
    Browse the repository at this point in the history
  2. Replace libsox with stub library (#3497)

    Summary:
    This commit updates the way libsox is integrated to torchaudio
    
    1. We stop statically linking libsox, so torchaudio will not ship libsox.
    2. We link libsox dynamically. Users are expected to install libsox by themselves.
    3. We use stab library to build torchaudio.
    
    Pull Request resolved: #3497
    
    Differential Revision: D47803706
    
    Pulled By: mthrok
    
    fbshipit-source-id: 31b05495d81069186fa52d67beea360cc7e817a8
    mthrok authored and facebook-github-bot committed Jul 27, 2023
    Configuration menu
    Copy the full SHA
    8588fba View commit details
    Browse the repository at this point in the history
  3. Remove unused files (#3514)

    Summary:
    Removes residual from #3497
    
    Pull Request resolved: #3514
    
    Differential Revision: D47838049
    
    Pulled By: mthrok
    
    fbshipit-source-id: c4b00aba9f4cc887ec595f04d7a2dd673c63b975
    mthrok authored and facebook-github-bot committed Jul 27, 2023
    Configuration menu
    Copy the full SHA
    7368e33 View commit details
    Browse the repository at this point in the history

Commits on Jul 28, 2023

  1. Add real-time av-asr tutorial (#3511)

    Summary: Pull Request resolved: #3511
    
    Reviewed By: mthrok
    
    Differential Revision: D47852108
    
    Pulled By: mpc001
    
    fbshipit-source-id: c0ecb4b5bcc8670013dcbe1164e3929f5793c8aa
    Pingchuan Ma authored and facebook-github-bot committed Jul 28, 2023
    Configuration menu
    Copy the full SHA
    d6aeaa7 View commit details
    Browse the repository at this point in the history
  2. Move TorchAudio-Squim models to Beta (#3512)

    Summary:
    The PR move `SquimObjective` and `SquimSubjective` models and corresponding factory functions and pre-trained pipelines out of prototype and to the core directory. They will be included in the next official release.
    
    Pull Request resolved: #3512
    
    Reviewed By: mthrok
    
    Differential Revision: D47837434
    
    Pulled By: nateanl
    
    fbshipit-source-id: d0639f29079f7e1afc30f236849e530c8cadffd8
    nateanl authored and facebook-github-bot committed Jul 28, 2023
    Configuration menu
    Copy the full SHA
    b7d2d92 View commit details
    Browse the repository at this point in the history
  3. Update documentation about dependencies (#3517)

    Summary: Pull Request resolved: #3517
    
    Reviewed By: huangruizhe
    
    Differential Revision: D47858452
    
    Pulled By: mthrok
    
    fbshipit-source-id: 62ee6c8bb2669dd70f8ca25703a04dc8a9d19aec
    mthrok authored and facebook-github-bot committed Jul 28, 2023
    Configuration menu
    Copy the full SHA
    a051985 View commit details
    Browse the repository at this point in the history
  4. Remove ffmpeg fallback from sox_io backend (#3516)

    Summary:
    In #2419, we added ffmpeg as fallback for sox_io backend. The was a warkaround for solving the issue with libmad removal.
    
    Now that we introduced `backend` argument to I/O functions, and libsox integration is moved to dynamic binding where users can use libsox with libmad integration, we do not need the workaround.
    
    This commit is based on reverting #2416 (fd7ace1).
    
    Pull Request resolved: #3516
    
    Reviewed By: huangruizhe
    
    Differential Revision: D47855272
    
    Pulled By: mthrok
    
    fbshipit-source-id: 5af73af7865f6e545ccb052d478e86588ff2a014
    mthrok authored and facebook-github-bot committed Jul 28, 2023
    Configuration menu
    Copy the full SHA
    2c8665d View commit details
    Browse the repository at this point in the history
  5. Amend amp_to_db docstring (#3519)

    Summary:
    Context: #3448
    
    The documentation of amplitude_to_DB is ambigious on how cut-off values are computed when the input tensor is 3D.
    
    This commit clarifies that.
    
    Closes: #3448
    
    Pull Request resolved: #3519
    
    Reviewed By: huangruizhe
    
    Differential Revision: D47875505
    
    Pulled By: mthrok
    
    fbshipit-source-id: e06bb997e7a27e2abe35c8e2ac91ddfbded4e641
    mthrok authored and facebook-github-bot committed Jul 28, 2023
    Configuration menu
    Copy the full SHA
    61cbf79 View commit details
    Browse the repository at this point in the history

Commits on Jul 29, 2023

  1. Refactor compat (#3518)

    Summary:
    The I/O functions in _compat module was introduced there so that
    everything related to FFmpeg is in torchaudio.io and FFmpeg library
    initialization can be carried out in `torchaudio.io.__init__`.
    
    Now that this constraint is removed, (all the initialization happens
    at `torchaudio._extension.__init__`) and `_compat` is only used by
    FFmpeg dispatcher backend, we move the module to `torchaudio._backend`
    for better locality.
    
    Pull Request resolved: #3518
    
    Reviewed By: huangruizhe
    
    Differential Revision: D47877412
    
    Pulled By: mthrok
    
    fbshipit-source-id: aa18c8cb6e5d5360950df5158c33c653e37c565f
    mthrok authored and facebook-github-bot committed Jul 29, 2023
    Configuration menu
    Copy the full SHA
    8497ee9 View commit details
    Browse the repository at this point in the history

Commits on Jul 31, 2023

  1. Set and tweak global matplotlib configuration in tutorials (#3515)

    Summary:
    - Set global matplotlib rc params
    - Fix style check
    - Fix and updates FA tutorial plots
    - Add av-asr index cars
    
    Pull Request resolved: #3515
    
    Reviewed By: huangruizhe
    
    Differential Revision: D47894156
    
    Pulled By: mthrok
    
    fbshipit-source-id: b40d8d31f12ffc2b337e35e632afc216e9d59a6e
    mthrok authored and facebook-github-bot committed Jul 31, 2023
    Configuration menu
    Copy the full SHA
    84b1230 View commit details
    Browse the repository at this point in the history
  2. Migrate torch.norm to torch.linalg.vector_norm (#3522)

    Summary:
    torch.norm is now deprecated.
    The usages in torchaudio seems to be vector norm, so replacing them with torch.linalg.vector_norm
    
    Resolves #3484
    
    Pull Request resolved: #3522
    
    Reviewed By: huangruizhe
    
    Differential Revision: D47926659
    
    Pulled By: mthrok
    
    fbshipit-source-id: f7428cf0168109a3d340b8784adc99bb5f781084
    mthrok authored and facebook-github-bot committed Jul 31, 2023
    Configuration menu
    Copy the full SHA
    8a2e12d View commit details
    Browse the repository at this point in the history

Commits on Aug 1, 2023

  1. Add pretrained VGGish inference pipeline (#3491)

    Summary:
    Adds pre-trained VGGish inference pipeline ported from https://github.com/harritaylor/torchvggish and https://github.com/tensorflow/models/tree/master/research/audioset.
    
    Pull Request resolved: #3491
    
    Reviewed By: mthrok
    
    Differential Revision: D47738130
    
    Pulled By: hwangjeff
    
    fbshipit-source-id: 859c1ff1ec1b09dae4e26586169544571657cc67
    hwangjeff authored and facebook-github-bot committed Aug 1, 2023
    Configuration menu
    Copy the full SHA
    cbfde17 View commit details
    Browse the repository at this point in the history
  2. Migrate weight_norm (#3523)

    Summary:
    torch.nn.utils.weight_norm is deprecated.
    Replacing this with new API
    
    Pull Request resolved: #3523
    
    Reviewed By: huangruizhe
    
    Differential Revision: D47932384
    
    Pulled By: mthrok
    
    fbshipit-source-id: 344abfa12bd11da779f7fd13b74a1e009a582b52
    mthrok authored and facebook-github-bot committed Aug 1, 2023
    Configuration menu
    Copy the full SHA
    144cfcf View commit details
    Browse the repository at this point in the history
  3. Add cuctc tutorial, change blank skip threshold into prob (#3297)

    Summary:
    Add a separate tutorial for cuctc.
    Reslove #3096
    
    Pull Request resolved: #3297
    
    Reviewed By: huangruizhe
    
    Differential Revision: D47928400
    
    Pulled By: mthrok
    
    fbshipit-source-id: 8c16492fb4d007b6ea7969ba77c866a51749c0ec
    yuekaizhang authored and facebook-github-bot committed Aug 1, 2023
    Configuration menu
    Copy the full SHA
    732c94a View commit details
    Browse the repository at this point in the history

Commits on Aug 2, 2023

  1. Fix save INT16 sox backend (#3524)

    Summary:
    When passing int16 type tensor to `save(backend="sox")`, the resulting file should be 16-bit signed PCM, but instead is 32-bit signed PCM.
    
    Resolves #3304
    
    Pull Request resolved: #3524
    
    Reviewed By: huangruizhe
    
    Differential Revision: D47941090
    
    Pulled By: mthrok
    
    fbshipit-source-id: 2622b31eb1cbf03969f67ab2b2adec6e2ba677c4
    mthrok authored and facebook-github-bot committed Aug 2, 2023
    Configuration menu
    Copy the full SHA
    3f9b517 View commit details
    Browse the repository at this point in the history

Commits on Aug 3, 2023

  1. Relax Conformer RNN-T numerical parity tests (#3525)

    Summary:
    Increases numerical tolerance on Conformer RNN-T TorchScript consistency tests to resolve CI test failures.
    
    Pull Request resolved: #3525
    
    Reviewed By: mthrok
    
    Differential Revision: D48000613
    
    Pulled By: hwangjeff
    
    fbshipit-source-id: 1d35ba58055a8346dc40e2b67f37ccfd2e015894
    hwangjeff authored and facebook-github-bot committed Aug 3, 2023
    Configuration menu
    Copy the full SHA
    72b0917 View commit details
    Browse the repository at this point in the history
  2. Refactor wav2vec2 pipeline misc helper functions (#3527)

    Summary: Pull Request resolved: #3527
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48008822
    
    Pulled By: mthrok
    
    fbshipit-source-id: 4beae2956dfd1f00534832b70a1bf0897cba7812
    mthrok authored and facebook-github-bot committed Aug 3, 2023
    Configuration menu
    Copy the full SHA
    09aabcc View commit details
    Browse the repository at this point in the history

Commits on Aug 4, 2023

  1. Update ctc forced alignment tutorial (#3529)

    Summary:
    - Simplify the step to generate token-level alignment
    
    Pull Request resolved: #3529
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48066787
    
    Pulled By: mthrok
    
    fbshipit-source-id: 452c243d278e508926a59894928e280fea76dcc6
    mthrok authored and facebook-github-bot committed Aug 4, 2023
    Configuration menu
    Copy the full SHA
    b645c07 View commit details
    Browse the repository at this point in the history
  2. Revise VGGish pipeline to accept arbitrary state dict function (#3531)

    Summary:
    Pull Request resolved: #3531
    
    Revises VGGish pipeline to accept arbitrary state dict function to accommodate loading weights from any source.
    
    Reviewed By: mthrok
    
    Differential Revision: D48056390
    
    fbshipit-source-id: 2767699b58442ad132b518b4a6435f2772a637c3
    hwangjeff authored and facebook-github-bot committed Aug 4, 2023
    Configuration menu
    Copy the full SHA
    b976c8f View commit details
    Browse the repository at this point in the history

Commits on Aug 7, 2023

  1. Make target_lengths/input_lengths in forced_align optional (#3533)

    Summary:
    Currently `torchaudio.functional.forced_align` function requires full information on input/target lengths.
    When performing non-batched alignment, these can be inferred from the size of Tensor.
    
    Pull Request resolved: #3533
    
    Reviewed By: nateanl
    
    Differential Revision: D48111041
    
    Pulled By: mthrok
    
    fbshipit-source-id: fbf07124d3959c5cc5533dcd86296851587082fb
    mthrok authored and facebook-github-bot committed Aug 7, 2023
    Configuration menu
    Copy the full SHA
    cd80976 View commit details
    Browse the repository at this point in the history
  2. Add merge_tokens / TokenSpan (#3535)

    Summary:
    This commit adds `merge_tokens` function which removes repeated tokens from CTC token sequences returned from `forced_align`.
    
    Resolving repeated tokens is a necessary step and almost universal, thus it makes sense to have such helper function in torchaudio.
    
    Pull Request resolved: #3535
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48111202
    
    Pulled By: mthrok
    
    fbshipit-source-id: 25354bfa210aa5c03f8c1d3e201f253ca3761b24
    mthrok authored and facebook-github-bot committed Aug 7, 2023
    Configuration menu
    Copy the full SHA
    30668af View commit details
    Browse the repository at this point in the history
  3. Add MMS FA Bundle (#3521)

    Summary:
    Port the MMS FA model from tutorial to the library with post-processing module.
    
    Pull Request resolved: #3521
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48038285
    
    Pulled By: mthrok
    
    fbshipit-source-id: 571cf0fceaaab4790983be2719f1a85805b814f5
    mthrok authored and facebook-github-bot committed Aug 7, 2023
    Configuration menu
    Copy the full SHA
    5e211d6 View commit details
    Browse the repository at this point in the history
  4. Move alignment code to separate submodule (#3536)

    Summary: Pull Request resolved: #3536
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48120170
    
    Pulled By: mthrok
    
    fbshipit-source-id: dec7575db07734490099b35a8bfc854252952c6e
    mthrok authored and facebook-github-bot committed Aug 7, 2023
    Configuration menu
    Copy the full SHA
    90143e9 View commit details
    Browse the repository at this point in the history

Commits on Aug 8, 2023

  1. Add abstraction for download util (#1959)

    Summary: Pull Request resolved: #1959
    
    Reviewed By: hwangjeff
    
    Differential Revision: D32078361
    
    fbshipit-source-id: 50b56bac9593c36197998e89db19cd6d65b793cc
    mthrok authored and facebook-github-bot committed Aug 8, 2023
    Configuration menu
    Copy the full SHA
    3f98fb9 View commit details
    Browse the repository at this point in the history
  2. Librispeech RNNT recipe updates for pytorch lightening 2.0 (#3336)

    Summary: Pull Request resolved: #3336
    
    Reviewed By: mthrok
    
    Differential Revision: D47846814
    
    Pulled By: huangruizhe
    
    fbshipit-source-id: dc12362bf243c52222dccadec3176e25e43dd652
    huangruizhe authored and facebook-github-bot committed Aug 8, 2023
    Configuration menu
    Copy the full SHA
    e6c8973 View commit details
    Browse the repository at this point in the history
  3. Fix FA bundle (#3538)

    Summary: Pull Request resolved: #3538
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48154056
    
    Pulled By: mthrok
    
    fbshipit-source-id: 72f58c501c5302d40f1d059f95bd6fe40d4a52aa
    mthrok authored and facebook-github-bot committed Aug 8, 2023
    Configuration menu
    Copy the full SHA
    7e85f62 View commit details
    Browse the repository at this point in the history
  4. Adopt MMS_FA bundle in multilingual FA tutorials (#3534)

    Summary: Pull Request resolved: #3534
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48155817
    
    Pulled By: mthrok
    
    fbshipit-source-id: a3d45fdfd360f9668063a3ecb3b00364290134c9
    mthrok authored and facebook-github-bot committed Aug 8, 2023
    Configuration menu
    Copy the full SHA
    19e9046 View commit details
    Browse the repository at this point in the history
  5. Add tutorial link to AVSR recipe (#3532)

    Summary: Pull Request resolved: #3532
    
    Reviewed By: mthrok
    
    Differential Revision: D48165499
    
    Pulled By: mpc001
    
    fbshipit-source-id: c87b3361f0e6282684f218b32888df883d56682b
    Pingchuan Ma authored and facebook-github-bot committed Aug 8, 2023
    Configuration menu
    Copy the full SHA
    f7ab406 View commit details
    Browse the repository at this point in the history
  6. Updating CTC FA tutorial (#3542)

    Summary: Pull Request resolved: #3542
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48166025
    
    Pulled By: mthrok
    
    fbshipit-source-id: 29fee7dbf08394993972ec2967f94ce9fcb1c853
    mthrok authored and facebook-github-bot committed Aug 8, 2023
    Configuration menu
    Copy the full SHA
    eab8aa7 View commit details
    Browse the repository at this point in the history

Commits on Aug 9, 2023

  1. Revise VGGish inference pipeline test (#3544)

    Summary:
    Pull Request resolved: #3544
    
    Revises VGGish inference pipeline test to support internal testing.
    
    Reviewed By: mthrok
    
    Differential Revision: D48058409
    
    fbshipit-source-id: 045140a0e9d50128d32ef6510bdb2f642a365c83
    hwangjeff authored and facebook-github-bot committed Aug 9, 2023
    Configuration menu
    Copy the full SHA
    9f5fa84 View commit details
    Browse the repository at this point in the history

Commits on Aug 10, 2023

  1. Misc tutorial updates (#3546)

    Summary: Pull Request resolved: #3546
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48219274
    
    Pulled By: mthrok
    
    fbshipit-source-id: 6881f039bf70cf7240fbcfeb48443471ef457bd4
    mthrok authored and facebook-github-bot committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    bc26425 View commit details
    Browse the repository at this point in the history
  2. Fix SoundfileBackend method decorators (#3550)

    Summary: Pull Request resolved: #3550
    
    Reviewed By: hwangjeff
    
    Differential Revision: D48219176
    
    Pulled By: mthrok
    
    fbshipit-source-id: 4b11111dd3853cbef4ffe1859ec428ca05394824
    mthrok authored and facebook-github-bot committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    2d1138c View commit details
    Browse the repository at this point in the history
  3. Move backend initialization to toplevel (#3548)

    Summary:
    The backend dispatcher is implemented in `torchaudio._backend`, while the legacy backend is implemented in `torchaudio.backend`.
    
    The initialization happen in `torchaudio._backend`.
    This commit moves it to `torchaudio.__init__`, so that `backend` and `_backend` is more independent.
    
    Pull Request resolved: #3548
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48219244
    
    Pulled By: mthrok
    
    fbshipit-source-id: e694cb232794f90902a60ee51c7bf11b7f0548a0
    mthrok authored and facebook-github-bot committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    6fb21ab View commit details
    Browse the repository at this point in the history
  4. [aarch64] Add aarch64 workflow (#3553)

    Summary:
    # Changes
    * Adding workflow for building aarch64 wheels.
    
    Pull Request resolved: #3553
    
    Reviewed By: hwangjeff, osalpekar
    
    Differential Revision: D48239384
    
    Pulled By: atalman
    
    fbshipit-source-id: dfa00edb3fee0acaf2b83fb420eaf12bddc6980e
    Mike Schneider authored and facebook-github-bot committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    8d858c3 View commit details
    Browse the repository at this point in the history
  5. Add Frechet distance function (#3545)

    Summary:
    Pull Request resolved: #3545
    
    Adds function for computing the Fréchet distance between two multivariate normal distributions.
    
    Reviewed By: mthrok
    
    Differential Revision: D48126102
    
    fbshipit-source-id: e4e122b831e1e752037c03f5baa9451e81ef1697
    hwangjeff authored and facebook-github-bot committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    06301c0 View commit details
    Browse the repository at this point in the history
  6. Refactor _backend module (#3547)

    Summary:
    * Move Backend implementations to separate files
    
    Pull Request resolved: #3547
    
    Reviewed By: hwangjeff
    
    Differential Revision: D48233538
    
    Pulled By: mthrok
    
    fbshipit-source-id: bcc63fc07a5dfcd48929f0a2fb64bfcb3282eb92
    mthrok authored and facebook-github-bot committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    1e6a8f9 View commit details
    Browse the repository at this point in the history

Commits on Aug 11, 2023

  1. Support writing opus and mp3 with soundfile (#3554)

    Summary: Pull Request resolved: #3554
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48240906
    
    Pulled By: mthrok
    
    fbshipit-source-id: 1936757646f8ebba74e8b65e2ffe2a8b74fdfeeb
    mthrok authored and facebook-github-bot committed Aug 11, 2023
    Configuration menu
    Copy the full SHA
    9bd7ca5 View commit details
    Browse the repository at this point in the history
  2. Revise VGGish pipeline test again (#3551)

    Summary:
    Pull Request resolved: #3551
    
    Restores VGGish pipeline test to be a function rather than class.
    
    Reviewed By: mthrok
    
    Differential Revision: D48236197
    
    fbshipit-source-id: 25ac19d87a7a0964a9c3f7552037cd6c21dc38a9
    hwangjeff authored and facebook-github-bot committed Aug 11, 2023
    Configuration menu
    Copy the full SHA
    f2b2f05 View commit details
    Browse the repository at this point in the history
  3. Expose AudioMetadata (#3556)

    Summary:
    `torchaudio.info` returns `AudioMetaData`. It should be exposed as public API, without referring `backend` submodule.
    
    Pull Request resolved: #3556
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48267349
    
    Pulled By: mthrok
    
    fbshipit-source-id: 6ccc0c32bf62fbdcb71495fc7d8d4cc29891538a
    mthrok authored and facebook-github-bot committed Aug 11, 2023
    Configuration menu
    Copy the full SHA
    9467fc4 View commit details
    Browse the repository at this point in the history

Commits on Aug 14, 2023

  1. Add default use_tmp_hub_dir value for integration tests (#3558)

    Summary:
    Pull Request resolved: #3558
    
    In the event that `use_tmp_hub_dir` isn't specified as an option, pytest shouldn't fail. To resolve such failures, this PR modifies function `temp_hub_dir` to fall back on a default value of `False` for `use_tmp_hub_dir`.
    
    Reviewed By: mthrok
    
    Differential Revision: D48318947
    
    fbshipit-source-id: 5dd692f9202ef37ec3e2c9ea39896156f928d693
    hwangjeff authored and facebook-github-bot committed Aug 14, 2023
    Configuration menu
    Copy the full SHA
    d1d41fd View commit details
    Browse the repository at this point in the history
  2. Update ffmpeg pre-built binary to 4.4.4 (#3557)

    Summary: Pull Request resolved: #3557
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48326462
    
    Pulled By: mthrok
    
    fbshipit-source-id: c37ae38e28e4514ea284613636604a725829346d
    mthrok authored and facebook-github-bot committed Aug 14, 2023
    Configuration menu
    Copy the full SHA
    a9e38e7 View commit details
    Browse the repository at this point in the history
  3. Update integration test CI config (#3502)

    Summary:
    Update the ubuntu image so that CI is triggered.
    There is some issue with FFmpeg 4, so that CI does not succeed.
    This will be is handled separately.
    
    Pull Request resolved: #3502
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48327431
    
    Pulled By: mthrok
    
    fbshipit-source-id: 5ea639f3e20c3aaf460e6030f6cb1ad2daa00172
    mthrok authored and facebook-github-bot committed Aug 14, 2023
    Configuration menu
    Copy the full SHA
    9d8f76d View commit details
    Browse the repository at this point in the history
  4. Update I/O and backend docs (#3555)

    Summary:
    * Merge backend doc into torchaudio toplevel doc
    * Update backend, dispatcher, installation doc
    
    Pull Request resolved: #3555
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48326812
    
    Pulled By: mthrok
    
    fbshipit-source-id: cc0d7326eacfebd341323b5d613ca1777255748b
    mthrok authored and facebook-github-bot committed Aug 14, 2023
    Configuration menu
    Copy the full SHA
    c0f25f2 View commit details
    Browse the repository at this point in the history
  5. Move essential backend implementations to _backend (#3549)

    Summary:
    Move the actual I/O implementation to `_backend` submodule so that the existing `backend` submodule contains only what's related to legacy backend utilities.
    
    Pull Request resolved: #3549
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48253550
    
    Pulled By: mthrok
    
    fbshipit-source-id: c23f1664458c723f63e134c7974b3f7cf17a1e98
    mthrok authored and facebook-github-bot committed Aug 14, 2023
    Configuration menu
    Copy the full SHA
    2e0dfaf View commit details
    Browse the repository at this point in the history

Commits on Aug 15, 2023

  1. [BC-breaking] Update pre-built ffmpeg4 to 4.4.4 (#3561)

    Summary:
    In #3460, we switched the build process for FFmpeg extension.
    Since it is complicated to install FFmpeg in some environments, at build time, pre-built binaries and its headers
    are downloaded and used as a scaffolding for torchaudio build.
    
    Now even though we did not change any code or FFmpeg version, it turned out that this causes segmentation
    fault on Ubuntu when using system Python and FFmpeg 4.4 installed via aptitude.
    While investigating the issue, I swapped the said pre-built FFmpeg scaffolding with FFmpeg 4.4 from aptitude,
    and the segmentation fault did not happen. This indicates that it is binary compatibility issue.
    
    Before #3460, each binary build job was building FFmpeg 4.1.8 using the same compiler used to build torchaudio,
    but after #3460 the environments to build FFmpeg 4.1.8 and torchaudio are different. My hypothesis is that
    this difference is causing some ABI incompatibility when linking against FFmpeg 4.4. (Also, I don't remember well,
    but I read somewhere that 4.4 has a different ABI)
    
    Through experiments, it turned out upgrading the pre-built FFmpeg scaffolding to 4.4 resolves this.
    So this commit upgrade the pre-built FFmpeg 4 to 4.4.
    The potential (yet unconfirmed) downside is that torchaudio will no longer work with 4.1, 4.2, and 4.3.
    Since FFmpeg 4.4 is what Ubuntu 20.04 and 22.04 support by default, and Google Colab is also on 20.04,
    I think it is more important to support 4.4.
    
    Therefore we drop the support for 4.1-4.3 from normal build (and official distributions). Those who wish to
    use 4.1-4.3 can build torchaudio from source by linking to specific FFmpeg.
    
    Pull Request resolved: #3561
    
    Reviewed By: hwangjeff
    
    Differential Revision: D48340201
    
    Pulled By: mthrok
    
    fbshipit-source-id: 7ece82910f290c7cf83f58311c4cf6a384e8795e
    mthrok authored and facebook-github-bot committed Aug 15, 2023
    Configuration menu
    Copy the full SHA
    bf07ea6 View commit details
    Browse the repository at this point in the history
  2. Use pytorch/manylinuxaarch64-builder:cpu-aarch64 docker image (#3560)

    Summary:
    Use pytorch/manylinuxaarch64-builder:cpu-aarch64
    
    Introduced in pytorch/builder#1472
    
    Pull Request resolved: #3560
    
    Reviewed By: mthrok
    
    Differential Revision: D48366572
    
    Pulled By: atalman
    
    fbshipit-source-id: 6de15f81abb09c737e6a1271226259483141e8f4
    atalman authored and facebook-github-bot committed Aug 15, 2023
    Configuration menu
    Copy the full SHA
    126f9f6 View commit details
    Browse the repository at this point in the history

Commits on Aug 18, 2023

  1. Update README.md (#3567)

    Summary:
    Remove mention of backend and quick usage. Those are explained in the documentation in detail.
    
    Pull Request resolved: #3567
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48471832
    
    Pulled By: mthrok
    
    fbshipit-source-id: 467efc1f11f66534c33cf4751de27b08176c31bf
    mthrok authored and facebook-github-bot committed Aug 18, 2023
    Configuration menu
    Copy the full SHA
    1638efe View commit details
    Browse the repository at this point in the history

Commits on Aug 19, 2023

  1. Enable ROCm RNN-T Loss (#2485)

    Summary:
    Added HIPIFY code and small changes for ROCm. Targeting RNN-T loss.
    
    Pull Request resolved: #2485
    
    Reviewed By: huangruizhe
    
    Differential Revision: D43537864
    
    Pulled By: mthrok
    
    fbshipit-source-id: 4bdb1f291dc51a12232ccd072b97ae94ae20cc0c
    jpvillam-amd authored and facebook-github-bot committed Aug 19, 2023
    Configuration menu
    Copy the full SHA
    c593961 View commit details
    Browse the repository at this point in the history

Commits on Aug 20, 2023

  1. Add detail about CTC peaky behavior (#3566)

    Summary: Pull Request resolved: #3566
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48499338
    
    Pulled By: mthrok
    
    fbshipit-source-id: 7f837e1a1f8116d7d82411607c91628b729077d8
    mthrok authored and facebook-github-bot committed Aug 20, 2023
    Configuration menu
    Copy the full SHA
    a25bcb6 View commit details
    Browse the repository at this point in the history
  2. Fix style check CI job (#3564)

    Summary:
    It seems that the default Python version was updated to 3.11.
    libcst does not have binary release for 3.11, so the CI attempts to
    build from source but it fails because building libcst requires Rust
    compiler.
    
    This commit fix the Python version of style check job to 3.10 so that
    the issue with Rust compiler is avoided.
    
    Pull Request resolved: #3564
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48499560
    
    Pulled By: mthrok
    
    fbshipit-source-id: 53ab77268d8143f4946d92e8cd1f96aea55e7b72
    mthrok authored and facebook-github-bot committed Aug 20, 2023
    Configuration menu
    Copy the full SHA
    a5da0a2 View commit details
    Browse the repository at this point in the history
  3. Fix I/O test (#3568)

    Summary:
    Turned out FFmpeg 5 installed via conda reports video frame rate -1. FFmpeg 4 and 6 are fine. This is either a regression in FFmpeg or in the underlying decoding library.
    
    Make the reference value adoptive.
    
    Pull Request resolved: #3568
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48499621
    
    Pulled By: mthrok
    
    fbshipit-source-id: fb64187bcf0dc57b753cb6c05f04d436238f5c51
    mthrok authored and facebook-github-bot committed Aug 20, 2023
    Configuration menu
    Copy the full SHA
    0688863 View commit details
    Browse the repository at this point in the history

Commits on Aug 21, 2023

  1. Fix style (#3569)

    Summary: Pull Request resolved: #3569
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48508244
    
    Pulled By: mthrok
    
    fbshipit-source-id: 6e14267e2dbdf08ea3c25a1dab480cb0e908e0c3
    mthrok authored and facebook-github-bot committed Aug 21, 2023
    Configuration menu
    Copy the full SHA
    3318bce View commit details
    Browse the repository at this point in the history
  2. Use FFmpeg6 in unittest (#3570)

    Summary: Pull Request resolved: #3570
    
    Reviewed By: huangruizhe
    
    Differential Revision: D48518568
    
    Pulled By: mthrok
    
    fbshipit-source-id: 0fdfb8b3988789c7ded0fb336824034bedf6a394
    mthrok authored and facebook-github-bot committed Aug 21, 2023
    Configuration menu
    Copy the full SHA
    9d11563 View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2023

  1. update CUDA to 12.1 U1 (#3563)

    Summary:
    Follow-up of: pytorch/builder#1485
    
    CC atalman
    
    Pull Request resolved: #3563
    
    Reviewed By: kit1980
    
    Differential Revision: D48610200
    
    Pulled By: atalman
    
    fbshipit-source-id: 61c9981da5a343a3cbce97b0a77ab91f37560087
    ptrblck authored and facebook-github-bot committed Aug 23, 2023
    Configuration menu
    Copy the full SHA
    47eaab4 View commit details
    Browse the repository at this point in the history

Commits on Aug 29, 2023

  1. Remove random print statement (#3577)

    Summary: Pull Request resolved: #3577
    
    Reviewed By: atalman
    
    Differential Revision: D48763580
    
    Pulled By: mthrok
    
    fbshipit-source-id: 6ab155a5dd4cf11b2a58f26ced369107f0a2f08f
    mthrok authored and facebook-github-bot committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    5ee254e View commit details
    Browse the repository at this point in the history
  2. Separate Test Token for Conda Uploads (#3582)

    Summary:
    We will use a separate token for uploading test binaries (instead of reusing the nightly token). This PR adds that token to the caller workflow.
    
    Pull Request resolved: #3582
    
    Reviewed By: atalman
    
    Differential Revision: D48803009
    
    Pulled By: osalpekar
    
    fbshipit-source-id: c2af57f6946da51a7b56c975614e60f243e3f6fb
    osalpekar authored and facebook-github-bot committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    6fb6854 View commit details
    Browse the repository at this point in the history

Commits on Aug 30, 2023

  1. Bump Version 2.1 -> 2.2 (#3585)

    Summary: Pull Request resolved: #3585
    
    Reviewed By: atalman
    
    Differential Revision: D48832146
    
    Pulled By: osalpekar
    
    fbshipit-source-id: c36dd79a0c0f407d490a488902207c8066c01383
    osalpekar authored and facebook-github-bot committed Aug 30, 2023
    Configuration menu
    Copy the full SHA
    bf77b2a View commit details
    Browse the repository at this point in the history
  2. Revert "Enable ROCm RNN-T Loss (#2485)" (#3586)

    Summary:
    This reverts commit c593961.
    
    Unblock 2.1.0 rc
    
    Pull Request resolved: #3586
    
    Reviewed By: osalpekar
    
    Differential Revision: D48842032
    
    Pulled By: atalman
    
    fbshipit-source-id: bbdf9e45c9aa5fde00f315a2ff491ed050bc1707
    atalman authored and facebook-github-bot committed Aug 30, 2023
    Configuration menu
    Copy the full SHA
    5cf7d2d View commit details
    Browse the repository at this point in the history

Commits on Sep 1, 2023

  1. Automate Release-Only Changes (#3590)

    Summary:
    First Pass for automating release-only changes.
    
    Tested this in audio to ensure it works.
    
    Pull Request resolved: #3590
    
    Reviewed By: huydhn
    
    Differential Revision: D48921925
    
    Pulled By: osalpekar
    
    fbshipit-source-id: 2920e4d984ffb0c1957dae6fb6bab10344e65ff7
    osalpekar authored and facebook-github-bot committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    af79867 View commit details
    Browse the repository at this point in the history

Commits on Sep 2, 2023

  1. Moving to More Efficient Windows GPU Runner (#3587)

    Summary: Pull Request resolved: #3587
    
    Reviewed By: huydhn
    
    Differential Revision: D48847224
    
    Pulled By: osalpekar
    
    fbshipit-source-id: 15ccae56f0e5e8e366b5feb66fdb42bb5cba97e1
    osalpekar authored and facebook-github-bot committed Sep 2, 2023
    Configuration menu
    Copy the full SHA
    e057d7d View commit details
    Browse the repository at this point in the history

Commits on Sep 4, 2023

  1. Fix doc link (#3593)

    Summary: Pull Request resolved: #3593
    
    Reviewed By: nateanl
    
    Differential Revision: D48933041
    
    Pulled By: mthrok
    
    fbshipit-source-id: cd05d3cf5006206ba441fdc05548bcd922ce0598
    mthrok authored and facebook-github-bot committed Sep 4, 2023
    Configuration menu
    Copy the full SHA
    3e7e696 View commit details
    Browse the repository at this point in the history
  2. Add incremental decoding support to CTC decoder (#3594)

    Summary:
    Add incremental decoding support to CTC decoder.
    
    Resolves #3574
    
    Pull Request resolved: #3594
    
    Reviewed By: nateanl
    
    Differential Revision: D48940584
    
    Pulled By: mthrok
    
    fbshipit-source-id: 31871614008cf197cf3900f7183ec6cff34d2905
    mthrok authored and facebook-github-bot committed Sep 4, 2023
    Configuration menu
    Copy the full SHA
    6fbc1e6 View commit details
    Browse the repository at this point in the history
  3. Fix decoder call in Device ASR/AVSR tutorials (#3572)

    Summary:
    Fixes decoder calls and related code in Device ASR/AVSR tutorials to account for changes to RNN-T decoder introduced in #3295.
    
    Pull Request resolved: #3572
    
    Reviewed By: mthrok
    
    Differential Revision: D48629428
    
    Pulled By: hwangjeff
    
    fbshipit-source-id: 63ede307fb4412aa28f88972d56dca8405607b7a
    hwangjeff authored and facebook-github-bot committed Sep 4, 2023
    Configuration menu
    Copy the full SHA
    7d37f69 View commit details
    Browse the repository at this point in the history
  4. [BC-Breaking] Remove legacy global backend switch (#3559)

    Summary:
    This PR removes the legacy backend switch mechanism.
    The implementation itself is still available.
    
    Merge after v2.1 release
    
    Pull Request resolved: #3559
    
    Reviewed By: nateanl
    
    Differential Revision: D48353764
    
    Pulled By: mthrok
    
    fbshipit-source-id: 4d3924dbe6f334ecebe2b12fcd4591c61c4aa656
    mthrok authored and facebook-github-bot committed Sep 4, 2023
    Configuration menu
    Copy the full SHA
    454418d View commit details
    Browse the repository at this point in the history

Commits on Sep 5, 2023

  1. Fix backward compatibility layer in backend module (#3595)

    Summary:
    The PR #3549 re-organized the backend implementations and deprecated the direct access to torchaudio.backend.
    
    The change was supposed to be BC-compatible while issuing a warning to users, but the implementation of module-level `__getattr__` was not quite right.
    
    See an issue pyannote/pyannote-audio#1456.
    
    This commit fixes it so that the following imports work;
    
    ```python
    from torchaudio.backend.common import AudioMetaData
    
    from torchaudio.backend import sox_io_backend
    from torchaudio.backend.sox_io_backend import save, load, info
    
    from torchaudio.backend import no_backend
    from torchaudio.backend.no_backend import save, load, info
    
    from torchaudio.backend import soundfile_backend
    from torchaudio.backend.soundfile_backend import save, load, info
    ```
    
    Pull Request resolved: #3595
    
    Reviewed By: nateanl
    
    Differential Revision: D48957446
    
    Pulled By: mthrok
    
    fbshipit-source-id: ebb256461dd3032025fd27d0455ce980888f7778
    mthrok authored and facebook-github-bot committed Sep 5, 2023
    Configuration menu
    Copy the full SHA
    931598c View commit details
    Browse the repository at this point in the history
  2. Update FFmpeg arm64 CI (#3578)

    Summary: Pull Request resolved: #3578
    
    Reviewed By: nateanl
    
    Differential Revision: D48971591
    
    Pulled By: mthrok
    
    fbshipit-source-id: a76cf8f1287c076d7b6919d901eda0ce832bdb7d
    mthrok authored and facebook-github-bot committed Sep 5, 2023
    Configuration menu
    Copy the full SHA
    ede4309 View commit details
    Browse the repository at this point in the history

Commits on Sep 7, 2023

  1. Configuration menu
    Copy the full SHA
    e756b23 View commit details
    Browse the repository at this point in the history
  2. Add CODEOWNERS (#3571)

    mthrok authored Sep 7, 2023
    Configuration menu
    Copy the full SHA
    3e1d8f3 View commit details
    Browse the repository at this point in the history

Commits on Sep 8, 2023

  1. Simplify trainining step in av-asr recipe (#3598)

    * Simplify trainining step in av-asr recipe
    
    * Run pre-commit
    Pingchuan Ma authored Sep 8, 2023
    Configuration menu
    Copy the full SHA
    5e893d6 View commit details
    Browse the repository at this point in the history

Commits on Sep 13, 2023

  1. Configuration menu
    Copy the full SHA
    b4c9aa5 View commit details
    Browse the repository at this point in the history
  2. Update README.md (#3609)

    mthrok authored Sep 13, 2023
    Configuration menu
    Copy the full SHA
    4bbf65e View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2023

  1. Fix doc nightly doc CI (#3611)

    Some changes at matplotlib 3.8.0 rejects torch.Tensor passed to `plot` function.
    mthrok authored Sep 19, 2023
    Configuration menu
    Copy the full SHA
    ac63c45 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    402939e View commit details
    Browse the repository at this point in the history
  3. Add wall implementation for RIR ray tracing (#3612)

    Extracted from #3604
    
    Add Wall helper class and C++ unit test
    mthrok authored Sep 19, 2023
    Configuration menu
    Copy the full SHA
    94aafd8 View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2023

  1. replace avsr model used in the tutorial (#3602)

    * replace model used in the tutorial
    
    * Upload a torchscript model;Remove model components
    
    * Update download url
    
    * Switch from download_url_to_file to download_asset
    Pingchuan Ma authored Sep 20, 2023
    Configuration menu
    Copy the full SHA
    0ff6d26 View commit details
    Browse the repository at this point in the history

Commits on Sep 21, 2023

  1. Optimize merge_tokens method (#3615)

    Optimizes merge_tokens method as discussed in #3614 
    
    Co-authored-by: Eyal Cohen <[email protected]>
    eyalcohen308 and Eyal Cohen authored Sep 21, 2023
    Configuration menu
    Copy the full SHA
    0a27bb0 View commit details
    Browse the repository at this point in the history

Commits on Sep 23, 2023

  1. Refactor PRA test (#3616)

    mthrok authored Sep 23, 2023
    Configuration menu
    Copy the full SHA
    f1a9f79 View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2023

  1. Configuration menu
    Copy the full SHA
    71104e8 View commit details
    Browse the repository at this point in the history
  2. Disable Windows CPP test (#3613)

    The test executable is not found. Need to investigate more later.
    mthrok authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    4dc06ce View commit details
    Browse the repository at this point in the history

Commits on Sep 28, 2023

  1. Configuration menu
    Copy the full SHA
    acf1c7b View commit details
    Browse the repository at this point in the history

Commits on Sep 29, 2023

  1. Fix typo (#3627)

    orena1 authored Sep 29, 2023
    Configuration menu
    Copy the full SHA
    d07aebf View commit details
    Browse the repository at this point in the history
  2. Remove Python2 style object inheritance (#3597)

    Summary: Python3 makes the use of `(object)` in class inheritance unnecessary. Let's modernize our code by eliminating this.
    
    Reviewed By: aaronenyeshi
    
    Differential Revision: D48957872
    r-barnes authored Sep 29, 2023
    Configuration menu
    Copy the full SHA
    b7791ea View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2023

  1. Configuration menu
    Copy the full SHA
    ff266b1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    92ded61 View commit details
    Browse the repository at this point in the history

Commits on Oct 3, 2023

  1. change modality naming inconsistency (visual & video) (#3631)

    * Update transforms.py
    
    * Update train.py
    orena1 authored Oct 3, 2023
    Configuration menu
    Copy the full SHA
    47f502a View commit details
    Browse the repository at this point in the history
  2. Remove 2d impl from ray tracing (#3633)

    * Fix some issues
    * Remove 2D implementation from ray tracing
    
    We only add 3D RIR.
    mthrok authored Oct 3, 2023
    Configuration menu
    Copy the full SHA
    d9942ba View commit details
    Browse the repository at this point in the history

Commits on Oct 5, 2023

  1. Add automated doc flow for release (#3634)

    Currently, nightly documentation is pushed automatically for each commit and on daily basis.
    
    For release, automating it is tricky because we need to ensure that the version number is properly updated.
    (removing alpha suffix at branch cut time and updating minor version at minor release)
    
    For this reason, in release/2.1 we used manual-trigger to build and deploy doc for release.
    This commit back port it to main.
    
    For each release, once the version number is updated, running build_doc job from GHA will deploy the documentation automatically.
    mthrok authored Oct 5, 2023
    Configuration menu
    Copy the full SHA
    df44d30 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e3b11a8 View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2023

  1. Migrate to src-layout

    Differential Revision: D49965263
    
    Pull Request resolved: #3639
    moto-meta authored Oct 9, 2023
    Configuration menu
    Copy the full SHA
    ec13a81 View commit details
    Browse the repository at this point in the history
  2. Add bytes support to StreamReader (#3642)

    Addresses #3640
    mthrok authored Oct 9, 2023
    Configuration menu
    Copy the full SHA
    2994ce2 View commit details
    Browse the repository at this point in the history
  3. Fix breadcrumbs for v2.1

    Differential Revision: D50036850
    
    Pull Request resolved: #3637
    carljparker authored Oct 9, 2023
    Configuration menu
    Copy the full SHA
    a8bb397 View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2023

  1. Move libtorchaudio_ffmpeg to dedicated directory

    Differential Revision: D50082877
    
    Pull Request resolved: #3646
    moto-meta authored Oct 11, 2023
    Configuration menu
    Copy the full SHA
    2836a23 View commit details
    Browse the repository at this point in the history
  2. Update gitignore (#3649)

    mthrok authored Oct 11, 2023
    Configuration menu
    Copy the full SHA
    03d1270 View commit details
    Browse the repository at this point in the history

Commits on Oct 12, 2023

  1. Move libtorchaudio to dedicated directory

    Differential Revision: D50086556
    
    Pull Request resolved: #3648
    moto-meta authored Oct 12, 2023
    Configuration menu
    Copy the full SHA
    e65e472 View commit details
    Browse the repository at this point in the history
  2. Remove FFmpeg compat load/info function

    Differential Revision: D50229857
    
    Pull Request resolved: #3652
    moto-meta authored Oct 12, 2023
    Configuration menu
    Copy the full SHA
    57f7f52 View commit details
    Browse the repository at this point in the history
  3. Resolve lint issues

    Differential Revision: D50205775
    
    Pull Request resolved: #3651
    moto-meta authored Oct 12, 2023
    Configuration menu
    Copy the full SHA
    d947dee View commit details
    Browse the repository at this point in the history
  4. Simplify the logic to initialize FFmpeg

    Differential Revision: D50193749
    
    Pull Request resolved: #3650
    moto-meta authored Oct 12, 2023
    Configuration menu
    Copy the full SHA
    f62367a View commit details
    Browse the repository at this point in the history

Commits on Oct 13, 2023

  1. Simplify the logic to initialize sox

    Differential Revision: D50197331
    
    Pull Request resolved: #3654
    moto-meta authored Oct 13, 2023
    Configuration menu
    Copy the full SHA
    dde08ba View commit details
    Browse the repository at this point in the history
  2. Add Ray Tracing (#3604) (#2850) (#3655)

    Summary:
    Revamped version of #3234
    (which was also revamp of #2850)
    mthrok authored Oct 13, 2023
    Configuration menu
    Copy the full SHA
    fa78fb6 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    0b6ca28 View commit details
    Browse the repository at this point in the history

Commits on Oct 17, 2023

  1. Fix wavlm bundles (#3660)

    The `encoder_layer_norm_first` should be set to True for the Large model of WavLM.
    Address #3347
    nateanl authored Oct 17, 2023
    Configuration menu
    Copy the full SHA
    671261c View commit details
    Browse the repository at this point in the history

Commits on Oct 19, 2023

  1. Support compression level in i/o dispatcher backend

    Differential Revision: D50367721
    
    Pull Request resolved: #3662
    hwangjeff authored Oct 19, 2023
    Configuration menu
    Copy the full SHA
    7ea108f View commit details
    Browse the repository at this point in the history
  2. Do not autoremove prototype in release branch (#3663)

    So that release engineering team does not have to manually remove prototype-related things
    in each release
    mthrok authored Oct 19, 2023
    Configuration menu
    Copy the full SHA
    d00df95 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b7b61c2 View commit details
    Browse the repository at this point in the history

Commits on Oct 24, 2023

  1. Change namespace to torio

    Differential Revision: D50506299
    
    Pull Request resolved: #3669
    moto-meta authored Oct 24, 2023
    Configuration menu
    Copy the full SHA
    a78ba38 View commit details
    Browse the repository at this point in the history
  2. Fix cuctc module

    Differential Revision: D50602496
    
    Pull Request resolved: #3670
    moto-meta authored Oct 24, 2023
    Configuration menu
    Copy the full SHA
    8342795 View commit details
    Browse the repository at this point in the history
  3. Update C++ API doc (#3671)

    mthrok authored Oct 24, 2023
    Configuration menu
    Copy the full SHA
    1caa3fc View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    7d6e486 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2023

  1. Prep for restructure (#3676)

    Add torio top-level directory. It's not part of the package yet.
    mthrok authored Oct 25, 2023
    Configuration menu
    Copy the full SHA
    7c988b4 View commit details
    Browse the repository at this point in the history
  2. Update library/extension name

    Differential Revision: D50633306
    
    Pull Request resolved: #3675
    moto-meta authored Oct 25, 2023
    Configuration menu
    Copy the full SHA
    478a852 View commit details
    Browse the repository at this point in the history

Commits on Oct 26, 2023

  1. Fix doc (#3678)

    mthrok authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    3ff5e8c View commit details
    Browse the repository at this point in the history
  2. Fix doc on FA (#3679)

    mthrok authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    2a0f4c0 View commit details
    Browse the repository at this point in the history
  3. Swap decoder/encoder implementation

    Differential Revision: D50677606
    
    Pull Request resolved: #3681
    moto-meta authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    36f5010 View commit details
    Browse the repository at this point in the history
  4. Remove unused files (#3680)

    mthrok authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    0988724 View commit details
    Browse the repository at this point in the history
  5. Update StreamReader/Writer name

    Differential Revision: D50696105
    
    Pull Request resolved: #3682
    moto-meta authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    fcf3894 View commit details
    Browse the repository at this point in the history
  6. Fix doc config (#3683)

    mthrok authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    6e26515 View commit details
    Browse the repository at this point in the history

Commits on Oct 30, 2023

  1. Fix backend doc (#3686)

    mthrok authored Oct 30, 2023
    Configuration menu
    Copy the full SHA
    90186bc View commit details
    Browse the repository at this point in the history

Commits on Oct 31, 2023

  1. Update CITATION (#3687)

    mthrok authored Oct 31, 2023
    Configuration menu
    Copy the full SHA
    c5b6933 View commit details
    Browse the repository at this point in the history

Commits on Nov 9, 2023

  1. Configuration menu
    Copy the full SHA
    d4cf8d5 View commit details
    Browse the repository at this point in the history
  2. Fix vad to return zero output for zero input (#3685)

    When the input is zero Tensor, the result should be empty.
    wasd96040501 authored Nov 9, 2023
    Configuration menu
    Copy the full SHA
    df2a6bc View commit details
    Browse the repository at this point in the history
  3. Make pytest run ignoring the ones with collection error (#3692)

    PyTorch lightening is having issue with the nightly PyTorch.
    Let the other tests still run.
    mthrok authored Nov 9, 2023
    Configuration menu
    Copy the full SHA
    4c85f81 View commit details
    Browse the repository at this point in the history
  4. Remove global audio backend from test (#3691)

    global audio backend is removed thus this is no-op.
    mthrok authored Nov 9, 2023
    Configuration menu
    Copy the full SHA
    a59e9ab View commit details
    Browse the repository at this point in the history
  5. Fix release doc deployment (#3693)

    Back port from release/2.1 branch.
    Need to git-fetch source code to get the version number dynamically
    mthrok authored Nov 9, 2023
    Configuration menu
    Copy the full SHA
    65df10b View commit details
    Browse the repository at this point in the history

Commits on Nov 10, 2023

  1. Configuration menu
    Copy the full SHA
    172260f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ccd78ff View commit details
    Browse the repository at this point in the history

Commits on Dec 1, 2023

  1. Configuration menu
    Copy the full SHA
    6518fa9 View commit details
    Browse the repository at this point in the history

Commits on Jan 3, 2024

  1. Update doc (#3677)

    * Update doc
    
    * Update citation
    mthrok authored Jan 3, 2024
    Configuration menu
    Copy the full SHA
    258169e View commit details
    Browse the repository at this point in the history

Commits on Jan 7, 2024

  1. Configuration menu
    Copy the full SHA
    639a4e9 View commit details
    Browse the repository at this point in the history

Commits on Jan 8, 2024

  1. Configuration menu
    Copy the full SHA
    e3efbc2 View commit details
    Browse the repository at this point in the history

Commits on Jan 12, 2024

  1. Remove AWS credentials on workflows (#3723) (#3723)

    Summary:
    This is not needed anymore after pytorch/test-infra#4865.
    
    
    Reviewed By: malfet, jeanschmidt, clee2000, NicolasHug
    
    Differential Revision: D52735187
    
    Pulled By: huydhn
    huydhn authored Jan 12, 2024
    Configuration menu
    Copy the full SHA
    b2d9c3e View commit details
    Browse the repository at this point in the history

Commits on Jan 25, 2024

  1. Doc: references for lfilter implementation (#3734)

    * add golf and dynonet paper
    
    * doc: add references
    
    * add EOF
    
    * fix: line too long
    
    * remove line end space
    
    * remove indentation
    
    Co-authored-by: moto <[email protected]>
    
    ---------
    
    Co-authored-by: moto <[email protected]>
    yoyololicon and mthrok authored Jan 25, 2024
    Configuration menu
    Copy the full SHA
    d5a9433 View commit details
    Browse the repository at this point in the history

Commits on Jan 30, 2024

  1. Configuration menu
    Copy the full SHA
    02586da View commit details
    Browse the repository at this point in the history

Commits on Feb 9, 2024

  1. Migrate the macOS runners label from macos-m1-12 to macos-m1-stable

    Differential Revision: D53606067
    
    Pull Request resolved: #3740
    DanilBaibak authored Feb 9, 2024
    Configuration menu
    Copy the full SHA
    5286f9f View commit details
    Browse the repository at this point in the history

Commits on Feb 16, 2024

  1. Fix document for forced_align method (#3748)

    The lengths of targets and log_probs should be reversed.
    nateanl authored Feb 16, 2024
    Configuration menu
    Copy the full SHA
    b7b7b5d View commit details
    Browse the repository at this point in the history

Commits on Feb 27, 2024

  1. Remove macos x86 nightly builds

    Differential Revision: D54263224
    
    Pull Request resolved: #3751
    atalman authored Feb 27, 2024
    Configuration menu
    Copy the full SHA
    87aeb55 View commit details
    Browse the repository at this point in the history

Commits on Mar 18, 2024

  1. Configuration menu
    Copy the full SHA
    1bc1479 View commit details
    Browse the repository at this point in the history
  2. Update tacotron2_pipeline_tutorial.py (#3759)

    * Update tacotron2_pipeline_tutorial.py
    
    - Fixed typo
    - Clarified what was being done in different sections
    mikeboensel authored Mar 18, 2024
    Configuration menu
    Copy the full SHA
    17a7081 View commit details
    Browse the repository at this point in the history

Commits on Mar 24, 2024

  1. Fix windows CI CPU and GPU tests by unpinning mkl version and install…

    …ing pybind11 (#3766)
    
    Unpin mkl version and install pybind11 to get the windows CI working again
    
    This fixes #3767
    ahmadsharif1 authored Mar 24, 2024
    Configuration menu
    Copy the full SHA
    ea437b3 View commit details
    Browse the repository at this point in the history

Commits on May 14, 2024

  1. [codemod] c10::optional -> std::optional in pytorch/audio/src/libtori…

    …o/ffmpeg/stream_reader/stream_processor.h +20
    
    Differential Revision: D57294285
    
    Pull Request resolved: #3792
    r-barnes authored May 14, 2024
    Configuration menu
    Copy the full SHA
    b4407e0 View commit details
    Browse the repository at this point in the history
  2. [codemod] c10::optional -> std::optional in pyspeech/experimental/csr…

    …c/decoders/TransducerDecoder.h +20
    
    Differential Revision: D57294284
    
    Pull Request resolved: #3793
    r-barnes authored May 14, 2024
    Configuration menu
    Copy the full SHA
    9f10306 View commit details
    Browse the repository at this point in the history
  3. [codemod] c10::optional -> std::optional in pytorch/audio/src/libtorc…

    …haudio/sox/effects.cpp +20
    
    Differential Revision: D57294298
    
    Pull Request resolved: #3791
    r-barnes authored May 14, 2024
    Configuration menu
    Copy the full SHA
    1980f8a View commit details
    Browse the repository at this point in the history

Commits on Jun 6, 2024

  1. Configuration menu
    Copy the full SHA
    b829e93 View commit details
    Browse the repository at this point in the history

Commits on Jun 24, 2024

  1. Configuration menu
    Copy the full SHA
    7f6209b View commit details
    Browse the repository at this point in the history

Commits on Jul 3, 2024

  1. Fix model downloading in bento (#3803)

    Summary:
    Pull Request resolved: #3803
    
    The model checkpoint path can not be created for Squim models. Use the latest download_asset method to fix it.
    
    Reviewed By: moto-meta
    
    Differential Revision: D59061348
    nateanl authored Jul 3, 2024
    Configuration menu
    Copy the full SHA
    69b2a0a View commit details
    Browse the repository at this point in the history

Commits on Jul 25, 2024

  1. Configuration menu
    Copy the full SHA
    b62dc24 View commit details
    Browse the repository at this point in the history