Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Torchtext undefined module when using gpt2bpe tokenizer #4013

Open
jhudsy opened this issue Jun 6, 2024 · 2 comments
Open

Torchtext undefined module when using gpt2bpe tokenizer #4013

jhudsy opened this issue Jun 6, 2024 · 2 comments
Labels
bug Something isn't working dependency llm Large Language Model related waiting for answer Further information is requested

Comments

@jhudsy
Copy link

jhudsy commented Jun 6, 2024

When trying to use the gpt2bpe tokenizer, I get an error that "module torchtext has no attribute util".

Fixed by adding

import torchtext.utils
import torchtext.transforms

to ludwig/utils/tokenizers.py

@alexsherstinsky
Copy link
Collaborator

@jhudsy Would you like to create a pull request with your fix? Thanks a lot.

@alexsherstinsky alexsherstinsky added the llm Large Language Model related label Sep 6, 2024
@mhabedank mhabedank assigned mhabedank and unassigned mhabedank Oct 20, 2024
@mhabedank mhabedank added bug Something isn't working dependency labels Oct 21, 2024
@mhabedank
Copy link
Collaborator

Hi @jhudsy could you try again with torchtext 0.17.2?

@mhabedank mhabedank added the waiting for answer Further information is requested label Oct 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working dependency llm Large Language Model related waiting for answer Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants