You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
During the BPE encoding, subword-nmt is generating the file {codes_file}. Can you please share this file? If it is not possible, can you share the training set for obtaining the {codes_file}?
I would like to use OpenVocabNLM with a certain dataset and compare my results with the ones obtained in your research.
Furthermore, I have another question. Do you run create_subtoken_data.py and non-ascii_sequences_to_unk.py before BPE encoding?
Thank you.
The text was updated successfully, but these errors were encountered:
Hello,
During the BPE encoding, subword-nmt is generating the file {codes_file}. Can you please share this file? If it is not possible, can you share the training set for obtaining the {codes_file}?
I would like to use OpenVocabNLM with a certain dataset and compare my results with the ones obtained in your research.
Furthermore, I have another question. Do you run create_subtoken_data.py and non-ascii_sequences_to_unk.py before BPE encoding?
Thank you.
The text was updated successfully, but these errors were encountered: