Skip to content

Commit

Permalink
Use the custom eos/bos tokens passed to the bytelevel tokenizer
Browse files Browse the repository at this point in the history
Signed-off-by: John St John <[email protected]>
  • Loading branch information
jstjohn committed Feb 27, 2025
1 parent 8ca01eb commit 44d08b5
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions nemo/collections/common/tokenizers/bytelevel_tokenizers.py
Original file line number Diff line number Diff line change
Expand Up @@ -167,21 +167,21 @@ def pad_id(self):
"""
Get the padding ID.
"""
return 256
return self._pad_id

@property
def bos_id(self):
"""
Get the beginning-of-sequence ID.
"""
return 257
return self._bos_id

@property
def eos_id(self):
"""
Get the end-of-sequence ID.
"""
return 258
return self._eos_id

@property
def unk_id(self):
Expand Down

0 comments on commit 44d08b5

Please sign in to comment.