Hendrik van Antwerpen
Posts Publications Talks

Fast byte-pair encoding

12 December 2024

Together with my colleague Alexander Neubeck we wrote a blog post about a fast byte-pair encoding algorithm he developed. Read it here: So many tokens, so little time: Introducing a faster, more flexible byte-pair tokenizer.

Share on
  • Hendrik van Antwerpen
  • Subscribe via RSS
  • hendrik@van-antwerpen.net
  • hendrikvanantwerpen
  • hendrikvanantwerpen

Everything is either impossible or trivial.