Decasing Transformers for Fun

2 stephantul 1 8/1/2025, 8:05:11 PM stephantul.github.io ↗

Comments (1)

stephantul · 13h ago
Hello!

I recently did some experiments in decasing models. I noticed our users not using casing at all, while the base models we use tend to be cased. So I investigated, and found a nice way to decase transformers tokenizers and make them lowercase internally, which I call decasing. I wrote a blog about it!