Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

alea-institute
/
kl3m-tokenizer-003-8k

Fill-Mask
Transformers
English
tokenizer
legal
bpe
byte-pair-encoding
whitespace
kl3m
legal-domain
Model card Files Files and versions
xet
Community
kl3m-tokenizer-003-8k
539 kB
  • 1 contributor
History: 3 commits
alea-institute's picture
alea-institute
Upload KL3M whitespace tokenizer v5 (8K) - Update README
e62f45a verified about 1 month ago
  • .gitattributes
    1.52 kB
    initial commit about 1 month ago
  • README.md
    7.77 kB
    Upload KL3M whitespace tokenizer v5 (8K) - Update README about 1 month ago
  • special_tokens_map.json
    189 Bytes
    Upload KL3M whitespace tokenizer v5 (8K) about 1 month ago
  • tokenizer.json
    528 kB
    Upload KL3M whitespace tokenizer v5 (8K) about 1 month ago
  • tokenizer_config.json
    1.55 kB
    Upload KL3M whitespace tokenizer v5 (8K) about 1 month ago