"Tokenizer Choice For LLM Training: Negligible or Crucial?"

Mehdi Ali et al. (2024)

> Home

Details and statistics

DOI: 10.18653/V1/2024.FINDINGS-NAACL.247

access: open

type: Conference or Workshop Paper

metadata version: 2024-09-11

- view
  - electronic edition via DOI (open access)
  authority control:
- export record
  dblp key:
  - conf/naacl/AliFTRLLKEDBJWJAJSOWSKF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/AliFTRLLKEDBJWJAJSOWSKF24
Mehdi Ali, Michael Fromm, Klaudia Thellmann, Richard Rutmann, Max Lübbering, Johannes Leveling, Katrin Klug, Jan Ebert, Niclas Doll, Jasper Schulze Buschhoff, Charvi Jain, Alexander Arno Weber, Lena Jurkschat, Hammam Abdelwahab, Chelsea John, Pedro Ortiz Suarez, Malte Ostendorff, Samuel Weinbach, Rafet Sifa, Stefan Kesselheim, Nicolas Flores-Herr:
Tokenizer Choice For LLM Training: Negligible or Crucial? NAACL-HLT (Findings) 2024: 3907-3924

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.