Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP


 🔘 Table of associated records

<meta name="Description" CONTENT="Artificial Intelligence Journal" />
<meta name="r0identifier" content="884a5f6297216a3c5e883ae639b13721" />
RxRegistration ID
R0Hash MD5 (of R3):884a5f6297216a3c5e883ae639b13721
R1Registration number (in the domain editorialia.com at WordPress):dmeditorialiawp.31691
R2Date-p-order (ddmmyyyypx): 27032022p1
R3Cid (combined id R1+R2): dmeditorialiawp.3169127032022p1
R4Resource official title:Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
R5Publisher:arXiv.org
R6Resource website (1) ( #OpenAccess | #Openscience ): arxiv.org/abs/2112.10508
R12Authors (separated by commas):Sabrina J. Mielke, Zaid Alyafeai, Elizabeth Salesky, Colin Raffel, Manan Dey, Matthias Gallé, Arun Raja, Chenglei Si, Wilson Y. Lee, Benoît Sagot, Samson Tan
R14Keyword (selected 1 among the labels applied to this entry):=NLP
R15QR code (of the linked url at WP):qr code
R16Time stamp URL:
R17Digital signature URL:Pending signature

Click to rate this post
[Total: 3 Average: 5]

Liked this post? Follow this blog to get more.