nlg

Description

While using tokenizers.create with the model and vocab file for a custom corpus, the code throws an error and is not able to generate the BERT vocab file

Error Message

ValueError: Mismatch vocabulary! All special tokens specified must be control tokens in the sentencepiece vocabulary.

To Reproduce

from gluonnlp.data import tokenizers
tokenizers.create('spm', model_p

nlg

Here are 167 public repositories matching this topic...

spro / practical-pytorch

dmlc / gluon-nlp

[Error Message] Improve error message in SentencepieceTokenizer when arguments are not expected.

Description

Error Message

To Reproduce

Use official MXNet batchify to implement the batchify functions

NMT Inference: Chunk overlength sequences and translate in sequence

Maluuba / nlg-eval

charlesXu86 / Chatbot_CN

MiuLab / TC-Bot

simplenlg / simplenlg

rodrigopivi / Chatito

patil-suraj / question_generation

santhoshkolloju / Abstractive-Summarization-With-Transfer-Learning

tokenmill / accelerated-text

wyu97 / KENLG-Reading

tokenmill / awesome-nlg

SimGus / Chatette

yongzhuo / nlg-yongzhuo

semiosis / pen.el

gyunggyung / NLP-Papers

CZWin32768 / XNLG

AMontgomerie / question_generator

agaralabs / transformer-drg-style-transfer

google / abstracttext

KaijuML / data-to-text-hierarchical

BSlience / xbot

MiuLab / DuaLUG

Eulring / Text-Generation-Papers

spro / nalgene

DrDub / php-nlgen

rajammanabrolu / C2PO

naver / gdc

SmartDataAnalytics / SemWeb2NL

majumderb / recipe-personalization

Improve this page

Add this topic to your repo