gpt

🚀 Feature request

Bart is a seq2seq model, but there might be applications where one would like to use only the pre-trained BartDecoder in an EncoderDecoder setting with a "long" encoder, such as

from transformers import EncoderDecoderModel

model = EncoderDecoderModel("allenai/longformer-large-4096", "facebook/bart-large")

# fine-tune model ...

This is already p

I'm playing around with this wonderful code but I'm running into a curious issue when I try to train the model with my own data.

I replicated the personachat_self_original.json file structure and added my own data. I deleted dataset_cache_OpenAIGPTTokenizer file but when I try to train, I get this error:

INFO:train.py:Pad inputs and convert to Tensor
Traceback (most recent call last)

gpt

Here are 96 public repositories matching this topic...

huggingface / transformers

Add BartForCausalLM analogs to `ProphetNetForCausalLM`

🚀 Feature request

Improve coverage of the documentation

[Good first issue] Documentation links in older docs versions

pbatard / rufus

huggingface / tokenizers

dbiir / UER-py

huggingface / transfer-learning-conv-ai

RuntimeError: shape '[-1, 2, 34]' is invalid for input of size 61710

systemd / mkosi

bradfitz / embiggen-disk

guillaume-be / rust-bert

ValdikSS / Super-UEFIinSecureBoot-Disk

MorvanZhou / NLP-Tutorials

Novetta / adaptnlp

akanyaani / gpt-2-tensorflow2.0

teddykoker / image-gpt

jaanauati / react-dfp

pingpong-ai / dialogue-generation-models

EleutherAI / GPTNeo

will-thompson-k / deeplearning-nlp-models

jhermsmeier / node-disk

pampanic / pam_panic

ethanmad / chromeos-resize

itoffshore / alpine-linux-scripts

luni64 / TeensyTimerTool

eBayClassifiedsGroup / react-advertising

JRC1995 / Chatbot

Mexit / MultiOS-USB

manilarome / A-Personal-Arch-Installation-Guide

HLTCHKUST / ke-dialogue

kamalkraj / minGPT-TF

sdelgadoc / download-tweets-ai-text-gen-plus

thewhiteninja / ntfstool

Improve this page

Add this topic to your repo