Skip to main content
Filter by
Sorted by
Tagged with
0 votes
1 answer
52 views

I'm using fine-tuned T5 model for performing spell checks in my dataset of consisting of reviews. However, I'm facing an issue where the model when performing spell checks does not give entire string ...
Anurag Pandey's user avatar
0 votes
1 answer
35 views

After receiving question, my program calls the run_predict function then finds the best paragraph match with the question. After that, my model is constantly reloaded without knowing the reasons. from ...
Ngọc Minh's user avatar
0 votes
1 answer
134 views

I am trying to load and train a ConvAI Model, fairly new to the whole concept. Keep running into two main problems - one on my personal computer, one on a corporate machine. On my personal machine, it ...
sj6266's user avatar
  • 3
0 votes
1 answer
65 views

In this example, I am trying to use overrides as a Params object and I want it to be used as a list of strings. But I am not able to assign its value using the below code. class _AB(Params): ...
300's user avatar
  • 323
-1 votes
2 answers
361 views

wanted to try CUDA (I have an RTX 3070 TI) on my Windows setup, using this code: import pandas as pd from simpletransformers.classification import ClassificationModel from sklearn.model_selection ...
Stefan's user avatar
  • 37
1 vote
0 answers
443 views

I am using the simple transformers library, I have just finished training a model and now I want to load it to try making some predictions. However, I must be doing something wrong because it keeps ...
kelvin's user avatar
  • 163
3 votes
0 answers
977 views

I want to use .generate() functionality of hugging face in my model's predictions. My model is a custom model inehriting from "TFPreTrainedModel" class and has a custom transformer ...
Avani Gupta's user avatar
1 vote
0 answers
40 views

I have a problem while predicting the labels in a token sequence with a SimpleTransformers model trained previously. When I predict the labels in a sequence, the model omits lots of tokens, depending ...
David Muñoz's user avatar
-2 votes
1 answer
173 views

from simpletransformers.classification import ClassificationModel import torch Bert_model = ClassificationModel( "bert", "bert_model") a=Bert_model.predict(["kötü"]) ...
Gokhan Basaran's user avatar
-4 votes
1 answer
213 views

I am trying to install simpletransformers via pip, but it gives me the following error: "ERROR: Exception: Traceback (most recent call last): File "/sw/arch/Debian10/EB_production/2021/...
maarten van hove's user avatar
0 votes
1 answer
658 views

so i was trying to train a chatbot using transformers for my ai assistant , i thought simpletransformer package in python would help me speed up alot of my tasks . I soon gathered a good dataset over ...
Lucid's user avatar
  • 144
2 votes
1 answer
754 views

I have a simple transformers script looking like this. from simpletransformers.seq2seq import Seq2SeqModel, Seq2SeqArgs args = Seq2SeqArgs() args.num_train_epoch=5 model = Seq2SeqModel( "...
DevDog's user avatar
  • 121
0 votes
0 answers
721 views

import logging from simpletransformers.question_answering import QuestionAnsweringModel, QuestionAnsweringArgs i run the above piece of code and get this error > ModuleNotFoundError ...
Saad Mohammed's user avatar
1 vote
1 answer
396 views

My data has 1751 sentences however when training a number appears under the epochs bars. Sometimes it is 1751 which makes sense it's the number of sentences I have, but most of the times it's 50% the ...
IS92's user avatar
  • 730
0 votes
1 answer
242 views

I have a fine tuned simple transformer representation model. Now I want to save the weights of only the pool layer in pickle format and put it in the pool layer of another custom auto encoder that I ...
Parmida Granfar's user avatar
1 vote
0 answers
181 views

so I am using simpletransformers on my classification task, I have 2 columns in my Dataframe (sentence, label) of size 3890 When I train the model it shows that it is only using 8 samples from my ...
IS92's user avatar
  • 730
0 votes
1 answer
1k views

mt5 fine-tuning does not use gpu(volatile gpu utill 0%) Hi, im trying to fine tuning for ko-en translation with mt5-base model. I think the Cuda setting was done correctly(cuda available is True) But ...
Shanha_Kim's user avatar
2 votes
0 answers
1k views

Training MBART in Seq2Seq with SimpleTransformers but getting an error I am not seeing with BART: TypeError: shift_tokens_right() missing 1 required positional argument: 'decoder_start_token_id' So ...
LeOverflow's user avatar
0 votes
1 answer
2k views

Hell all, so let try to describe my problem in the best light possible. I have experience with Python but have never used FME. I would like to delete a full row of data based on criteria in a ...
Hondabear's user avatar
4 votes
2 answers
13k views

I'm currently working on a seminar paper on nlp, summarization of sourcecode function documentation. I've therefore created my own dataset with ca. 64000 samples (37453 is the size of the training ...
ThyR4n's user avatar
  • 105
1 vote
1 answer
126 views

I need to clasify some text in labels of emotions. I'm using Multi-Label Classification because the same text can contain more than one emotion, but I want to implement that some of them be disjoint ...
Andrea NR's user avatar
  • 1,817
1 vote
1 answer
718 views

I'm trying to use SimpleTransformers module for this competition. After successfully installing the package SimpleTransformers in kernel , i'm trying to import NERModel and NERArgs packages from ...
user3001161's user avatar
2 votes
0 answers
1k views

I encountered this error while running simpletransformers on Google Colab. I enabled h/w accelerator as GPU and ran the code. from simpletransformers.classification import ClassificationModel # Create ...
ahmedshahriar's user avatar
8 votes
1 answer
662 views

I am saving and loading a model using torch.save() and torch.load() commands. While loading a fine-tuned simple transformer model in Docker Container, I am facing this error which I am not able to ...
SK Singh's user avatar
  • 163
3 votes
1 answer
5k views

I trained a T5 transformer using simpletransformers library. Here is a code to get the predictions: pred_values = model.predict(input_values) However, it just returns the top or greedy prediction, ...
Ahmad's user avatar
  • 9,816