-
Notifications
You must be signed in to change notification settings - Fork 191
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Using My own dataset with csv #287
Comments
|
Is there any automated way to convert the data into this format? |
https://github.com/cdqa-suite/cdQA/blob/88a1ff2bb249f24edc427737ccb0b8f8959cf0b6/cdqa/scrapper/bs4_bnpp_newsroom.py |
I will try with this |
The convertors used for pdf does not read in my file, is there any format for the pdf file as well? |
Even I want to do the same. Kindly help on this. |
Hi, Unfortunately, our |
@aqsa27 how does your csv look like? Can you share the format or a sample here? |
@aqsa27 ,@fmikaelian - I would also like to have a look at the csv. Can you please share a sample ? Also once we build the csv , how do you train the model ? If I use the existing QAPipeline : cdqa_pipeline = QAPipeline(reader='./models/bert_qa_vCPU-sklearn.joblib') the results are not matching the questions asked while testing. Any pointers on training the model ? |
Hi, My dataset contains 4 columns, like question, answer, date and additional information. |
I create a new dataframe of my csv and use that to train my model The answer wit this method is not 100% accurate, but its a lot more relevant |
Hi, can you give us a example? about the format? not the one mentioned in the readme, a live example of a csv in the recommend format |
One of our official tutorials (found in our readme and our examples repository): https://colab.research.google.com/github/cdqa-suite/cdQA/blob/master/examples/tutorial-first-steps-cdqa.ipynb If you run this notebook the You can ignore the columns |
#345 can you fix my issue |
Hi, I am trying to build a cdqa with my customized dataset which is in CSV. Can you tell me what format should my dataset be?
and there is only a pdf converter for csv.
is there any way of converting my dataset into the acceptable cdqa dataframe?
The text was updated successfully, but these errors were encountered: