Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset output formatting and bs4 method implementation #8

Open
navan0 opened this issue Oct 1, 2020 · 2 comments
Open

Dataset output formatting and bs4 method implementation #8

navan0 opened this issue Oct 1, 2020 · 2 comments

Comments

@navan0
Copy link
Collaborator

navan0 commented Oct 1, 2020

1.Dataset output formatting to CSV with question , answer,source_verified,manual_verified,url,final_answer
2.bs4 method implementation of URLs and other informations
3.crons

@blacksmithop
Copy link
Contributor

blacksmithop commented Oct 2, 2020

Current progress:
image
Managed to collect question , source_verified (accepted question), url (share link for answer) and the answer itself.
Currently answer is basically a collection of code blocks from the accepted answer (only one considered at present) without the comments/ explanations it might have.
Possible Changes: Include Comments for better readability
Changes

@navan0
Copy link
Collaborator Author

navan0 commented Oct 2, 2020

looks good, including comment is not necessary. because comments might be messy and not fit for the dataset. for the final dataset, we have to manually go through all the data points. so we can refer the thread with url

I think you can make a PR after testing from your side. also, I'm assigned myself to do the dataset formatting and corn job. hope I can do it by tomorrow. then let's have a VC/discussion with the team

thanks

@GopikrishnanSasikumar

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants