"""
there are some slight differences in some of the tags/keys. Specifically, instead of “parent” (the name of the commenter being replied to, if there is one) we have “comment_reply_to_id”.
Can we standardize that going forward? I actually like “comment_reply_to_id” better, if we’re choosing, but I need those fields to be named the same one way or another so we’re not re-writing the script for each new source. For reference, the fields I’m pulling out of these right now are:
For each article:
‘title’, ‘author’, ‘parsed_comments’, and ‘fulltext’
For each comment:
‘commenter_id’, ‘comment_reply_to_id’, ‘comment’, and ‘upvotes’
And of course, if we run into a source where we don’t have one or more of the above fields, definitely let me know
"""
From @gallupBenRyan
"""
there are some slight differences in some of the tags/keys. Specifically, instead of “parent” (the name of the commenter being replied to, if there is one) we have “comment_reply_to_id”.
Can we standardize that going forward? I actually like “comment_reply_to_id” better, if we’re choosing, but I need those fields to be named the same one way or another so we’re not re-writing the script for each new source. For reference, the fields I’m pulling out of these right now are:
For each article:
‘title’, ‘author’, ‘parsed_comments’, and ‘fulltext’
For each comment:
‘commenter_id’, ‘comment_reply_to_id’, ‘comment’, and ‘upvotes’
And of course, if we run into a source where we don’t have one or more of the above fields, definitely let me know
"""