Skip to content

gareth-harman/subreddit_nlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

subreddit_nlp

Python tool: Parse and examine NLP diagnostics from subreddit titles

This tool allows users to retrieve the 'n' posts from a given subreddit's post titles in order to examine NLP metrics. For example you can return a sentiment score for each title and plot the distribution of sentiment scores for all posts retrieved. The user can also return the 'p' most common words or nouns from the retrieved posts.

Arguments
-id

  • The client id required by PRAW's OAUTH2 client

-secret

  • The client secret required by PRAW's OAUTH2 client

-user_agent

  • The user agent id required by PRAW's OAUTH2 client

-username

  • Your reddit username

-subreddit

  • The subreddit you want to analyze
  • default: gradschool

-sort_by_top

  • If 0 it will retrieve posts by 'new'
  • default: retrieve posts by top of all time

-n_posts

  • The number of posts to retrieve
  • default: 100
  • Max = 1000

-en_path

  • Path the spacey NLP core library to create nlp objects

Required Python Libraries

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages