Reddit-dataset-samples

A sample dataset of 1001 Reddit posts

A Reddit dataset sample of over 1000 records. Dataset was extracted using the Bright Data API.

Some of the data points that are included in the Reddit dataset:

post_id: Post ID
url: URL of the post
user_posted: Username of the post creator
title: Title of the post
description: Post text description
num_comments: Number of comments
date_posted: Post publication date
community_name: Name of the community
num_upvotes: Number of upvotes
photos: URLs of attached photos
videos: URLs of attached videos
tag: The name of the tag

And a lot more.

This is a sample subset which is derived from the "Reddit posts" dataset which includes more than 404K records.

Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.

Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.

Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.

Data enrichment available as an addition to the data points extracted: Based on request.

Get the full Reddit dataset.

What are the Reddit datasets use cases?

1. Sentiment Analysis

Monitor consumer sentiment by analyzing online conversations on Reddit to track brand reputation and respond to customer feedback.

2. Trend Identification

Identify industry-related trends and topics on Reddit to inform marketing content and campaign development.

3. Competitor Analysis

Enhance competitive intelligence by analyzing the Reddit activity of similar brands to uncover opportunities for improvement.

Free access to web scraping tools and datasets for academic researchers and NGOs

The Bright Initiative offers access to Bright Data's Web Scraper APIs and ready-to-use datasets to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application here.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Reddit- Posts.csv		Reddit- Posts.csv
Reddit-datasets.png		Reddit-datasets.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reddit-dataset-samples

A sample dataset of 1001 Reddit posts

Some of the data points that are included in the Reddit dataset:

What are the Reddit datasets use cases?

1. Sentiment Analysis

2. Trend Identification

3. Competitor Analysis

Free access to web scraping tools and datasets for academic researchers and NGOs

About

luminati-io/Reddit-dataset-samples

Folders and files

Latest commit

History

Repository files navigation

Reddit-dataset-samples

A sample dataset of 1001 Reddit posts

Some of the data points that are included in the Reddit dataset:

What are the Reddit datasets use cases?

1. Sentiment Analysis

2. Trend Identification

3. Competitor Analysis

Free access to web scraping tools and datasets for academic researchers and NGOs

About

Topics

Resources

Stars

Watchers

Forks