Collect all reddit topics and comments for a specific subreddit

完了済み 投稿 7年前 着払い
完了済み 着払い

Hi,

Looking for someone to write a piece of software that will fetch all reddit posts and comments for a subreddit from the year 2015. Please take the reddit API limits into account and find a workaround for that.

Return format for each topic should be a python dict. Comments are stored nested in the post dict:

posts =

{{'author': {'username': redditusername,

'timestamp':timestamp,

'url': topicurl

'topictext':text,

'comments': {{'comment': text, 'timestamp': timestamp, 'author': {'username': redditusername, 'profileurl': url}, {comment2 etc..}}

}, second post here ... }

Output stored to a text file.

Looking to get this done ASAP. Get in touch to discuss details.

The Reddit API limit is set to a maximum of 1000 items. I think it is possible to get around the API limit by using timestamps, but I'm not sure. Use the reddit search API function to get less than 1000 items within a specific timeframe (which you can specify in the search), then use the timestamp of the last post to create a new time window. Open to other approaches.

API Documentation can be found here: https://www.reddit.com/dev/api

情報処理 Python ソフトウェアアーキテクチャ ウェブ記事のスクラップ

プロジェクトID: #10606322

プロジェクトについて

11個の提案 リモートプロジェクト アクティブ 7年前

アワード:

akprj

Hello Final year CS undergrad at IIT Bombay. Had done a similar crawler project for codechef and stackoverflow a couple of months ago using bs4 and python. Can do this in a few hours. Looking forward to work on this もっと

$250 USD 1日以内
(3レビュー)
3.1

11人のフリーランサーが、平均$342 で、この仕事に入札しています。

e3d

Hi, I can scrape those from reddit with no problem but to properly estimate this project I need to know which subreddit you're talking about.

$263 USD 3日以内
(268件のレビュー)
8.7
mananraja

Hi there, I have read the project & would like to discuss.. I can scrape data from website using custom made scripts in Python.. I have good web scraping reviews as well.. I have experience with APIs as well as like もっと

$250 USD 1日以内
(171件のレビュー)
6.4
mwarrenschultz

Hello! I can get around Reddit's API limit by avoiding the API altogether, and interfacing instead directly through the browser using Selenium. I am a professional programmer with many years of web scraping experience もっと

$444 USD 10日以内
(69件のレビュー)
6.5
DhvanitAkbari

Hello, I am a computer engineer, I have experience in web scrapping using python so we can chat further to discuss project. Thanks

$500 USD 2日以内
(19件のレビュー)
3.9
Fonseca25T

hello, I can automate this tasks considering the Reddit api limitations. This will be up the amount of post the subreddit would have and all the comments too. For example, if the api limitations will reach the top in a もっと

$500 USD 15日以内
(レビュー1件)
1.6
tonygiorgi

Hello, I am currently building my own software and am in need of work to help supplement the costs of building my own business. I am a graduate of UC Berkeley with a breadth of professional experience as a product m もっと

$277 USD 5日以内
(0件のレビュー)
0.0