Here’s my first attempt to explain how RedditRecs works. Core to RedditRecs is its data pipeline that analyzes Reddit data for reviews on products. This is a gist of what the pipeline does: Given a set of products types (e.g. Air purifier, Portable monitor etc) Collect a list of reviews from reddit That can be aggregated by product models Such that the product models can be ranked by sentiment And have shop links for each product model The pipeline can be broken down into 5 main steps:...
