Facebook Reactions Scraper

This project use Selenium to crawling facebook reaciton list and parsing it to csv file with xpath. You can get post_url, user,reaction,user_url as output(in csv), and you need to provide post url as input.
I recommend to use rugantio/fbcrawl to crawl post link of page if you need

Disclaimer

This software is not authorized by Facebook, use it at your own risk. Scraping facebook data does not follow Facebook robots.txt and violating terms and condition of Facebook.This software is provided only in educational propose, show how to scrap faceook page

Installation

Only support Windows now, you can modified selenium parameter to support macOS or Linux if you need
Of course, make sure you have already install python3, and required python packages are as following:

seleinum
pandas
tqdm
lxml

Or simply install with pip install -r requirements.txt

This project also use Chrome as selenium browser, so that make sure you have already install Google Chrome

By default,it should be all right if you installed latest chrome ,however if the version of webdriver is not consistent with Chrome, please replace webdriver in project folder

Usage

Make sure your chrome have already login Facebook,do not remove facebook cookies (You need to login to see reaction list) , and turn your facebook display language to English
Close all Chrome windows to avoid preventing selenium start
Put the link(in txt fromat) you want to scrap in INPUT_DIR
- you can split all post url in several txt file to estimate scraping speed(by tqdm) or to split output file
- Scraper would ouptut one txt file after crawling all link in one txt file in INPUT_DIR
- support www.facebook, m.faceboook, mbasic link (you can see sample input in input folder)
python3 scarper.py INPUT_DIR OUTPUT_DIR

Known Issue

Miss data in big reaction count(might be the problem of mbasic.facebook)
- a post with about 5k up reaction, would only crawled 1-2k reaction
slow down when big reactions count

something else

you can remove options.add_argument('headless') to see full scraping progress in Chrome

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
input		input
.deepsource.toml		.deepsource.toml
README.md		README.md
chromedriver.exe		chromedriver.exe
requirements.txt		requirements.txt
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Facebook Reactions Scraper

Disclaimer

Installation

Usage

Known Issue

something else

About

Releases

Packages

Contributors 2

Languages

www10177/Facebook-Reaction-Scraper

Folders and files

Latest commit

History

Repository files navigation

Facebook Reactions Scraper

Disclaimer

Installation

Usage

Known Issue

something else

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages