I am trying to scrape and save from a facebook groups media page like this one: https://www.facebook.com/groups/Transeurotrail.org/media. I create a new scraper and set it to run on the active tab using infinite scroll, I select list and click on the first and second photo on the page to setup the elements I want to capture, then i click on the first photo and select click to open the larger photo and any comments on the right, next I click the photo and select save image, then i click the text on the right where there maybe a comment about the photo. When I then run this all i get is some of the photos but all the image and text are the same as the first one? should I maybe run a pre scraper on the media page to pull all the links to the photos, then feed this into a second scraper to process each link and extract the image and text ( not some do not have any text so unsure how to ignore if none found? )
Your statement right at the end is exactly what you need to do to get the data you described. This is how your playbook should be set up:
Scraper 1 - Get the links from the main page : https://www.facebook.com/groups/Transeurotrail.org/media
Scraper 2 - Open each of the links from scraper 1 in the background and then scrape all comments and the image description - this will also need to be a list scraper as you are scraping multiple comments
I’ve gone ahead and created a playbook with these two scrapers. You’ll just need to add an action at the end to output the results to a location of your choosing (e.g Google Sheets)
Playbook : https://www.bardeen.ai/playbook/community/Scraping-www.facebook.com-5v1ycmBHit0AIDrvVG
Let me know how you get on with is and if you’re still having issues.
Customer Support - bardeen.ai
Explore | @bardeenai | Bardeen Community
This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.