Trying to create a scraper template for this site but I can’t select specific element/field, only the website can be collected. Any way to get around this?
Hi @Rye ,
Looks like you’re onto a complex case here. This can be done with Bardeen however, it requires some tweaking of the CSS selectors. We’ve covered this topic in our Masterclass (linked below) .
If you’d like to give it a try, the masterclass below will be a good starting point. Alternatively, you could share the link to the site you are trying to scrape and we can see what is possible
Thanks for this, I’ll check the Masterclass in the meantime. The site is a directory A - C
@vin_bardeen I honestly don’t know what he’s saying, I might as well copy & paste or use something else at this point because this ain’t working for me. Even when I run the automation nothing is going to sheets, than Bardeen logo. Could’ve reached far in building my list but I’m spending too much time trying to figure out a platform.
Hi there,
Sorry you’re facing these issues. We’re working to get you a custom playbook to scrape the website. I’ll be in touch soon
I was able to scrape the details without specific selectors in the below video:
REDS Dentists - Details Scrape
I hope this helps!
Thank you,
Jess
Thanks for the amazing use-case @Rye
This indeed is a super tricky case and in order to scrape the list of items from that page a custom model needs to be built, there are also some additional problems here like table being used for layouting, no common containers and data extraction from the text nodes.
Here is a demo overcoming most of those limitations: Loom | Free Screen & Video Recording Software | Loom
Playbook: Shared Playbook Template
Selectors:
Phone: `./text()[contains(., 'Phone')]`
Fax: `./text()[contains(., 'Fax')]`
Address: `./text()[contains(., 'Address')]`
Email: `./text()[contains(., 'Email')]`
Name: `./preceding-sibling::h2[1]`
Image: `./preceding-sibling::span[1]//img`
@Jess hehe that’s just scraping one person
Ope you’re looking for a list scraper then! Looks like Manuel has you covered here:
@manvel thank you so much brother🙏 Yes, this is definitely tricky, I was getting so frustrated & you made my day.
This topic was automatically closed 5 days after the last reply. New replies are no longer allowed.