I’m trying to Scrape the Case# and Parties at the top, Petitioner Address and the Respondent Address.
Because some of the pages are a little bit different it’s not scraping them properly. Sometimes one party or the other will have an address, sometimes one won’t. Sometimes there is an attorney on the right side, sometimes, there is two and sometimes there are none.
I’m not sure if the scraper is designed to find items in the same exact spot on the page each time or if it’s finding the information based on the code of the website and trying to find it based on the label for where that information is placed.
Any help is greatly appreciated as I have made a handful of different scrapers to scrape this site and I can not get any of them to get through my list of URLs to scrape and not have about half of them wrong.
Hi @thisshirtoffmyback (lol) - could you please provide a few links you are trying to scrape from? I’ll see if we can get you the correct selectors to use for this use case and that should solve the issues you’re running into.
What I have figured out is, for any record that looks like the ones with parties on the left and no attorneys on the right. I have a scraper that will pull them perfectly. However, when I have a list of links that has both types of pages, I don’t know which one is pulling in the info correctly unless I manually look at everything. I’m trying to figure out a way that I can run the scrapper for all links I have and have it pull all the correct info on every one or close to every one.
Hope that makes sense.