Scraping ajax pages using python -


i've seen this question scraping ajax, python isn't mentioned there. considered using scrapy, believe have docs on subject, can see website down. don't know do. want following:

i have 1 url, example.com go page page clicking submit, url doesn't change since they're using ajax display content. want scrape content of each page, how it?

lets want scrape numbers, there other scrapy it? if not, give me snippet on how it, because website down can't reach docs.

first of all, scrapy docs available @ https://scrapy.readthedocs.org/en/latest/.

speaking handling ajax while web scraping. basically, idea rather simple:

  • open browser developer tools, network tab
  • go target site
  • click submit button , see xhr request going server
  • simulate xhr request in spider

also see:

hope helps.


Comments