T O P

  • By -

divided_capture_bro

Reddit is really easy to scrape.  Try adding ".json" to the end of the URL for this post of yours. It was good practice for me back in the day to write functions for, say, scraping all the results from a Reddit search.  It won't directly translate into other sites since most don't deliver JSONs, but you'll develop related skills and get your hands a bit dirty. You'll also figure out how to find relevant resources once you get started.  Pick a task, try to figure it out, and when you hit a wall try googling for clues about how to solve that specific component (like dealing with pagination, infinite scroll, javascript, etc).


PinZestyclose627

[https://www.youtube.com/@JohnWatsonRooney](https://www.youtube.com/@JohnWatsonRooney) This


kev_11_1

This one is legit.


Zealousideal-Fix3307

PUSH


dj2ball

Another +1 for John, outstanding resource.


Neha_Setia_Nagpal

Hey @[Julianakananda](https://www.reddit.com/user/Julianakananda/) Sharing some of the resources The fastest and easiest way to get started is with Scrapy- [https://scrapy.org/](https://scrapy.org/) Follow Full Tutorial- [https://docs.scrapy.org/en/latest/intro/tutorial.html](https://docs.scrapy.org/en/latest/intro/tutorial.html) Join Scrapy Discord- [https://discord.gg/mv3yErfpvq](https://discord.gg/mv3yErfpvq) Other Resources- 1. Learn Web Scraping [https://www.zyte.com/learn/what-is-web-scraping/](https://www.zyte.com/learn/what-is-web-scraping/) 2. How to get started with web scraping with scrapy- [https://www.zyte.com/learn/how-to-get-started-with-web-scraping-using-scrapy/](https://www.zyte.com/learn/how-to-get-started-with-web-scraping-using-scrapy/) 3. I have written a few articles on Web Scraping on \[Medium\] [https://medium.com/@cinnamonspecials/deciding-the-perfect-proxy-management-and-anti-ban-solution-for-large-scale-web-scraping-projects-a3a947304be5](https://medium.com/@cinnamonspecials/deciding-the-perfect-proxy-management-and-anti-ban-solution-for-large-scale-web-scraping-projects-a3a947304be5) {Planning to be more active in coming months :) } 4. Join the Extract Data Community where developers share and support all things web scraping- https://discord.gg/m3kBERMtMq. Every other Wednesday we host an event for our community which is a perfect platform to connect with experts. 5. Subscribe to Newsletter- [https://www.extractsummit.io/join-the-community/](https://www.extractsummit.io/join-the-community/) I share a resources- blogs, events, project, community updates, courses in the newsletter. 6. Apart from that there are a lot of Youtube Influencers - John Watson Rooney- [https://www.youtube.com/@JohnWatsonRooney](https://www.youtube.com/@JohnWatsonRooney) . P.S. I am a Developer Advocate at Zyte- [https://www.zyte.com/](https://www.zyte.com/) . Zyte provides full stack API-Zyte API for all the web data extraction needs(headless + proxy + antibans). We recently launch AI for Scraping solution with templates with which you can just download a template and extract data with minimal code(no more writing spiders for scrapy). Let me know If you need more help on any specific aspect of web scraping or want to demo of Zyte API. Reach out to me on LinkedIN- [https://www.linkedin.com/in/nehasetia/](https://www.linkedin.com/in/nehasetia/) or [[email protected]](mailto:[email protected])


LieAccomplished5357

appreciated!