Damn.
I just be chillin fr
Damn.
Yeah I was using it before I realized I might need a scraper.
What is the USTR blacklist? how do we preserve this data before its lost?
Very little. I know basic html + css but I am trying to work with python
I test with IDLE for python + use selenium for driver directory (geckodrive)
I could send it to you privately if you let me know ur discord or something
I don’t like to touch js so ive being going python only. (besides basic html & Css) but I found puppeteer and didn’t really get it.
The discord thing is a no-go since I don’t really know how to make my issue palatable. That’s why I used lemmy. Thanks again!
I am having to frankinscript because resources don’t really give out the code for my needs. I am using command prompt from win powershell and testing with python IDLE
I’m attempting to make a webscraper that can grab online books that are stored within the site or stored with a direct link to the storage site. I don’t want to reinvent this but finding one that I can work and/or build off of is hard due to my lack of experience and vague resources.
This was the original plan but it doesn’t work as well for this on ‘dynamic’ websites
My current script uses bs4 and request imports. It also has the selenium import for geckodrive but I am considering just removing that feature lol
I wouldn’t mind taking it up if I could just focus on what i’m interested in working on. Python seems simple enough after spending 9 hours trying to get this to work lol. I don’t want to “reinvent the wheel” as much as I just want to be able to understand and work with tools that already exist.
I don’t want a point and click scraper, just a guide that isn’t assuming I have background + simple mans terms for easier reading. Thanks for believing in me to be able to build the basic skills necessary! Much appreciated :3
I recommend talking to a LLM
Any recommendations? Not chat-GPT
Also thanks for the help so far!
I don’t have programming experience and what sorts of software can “drive” the driver?
use the megathread and go by what has a 🐐 beside it
HydraHD (on the megathread)
https://ustr.gov/sites/default/files/2023_Review_of_Notorious_Markets_for_Counterfeiting_and_Piracy_Notorious_Markets_List_final.pdf
Anna’s Archive, Libgen, etc are all on here. this is just 2023