[ / / / / / / / / / / / / / ] [ dir / canada / htg / jewess / lovelive / marx / russian / sw / u ][Options][ watchlist ]

/prog/ - Programming

Programming board
You can now write text to your AI-generated image at https://aiproto.com It is currently free to use for Proto members.
Name
Email
Subject
Comment *
File
Select/drop/paste files here
* = required field[▶ Show post options & limits]
Confused? See the FAQ.
Expand all images

File (hide): 4f07b46a2083a82⋯.png (174.03 KB, 1231x652, 1231:652, Screenshot from 2017-10-29….png) (h) (u)

[–]

5f5be6 (1) No.4867[Watch Thread][Show All Posts]

Looking for some general guidelines on web scraping. I made a shitty web scraper in python, but it does the job. I just put in a few links into a file, and then it goes through each link and gets all the links in the web page and then keeps going through them searching for words or whatever.. but is there any guidelines? Like.. should I test the connection or site before I request a page? I got a bunch of shit the first time, so had to also filter out certain words within the links.



[Return][Go to top][Catalog][Screencap][Nerve Center][Cancer][Update] ( Scroll to new posts) ( Auto) 5
0 replies | 0 images | 1 UIDs | Page ?
[Post a Reply]
[ / / / / / / / / / / / / / ] [ dir / canada / htg / jewess / lovelive / marx / russian / sw / u ][ watchlist ]