Lazarus
Miscellaneous => Other => Topic started by: nouzi on August 01, 2021, 10:49:12 pm
-
This real number or robot ?
https://forum.lazarus.freepascal.org/index.php?action=stats
Sorry c'not attachment image from my Mobile
-
Screenshot attached.
-
Maybe somebody with multiple personalities?
Bart
-
Maybe somebody with multiple personalities?
Bart
Or a person who stuttered?
Fre;D
-
https://forum.lazarus.freepascal.org/index.php/topic,55641.msg414029
might be relevant: somebody's been experimenting with scraping the forum to get an offline copy.
I think the feasibility of scraping an offline copy using wget or similar is touched on somewhere in https://www.eevblog.com/forum/chat/migrating-the-forum-to-discourse/?all , but it needs considerably more subtlety than simply grabbing what the user sees: to all intents and purposes that sort of manipulation is best left to the forum administrator.
I think a fair summary is that if looking at the source of a forum page shows the text then it can be scraped, although useful metadata might be lost. If the text is only displayed after it's been requested by custom Javascript then forget it.
And in any event, a forum's administrator and users would be justified in being suspicious of anybody who attempted to do this, since a corpus of this type has value to Google et al... not to mention to anybody even less salubrious.
MarkMLl
-
Thank for all reply
-
https://forum.lazarus.freepascal.org/index.php/topic,55641.msg414029
might be relevant: somebody's been experimenting with scraping the forum to get an offline copy.
Funny that I thought an offline copy of these forums yesterday when my internet reset and I wanted to read a thread 8-).
I just tried this, guest counter from wget alone doesn't seem to increase the guest counter. Maybe by scraping through Tor, where you get a new IP every 10 minutes but you could get a new one by restarting the Tor router after every request but I see no point for someone to do that.
My guess is that this forum was linked somewhere. Admin could check web request logs for referrals, if such logs are stored.