Lazarus
Home
Help
TinyPortal
Search
Login
Register
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
Free Pascal
Website
Downloads
Wiki
Documentation
Bugtracker
Mailing List
Lazarus
Website
Downloads (Laz+FPC)
Packages (OPM)
FAQ
Wiki
Documentation (RTL/FCL/LCL)
Bugtracker
CCR Bugs
GIT
Mailing List
Other languages
Foundation
Website
Useful Wiki Links
Project Roadmap
Getting the Source
Screenshots
How to use the forum
About donations (wiki)
Bookstore
Computer Math and Games in Pascal
(preview)
Lazarus Handbook
Search
Advanced search
Recent
Setting the fpc path
by
cousinp
[
Today
at 05:07:17 pm]
Simple multithreading cod...
by
LV
[
Today
at 04:39:00 pm]
Str declaration
by
Kays
[
Today
at 04:36:16 pm]
Recommended way to build ...
by
carl_caulkett
[
Today
at 04:17:44 pm]
Stack Overflow in RtlVirt...
by
simone
[
Today
at 04:13:06 pm]
Vectors to find Center of...
by
Boleeman
[
Today
at 03:48:26 pm]
7zip DLL is super broken
by
domasz
[
Today
at 03:47:13 pm]
Incorrect FPC/Lazarus bui...
by
Seenkao
[
Today
at 03:26:56 pm]
Gridprinter Newby
by
Zvoni
[
Today
at 03:03:35 pm]
Loading text data from a ...
by
carl_caulkett
[
Today
at 02:58:56 pm]
QuasiClever Fractal
by
Boleeman
[
Today
at 02:23:56 pm]
Bitmap 16bit R5G6B5
by
wp
[
Today
at 11:50:20 am]
"Show Compiler Dialog"
by
MarkMLl
[
Today
at 10:21:49 am]
AI, NLP and CAI: Text Gen...
by
schuler
[
Today
at 09:48:18 am]
Compiler error when check...
by
af0815
[
Today
at 09:35:48 am]
Hashids that you need but...
by
Thaddy
[
Today
at 06:10:34 am]
C++ conversion : what is ...
by
Thaddy
[
Today
at 05:44:34 am]
Game Music Emulator
by
Guva
[
Today
at 05:25:58 am]
Is there a better web-bro...
by
QEnnay
[September 17, 2024, 11:50:29 pm]
LazReport / FreeReport qu...
by
dseligo
[September 17, 2024, 11:37:59 pm]
Complex objects and JsonO...
by
Flea
[September 17, 2024, 11:02:28 pm]
Best way to exchange data...
by
cdbc
[September 17, 2024, 10:16:27 pm]
TListView.OnChange Event ...
by
msintle
[September 17, 2024, 10:11:04 pm]
Broken Icon Display on ma...
by
msintle
[September 17, 2024, 10:09:18 pm]
how to convert pdf to png...
by
zeljko
[September 17, 2024, 08:35:54 pm]
« previous
next »
Print
Pages: [
1
]
Author
Topic: Is there any library that would make process of making a website scrapper easier (Read 1030 times)
Rave
Full Member
Posts: 170
Is there any library that would make process of making a website scrapper easier
«
on:
August 09, 2024, 11:19:30 pm »
Basically I want to make a web scraper for one of the sites I love that has awful UI so that I can interact with it better. Is there any Lazarus/Free Pascal library that would make this process easier? I'd rather not parse HTML by hand.
Logged
Thaddy
Hero Member
Posts: 15550
Censorship about opinions does not belong here.
Re: Is there any library that would make process of making a website scrapper easier
«
Reply #1 on:
August 10, 2024, 01:50:13 pm »
Our member benibela has a good library for that:
https://www.benibela.de/sources_en.html#internettools
Simple example that extracts all hrefs from a page:
Code: Pascal
[Select]
[+]
[-]
uses
simpleinternet
,
xquery
;
var
a
:
IXQValue
;
begin
for
a
in
process
(
'https://freepascal.org'
,
'//a/@href'
)
do
writeln
(
a
.
toString
)
;
end
.
You need to undefine USE_PASDBLSTRUTILS_FOR_JSON in internettoolsconfig.inc.
«
Last Edit: August 10, 2024, 01:52:43 pm by Thaddy
»
Logged
If I smell bad code it usually is bad code and that includes my own code.
Print
Pages: [
1
]
« previous
next »
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
TinyPortal
© 2005-2018