Lazarus
Home
Help
TinyPortal
Search
Login
Register
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
Free Pascal
Website
Downloads
Wiki
Documentation
Bugtracker
Mailing List
Lazarus
Website
Downloads (Laz+FPC)
Packages (OPM)
FAQ
Wiki
Documentation (RTL/FCL/LCL)
Bugtracker
CCR Bugs
GIT
Mailing List
Other languages
Foundation
Website
Useful Wiki Links
Project Roadmap
Getting the Source
Screenshots
How to use the forum
Forum Rules
About donations (wiki)
Bookstore
Computer Math and Games in Pascal
(preview)
Lazarus Handbook
Search
Advanced search
Recent
CustomDraw package compil...
by
wp
[
Today
at 01:00:02 am]
Initialize list in functi...
by
jamie
[
Today
at 12:40:04 am]
Anyone interested in help...
by
ad1mt
[
Today
at 12:27:45 am]
Parsing command line stri...
by
simone
[
Today
at 12:16:24 am]
How detect a keypress in ...
by
ad1mt
[
Today
at 12:06:30 am]
TScrollBox ScrollBy Metho...
by
andrew Bubble
[November 07, 2025, 11:54:17 pm]
synedit
by
Martin_fr
[November 07, 2025, 11:33:18 pm]
Distance of gridlines in ...
by
wp
[November 07, 2025, 07:50:35 pm]
Get Out of Squircle Jail ...
by
msintle
[November 07, 2025, 07:08:30 pm]
FPC 3.2.4-rc1 available
by
pleumann
[November 07, 2025, 06:53:04 pm]
OPCB – Object Pascal Comp...
by
fabianoallex
[November 07, 2025, 06:51:39 pm]
Blog area?
by
TBMan
[November 07, 2025, 03:57:17 pm]
MOSAIC - free terrain mak...
by
Tomi
[November 07, 2025, 02:09:39 pm]
Lazarus portable?
by
parcel
[November 07, 2025, 02:04:35 pm]
TLazSerial : serial port ...
by
CM630
[November 07, 2025, 01:45:15 pm]
paszlib. Determine if ext...
by
Thaddy
[November 07, 2025, 01:35:11 pm]
How to turn OFF Word Wrap...
by
Thaddy
[November 07, 2025, 01:03:43 pm]
Code improvement, suggest...
by
ALLIGATOR
[November 07, 2025, 11:54:58 am]
[CLOSED] WebView4Delphi q...
by
egsuh
[November 07, 2025, 06:00:34 am]
Build FCP with WinCE cros...
by
dbannon
[November 07, 2025, 12:44:36 am]
[SOLVED] Where are my Uni...
by
tfurnivall
[November 07, 2025, 12:05:13 am]
32 team NFL schedule for ...
by
TBMan
[November 06, 2025, 10:29:10 pm]
Strange result of text so...
by
PascalDragon
[November 06, 2025, 08:58:36 pm]
Raising an exception with...
by
PascalDragon
[November 06, 2025, 08:29:26 pm]
Google announced the kill...
by
PascalDragon
[November 06, 2025, 08:18:52 pm]
« previous
next »
Print
Pages: [
1
]
Author
Topic: Is there any library that would make process of making a website scrapper easier (Read 1480 times)
Rave
Full Member
Posts: 170
Is there any library that would make process of making a website scrapper easier
«
on:
August 09, 2024, 11:19:30 pm »
Basically I want to make a web scraper for one of the sites I love that has awful UI so that I can interact with it better. Is there any Lazarus/Free Pascal library that would make this process easier? I'd rather not parse HTML by hand.
Logged
Thaddy
Hero Member
Posts: 18305
Here stood a man who saw the Elbe and jumped it.
Re: Is there any library that would make process of making a website scrapper easier
«
Reply #1 on:
August 10, 2024, 01:50:13 pm »
Our member benibela has a good library for that:
https://www.benibela.de/sources_en.html#internettools
Simple example that extracts all hrefs from a page:
Code: Pascal
[Select]
[+]
[-]
uses
simpleinternet
,
xquery
;
var
a
:
IXQValue
;
begin
for
a
in
process
(
'https://freepascal.org'
,
'//a/@href'
)
do
writeln
(
a
.
toString
)
;
end
.
You need to undefine USE_PASDBLSTRUTILS_FOR_JSON in internettoolsconfig.inc.
«
Last Edit: August 10, 2024, 01:52:43 pm by Thaddy
»
Logged
Due to censorship, I changed this to "Nelly the Elephant". Keeps the message clear.
Print
Pages: [
1
]
« previous
next »
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
TinyPortal
© 2005-2018