Lazarus
Home
Help
TinyPortal
Search
Login
Register
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
Free Pascal
Website
Downloads
Wiki
Documentation
Bugtracker
Mailing List
Lazarus
Website
Downloads (Laz+FPC)
Packages (OPM)
FAQ
Wiki
Documentation (RTL/FCL/LCL)
Bugtracker
CCR Bugs
GIT
Mailing List
Other languages
Foundation
Website
Useful Wiki Links
Project Roadmap
Getting the Source
Screenshots
How to use the forum
About donations (wiki)
Bookstore
Computer Math and Games in Pascal
(preview)
Lazarus Handbook
Search
Advanced search
Recent
Why renaming must be done...
by
Remy Lebeau
[
Today
at 06:53:13 am]
Move a window between Wor...
by
Aruna
[
Today
at 03:46:12 am]
[SOLVED] Getting started ...
by
rushfan
[
Today
at 02:33:39 am]
Game Music Emulator
by
Fred vS
[
Today
at 01:00:26 am]
Lazarus IDE defaults when...
by
Aruna
[
Today
at 12:39:54 am]
Str declaration
by
MarkMLl
[
Today
at 12:39:52 am]
Beeping sound when cursor...
by
carl_caulkett
[
Today
at 12:20:27 am]
DBGRID | Picklist on fiel...
by
TRon
[September 18, 2024, 10:25:50 pm]
Stack Overflow in RtlVirt...
by
af0815
[September 18, 2024, 09:32:09 pm]
Setting the fpc path
by
TRon
[September 18, 2024, 08:02:50 pm]
How to add float value s...
by
LBoxPO
[September 18, 2024, 07:25:39 pm]
Compiler error when check...
by
Thaddy
[September 18, 2024, 07:11:38 pm]
Loading text data from a ...
by
alpine
[September 18, 2024, 06:47:01 pm]
Simple multithreading cod...
by
MarkMLl
[September 18, 2024, 06:10:58 pm]
Recommended way to build ...
by
carl_caulkett
[September 18, 2024, 06:07:42 pm]
Incorrect FPC/Lazarus bui...
by
TRon
[September 18, 2024, 05:26:36 pm]
Vectors to find Center of...
by
Boleeman
[September 18, 2024, 03:48:26 pm]
7zip DLL is super broken
by
domasz
[September 18, 2024, 03:47:13 pm]
Gridprinter Newby
by
Zvoni
[September 18, 2024, 03:03:35 pm]
QuasiClever Fractal
by
Boleeman
[September 18, 2024, 02:23:56 pm]
Bitmap 16bit R5G6B5
by
wp
[September 18, 2024, 11:50:20 am]
"Show Compiler Dialog"
by
MarkMLl
[September 18, 2024, 10:21:49 am]
AI, NLP and CAI: Text Gen...
by
schuler
[September 18, 2024, 09:48:18 am]
Hashids that you need but...
by
Thaddy
[September 18, 2024, 06:10:34 am]
C++ conversion : what is ...
by
Thaddy
[September 18, 2024, 05:44:34 am]
« previous
next »
Print
Pages: [
1
]
Author
Topic: Is there any library that would make process of making a website scrapper easier (Read 1038 times)
Rave
Full Member
Posts: 170
Is there any library that would make process of making a website scrapper easier
«
on:
August 09, 2024, 11:19:30 pm »
Basically I want to make a web scraper for one of the sites I love that has awful UI so that I can interact with it better. Is there any Lazarus/Free Pascal library that would make this process easier? I'd rather not parse HTML by hand.
Logged
Thaddy
Hero Member
Posts: 15553
Censorship about opinions does not belong here.
Re: Is there any library that would make process of making a website scrapper easier
«
Reply #1 on:
August 10, 2024, 01:50:13 pm »
Our member benibela has a good library for that:
https://www.benibela.de/sources_en.html#internettools
Simple example that extracts all hrefs from a page:
Code: Pascal
[Select]
[+]
[-]
uses
simpleinternet
,
xquery
;
var
a
:
IXQValue
;
begin
for
a
in
process
(
'https://freepascal.org'
,
'//a/@href'
)
do
writeln
(
a
.
toString
)
;
end
.
You need to undefine USE_PASDBLSTRUTILS_FOR_JSON in internettoolsconfig.inc.
«
Last Edit: August 10, 2024, 01:52:43 pm by Thaddy
»
Logged
If I smell bad code it usually is bad code and that includes my own code.
Print
Pages: [
1
]
« previous
next »
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
TinyPortal
© 2005-2018