Lazarus
Home
Help
TinyPortal
Search
Login
Register
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
Free Pascal
Website
Downloads
Wiki
Documentation
Bugtracker
Mailing List
Lazarus
Website
Downloads (Laz+FPC)
Packages (OPM)
FAQ
Wiki
Documentation (RTL/FCL/LCL)
Bugtracker
CCR Bugs
GIT
Mailing List
Other languages
Foundation
Website
Useful Wiki Links
Project Roadmap
Getting the Source
Screenshots
How to use the forum
About donations (wiki)
Bookstore
Computer Math and Games in Pascal
(preview)
Lazarus Handbook
Search
Advanced search
Recent
TMemo issue?
by
rvk
[
Today
at 10:12:20 pm]
how to check type of valu...
by
PascalDragon
[
Today
at 09:48:39 pm]
Circle bare metal environ...
by
MarkMLl
[
Today
at 09:13:43 pm]
If you want to learn how ...
by
Fred vS
[
Today
at 08:26:29 pm]
Ability to use the Win ke...
by
furious programming
[
Today
at 08:23:40 pm]
FPC on Web3
by
Warfley
[
Today
at 07:44:36 pm]
Integer maths > 16 bits
by
MarkMLl
[
Today
at 07:44:33 pm]
memory management
by
srvaldez
[
Today
at 06:50:59 pm]
[SOLVED] How can I track ...
by
furious programming
[
Today
at 06:48:23 pm]
Has QT Richmemo acquired ...
by
zeljko
[
Today
at 06:38:33 pm]
scrolling Tmemos simultan...
by
LV
[
Today
at 06:24:22 pm]
dcpcrypt trojan report
by
Thaddy
[
Today
at 06:22:31 pm]
[SOLVED] How to query the...
by
Hartmut
[
Today
at 06:06:42 pm]
Tprocess capturing progre...
by
domasz
[
Today
at 05:38:59 pm]
Another question about cr...
by
carl_caulkett
[
Today
at 05:18:11 pm]
Common File Dialogs Have ...
by
msintle
[
Today
at 05:05:44 pm]
Problem building trunk An...
by
Ats
[
Today
at 04:40:07 pm]
can't compile fpgui on AR...
by
Fred vS
[
Today
at 04:34:42 pm]
[Solved] Could not find O...
by
Thaddy
[
Today
at 04:22:45 pm]
TChart intervals nicestep...
by
eldonfsr
[
Today
at 03:32:41 pm]
Ignoring viewBox TBGRASVG
by
AsleyCruz
[
Today
at 03:10:14 pm]
Translation and ShowMessa...
by
mig-31
[
Today
at 02:35:52 pm]
TMemo scroll down (stupid...
by
Weiss
[
Today
at 02:19:26 pm]
Lamw_manager on Ubuntu 24...
by
nobody.00
[
Today
at 02:02:47 pm]
combobox inside a treevie...
by
jamie
[
Today
at 01:54:14 pm]
« previous
next »
Print
Pages: [
1
]
Author
Topic: Is there any library that would make process of making a website scrapper easier (Read 1106 times)
Rave
Full Member
Posts: 170
Is there any library that would make process of making a website scrapper easier
«
on:
August 09, 2024, 11:19:30 pm »
Basically I want to make a web scraper for one of the sites I love that has awful UI so that I can interact with it better. Is there any Lazarus/Free Pascal library that would make this process easier? I'd rather not parse HTML by hand.
Logged
Thaddy
Hero Member
Posts: 15747
Censorship about opinions does not belong here.
Re: Is there any library that would make process of making a website scrapper easier
«
Reply #1 on:
August 10, 2024, 01:50:13 pm »
Our member benibela has a good library for that:
https://www.benibela.de/sources_en.html#internettools
Simple example that extracts all hrefs from a page:
Code: Pascal
[Select]
[+]
[-]
uses
simpleinternet
,
xquery
;
var
a
:
IXQValue
;
begin
for
a
in
process
(
'https://freepascal.org'
,
'//a/@href'
)
do
writeln
(
a
.
toString
)
;
end
.
You need to undefine USE_PASDBLSTRUTILS_FOR_JSON in internettoolsconfig.inc.
«
Last Edit: August 10, 2024, 01:52:43 pm by Thaddy
»
Logged
If I smell bad code it usually is bad code and that includes my own code.
Print
Pages: [
1
]
« previous
next »
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
TinyPortal
© 2005-2018