Lazarus
Home
Help
TinyPortal
Search
Login
Register
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
Free Pascal
Website
Downloads
Wiki
Documentation
Bugtracker
Mailing List
Lazarus
Website
Downloads (Laz+FPC)
Packages (OPM)
FAQ
Wiki
Documentation (RTL/FCL/LCL)
Bugtracker
CCR Bugs
GIT
Mailing List
Other languages
Foundation
Website
Useful Wiki Links
Project Roadmap
Getting the Source
Screenshots
How to use the forum
About donations (wiki)
Bookstore
Computer Math and Games in Pascal
(preview)
Lazarus Handbook
Search
Advanced search
Recent
LazCAD – First Release!
by
jcarlos77
[
Today
at 05:27:40 pm]
Class designer/editor
by
Aruna
[
Today
at 05:16:37 pm]
Listview how to custom or...
by
VisualLab
[
Today
at 04:59:00 pm]
Broken Icon Display on ma...
by
msintle
[
Today
at 04:46:35 pm]
Browser User Agent & Frie...
by
MarkMLl
[
Today
at 04:26:29 pm]
TListView.OnChange Event ...
by
msintle
[
Today
at 04:14:58 pm]
programming with Lazarus
by
srvaldez
[
Today
at 04:03:11 pm]
Extended Module Player
by
Gigatron
[
Today
at 03:08:43 pm]
Change execution order wh...
by
Khrys
[
Today
at 03:02:06 pm]
7zip DLL is super broken
by
rvk
[
Today
at 01:55:47 pm]
Searching for a safe alte...
by
marcov
[
Today
at 01:28:57 pm]
Hints.
by
seghele0
[
Today
at 12:29:35 pm]
CLI | Wildcard parameters...
by
Bart
[
Today
at 11:21:42 am]
TDBF Recordcount
by
rvk
[
Today
at 11:19:09 am]
Error code 5
by
ccrause
[
Today
at 11:14:19 am]
Font draw in customdraw i...
by
mikutu
[
Today
at 08:02:02 am]
Problems with Gitlab
by
Joanna
[
Today
at 07:35:08 am]
Cannot open Access databa...
by
Khrys
[
Today
at 06:59:12 am]
TMemo and paste from clip...
by
Aruna
[
Today
at 05:58:24 am]
environment variables
by
dbannon
[
Today
at 01:59:07 am]
TDBDateEdit component def...
by
wp
[September 11, 2024, 11:56:30 pm]
TStateMachine
by
matthius
[September 11, 2024, 11:40:00 pm]
Any support for Perpendic...
by
circular
[September 11, 2024, 09:05:59 pm]
Local Vars often inspect ...
by
Martin_fr
[September 11, 2024, 08:58:22 pm]
Problem with pasting text...
by
artist
[September 11, 2024, 08:55:36 pm]
« previous
next »
Print
Pages: [
1
]
Author
Topic: Is there any library that would make process of making a website scrapper easier (Read 898 times)
Rave
Full Member
Posts: 170
Is there any library that would make process of making a website scrapper easier
«
on:
August 09, 2024, 11:19:30 pm »
Basically I want to make a web scraper for one of the sites I love that has awful UI so that I can interact with it better. Is there any Lazarus/Free Pascal library that would make this process easier? I'd rather not parse HTML by hand.
Logged
Thaddy
Hero Member
Posts: 15505
Censorship about opinions does not belong here.
Re: Is there any library that would make process of making a website scrapper easier
«
Reply #1 on:
August 10, 2024, 01:50:13 pm »
Our member benibela has a good library for that:
https://www.benibela.de/sources_en.html#internettools
Simple example that extracts all hrefs from a page:
Code: Pascal
[Select]
[+]
[-]
uses
simpleinternet
,
xquery
;
var
a
:
IXQValue
;
begin
for
a
in
process
(
'https://freepascal.org'
,
'//a/@href'
)
do
writeln
(
a
.
toString
)
;
end
.
You need to undefine USE_PASDBLSTRUTILS_FOR_JSON in internettoolsconfig.inc.
«
Last Edit: August 10, 2024, 01:52:43 pm by Thaddy
»
Logged
My great hero has found the key to the highway. Rest in peace John Mayall.
Playing: "Broken Wings" in your honour. As well as taking out some mouth organs.
Print
Pages: [
1
]
« previous
next »
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
TinyPortal
© 2005-2018