Lazarus
Home
Help
TinyPortal
Search
Login
Register
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
Free Pascal
Website
Downloads
Wiki
Documentation
Bugtracker
Mailing List
Lazarus
Website
Downloads (Laz+FPC)
Packages (OPM)
FAQ
Wiki
Documentation (RTL/FCL/LCL)
Bugtracker
CCR Bugs
GIT
Mailing List
Other languages
Foundation
Website
Useful Wiki Links
Project Roadmap
Getting the Source
Screenshots
How to use the forum
About donations (wiki)
Bookstore
Computer Math and Games in Pascal
(preview)
Lazarus Handbook
Search
Advanced search
Recent
Lazarus IDE Windows 64 lo...
by
StuPot
[
Today
at 12:01:38 am]
LazCAD – First Release!
by
wp
[
Today
at 12:00:21 am]
My first Attempt making a...
by
jamie
[September 09, 2024, 11:43:32 pm]
Unable install latest fpc...
by
marcov
[September 09, 2024, 11:03:47 pm]
Extended Module Player
by
Gigatron
[September 09, 2024, 09:11:48 pm]
IDE odd behaviour
by
Martin_fr
[September 09, 2024, 09:08:17 pm]
How to disable the compat...
by
Jonas Maebe
[September 09, 2024, 08:59:30 pm]
Problem with pasting text...
by
LV
[September 09, 2024, 08:46:58 pm]
How to create webhook in ...
by
Prakash
[September 09, 2024, 07:59:41 pm]
New Component jZCamView
by
Guser979
[September 09, 2024, 07:49:22 pm]
Forum upgrade?
by
MarkMLl
[September 09, 2024, 07:47:00 pm]
Is there a maximum length...
by
Remy Lebeau
[September 09, 2024, 07:35:30 pm]
PATCH method in fphttpcli...
by
JohnnieK
[September 09, 2024, 05:57:38 pm]
THTTPRouter Documentation
by
JohnnieK
[September 09, 2024, 05:54:48 pm]
TryStrToFloat
by
jcmontherock
[September 09, 2024, 05:25:30 pm]
Having difficulty buidlin...
by
MarkMLl
[September 09, 2024, 03:49:23 pm]
Move a window between Wor...
by
dbannon
[September 09, 2024, 02:05:28 pm]
Loosing mouse clicks when...
by
Wilko500
[September 09, 2024, 12:51:41 pm]
TStateMachine
by
matthius
[September 09, 2024, 12:51:16 pm]
Forum is so SLOOOW... Aga...
by
Marc
[September 09, 2024, 12:29:35 pm]
[SOLVED] Need help to use...
by
marcov
[September 09, 2024, 10:13:45 am]
AI, NLP and CAI: Text Gen...
by
schuler
[September 09, 2024, 06:24:11 am]
Advice/Guidance Needed Wi...
by
Aruna
[September 09, 2024, 06:05:58 am]
Using Lua 5.4 with Free P...
by
Roland57
[September 09, 2024, 05:14:59 am]
How to open Folder and se...
by
Remy Lebeau
[September 09, 2024, 01:57:02 am]
« previous
next »
Print
Pages: [
1
]
Author
Topic: Is there any library that would make process of making a website scrapper easier (Read 789 times)
Rave
Full Member
Posts: 170
Is there any library that would make process of making a website scrapper easier
«
on:
August 09, 2024, 11:19:30 pm »
Basically I want to make a web scraper for one of the sites I love that has awful UI so that I can interact with it better. Is there any Lazarus/Free Pascal library that would make this process easier? I'd rather not parse HTML by hand.
Logged
Thaddy
Hero Member
Posts: 15494
Censorship about opinions does not belong here.
Re: Is there any library that would make process of making a website scrapper easier
«
Reply #1 on:
August 10, 2024, 01:50:13 pm »
Our member benibela has a good library for that:
https://www.benibela.de/sources_en.html#internettools
Simple example that extracts all hrefs from a page:
Code: Pascal
[Select]
[+]
[-]
uses
simpleinternet
,
xquery
;
var
a
:
IXQValue
;
begin
for
a
in
process
(
'https://freepascal.org'
,
'//a/@href'
)
do
writeln
(
a
.
toString
)
;
end
.
You need to undefine USE_PASDBLSTRUTILS_FOR_JSON in internettoolsconfig.inc.
«
Last Edit: August 10, 2024, 01:52:43 pm by Thaddy
»
Logged
My great hero has found the key to the highway. Rest in peace John Mayall.
Playing: "Broken Wings" in your honour. As well as taking out some mouth organs.
Print
Pages: [
1
]
« previous
next »
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
TinyPortal
© 2005-2018