Lazarus
Home
Help
TinyPortal
Search
Login
Register
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
Free Pascal
Website
Downloads
Wiki
Documentation
Bugtracker
Mailing List
Lazarus
Website
Downloads (Laz+FPC)
Packages (OPM)
FAQ
Wiki
Documentation (RTL/FCL/LCL)
Bugtracker
CCR Bugs
GIT
Mailing List
Other languages
Foundation
Website
Useful Wiki Links
Project Roadmap
Getting the Source
Screenshots
How to use the forum
About donations (wiki)
Bookstore
Computer Math and Games in Pascal
(preview)
Lazarus Handbook
Search
Advanced search
Recent
CPU & Memory Profiler
by
440bx
[
Today
at 06:55:37 pm]
Problems with Gitlab
by
MarkMLl
[
Today
at 06:46:56 pm]
Key Word vs Key Char
by
paweld
[
Today
at 06:26:28 pm]
Was:Forum Upgrade? by @Re...
by
paweld
[
Today
at 06:07:07 pm]
Problem with pasting text...
by
LV
[
Today
at 05:43:15 pm]
order of evaluation of ov...
by
BrunoK
[
Today
at 05:38:23 pm]
Child board for reporting...
by
rvk
[
Today
at 05:00:16 pm]
How to disable the compat...
by
salvadordf
[
Today
at 04:16:38 pm]
Browser User Agent & Frie...
by
MarkMLl
[
Today
at 02:52:58 pm]
LazCAD – First Release!
by
maurog
[
Today
at 02:41:37 pm]
TStateMachine
by
matthius
[
Today
at 02:20:26 pm]
DBlookupcombobox question
by
Petrus Vorster
[
Today
at 01:43:36 pm]
Font draw in customdraw i...
by
mikutu
[
Today
at 10:28:00 am]
[Solved] Unable install l...
by
dsiders
[
Today
at 10:27:32 am]
[Solved] PATCH method in ...
by
JohnnieK
[
Today
at 09:13:00 am]
Forum is so SLOOOW... Aga...
by
Gustavo 'Gus' Carreno
[
Today
at 08:15:51 am]
Is there a "canonical" Gi...
by
dsiders
[
Today
at 08:04:09 am]
IDE odd behaviour
by
alpine
[
Today
at 07:38:13 am]
Terminal Component Availa...
by
JZS
[
Today
at 07:37:28 am]
Advice/Guidance Needed Wi...
by
dbannon
[
Today
at 06:07:33 am]
Lazarus IDE Windows 64 lo...
by
StuPot
[
Today
at 12:01:38 am]
My first Attempt making a...
by
jamie
[September 09, 2024, 11:43:32 pm]
Extended Module Player
by
Gigatron
[September 09, 2024, 09:11:48 pm]
How to create webhook in ...
by
Prakash
[September 09, 2024, 07:59:41 pm]
New Component jZCamView
by
Guser979
[September 09, 2024, 07:49:22 pm]
« previous
next »
Print
Pages: [
1
]
Author
Topic: Is there any library that would make process of making a website scrapper easier (Read 837 times)
Rave
Full Member
Posts: 170
Is there any library that would make process of making a website scrapper easier
«
on:
August 09, 2024, 11:19:30 pm »
Basically I want to make a web scraper for one of the sites I love that has awful UI so that I can interact with it better. Is there any Lazarus/Free Pascal library that would make this process easier? I'd rather not parse HTML by hand.
Logged
Thaddy
Hero Member
Posts: 15496
Censorship about opinions does not belong here.
Re: Is there any library that would make process of making a website scrapper easier
«
Reply #1 on:
August 10, 2024, 01:50:13 pm »
Our member benibela has a good library for that:
https://www.benibela.de/sources_en.html#internettools
Simple example that extracts all hrefs from a page:
Code: Pascal
[Select]
[+]
[-]
uses
simpleinternet
,
xquery
;
var
a
:
IXQValue
;
begin
for
a
in
process
(
'https://freepascal.org'
,
'//a/@href'
)
do
writeln
(
a
.
toString
)
;
end
.
You need to undefine USE_PASDBLSTRUTILS_FOR_JSON in internettoolsconfig.inc.
«
Last Edit: August 10, 2024, 01:52:43 pm by Thaddy
»
Logged
My great hero has found the key to the highway. Rest in peace John Mayall.
Playing: "Broken Wings" in your honour. As well as taking out some mouth organs.
Print
Pages: [
1
]
« previous
next »
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
TinyPortal
© 2005-2018