Lazarus
Home
Help
TinyPortal
Search
Login
Register
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
Free Pascal
Website
Downloads
Wiki
Documentation
Bugtracker
Mailing List
Lazarus
Website
Downloads (Laz+FPC)
Packages (OPM)
FAQ
Wiki
Documentation (RTL/FCL/LCL)
Bugtracker
CCR Bugs
GIT
Mailing List
Other languages
Foundation
Website
Useful Wiki Links
Project Roadmap
Getting the Source
Screenshots
How to use the forum
Forum Rules
About donations (wiki)
Bookstore
Computer Math and Games in Pascal
(preview)
Lazarus Handbook
Search
Advanced search
WIKI Timeout issues
Please read here if you have trouble connecting to the wiki
Recent
new private messages fly ...
by
Martin_fr
[
Today
at 05:13:47 pm]
Fpcupdeluxe
by
DonAlfredo
[
Today
at 04:45:54 pm]
[SOLVED] SQLite query to ...
by
DanishMale
[
Today
at 04:37:24 pm]
Has anyone installed TeeB...
by
Nicole
[
Today
at 04:29:53 pm]
ChatGPT and ObjectPascal ...
by
Martin_fr
[
Today
at 04:22:09 pm]
Compare two text lines an...
by
cdbc
[
Today
at 04:19:13 pm]
Pulsating Radial Segments...
by
Boleeman
[
Today
at 04:11:52 pm]
Object Pascal Playing Car...
by
marcov
[
Today
at 02:53:25 pm]
Fast Canvas Library V1.05...
by
Gigatron
[
Today
at 02:49:43 pm]
Trouchet Koch (Snowflake ...
by
Boleeman
[
Today
at 02:34:31 pm]
Extended Module Player
by
Gigatron
[
Today
at 02:27:09 pm]
Cirumvent the assignment ...
by
Thaddy
[
Today
at 01:29:52 pm]
Setting up an ARM embedde...
by
DonAlfredo
[
Today
at 12:43:09 pm]
controls.lpr(731,15) Erro...
by
Martin_fr
[
Today
at 11:44:05 am]
Amigo programming languag...
by
paxscript
[
Today
at 08:48:11 am]
[Solved]Garbled web page ...
by
PeterHu
[
Today
at 08:34:38 am]
Rainbowed Rotating Square...
by
Boleeman
[
Today
at 07:33:00 am]
[Solved]Retrieving wrong ...
by
PeterHu
[
Today
at 03:19:47 am]
I read this somewhere... ...
by
1HuntnMan
[
Today
at 12:06:28 am]
TCustomApplication.DoRun
by
cdbc
[July 11, 2025, 09:31:55 pm]
Searching for current wor...
by
fiscraliger
[July 11, 2025, 09:05:52 pm]
Forte Report CE Questions
by
PascalProg
[July 11, 2025, 08:26:14 pm]
Pascal Conference 2025
by
Jorg3000
[July 11, 2025, 04:17:06 pm]
problem by export and imp...
by
paule32
[July 11, 2025, 04:04:56 pm]
Message
by
Alexandr R
[July 11, 2025, 03:31:31 pm]
« previous
next »
Print
Pages: [
1
]
Author
Topic: Is there any library that would make process of making a website scrapper easier (Read 1364 times)
Rave
Full Member
Posts: 170
Is there any library that would make process of making a website scrapper easier
«
on:
August 09, 2024, 11:19:30 pm »
Basically I want to make a web scraper for one of the sites I love that has awful UI so that I can interact with it better. Is there any Lazarus/Free Pascal library that would make this process easier? I'd rather not parse HTML by hand.
Logged
Thaddy
Hero Member
Posts: 17410
Ceterum censeo Trumpum esse delendum (Tnx Charlie)
Re: Is there any library that would make process of making a website scrapper easier
«
Reply #1 on:
August 10, 2024, 01:50:13 pm »
Our member benibela has a good library for that:
https://www.benibela.de/sources_en.html#internettools
Simple example that extracts all hrefs from a page:
Code: Pascal
[Select]
[+]
[-]
uses
simpleinternet
,
xquery
;
var
a
:
IXQValue
;
begin
for
a
in
process
(
'https://freepascal.org'
,
'//a/@href'
)
do
writeln
(
a
.
toString
)
;
end
.
You need to undefine USE_PASDBLSTRUTILS_FOR_JSON in internettoolsconfig.inc.
«
Last Edit: August 10, 2024, 01:52:43 pm by Thaddy
»
Logged
Due to censorship, I changed this to "Nelly the Elephant". Keeps the message clear.
Print
Pages: [
1
]
« previous
next »
Lazarus
»
Forum
»
Programming
»
Networking and Web Programming
»
Is there any library that would make process of making a website scrapper easier
TinyPortal
© 2005-2018