PyKHTML site-scraping library

Developers Apps

Source i (link to git-repo or to original if based on someone elses unmodified work):

Add the source-code for this project on opencode.net

1
Become a Fan
5.0

Available as/for:
Description:
From the PyKHTML website at http://paul.giannaros.org/pykhtml:

"PyKHTML is...
A Python module for writing website scrapers/spiders. Whereas traditional methods focus on writing the code to parse HTML/forms themselves, PyKHTML uses the excellent KHTML engine to do all the trudge work. It therefore handles webpages very well (even the severely crufty ones) and is pretty darn fast (implemented in C++). As a bonus the module handles JavaScript and cookies transparently. Hurrah!"
Last changelog:

The PyKHTML Changelog is available at <a href="http://paul.giannaros.org/pykhtml/changelog.htm">http://paul.giannaros.org/pykhtml/changelog.htm</a>


Ratings & Comments

5 Comments

Gogast

Is it possible to create thumbnails of web pages?

cerulean

Not at the moment, but that should be easily accomplished. Is it something you'd find useful? If so, I could have a go at implementing it.

Gogast

Yes What do you think about PyKDE ? Are all KDE libraries already ported to Python ?

cerulean

PyKDE is excellent. It wraps pretty much all of the functionality of kdelibs and is a pleasure to work with -- I would highly reocmmend it. PyKDE4 is not ready yet, it will be released when the KDE4 API is stable and finalised.

cerulean

More or less implemented in the development repository (though with a requirement that you're in GUI debug mode, for the moment). You can get instructions on how to check it out at http://paul.giannaros.org/pykhtml/download.htm

Pling
0 Affiliates
Details
license
version 0.2
updated
added
downloads 24h 0
mediaviews 24h 0
pageviews 24h 0

More Developers Apps from cerulean:

KOffice python module
cerulean
last update date: 19 years ago

Score 5.0

Other Developers Apps:

Oo-mox
actionless
last update date: 8 years ago

Score 6.5

KEXI
jstaniek
last update date: 7 years ago

Score 6.5

BlackAdder
appy
last update date: 20 years ago

Score 5.0

Wing IDE Professional
sdeibel
last update date: 15 years ago

Score 5.0

Quanta Gold
appy
last update date: 20 years ago

Score 5.0

Codie
elgunvo
last update date: 15 years ago

Score 5.0



System Tags