Module orchid :: Class UrlHandler
[show private | hide private]
[frames | no frames]

Class UrlHandler


A class responsible for parsing a url and retrieving it's contents.
Method Summary
  __init__(self)
A constructor for the url handler.
  getSite(self)
Returns the url object which was opened by setCurrentUrl.
  processUrl(self, stringUrl)
Sets the url that the parser is working on.
  __canVisitSite(self, stringUrl)
Checks whether we are allowed by robots.txt to visit some page.
  __createRequest(self, stringUrl)

Method Details

__init__(self)
(Constructor)

A constructor for the url handler. Should be followed by calls to setCurrentUrl and getSite.

getSite(self)

Returns the url object which was opened by setCurrentUrl. The returned object acts just like a file object.

processUrl(self, stringUrl)

Sets the url that the parser is working on. Raises an exception if we can't open it.

__canVisitSite(self, stringUrl)

Checks whether we are allowed by robots.txt to visit some page. Returns true if we can, false otherwise.

Generated by Epydoc 2.1 on Mon Dec 12 14:30:34 2005 http://epydoc.sf.net