Trees | Index | Help |
---|
Module malcontent :: Class Malcontent |
|
object
--+ |_Verbose
--+ |Thread
--+ |NaiveAnalyzer
--+ | Malcontent
Method Summary | |
---|---|
Creates a new malicious content analyzer. | |
Add the sites we extracted in analyzeSite to the "to fetch" queue. | |
Applies all the available rules to the given site and extracts the links that we intend to crawl. | |
Logs the results of the crawl. | |
Select the next url to crawl to. | |
Applies all the rules to the given site and records the results. | |
Applies all the rules to the given site. | |
Inherited from NaiveAnalyzer | |
Returns the number of sites this analyzer has processed | |
Returns a map which maps domain names to links inside the domain. | |
Performs the main function of the analyzer. | |
Sets the stop condition to the specified value. | |
Inherited from Thread | |
| |
| |
| |
| |
| |
| |
| |
| |
| |
Inherited from _Verbose | |
| |
Inherited from object | |
x.__delattr__('name') <==> del x.name | |
x.__getattribute__('name') <==> x.name | |
x.__hash__() <==> hash(x) | |
T.__new__(S, ...) -> a new object with type S, a subtype of T | |
helper for pickle | |
helper for pickle | |
x.__setattr__('name', value) <==> x.name = value | |
x.__str__() <==> str(x) |
Class Variable Summary | |
---|---|
Inherited from Thread | |
bool |
_Thread__initialized = False
|
Method Details |
---|
__init__(self,
linksToFetchAndCond,
siteQueueAndCond,
db,
rules)
Creates a new malicious content analyzer.
|
addSiteToFetchQueue(self, lfs)Add the sites we extracted in analyzeSite to the "to fetch" queue.
|
analyzeSite(self, db, site)Applies all the available rules to the given site and extracts the links that we intend to crawl. Currently we follow regular ('<a...'), frame, iframe and script links.
|
report(self)Logs the results of the crawl.
|
selectNextUrl(self)Select the next url to crawl to. This is done by selecting a random domain and then taking one page from it's queue.
|
__checkSiteEvilness(self, site)Applies all the rules to the given site and records the results. |
__checkSiteEvilnessWithRules(self, site)Applies all the rules to the given site. Helper method. |
Trees | Index | Help |
---|
Generated by Epydoc 2.1 on Mon Dec 12 14:30:34 2005 | http://epydoc.sf.net |