I have submitted some modifications to the monkey-spider honeyclient in a separate project branch. Here is what has been done in particular:
- Converted stand alone source files into OO classes/methods: msProcessFolder.py ( depends on msExtractArc.py, msScannerClamav.py, msStatsCollector.py )
- Improved Error handling when adding results into database
- Added functionlity to log all queries sent to the database into separate log files, so in case database gets corrupt it can be recovered from the log.
- Migrated from PostgreSQL to MySQL database
- Added queries to complete the unused database tables. Now 5 tables are being filled up with msProcessFolder.py: malware, mw_output, mw_scanner, extractorstats, scannerstats
- Added functionality to log extraction and scanning times into the database for performance analysis
- Added a script to analyze malware extractor/scanner statistics ( msAnalyzer.py )
- Added a script to gather URLs from Google searches ( ms-seeder-ws-google.py )
You can find the code in the new branch here.