Pavuk
Encyclopedia
Pavuk is a GPL opensource web mirror (recursive download) software, with both command line and X Window GUI. Win32 ports are also available.

The most significant feature compared to similar software wget
Wget
GNU Wget is a computer program that retrieves content from web servers, and is part of the GNU Project. Its name is derived from World Wide Web and get...

 and httrack
HTTrack
HTTrack is a free and open source Web crawler and offline browser, developed by Xavier Roche and licensed under the GNU General Public License....

are advanced regular expression based filtering capabilities, filename creation rules, filtering based on HTML tag patterns, proper stop and resume. Unlike httrack, Javascripts are not interpreted to extract URL links, but they can be processed to a certain extent with regular expression matching.

The project has been dead since 2008 and suffers a lot segmentation fault.
The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK