Xtract homepage

Download Xtract

Versiontgzi386i486i586i686src
Current            
           

Status for Xtract

  Xtract currently supports only the HTML language. The HTML functionallity is fully implemented. I do not intend to add support for any other language by m

Updates

(build nr. )
  • The generic package Xtract is included in all language-specific module-packages. To get Xtract, download one of those.
  Xtract is a generic module for extracting, removing or replacing parts of a document in any block-oriented language (Non-regular, context-free), such as Html or LaTeX. The part to extract remove or replace is pointed out by a path through the nested blocks. Blocks are identified by their name and (a subset of) their parameters. If more than one indistinguishable occurance of a block (names and parameters are the same), the two blocks are identified by their order (The first occurance gets index 1, the second index 2, and so on).   The module is implemented as a class which must be derived (Instansiation of the base-class results in an exception). The derived class should implement all the language-specific functions.
Note: The version naming system used for Xtract. Please follow this when developing modules for, and deriving works from, Xtract.

Xtract modules

Homepage Source code Text util: HtmlXtract

Status for HtmlXtract

  The HTML functionallity is fully implemented. I do not intend to add any more features. Which would that be? :)
  This module implements a derived class with the language-specific functions for HTML. In addition, it includes a command-line interface for this derived class.