Fulltext handling

Status: idea

Proposal: having a separate module (let's call it Library) for the actual fulltext management, which can operate on a filesystem or any other kind of more specific repository, and adding in pyblio an attribute that relates unambiguously to the document (say, a secure hash).

For local files, this module should be able to keep track of the location of the files (ie, it is not a black box in which you put your files, but rather an overlay that observes the files on disk).

For remote files (basically, for URLs), the module is in charge of possibly keeping local copies, check for updates, notify in case of access error,...

Things to consider:

The API should allow to ingest / batch register a bunch of local files, to fetch URLs, and to view a resource.

Use cases

Related Stuff

Roadmap/Fulltext handling (last edited 2007-09-08 18:01:19 by FredericGobry)