About Harvest Search System
See Harvest Homepage for more
information about Harvest.
This is part of efforts to use Zebra as full text indexer for
Harvest. This is still work in progress. Following issues need to be
addressed:
- Ranking and proximity don't work as expected.
- We are currently working with well formed XML files. Would it be useful to
use valid XML files? Do we need XML DTD or Schema?
- Create a query interface which enables to use the rich features of Zebra,
but is not too complicated for casual users.
- Build an XSL for Harvest's XML files?
- Build an abstract syntax file (xsoif.abs) for Zebra?
- Put file name into XML file, so we can add a link to the raw XML file?
This should be helpful to evaluate the quality of the summarizer and
result ranking algorithm.
- Move the functionality of soif2xml.pl into broker.
Back to Query Page