Ticket #8 (closed enhancement: fixed)
Text search of available material
| Reported by: | rgrp | Owned by: | rgrp |
|---|---|---|---|
| Priority: | major | Milestone: | 0.6 |
| Component: | shakespeare | Version: | |
| Keywords: | Cc: |
Description (last modified by rgrp) (diff)
As A
Visitor
I Want To
Do a text search of the available material
So That
I can find where words and phrases that i am interested in are used
Notes
- This facility should be available through the web interface
- This facility would essentially replace the concordance (a concordance is a poor man's search facility)
Implementation
Best bet is probably xapian which has some reasonable python bindings available:
http://xapian.org/docs/bindings/python/
Examples of usage can be found in:
http://www.xapian.org/docs/bindings/python/examples/
See in particular:
http://www.xapian.org/docs/bindings/python/examples/simpleindex.py
Which indexes each para of a text file as a separate Xapian document (perhaps what we want for shakespeare texts?).
Change History
Note: See
TracTickets for help on using
tickets.
