Ticket #8 (closed enhancement: fixed)

Opened 4 years ago

Last modified 2 years ago

Text search of available material

Reported by: rgrp Owned by: rgrp
Priority: major Milestone: 0.6
Component: shakespeare Version:
Keywords: Cc:

Description (last modified by rgrp) (diff)

As A

Visitor

I Want To

Do a text search of the available material

So That

I can find where words and phrases that i am interested in are used

Notes

  • This facility should be available through the web interface
  • This facility would essentially replace the concordance (a concordance is a poor man's search facility)

Implementation

Best bet is probably xapian which has some reasonable python bindings available:

 http://xapian.org/docs/bindings/python/

Examples of usage can be found in:

 http://www.xapian.org/docs/bindings/python/examples/

See in particular:

 http://www.xapian.org/docs/bindings/python/examples/simpleindex.py

Which indexes each para of a text file as a separate Xapian document (perhaps what we want for shakespeare texts?).

Change History

Changed 4 years ago by rgrp

  • milestone changed from 0.4 to 0.5

Changed 2 years ago by rgrp

  • description modified (diff)

Changed 2 years ago by rgrp

  • milestone changed from 0.5 to 0.6

Changed 2 years ago by rgrp

  • status changed from new to closed
  • resolution set to fixed

Xapian based search implemented in r169, r191, r191 etc.

Note: See TracTickets for help on using tickets.