Ticket #8 (closed enhancement: fixed)

Opened 3 years ago

Last modified 10 months ago

Text search of available material

Reported by: rgrp Owned by: rgrp
Priority: major Milestone: 0.6
Component: shakespeare Version:
Keywords: Cc:

Description (last modified by rgrp) (diff)

As A

Visitor

I Want To

Do a text search of the available material

So That

I can find where words and phrases that i am interested in are used

Notes

  • This facility should be available through the web interface
  • This facility would essentially replace the concordance (a concordance is a poor man's search facility)

Implementation

Best bet is probably xapian which has some reasonable python bindings available:

http://xapian.org/docs/bindings/python/

Examples of usage can be found in:

http://www.xapian.org/docs/bindings/python/examples/

See in particular:

http://www.xapian.org/docs/bindings/python/examples/simpleindex.py

Which indexes each para of a text file as a separate Xapian document (perhaps what we want for shakespeare texts?).

Attachments

Change History

Changed 3 years ago by rgrp

  • milestone changed from 0.4 to 0.5

Changed 16 months ago by rgrp

  • description modified (diff)

Changed 11 months ago by rgrp

  • milestone changed from 0.5 to 0.6

Changed 10 months ago by rgrp

  • status changed from new to closed
  • resolution set to fixed

Xapian based search implemented in r169, r191, r191 etc.

Add/Change #8 (Text search of available material)

Author



Change Properties
<Author field>
Action
as closed
Next status will be 'reopened'
 
Note: See TracTickets for help on using tickets.