Seeker code update
I updated Seeker a few minutes. This is my code that wraps Lucene functionality. If that sounds like a type of mouthwash to you - just think of Lucene as a search engine, much like Verity, except that Lucene is free and open source. It also runs just fine on OSX.
The updates I included in Seeker are just bug fixes, but pretty critical bug fixes. Later this week I hope to have the ColdFusion Administrator pages build in to make it even easier to use. I'll be mimicking the Verity admin UI (pretty much) but will also include a search tool (like my Verity one) that will let you search indexes directly from the administrator.
p.s. And while I have your attention - my work on wrapping SVNKit as a possible replacement for the front end SVN stuff for RIAForge is close to being done. I'll be releasing that code as well (most likely).
Comments
Does it work in a similar way: Kinda. :) Like Verity, you have 2 main parts. Part one is creating and maintaining the index. Part 2 is the searching. I tried to make things very much like the Verity API in CF.
Would I recommend it? My code has had VERY little usage. I think about 2 people have used it. To me - that's a bit scary. But - we got to start someplace. ;)
There are ways of dealing with this, to a degree, but it sure is nice that Verity handles the document conversions out of the box.
This is definitely something that I will be looking into using, along with your FeedBurner CFC!
@Gus - There is another project at Apache that helps with this, but I haven't worked much with it. I built Seeker though so that it is easy to extend. Download it and look at how I built the readers. To add support for format X, you just add a CFC. Todd Sharp is going to share some PPT code with me soon.
The latest updates are looking promising for file based searches which were not working properly in the previous version. I really look forward to seeing the cfadmin stuff and will continue to test the code. Adding some more file readers will be really useful although pdf's and htm files are covered and these are the most common ones we index.
Keep up the excellent work Ray, I seriously don't know how you find the time.
I will give Lucene a test anyway - I'm only interested in query based indexing so it looks ideal.
I've got a new release of Seeker coming out later today. It just adds the ability to search N fields (thanks to AJ Mercer) and cleans up the zip a bit.
I also need to look into index operations like update/delete. It's going to suck if you have to blow away your index for every update.
I would like to know what file formats can currently be read and indexed?
Thanks
Would it be possible to index metadata from images? Or would that be something that could be added?
I basically need to create an indexes for multiple tables
I assume that would be true for file indexing too?
Thanks for creating it!

So does it work in a very similar way and would you recommend it as a good Verity alternative.