Search outside database!

Submitted by chris on Wed, 2008-12-24 20:10.

The strong feature of DBSight is searching databases, but because DBSight is only for database, it has been the shortcoming also.

But thing's changed! Now DBSight can search outside database also!

Basically you just need to implement a simple abstract java class, where you can retrieve content from any data source, like files on disk, XML feeds via HTTP, etc.

The API is simple, yet able to handle incremental indexing and a re-creating index. You just need to select the content and submit it to DBSight. The content is processed in a streaming fashion, without consuming too much memory. DBSight will handle uniqueness checking and content updating in the background.

Be sure to check it out. http://wiki.dbsight.com/index.php?title=Search_outside_of_database

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.
Submitted by path on Tue, 2009-01-13 04:57.

Hi!

I'd like to know how I can implement this on my dbsight server. So far there there is no readme provided, read the wiki but didn't say how to set it up.

Thanks in advance.

Submitted by will on Tue, 2009-01-13 10:33.

The sample fetcher has all the information you will need. You can start with the build.xml, to adjust 3 key properties:

 dir : the directory name. Should be unique among all fetchers.
 DBSIGHT_HOME : the directory you installed DBSIGHT
 jarName : Give you jar file a name.

Then, just use the sample as a starting point, rename the DumbFetcher and TestFetcher to your own class name/package.

After these, you can start writing your own code. When it can run with your TestFetcher, do an "ant", and it'll be deployed to your DBSight instance.

Do let us know where you found confusing, and we will explain in more details. Thanks!