Tag: search

Using a FSIS CTS flow to generate text files and compile as FAST ESP dictionaries

Posted by – December 23, 2011

After have some trouble trying while create my FSIS CTS flow, I could finally read the data need from SQL Server instance and push to FAST ESP for indexing, once I got everything working as expected I started to think in what else I could easily to in a CTS flow.

My next challenge was create a FAST ESP dictionary for use on query completion server of FAST ESP, the majority of built in operators doesn’t provide a easy way to do it, hopefully I found the RunCode operator that I can use to perform anything that I need with a custom C# or VB code.
More

Share

Running a FSIS CTS flow from command line

Posted by – December 21, 2011

FSIS is a powerfull solution, with CTS you can easily consume and process data to be used on ESP for search or not, recently I wrote a very small powershell script that just run a CTS flow to index database records on ESP.

Here’s how this script looks like:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
Add-PSSnapin HostControllerPSSnapIn
Add-PSSnapin EnginePSSnapin
Add-PSSnapin JunoPSSnapin

write-host "`nRunning flow`n" `
        -foregroundcolor green

Connect-System

Connect-Engine

Execute-Flow -FlowName MyFlow

write-host "`nFinished running`n" `
        -foregroundcolor green

And then I just run the script with:

powershell.exe -Command “& c:\scripts\process_database.ps1″

The above command line example can is ready to be used on Task Scheduler, enjoy!

Share

Web Content Extractor for Web Scrapping

Posted by – August 2, 2008

The scrapy is a very nice software that allows to extract only specific portions of html page, with web scrapping tools you can extract from the web pages the data that you really need, dropping out all layout markup, this software is very easy to use, it has similiar features to kapow but very less expensive.
You can learn more about this amazing framework by clicking here.

Share

Working a little with FAST ESP

Posted by – June 22, 2008

I’ve been stopped my work on mobile and web development for a couple days because now I’m working on a search engine backend project, this isn’t a easy task because you must try some different configuration settings in order to get the search engine components working properly.

I hope post good information about this work very soon and maybe I can help you someday with this, who knows? :D

Share

Information Retrieval and Search solutions for Java platform

Posted by – April 3, 2008

After few days reading Lucene and Solr mailing lists archives I discovered a large range of frameworks and tools related with information retrieval and natural language processing, at first moment we can find the most important search engines components at Lucene site as Lucene subprojects, but there are several other initiatives related with this topic.
More

Share