APACHE STANBOL

Apache Stanbol - Website - https://stanbol.apache.org/docs/trunk/tutorial.html

Stanbol helps to model a semantic relationship around NLP. Given a document it can find the main concepts like NER and gives link to these entities into DBPedia or Enterprise database.

The steps to follow to use Stanbol :

1) Use RESTFul aPI
2) Use Java API

Using RestFul API
----------------------------------

Step 1: export MAVEN_OPTS="-Xmx1024M -XX:MaxPermSize=256M"
Step 2 : svn co http://svn.apache.org/repos/asf/stanbol/trunk stanbol
Step 3: mvn clean install (From downloaded stanbol directory)
Step 4: java -Xmx1g -jar stable/target/org.apache.stanbol.launchers.stable-{snapshot-version}-SNAPSHOT.jar (give your corresponding stanbol jar name)
Step 5 : Open http://localhost:8080in web browser
Step 6 : The stanbol options are available now. For ex. enhancer we can use as we click on that and give a text , we will get the corresponding NERs and its related DBPedia links.

Otherwise Step 7 : curl -X POST -H "Accept: text/turtle" -H "Content-type: text/plain" \ --data "The Stanbol enhancer can detect famous cities such as Paris and people such as Bob Marley." \ http://localhost:8080/enhancer

We will get the results.

Java API :
----------------
We can download and integrate Apache Stanbol Client API into Java from
https://github.com/zaizi/apache-stanbol-client .

after downloading the file and unzipping import into eclipse as java maven project. The we can use the enhance from the code below :

public class Sample {

public static void main(String[] args) throws StanbolServiceException, StanbolClientException {
    Sample sample = new Sample();
    sample.SimpleContentEnhancement();
}

public void SimpleContentEnhancement() throws StanbolServiceException, StanbolClientException{
    final StanbolClientFactory factory = new StanbolClientFactory("http://localhost:8080");
    final Enhancer client = factory.createEnhancerClient();
    EnhancerParameters parameters = EnhancerParameters.
               builder().
               buildDefault("Paris is the capital of France");
    EnhancementStructure eRes = client.enhance(parameters);
    eRes.getBestAnnotations();

    for(TextAnnotation ta: eRes.getTextAnnotations()){
        System.out.println("********************************************");
        System.out.println("Selection Context: " + ta.getSelectionContext());
        System.out.println("Selected Text: " + ta.getSelectedText());
        System.out.println("Engine: " + ta.getCreator());
        System.out.println("Candidates: ");
        for(EntityAnnotation ea:eRes.getEntityAnnotations(ta))
              System.out.println("\t" + ea.getEntityLabel() + " - " + ea.getEntityReference());
    }
}
}

(U can refer to the actual documents in this link : -
https://github.com/zaizi/apache-stanbol-client )

The above pgm will give the output as : -

TechVision

Search This Blog

APACHE STANBOL

Comments

Post a Comment

Popular posts from this blog

Explore python Libraries - Numpy, Scipy, Matplotlib

A Rule Based Question Answering System in Malayalam corpus Using Vibhakthi and POS Tag Analysis

List of Computer Vision APIs