SUMMA

 

Overview

SUMMA is a toolkit for the development of text summarization systems.

 

 


 

SUMMA Multi-extract Exporter

Functionality

Allows dumping a multi-document summary to disk.

Parameters of the Resource

  • annotationSet: the annotation set where the summary sentences live.
  • annotationType: the name of the annotation (e.g. Sentence) with the selected sentences
  • dirName: the directory where you want to dump the summaries.
  • summaryName: the name of file you want for your summary
  • corpus: the corpus with the summary components

Restriction

This resource should be used in a GATE pipeline, it does not make sense to use it in a Corpus Pipeline! You need multi-extract sentences annotated in some of the documents. The component assumes that the order of documents in the corpus is the order in which sentences should be presented (sentences from first document first, sentences from last document last).

 

 

 

 

 

Copyright 2002-2014 Universitat Pompeu Fabra