SUMMA

 

Overview

SUMMA is a toolkit for the development of text summarization systems.

 

 


 

SUMMA Simple Summarizer

Functionality

This component generates the summary. It combines features producing a score, ranks sentences by score, and annotates top ranked sentences. It also generates a stand-alone summary in the user interface.

Parameters of the Resource

  • annSetName: the annotation set where the annotations live
  • compression: an integer value representing either a percent of sentences to extrat or an absolut number of words to extract from the document.
  • newDocument: a boolean idicating if you want a new document gerenated for your summary
  • scoreOnly: a boolean indicating if you want to
  • sentAnn: the name of the sentence annotation where the features for scoring are going to be found and where the final score will be stored.
  • sentCompression: a boolean indicating if sentence compression is going to be used (e.g. proportion of sentences to extract) or absolute number of words.
  • sumFeatures: the features to include in the computation of the score
  • sumWeigths: the weights to be used to combine the features
  • wordAnn: the token annotation to count number of words for computing compression
  • sumSetName: the name of the annotation where sentences are going to be annotated for the summary.

Restriction

Features should have been computed for each sentence in the document.

 

 

 

 

 

Copyright 2002-2014 Universitat Pompeu Fabra