SUMMA

 

Overview

SUMMA is a toolkit for the development of text summarization systems.

 

 


 

SUMMA Vector Computation

Functionality

Creates a vector for a annotations in a document (for example sentences, paragraph or the whole document). The vector is a map of terms and weights and is stored in the document.

Parameters of the Resource

Initialization parameters

  • stopFeature: the feature to filter stop words
  • stopWordLoc: location of table of stop words
  • stopTag: the feature to filter certain word categories
  • stopTagLoc: the location of the tabloe with thye categories to stop
  • lowercase: whether the feature for stop words will be lowercased before verifying if present in the table.

Runtime parameters

  • annSetName: the annotation set to process
  • ending: the encoding of the strings
  • sentAnn: the annotation name for which vectors will be created
  • tokenAnn: the annotation token used to extrat the terms for the vectors
  • statistics: the statistic to use for as value in the vector
  • tokenFeature: the feature of the token (or other annotation) to use as the term in the vector

Restriction

None. But to obtain the desired behaviour the appropriate annotations and statistics should have been computed.

 

 

 

 

 

Copyright 2002-2014 Universitat Pompeu Fabra