Class CasDataToXCas

java.lang.Object
org.apache.uima.cas_data.impl.CasDataToXCas

public class CasDataToXCas extends Object
Takes a CasData and generates XCAS SAX events.
  • Field Details

    • mDocumentTextTypeName

      private String mDocumentTextTypeName
    • mDocumentTextFeatureName

      private String mDocumentTextFeatureName
    • mXCasDocTextTag

      private String mXCasDocTextTag
    • mIncludeAnnotationSpannedText

      private boolean mIncludeAnnotationSpannedText
    • mTypesToFilter

      private List<String> mTypesToFilter
    • mHandler

      private ContentHandler mHandler
  • Constructor Details

    • CasDataToXCas

      public CasDataToXCas()
  • Method Details

    • getDocumentTextTypeName

      public String getDocumentTextTypeName()
      Gets the name of the CASData FeatureStructure Type that stores the document text.
      Returns:
      the document text type name
    • setDocumentTextTypeName

      public void setDocumentTextTypeName(String aDocumentTextTypeName)
      Sets the name of the CASData FeatureStructure Type that stores the document text.
      Parameters:
      aDocumentTextTypeName - the document text type name
    • getDocumentTextFeatureName

      public String getDocumentTextFeatureName()
      Gets the name of the CASData Feature that stores the document text.
      Returns:
      the document text feature name
    • setDocumentTextFeatureName

      public void setDocumentTextFeatureName(String aDocumentTextFeatureName)
      Sets the name of the CASData Feature that stores the document text.
      Parameters:
      aDocumentTextFeatureName - the document text feature name
    • setXCasDocumentTextTagName

      public void setXCasDocumentTextTagName(String aXCasDocTextTag)
      Sets the name of the XCAS tag that will contain the document text.
      Parameters:
      aXCasDocTextTag - the document text tag
    • setIncludeAnnotationSpannedText

      public void setIncludeAnnotationSpannedText(boolean aIncludeAnnotationSpannedText)
      Parameters:
      aIncludeAnnotationSpannedText - -
    • setTypesToFilter

      public void setTypesToFilter(String[] aTypesToFilter)
      Specifies names of types that will not be included in the XCAS
      Parameters:
      aTypesToFilter - -
    • setContentHandler

      public void setContentHandler(ContentHandler aHandler)
      Sets the ContentHandler to receive the SAX events.
      Parameters:
      aHandler - -
    • generateXCas

      public void generateXCas(CasData aCasData) throws SAXException
      Generates XCAS for a CasData. SAX events representing the XCAS will be sent to the ContentHandler registered via setContentHandler(ContentHandler).
      Parameters:
      aCasData - the CasData from which XCAS will be generated
      Throws:
      SAXException - if the ContentHandler throws a SAX Exception
    • generateXCas

      public void generateXCas(CasData aCasData, String aUEID) throws SAXException
      Special form of generateXCas(CasData) that allows a UEID (Universal Entity ID) element to be added as the first element in the XCAS.
      Parameters:
      aCasData - the CasData from which XCAS will be generated
      aUEID - the UEID to add to the XCAS
      Throws:
      SAXException - if the ContentHandler throws a SAX Exception
    • generateXCas

      public void generateXCas(CasData aCasData, String aUEID, boolean aSendStartAndEndDocEvents) throws SAXException
      Special form of generateXCas(CasData) that allows a UEID (Universal Entity ID) element to be added as the first element in the XCAS and also allows start/end document SAX calls to be supressed.
      Parameters:
      aCasData - the CasData from which XCAS will be generated
      aUEID - the UEID to add to the XCAS
      aSendStartAndEndDocEvents - true to send SAX events for start and end of document, false to supress them.
      Throws:
      SAXException - if the ContentHandler throws a SAX Exception
    • _generate

      private void _generate(FeatureStructure aFS, CasDataToXCas.DocTextHolder aDocTextHolder) throws SAXException
      Throws:
      SAXException
    • getXCasElementName

      private String getXCasElementName(FeatureStructure aFS)
      Gets the XCAS element name for a FS. This is usually the same as the type name, but the sequences _colon_ and _dash_ are translated to the characters : and -, respectively.
      Parameters:
      aFS - feature structures
      Returns:
      XCAS element name for this feature structure
    • _generateDocFS

      private void _generateDocFS(FeatureStructure aFS, CasDataToXCas.DocTextHolder aDocTextHolder) throws SAXException
      Parameters:
      aFS -
      Throws:
      SAXException