keywords: semi-structured text, unstructured text, structure recognition
- Retrieving Hierarchical Text Structure from Typeset : Scientific Articles – a Prerequisite for E-Science Text Mining
- Indexing Real-World Data using Semi-Structured Documents
- Inferring Structure Information from Typography
- Dr. Rolf Brugger
- Modeling Documents for Structure Recognition Using Generalized N-Grams
- A DTD Extension for Document Structure Recognition
- Jedi: Extracting and Synthesizing Information from the Web
- MarkItUp! An incremental approach to document structure recognition
- Lightweight Structure in Text
- A Maximum-Entropy Partial Parser for Unrestricted Text
- FeasPar: A Feature Structure Parser Learning to Parse Spontaneous Speech