Papers
- Converting From PDF To XML & MS Word: Avoiding The Pitfalls – Part 1, Part 2
- Converting PDF to XML with Publication-Specific Profiles
- Extracting metadata from PDF file (view in HTML)
- A Comparative Analysis Framework for Semi-structured Government Regulations (very long, not entirely related to PDF. It is actually a PhD dissertation about legal documents!)
Tools
- pdftohtml — PDF to HTML/XML conversion utility