-
KICSS 2006 : Knowledge, Information and Creativity Support Systems
The 1st International Conference on Knowledge, Information and Creativity Support Systems August 1-4, 2006 Ayutthaya, Thailand (abstract submission deadline: May 15, 2006) http://kind.siit.tu.ac.th/kicss2006/ The conference will cover a broad range of research topics in the fields of knowledge engineering and science, information technology, creativity support systems and complex system modeling. They include, but are not…
-
Character set detection in Java
jchardet — a Java port of Mozilla’s automatic charset detection algorithm.
-
TIGER API 1.8 released
TIGER API is a library which allows Java programmers to easily access the structure of any corpus given as a TIGER-XML file. oeze, one of the authors of TIGER API, has leave a message to us today: BTW, Tiger API has moved. This is the new URL: TIGER API. We have also included a section…
-
Looking for Structures
keywords: semi-structured text, unstructured text, structure recognition Retrieving Hierarchical Text Structure from Typeset : Scientific Articles – a Prerequisite for E-Science Text Mining Indexing Real-World Data using Semi-Structured Documents Inferring Structure Information from Typography Dr. Rolf Brugger Modeling Documents for Structure Recognition Using Generalized N-Grams A DTD Extension for Document Structure Recognition Jedi: Extracting and…
-
Island Grammars / Parsing
Water of uncertainty. Islands of certainty. Island Grammars and Island Parsing + Document Structure Parsing What is a Topic Map? (Durusau & O’Donnell, 2002) Semantic Role Parsing: Adding Semantic Structure to Unstructured Text (Pradhan, 2003) Adding Structure to Unstructured Text (Maletic & Collard, 2005) Island Parsing and Bidirectional Charts (Stock, 1988) (CiteSeer) Generating Robust Parsers…
-
Parsing Parsing
Natural Language Parsing (course) @ Uni Heidelberg The Program Transformation Wiki ANTLR tutorial @ The University of Birmingham (+ many other Java-related tutorials) Parsing books: by Dick Grune Modern Compiler Design, Parsing Techniques – A Practical Guide, Parsing Techniques – 2nd Edition Formalism / Tools SDF – Modular Syntax Definition Formalism TXL – The TXL…
-
NLP and Semantic Grid
เวลาต้องมานั่งประกอบนู่นประกอบนี่ (ที่แต่ละชิ้นไม่ได้ออกแบบมาให้ใช้ด้วยกัน) เข้าด้วยกันนี่ อย่างเบื่อ ถ้าแต่ละชิ้น ทำตัวเป็น บริการ (service) / ทรัพยากร (resource) ได้ก็ดีเนอะ Grid-Enabling Natural Language Engineering By Stealth by Baden Hughes and Steven Bird Semantic Grid GridNLP (nothing there, really) ลองค้นเว็บด้วยคำว่า natural+language+grid หรือ nlp+grid ดู
-
(Better, Faster,) Lighter NLTK
NLTK-Lite is substantially simplified and streamlined version of NLTK (Natural Language Toolkit). NLTK is no longer supported. NLTK-Lite is a new collection of lightweight NLP modules designed for maximum simplicity and efficiency. NLTK-Lite only covers the simple variants of standard data structures and tasks. Simplicity and efficiency are valued over generality and extensibility. Key differences…
-
Ruby NLP
NLP softwares in Ruby (old link) Ruby Linguistics Framework WordNet for Ruby Ruby on AI ตัว Ruby Linguistics นี่น่าสนใจ ตรงวิธีการใช้ .. คือมัน รูบี๊ รูบี้ น่ะ ตัวอย่าง: “box”.en.plural #=> “boxes” “mouse”.en.plural #=> “mice” “ruby”.en.plural #=> “rubies” “book”.en.a #=> “a book” “article”.en.a #=> “an article” “runs”.en.present_participle #=> “running” “eats”.en.present_participle #=> “eating” “spies”.en.present_participle #=> “spying” “leaving”.en.infinitive #=> “leave”…
-
2006 NLP workshop & symposium
TALN 2006: Workshop on Multilingual Lexical Semantics ISDD 2006: International Symposium : Discourse and Document