-
Topic Clustering
There are more than Vivisimo out there, read about Topic Clustering at Fagan Finder.
-
C|NET News: Search may be Microsoft’s next target, court told
Microsoft may be unlawfully wielding its desktop dominance to put the squeeze on search engines and on document formats like Adobe Acrobat, the state of Massachusetts claimed on Friday. [READ THE NEWS]
-
Web Graphs and P2P
Web Graphs All people in computer science and some fields of engineering (e.g. industrial engineering?) are very familiar with “Graphs” — those nodes and arcs. And, actually, we can represent the web as a [huge] graph. Where node=webpage, arc=(hyper)link. From this representation, it gives us a way to understand the characteristic of the web better…
-
Summarization for Search Engine
Talking about Document Clustering/Categorization/Classification, about ‘approach’ to aid user access to mountains of pages may be a Summarization. Instead of just only page title, url, and few first (nonsense) paragraphs from the page. Short summaries may help users to decide which pages are whattheywant and whattheydontwant. นอกจากจะแบ่งกลุ่มเอกสารที่หามาได้ ให้หา(ต่อโดยผู้ใช้ว่าอันไหนจะเอา อันไหนไม่เอา)ง่ายๆ แล้ว ถ้าเรามีเนื้อหาย่อๆ ของเอกสารแต่ละหน้า ก็น่าจะทำให้ผู้ใช้ตัดสินใจได้ง่ายขึ้น เร็วขึ้น อ่านเปเปอร์ข้างล่าง…
-
PageRank explained
A Survey of Google’s PageRank PageRank is one of algorithms used by Google search engine. If you want to know how PageRank works, this is the site.
-
Information Retrieval (and related) research groups in Thailand
CRL Thai Computational Linguistics Lab CU Machine Intelligence and Knowledge Discovery Lab KU NAiST Lab KU Intelligent Information Retrieval and Database Lab NECTEC RD-I4: Text Processing Technology Group SIIT Knowledge Information & Data Management Lab SIIT KIND IRDM