This book [R. Belew. Finding Out About: A Cognitive Perspective on Search Engines and the WWW. Cambridge University Press, Cambridge, 2000.] investigates and try to describes IR from the cognitive perspective (what human/user think, percept, behave, ..).
“Everything you want and don’t want to know about Google” 🙂
Thumbshots.org. Featuring small picture of each webpage, so users have more clue if it a site they looking for or not.
All people in computer science and some fields of engineering (e.g. industrial engineering?) are very familiar with “Graphs” — those nodes and arcs. And, actually, we can represent the web as a [huge] graph. Where node=webpage, arc=(hyper)link.
From this representation, it gives us a way to understand the characteristic of the web better (as we do well with normal graphs).
Talking about representing document/site as a node in a graph, Peer-to-Peer people already done this since their early day.
Instead of just only page title, url, and few first (nonsense) paragraphs from the page.
Short summaries may help users to decide which pages are whattheywant and whattheydontwant.
นอกจากจะแบ่งกลุ่มเอกสารที่หามาได้ ให้หา(ต่อโดยผู้ใช้ว่าอันไหนจะเอา อันไหนไม่เอา)ง่ายๆ แล้ว
PageRank is one of algorithms used by Google search engine.
If you want to know how PageRank works, this is the site.