Cite: Y. Lin, H. Sundaram, Y. Chi, J. Tatemura, B. Tseng. Discovery of Blog Communities based on Mutual Awareness, in Third Annual Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics, at the 15th Annual World Wide Web Conference - WWW 2006, also AME-TR-2006-03. May, 2006.
Abstract: Blogs have many fast growing communities on the Internet. Discovering such communities in the blogosphere is important for sustaining and encouraging new blogger participation. We focus on extracting communities based on two key insights – (a) communities form due to individual blogger actions that are mutually observable; (b) semantics of the hyperlink structure are different from traditional web analysis problems. Our approach involves developing computational models for mutual awareness that incorporates the specific action type, frequency and time of occurrence. We use the mutual awareness feature with a rankingbased community extraction algorithm to discover communities. To validate our approach, four performance measures are used on the WWW2006 Blog Workshop dataset and the NEC focused blog dataset with excellent quantitative results. The extracted communities also demonstrate to be semantically cohesive with respect to their topics of interest.
[ download ]
Yu-Ru Lin
Hari Sundaram