Emergent Semantics from Folksonomies: A Quantitative Study
2006, Journal on Data Semantics
https://doi.org/10.1007/11803034_8Abstract
Defining and using ontology to annotate web resources with semantic markups is generally perceived as the primary way to implement the vision of the Semantic Web. The ontology provides a shared and machine understandable semantics for web resources that agents and applications can utilize. This top-down approach (in the sense that an ontology is defined first on top of existing web resources and then used later to markup them), however, has a high barrier to entry and is difficult to scale up. In this paper, we investigate using a bottom-up approach for semantically annotating web resources as supported by the now widely popular social bookmarks services on the web where users can annotate and categorize web resources using “tags” freely choosen by the user without any pre-existing global semantic model. This kind of informal social categories is coined as “folksonomies”. We show how global semantics can be statistically inferred from the folksonomies to semantically annotate the web resources. The global semantic model also disambiguate the tags and group synonymous tags together. Finally, we show that there indeed are hierarchical relations among the emerged concepts in the folksonomy and it is plausible to further identify them if we use more advanced probabilistic models.
References (40)
- Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American 284 (2001) 34-43
- Manola, F., Miller, E.: RDF Primer. W3C Recommendation (2004)
- McGuinness, D.L., van Harmelen, F.: OWL Web ontology language overview. W3C Recommendation (2004)
- H.Gennari, J., A.Musen, M., W.Fergerson, R., E.Grosso, W., Crubézy, M., Eriks- son, H., F.Noy, N., W.Tu, S.: The evolution of Protégé: An environment for knowledge-based systems development. Technical Report SMI-2002-0943, Stan- ford Medical Informatics (2002)
- Bechhofer, S., Horrocks, I., Goble, C., Stevens, R.: OilEd: a reason-able ontol- ogy editor for the semantic web. In: Proceedings of the Joint German/Austrian Conference on AI. LNCS 2174 (2001) 396-408
- Corcho, O., López, M.F., Pérez, A.G., Vicente, O.: WebODE: An integrated work- bench for ontology representation, reasoning, and exchange. In: Proceedings of EKAW 2002. LNCS 2473 (2002) 138-153
- Zhang, L., Yu, Y., Lu, J., Lin, C., Tu, K., Guo, M., Zhang, Z., Xie, G., Su, Z., Pan, Y.: ORIENT: Integrate ontology engineering into industry tooling environment. In: Proc. of the 3rd Intl. Semantic Web Conference (ISWC2004). (2004)
- Kalyanpur, A., Sirin, E., Parsia, B., Hendler, J.: Hypermedia inspired ontology engineering environment: SWOOP. In: Proc. of the 3rd Intl. Semantic Web Con- ference (ISWC2004). (2004)
- Heflin, J., Hendler, J.: Dynamic ontologies on the web. In: Proceedings of the Seventeenth National Conference on Artificial Intelligence (AAAI-2000), Menlo Park, CA, USA, AAAI/MIT Press (2000) 443-449
- F.Noy, N., Klein, M.: Ontology evolution: Not the same as schema evolution. Knowledge and Information Systems 5 (2003)
- Kiryakov, A., Ognyanov, D.: Tracking changes in RDF(S) repositories. In: Pro- ceedings of the EKAW 2002, Siguenza, Spain, Springer (2002) 373-378
- Noy, N.F., Kunnatur, S., Klein, M., Musen, M.A.: Tracking changes during ontol- ogy evolution. In: Proc. of the 3rd Intl. Semantic Web Conference (ISWC2004). (2004)
- Klein, M., Fensel, D.: Ontology versioning for the semantic web. In: Proceedings of the 1st International Semantic Web Working Symposium (SWWS'01), Stanford University (2001) 75-91
- Klein, M., Fensel, D., Kiryakov, A., Ognyanov, D.: Ontology versioning and change detection on the web. In: Proceedings of the EKAW 2002, Siguenza, Spain, Springer (2002) 197-212
- Stojanovic, L., Maedche, A., Motik, B., Stojanovic, N.: User-driven ontology evo- lution management. In: Proceedings of the EKAW 2002, Siguenza, Spain, Springer (2002) 285-300
- N.F.Noy, M.Sintek, S.Decker, M.Crubezy, R.W.Fergerson, M.A.Musen: Creating semantic web contents with Protege-2000. IEEE Intelligent Systems 2 (2001) 60-71
- S.Handschuh, S.Staab: Authoring and annotation of web pages in CREAM. In: Proc. of the 11th Intl. World Wide Web Conference (WWW2002). (2002)
- Kiryakov, A., Popov, B., Ognyanoff, D., Manov, D., Kirilov, A., Goranov, M.: Semantic annotation, indexing, and retrieval. In: Proc. of the 2nd Intl. Semantic Web Conference (ISWC2003). (2003)
- Handschuh, S., Staab, S., Volz, R.: On deep annotation. In: Proc. of the 12th Intl. World Wide Web Conference (WWW2003). (2003) 431-438
- Blythe, J., Gil, Y.: Incremental formalization of document annotations through ontology-based paraphrasing. In: Proc. of the 13th conference on World Wide Web (WWW2004), ACM Press (2004) 455-461
- Cimiano, P., Handschuh, S., Staab, S.: Towards the self-annotating web. In: Proc. of the 13th Intl. World Wide Web Conference (WWW2004). (2004)
- Dill, S., Eiron, N., Gibson, D., Gruhl, D., R.Guha, Jhingran, A., Kanungo, T., Rajagopalan, S., Tomkins, A., A.Tomlin, J., Y.Zien, J.: SemTag and Seeker: Boot- strapping the semantic web via automated semantic annotation. In: Proc. of the 12th Intl. World Wide Web Conference (WWW2003). (2003) 178-186
- Etzioni, O., Cafarella, M., Downey, D., Kok, S., Popescu, A.M., Shaked, T., Soderland, S., S.Weld, D., Yates, A.: Web-scale information extraction in KnowItAll (preliminary results). In: Proc. of the 13th Intl. World Wide Web Conf.(WWW2004). (2004)
- Cimiano, P., Ladwig, G., Staab, S.: Gimme the context: Context-driven automatic semantic annotation with C-PANKOW. In: Proc. of the 14th Intl. World Wide Web Conference (WWW2005). (2005)
- Maedche, A.: Emergent semantics for ontologies. IEEE Intelligent Systems 17 (2002)
- Aberer, K., et.al: Emergent semantics principles and issues. In: Proc. of Database Systems for Advanced Applications. LNCS 2973 (2004)
- Kahan, J., Koivunen, M.R., Prud'Hommeaux, E., Swick, R.R.: Annotea: An open RDF infrastructure for shared web annotations. In: Proc. of the 10th Intl. World Wide Web Conference. (2001)
- Hammond, T., Hannay, T., Lund, B., Scott, J.: Social bookmarking tools (i) -a general review. D-Lib Magazine 11 (2005)
- Mathes, A.: Folksonomies -cooperative classification and communication through shared metadata. Computer Mediated Communication, LIS590CMC (Doctoral Seminar), Graduate School of Library and Information Science, University of Illi- nois Urbana-Champaign (2004)
- Udell, J.: Collaborative knowledge gardening. InfoWorld, August 20 (2004) 31. Merholz, P.: Metadata for the masses. http://www.adaptivepath.com/ publications/essays/archives/000361.php, accessed at May, 2005. (2004)
- Adamic, L.A., Huberman, B.A.: The web's hidden order. Communications of the ACM 44 (2001)
- Hofmann, T., Puzicha, J.: Statistical models for co-occurrence data. Technical report, A.I.Memo 1635, MIT (1998)
- G.A.Miller: WordNet: A lexical database for english. Communications of the ACM 2 (1995)
- A.Maedche, S.Staab: Ontology learning for the semantic web. IEEE Intelligent Systems 16 (2001)
- M.Shamsfard M, A.: The state of the art in ontology learning: a framework for comparison. Knowledge Engineering Review 18 (2003)
- J.J.Jung, Y.H.Yu, S.S.Jo: Collaborative web browsing based on ontology learn- ing from bookmarks. In: Proc. of the Intl. Conference of Computational Science (ICCS2004). (2004)
- W.I.Grosky, D.V.Sreenath, F.Fotouhi: Emergent semantics and the multimedia semantic web. SIGMOD Record 31 (2002)
- Aberer, K., Cudre-Mauroux, P., Hauswirth, M.: The chatty web: Emergent se- mantics through gossiping. In: Proc. of 12th Intl. Conf. on World Wide Web (WWW2003). (2003)
- Howe, B., Tanna, K., Turner, P., Maier, D.: Emergent semantics: Towards self- organizing scientific metadata. In: Proc. of the 1st Intl. IFIP Conference on Seman- tics of a Networked World: Semantics for Grid Databases (ICSNW 2004). LNCS 3226 (2004)
- W.Furnas, G., Deerwester, S., T.Dumais, S., K.Landauer, T., A.Harshman, R., A.Streeter, L., E.Lochbaum, K.: Information retrieval using a singular value de- composition model of latent semantic structure. In: Proc. of the ACM SIGIR'88, Grenoble, France (1988) 465-480