Balby Marinho, L.; Hotho, A.; Jäschke, R.; Nanopoulos, A.; Rendle, S.; Schmidt-Thieme, L.; Stumme, G. & Symeonidis, P.: Recommender Systems for Social Tagging Systems. Springer, 2012SpringerBriefs in Electrical and Computer Engineering
[Volltext]
[Kurzfassung]
[BibTeX]
Social Tagging Systems are web applications in which users upload resources (e.g., bookmarks, videos, photos, etc.) and annotate it with a list of freely chosen keywords called tags. This is a grassroots approach to organize a site and help users to find the resources they are interested in. Social tagging systems are open and inherently social; features that have been proven to encourage participation. However, with the large popularity of these systems and the increasing amount of user-contributed content, information overload rapidly becomes an issue. Recommender Systems are well known applications for increasing the level of relevant content over the ?noise? that continuously grows as more and more content becomes available online. In social tagging systems, however, we face new challenges. While in classic recommender systems the mode of recommendation is basically the resource, in social tagging systems there are three possible modes of recommendation: users, resources, or tags. Therefore suitable methods that properly exploit the different dimensions of social tagging systems data are needed. In this book, we survey the most recent and state-of-the-art work about a whole new generation of recommender systems built to serve social tagging systems. The book is divided into self-contained chapters covering the background material on social tagging systems and recommender systems to the more advanced techniques like the ones based on tensor factorization and graph-based models.
Jäschke, R.; Hotho, A.; Mitzlaff, F. & Stumme, G.: Challenges in Tag Recommendations for Collaborative Tagging Systems. In: Pazos Arias, J. J.; Fernández Vilas, A. & Díaz Redondo, R. P. (Hrsg.): Recommender Systems for the Social Web. Berlin/Heidelberg: Springer, 2012 (Intelligent Systems Reference Library 32), S. 65-87
[Volltext] [Kurzfassung]
[BibTeX]
Originally introduced by social bookmarking systems, collaborative tagging, or social tagging, has been widely adopted by many web-based systems like wikis, e-commerce platforms, or social networks. Collaborative tagging systems allow users to annotate resources using freely chosen keywords, so called tags . Those tags help users in finding/retrieving resources, discovering new resources, and navigating through the system. The process of tagging resources is laborious. Therefore, most systems support their users by tag recommender components that recommend tags in a personalized way. The Discovery Challenges 2008 and 2009 of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) tackled the problem of tag recommendations in collaborative tagging systems. Researchers were invited to test their methods in a competition on datasets from the social bookmark and publication sharing system BibSonomy. Moreover, the 2009 challenge included an online task where the recommender systems were integrated into BibSonomy and provided recommendations in real time. In this chapter we review, evaluate and summarize the submissions to the two Discovery Challenges and thus lay the groundwork for continuing research in this area.
Kluegl, P.; Toepfer, M.; Lemmerich, F.; Hotho, A. & Puppe, F.: Stacked Conditional Random Fields Exploiting Structural Consistencies. In: Carmona, P. L.; Sánchez, J. S. & Fred, A. (Hrsg.): Proceedings of 1st International Conference on Pattern Recognition Applications and Methods ICPRAM. Vilamoura, Algarve, Portugal: SciTePress, 2012, S. 240-248
[Volltext] [Kurzfassung]
[BibTeX]
Conditional Random Fields CRF are popular methods for labeling unstructured or textual data. Like many machine learning approaches these undirected graphical models assume the instances to be independently distributed. However, in real world applications data is grouped in a natural way, e.g., by its creation context. The instances in each group often share additional structural consistencies. This paper proposes a domain-independent method for exploiting these consistencies by combining two CRFs in a stacked learning framework. The approach incorporates three successive steps of inference: First, an initial CRF processes single instances as usual. Next, we apply rule learning collectively on all labeled outputs of one context to acquire descriptions of its specific properties. Finally, we utilize these descriptions as dynamic and high quality features in an additional stacked CRF. The presented approach is evaluated with a real-world dataset for the segmentation of references and achieves a significant reduction of the labeling error.
Analysis of Social Media and Ubiquitous Data - International Workshops MSM 2010, Toronto, Canada, June 13, 2010, and MUSE 2010, Barcelona, Spain, September 20, 2010, Revised Selected Papers. Lecture Notes in Computer Science , 2011
[Volltext]
[BibTeX]
Atzmueller, M.; Benz, D.; Doerfel, S.; Hotho, A.; Jäschke, R.; Macek, B. E.; Mitzlaff, F.; Scholz, C. & Stumme, G.: Enhancing Social Interactions at Conferences. In: it - Information Technology 53 (2011), Nr. 3, S. 101-107
[Volltext]
[BibTeX]
Atzmueller, M.; Doerfel, S.; Hotho, A.; Mitzlaff, F. & Stumme, G.: Face-to-Face Contacts during a Conference: Communities, Roles, and Key Players. Proc. Workshop on Mining Ubiquitous and Social Environments (MUSE 2011) at ECML/PKDD 2011. 2011
[BibTeX]
Atzmueller, M.; Doerfel, S.; Hotho, A.; Mitzlaff, F. & Stumme, G.: Face-to-Face Contacts during LWA 2010 - Communities, Roles, and Key Players. Working Notes of the LWA 2011 - Learning, Knowledge, Adaptation. 2011
[BibTeX]
Atzmueller, M. & Hotho, A. (Hrsg.): Proceedings of the 2011 International Workshop on Mining Ubiquitous and Social Environments (MUSE 2011). Athens, Greece: ECML/PKDD 2011, 2011
[BibTeX]
Atzmueller, M.; Benz, D.; Hotho, A. & Stumme, G.: Towards Mining Semantic Maturity in Social Bookmarking Systems. In: Passant, A.; Fernández, S.; Breslin, J. & Boj?rs, U. (Hrsg.): Proceedings of the 4th international workshop on Social Data on the Web (SDoW2011). 2011
[Volltext]
[BibTeX]
Benz, D.; Körner, C.; Hotho, A.; Stumme, G. & Strohmaier, M.: One Tag to Bind Them All : Measuring Term Abstractness in Social Metadata. In: Antoniou, G.; Grobelnik, M.; Simperl, E.; Parsia, B.; Plexousakis, D.; Pan, J. & Leenheer, P. D. (Hrsg.): Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011). Heraklion, Crete: 2011
[Volltext] [Kurzfassung]
[BibTeX]
Recent research has demonstrated how the widespread adoption of collaborative tagging systems yields emergent semantics. In recent years, much has been learned about how to harvest the data produced by taggers for engineering light-weight ontologies. For example, existing measures of tag similarity and tag relatedness have proven crucial step stones for making latent semantic relations in tagging systems explicit. However, little progress has been made on other issues, such as understanding the different levels of tag generality (or tag abstractness), which is essential for, among others, identifying hierarchical relationships between concepts. In this paper we aim to address this gap. Starting from a review of linguistic definitions of word abstractness, we first use several large-scale ontologies and taxonomies as grounded measures of word generality, including Yago, Wordnet, DMOZ and Wikitaxonomy. Then, we introduce and apply several folksonomy-based methods to measure the level of generality of given tags. We evaluate these methods by comparing them with the grounded measures. Our results suggest that the generality of tags in social tagging systems can be approximated with simple measures. Our work has implications for a number of problems related to social tagging systems, including search, tag recommendation, and the acquisition of light-weight ontologies from tagging data.
Benz, D.; Körner, C.; Hotho, A.; Stumme, G. & Strohmaier, M.: One Tag to Bind Them All: Measuring Term Abstractness in Social Metadata. Working Notes of the LWA 2011 - Learning, Knowledge, Adaptation. 2011
[BibTeX]
Bullock, B. N.; Lerch, H.; Ro A.; Hotho, A. & Stumme, G.: Privacy-aware spam detection in social bookmarking systems. Proceedings of the 11th International Conference on Knowledge Management and Knowledge Technologies. New York, NY, USA: ACM, 2011i-KNOW '11 , S. 15:1-15:8
[Volltext] [Kurzfassung]
[BibTeX]
With the increased popularity of Web 2.0 services in the last years data privacy has become a major concern for users. The more personal data users reveal, the more difficult it becomes to control its disclosure in the web. However, for Web 2.0 service providers, the data provided by users is a valuable source for offering effective, personalised data mining services. One major application is the detection of spam in social bookmarking systems: in order to prevent a decrease of content quality, providers need to distinguish spammers and exclude them from the system. They thereby experience a conflict of interests: on the one hand, they need to identify spammers based on the information they collect about users, on the other hand, they need to respect privacy concerns and process as few personal data as possible. It would therefore be of tremendous help for system developers and users to know which personal data are needed for spam detection and which can be ignored. In this paper we address these questions by presenting a data privacy aware feature engineering approach. It consists of the design of features for spam classification which are evaluated according to both, performance and privacy conditions. Experiments using data from the social bookmarking system BibSonomy show that both conditions must not exclude each other.
Bullock, B. N.; Jäschke, R. & Hotho, A.: Tagging data as implicit feedback for learning-to-rank. Proceedings of the ACM WebSci'11. 2011
[Volltext]
[BibTeX]
Burke, R.; Gemmell, J.; Hotho, A. & Jäschke, R.: Recommendation in the Social Web. In: AI Magazine 32 (2011), Nr. 3, S. 46-56
[Volltext]
[Kurzfassung]
[BibTeX]
Recommender systems are a means of personalizing the presentation of information to ensure that users see the items most relevant to them. The social web has added new dimensions to the way people interact on the Internet, placing the emphasis on user-generated content. Users in social networks create photos, videos and other artifacts, collaborate with other users, socialize with their friends and share their opinions online. This outpouring of material has brought increased attention to recommender systems, as a means of managing this vast universe of content. At the same time, the diversity and complexity of the data has meant new challenges for researchers in recommendation. This article describes the nature of recommendation research in social web applications and provides some illustrative examples of current research directions and techniques. It is difficult to overstate the impact of the social web. This new breed of social applications is reshaping nearly every human activity from the way people watch movies to how they overthrow governments. Facebook allows its members to maintain friendships whether they live next door or on another continent. With Twitter, users from celebrities to ordinary folks can launch their 140 character messages out to a diverse horde of ??followers.? Flickr and YouTube users upload their personal media to share with the world, while Wikipedia editors collaborate on the world?s largest encyclopedia.
Freyne, J.; Anand, S. S.; Guy, I. & Hotho, A.: 3rd workshop on recommender systems and the social web. Proceedings of the fifth ACM conference on Recommender systems. New York, NY, USA: ACM, 2011RecSys '11 , S. 383-384
[Volltext] [Kurzfassung]
[BibTeX]
The exponential growth of the social web poses challenges and new opportunities for recommender systems. The social web has turned information consumers into active contributors creating massive amounts of information. Finding relevant and interesting content at the right time and in the right context is challenging for existing recommender approaches. At the same time, social systems by their definition encourage interaction between users and both online content and other users, thus generating new sources of knowledge for recommender systems. Web 2.0 users explicitly provide personal information and implicitly express preferences through their interactions with others and the system (e.g. commenting, friending, rating, etc.). These various new sources of knowledge can be leveraged to improve recommendation techniques and develop new strategies which focus on social recommendation. The Social Web provides huge opportunities for recommender technology and in turn recommender technologies can play a part in fuelling the success of the Social Web phenomenon. The goal of this one day workshop was to bring together researchers and practitioners to explore, discuss, and understand challenges and new opportunities for Recommender Systems and the Social Web. The workshop consisted both of technical sessions, in which selected participants presented their results or ongoing research, as well as informal breakout sessions on more focused topics. Papers discussing various aspects of recommender system in the Social Web were submitted and selected for presentation and discussion in the workshop in a formal reviewing process: Case studies and novel fielded social recommender applications; Economy of community-based systems: Using recommenders to encourage users to contribute and sustain participation.; Social network and folksonomy development: Recommending friends, tags, bookmarks, blogs, music, communities etc.; Recommender systems mash-ups, Web 2.0 user interfaces, rich media recommender systems; Collaborative knowledge authoring, collective intelligence; Recommender applications involving users or groups directly in the recommendation process; Exploiting folksonomies, social network information, interaction, user context and communities or groups for recommendations; Trust and reputation aware social recommendations; Semantic Web recommender systems, use of ontologies or microformats; Empirical evaluation of social recommender techniques, success and failure measures Full workshop details are available at http://www.dcs.warwick.ac.uk/~ssanand/RSWeb11/index.htm
Illig, J.; Hotho, A.; Jäschke, R. & Stumme, G.: A Comparison of Content-Based Tag Recommendations in Folksonomy Systems. In: Wolff, K. E.; Palchunov, D. E.; Zagoruiko, N. G. & Andelfinger, U. (Hrsg.): Knowledge Processing and Data Analysis. Berlin/Heidelberg: Springer, 2011 (Lecture Notes in Computer Science 6581), S. 136-149
[Volltext] [Kurzfassung]
[BibTeX]
Recommendation algorithms and multi-class classifiers can support
users of social bookmarking systems in assigning tags to their
bookmarks. Content based recommenders are the usual approach for
facing the cold start problem, i.e., when a bookmark is uploaded for
the first time and no information from other users can be exploited.
In this paper, we evaluate several recommendation algorithms in a
cold-start scenario on a large real-world dataset.
Marinho, L. B.; Nanopoulos, A.; Schmidt-Thieme, L.; Jäschke, R.; Hotho, A.; Stumme, G. & Symeonidis, P.: Social Tagging Recommender Systems.. In: Ricci, F.; Rokach, L.; Shapira, B. & Kantor, P. B. (Hrsg.): Recommender Systems Handbook. Springer, 2011, S. 615-644
[Volltext]
[BibTeX]
Mitzlaff, F.; Atzmueller, M.; Benz, D.; Hotho, A. & Stumme, G.: Community Assessment using Evidence Networks. Analysis of Social Media and Ubiquitous Data. 2011 (LNAI 6904)
[BibTeX]
Scholz, C.; Doerfel, S.; Atzmueller, M.; Hotho, A. & Stumme, G.: Resource-Aware On-Line RFID Localization Using Proximity Data. ECML/PKDD (3). 2011, S. 129-144
[BibTeX]
Scholz, C.; Doerfel, S.; Atzmueller, M.; Hotho, A. & Stumme, G.: Resource-Aware On-Line RFID Localization Using Proximity Data. Working Notes of the LWA 2011 - Learning, Knowledge, Adaptation. 2011
[BibTeX]
Toepfer, M.; Kluegl, P.; Hotho, A. & Puppe, F.: Segmentation of References with Skip-Chain Conditional Random Fields for Consistent Label Transitions. Workshop Notes of the LWA 2011 - Learning, Knowledge, Adaptation. 2011
[Volltext]
[BibTeX]
Atzmueller, M. & Hotho, A. (Hrsg.): Proceedings of the 2010 Workshop on Mining Ubiquitous and Social Environments (MUSE 2010). Barcelona, Spain: ECML/PKDD 2010, 2010
[BibTeX]
Atzmueller, M.; Benz, D.; Hotho, A. & Stumme, G. (Hrsg.): Proceedings of the LWA 2010 - Lernen, Wissen, Adaptivität. Department of Electrical Engineering/Computer Science, Kassel University, 2010Technical report (KIS), 2010-10
[BibTeX]
Benz, D.; Hotho, A.; Jäschke, R.; Stumme, G.; Halle, A.; Lima, A. G. S.; Steenweg, H. & Stefani, S.: Academic Publication Management with PUMA - collect, organize and share publications. In: Lalmas, M.; Jose, J.; Rauber, A.; Sebastiani, F. & Frommholz, I. (Hrsg.): Proceedings of the European Conference on Research and Advanced Technology for Digital Libraries (ECDL) 2010. Berlin/Heidelberg: Springer, 2010 (Lecture Notes in Computer Science 6273), S. 417-420
[Kurzfassung]
[BibTeX]
The PUMA project fosters the Open Access movement und aims at a better support of the researcher?s publication work. PUMA stands for an integrated solution, where the upload of a publication results automatically in an update of both the personal and institutional homepage, the creation of an entry in a social bookmarking systems like BibSonomy, an entry in the academic reporting system of the university, and its publication in the institutional repository. In this poster, we present the main features of our solution.
Benz, D.; Hotho, A.; Jäschke, R.; Krause, B. & Stumme, G.: Query Logs as Folksonomies. In: Datenbank-Spektrum 10 (2010), Nr. 1, S. 15-24
[Volltext]
[Kurzfassung]
[BibTeX]
Query logs provide a valuable resource for preference information in search. A user clicking on a specific resource after
submitting a query indicates that the resource has some relevance with respect to the query. To leverage the information ofquery logs, one can relate submitted queries from specific users to their clicked resources and build a tripartite graph ofusers, resources and queries. This graph resembles the folksonomy structure of social bookmarking systems, where users addtags to resources. In this article, we summarize our work on building folksonomies from query log files. The focus is on threecomparative studies of the system?s content, structure and semantics. Our results show that query logs incorporate typicalfolksonomy properties and that approaches to leverage the inherent semantics of folksonomies can be applied to query logsas well.
Benz, D.; Hotho, A. & Stumme, G.: Semantics made by you and me: Self-emerging ontologies can capture the diversity of shared knowledge. Proceedings of the 2nd Web Science Conference (WebSci10). Raleigh, NC, USA: 2010
[BibTeX]
Benz, D.; Hotho, A.; Jäschke, R.; Krause, B.; Mitzlaff, F.; Schmitz, C. & Stumme, G.: The Social Bookmark and Publication Management System BibSonomy. In: The VLDB Journal 19 (2010), Nr. 6, S. 849-875
[Volltext]
[Kurzfassung]
[BibTeX]
Social resource sharing systems are central elements of the Web 2.0 and use the same kind of lightweight knowledge representation, called folksonomy. Their large user communities and ever-growing networks of user-generated content have made them an attractive object of investigation for researchers from different disciplines like Social Network Analysis, Data Mining, Information Retrieval or Knowledge Discovery. In this paper, we summarize and extend our work on different aspects of this branch of Web 2.0 research, demonstrated and evaluated within our own social bookmark and publication sharing system BibSonomy, which is currently among the three most popular systems of its kind. We structure this presentation along the different interaction phases of a user with our system, coupling the relevant research questions of each phase with the corresponding implementation issues. This approach reveals in a systematic fashion important aspects and results of the broad bandwidth of folksonomy research like capturing of emergent semantics, spam detection, ranking algorithms, analogies to search engine log data, personalized tag recommendations and information extraction techniques. We conclude that when integrating a real-life application like BibSonomy into research, certain constraints have to be considered; but in general, the tight interplay between our scientific work and the running system has made BibSonomy a valuable platform for demonstrating and evaluating Web 2.0 research.
Berendt, B.; Hotho, A. & Stumme, G.: Bridging the Gap-Data Mining and Social Network Analysis for Integrating Semantic Web and Web 2.0. In: Web Semantics: Science, Services and Agents on the World Wide Web 8 (2010), Nr. 2-3, S. 95 - 96
[Volltext]
[BibTeX]
Hotho, A.: Data Mining on Folksonomies. In: Armano, G.; de Gemmis, M.; Semeraro, G. & Vargiu, E. (Hrsg.): Intelligent Information Access. Berlin / Heidelberg: Springer, 2010 (Studies in Computational Intelligence 301), S. 57-82
[Volltext] [Kurzfassung]
[BibTeX]
Social resource sharing systems are central elements of the Web 2.0 and use all the same kind of lightweight knowledge representation, called folksonomy. As these systems are easy to use, they attract huge masses of users. Data Mining provides methods to analyze data and to learn models which can be used to support users. The application and adaptation of known data mining algorithms to folksonomies with the goal to support the users of such systems and to extract valuable information with a special focus on the Semantic Web is the main target of this paper. In this work we give a short introduction into folksonomies with a focus on our own system BibSonomy. Based on the analysis we made on a large folksonomy dataset, we present the application of data mining algorithms on three different tasks, namely spam detection, ranking and recommendation. To bridge the gap between folksonomies and the Semantic Web, we apply association rule mining to extract relations and present a deeper analysis of statistical measures which can be used to extract tag relations. This approach is complemented by presenting two approaches to extract conceptualizations from folksonomies.
Hotho, A.; Benz, D.; Eisterlehner, F.; Jäschke, R.; Krause, B.; Schmitz, C. & Stumme, G.: Publikationsmanagement mit BibSonomy - ein Social-Bookmarking-System für Wissenschaftler. In: HMD -- Praxis der Wirtschaftsinformatik Heft 271 (2010), S. 47-58
[Kurzfassung]
[BibTeX]
Kooperative Verschlagwortungs- bzw. Social-Bookmarking-Systeme wie Delicious, Mister Wong oder auch unser eigenes System BibSonomy erfreuen sich immer größerer Beliebtheit und bilden einen zentralen Bestandteil des heutigen Web 2.0. In solchen Systemen erstellen Nutzer leichtgewichtige Begriffssysteme, sogenannte Folksonomies, die die Nutzerdaten strukturieren. Die einfache Bedienbarkeit, die Allgegenwärtigkeit, die ständige Verfügbarkeit, aber auch die Möglichkeit, Gleichgesinnte spontan in solchen Systemen zu entdecken oder sie schlicht als Informationsquelle zu nutzen, sind Gründe für ihren gegenwärtigen Erfolg. Der Artikel führt den Begriff Social Bookmarking ein und diskutiert zentrale Elemente (wie Browsing und Suche) am Beispiel von BibSonomy anhand typischer Arbeitsabläufe eines Wissenschaftlers. Wir beschreiben die Architektur von BibSonomy sowie Wege der Integration und Vernetzung von BibSonomy mit Content-Management-Systemen und Webauftritten. Der Artikel schließt mit Querbezügen zu aktuellen Forschungsfragen im Bereich Social Bookmarking.
Kluegl, P.; Hotho, A. & Puppe, F.: Local Adaptive Extraction of References. In: Dillmann, R.; Beyerer, J.; Hanebeck, U. D. & Schultz, T. (Hrsg.): KI 2010: Advances in Artificial Intelligence, 33rd Annual German Conference on AI. Springer, 2010 LNAI 6359 , S. 40-47
[Volltext] [Kurzfassung]
[BibTeX]
The accurate extraction of scholarly reference information from scientific publications is essential for many useful applications like BibTeX management systems or citation analysis. Automatic extraction methods suffer from the heterogeneity of reference notation, no matter wether the extraction model was handcrafted or learnt from labeled data. However, references of the same paper or journal are usually homogeneous. We exploit this local consistency with a novel approach. Given some initial information from such a reference section, we try to derived generalized patterns. These patterns are used to create a local model of the current document. The local model helps to identify errors and to improve the extracted information incrementally during the extraction process. Our approach is implemented with handcrafted transformation rules working on a meta-level being able to correct the information independent of the applied layout style. The experimental results compete very well with the state of the art methods and show an extremely high performance on consistent reference sections.
Krause, B.; Lerch, H.; Hotho, A.; Roßnagel, A. & Stumme, G.: Datenschutz im Web 2.0 am Beispiel des sozialen Tagging-Systems BibSonomy. In: Informatik-Spektrum (2010), S. 1-12
[Volltext]
[Kurzfassung]
[BibTeX]
Soziale Tagging-Systeme gehören zu den in den vergangenen Jahren entstandenen Web2.0-Systemen. Sie ermöglichen es Anwendern, beliebige Informationen in das Internet einzustellen und untereinander auszutauschen. Je nach Anbieter verlinken Nutzer Videos, Fotos oder Webseiten und beschreiben die eingestellten Medien mit entsprechenden Schlagwörtern (Tags). Die damit einhergehende freiwillige Preisgabe oftmals persönlicher Informationen wirft Fragen im Bereich der informationellen Selbstbestimmung auf. Dieses Grundrecht gewährleistet dem Einzelnen, grundsätzlich selbst über die Preisgabe und Verwendung seiner persönlichen Daten zu bestimmen. Für viele Funktionalitäten, wie beispielsweise Empfehlungsdienste oder die Bereitstellung einer API, ist eine solche Kontrolle allerdings schwierig zu gestalten. Oftmals existieren keine Richtlinien, inwieweit Dienstanbieter und weitere Dritte diese öffentlichen Daten (und weitere Daten, die bei der Nutzung des Systems anfallen) nutzen dürfen. Dieser Artikel diskutiert anhand eines konkreten Systems typische, für den Datenschutz relevante Funktionalitäten und gibt Handlungsanweisungen für eine datenschutzkonforme technische Gestaltung.
Körner, C.; Benz, D.; Strohmaier, M.; Hotho, A. & Stumme, G.: Stop Thinking, start Tagging - Tag Semantics emerge from Collaborative Verbosity. Proceedings of the 19th International World Wide Web Conference (WWW 2010). Raleigh, NC, USA: ACM, 2010
[Volltext] [Kurzfassung]
[BibTeX]
Recent research provides evidence for the presence of emergent semantics in collaborative tagging systems. While several methods have been proposed, little is known about the factors that influence the evolution of semantic structures in these systems. A natural hypothesis is that the quality of the emergent semantics depends on the pragmatics of tagging: Users with certain usage patterns might contribute more to the resulting semantics than others. In this work, we propose several measures which enable a pragmatic differentiation of taggers by their degree of contribution to emerging semantic structures. We distinguish between categorizers, who typically use a small set of tags as a replacement for hierarchical classification schemes, and describers, who are annotating resources with a wealth of freely associated, descriptive keywords. To study our hypothesis, we apply semantic similarity measures to 64 different partitions of a real-world and large-scale folksonomy containing different ratios of categorizers and describers. Our results not only show that ?verbose? taggers are most useful for the emergence of tag semantics, but also that a subset containing only 40% of the most ?verbose? taggers can produce results that match and even outperform the semantic precision obtained from the whole dataset. Moreover, the results suggest that there exists a causal link between the pragmatics of tagging and resulting emergent semantics. This work is relevant for designers and analysts of tagging systems interested (i) in fostering the semantic development of their platforms, (ii) in identifying users introducing ?semantic noise?, and (iii) in learning ontologies.
Lerch, H.; Krause, B.; Hotho, A.; Roßnagel, A. & Stumme, G.: Social Bookmarking-Systeme ? die unerkannten Datensammler - Ungewollte personenbezogene Datenverabeitung?. In: MultiMedia und Recht 7 (2010), S. 454-458
[BibTeX]
Mitzlaff, F.; Atzmüller, M.; Benz, D.; Hotho, A. & Stumme, G.: Community Assessment using Evidence Networks. Proceedings of the Workshop on Mining Ubiquitous and Social Environments (MUSE2010). Barcelona, Spain: 2010
[Volltext] [Kurzfassung]
[BibTeX]
Community mining is a prominent approach for identifying (user) communities in social and ubiquitous contexts. While there are a variety of methods for community mining and detection, the effective evaluation and validation of the mined communities is usually non-trivial. Often there is no evaluation data at hand in order to validate the discovered groups. This paper proposes evidence networks using implicit information for the evaluation of communities. The presented evaluation approach is based on the idea of reconstructing existing social structures for the assessment and evaluation of a given clustering. We analyze and compare the presented evidence networks using user data from the real-world social
bookmarking application BibSonomy. The results indicate that the evidence
networks reflect the relative rating of the explicit ones very well.
Mitzlaff, F.; Benz, D.; Stumme, G. & Hotho, A.: Visit me, click me, be my friend: An analysis of evidence networks of user relationships in Bibsonomy. Proceedings of the 21st ACM conference on Hypertext and hypermedia. Toronto, Canada: 2010
[BibTeX]
Toepfer, M.; Kluegl, P.; Hotho, A. & Puppe., F.: Conditional Random Fields For Local Adaptive Reference Extraction. In: Atzmüller, M.; Benz, D.; Hotho, A. & Stumme, G. (Hrsg.): Proceedings of LWA2010 - Workshop-Woche: Lernen, Wissen & Adaptivitaet. Kassel, Germany: 2010
[Volltext] [Kurzfassung]
[BibTeX]
The accurate extraction of bibliographic information from scientific publications is an active field of research. Machine learning and sequence labeling approaches like Conditional Random Fields (CRF) are often applied for this reference extraction task, but still suffer from the ambiguity of reference notation. Reference sections apply a predefined style guide and contain only homogeneous references. Therefore, other references of the same paper or journal often provide evidence how the fields of a reference are correctly labeled. We propose a novel approach that exploits the similarities within a document. Our process model uses information of unlabeled documents directly during the extraction task in order to automatically adapt to the perceived style guide. This is implemented by changing the manifestation of the features for the applied CRF. The experimental results show considerable improvements compared to the common approach. We achieve an average F1 score of 96.7% and an instance accuracy of 85.4% on the test data set.
Atzmueller, M.; Lemmerich, F.; Krause, B. & Hotho, A.: Towards Understanding Spammers - Discovering Local Patterns for Concept Characterization and Description. In: Knobbe, J. F. A. (Hrsg.): Proc. LeGo-09: From Local Patterns to Global Models, Workshop at the 2009 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. 2009
[Volltext]
[BibTeX]
Atzmüller, M.; Lemmerich, F.; Krause, B. & Hotho, A.: Who are the Spammers - Understandable Local Patterns for Concept Description. 7th Conference on Computer Methods and Systems. Krakow, Poland: 2009
[Volltext]
[BibTeX]
Benz, D.; Krause, B.; Kumar, G. P.; Hotho, A. & Stumme, G.: Characterizing Semantic Relatedness of Search Query Terms. Proceedings of the 1st Workshop on Explorative Analytics of Information Networks (EIN2009). Bled, Slovenia: 2009
[BibTeX]
Benz, D.; Eisterlehner, F.; Hotho, A.; Jäschke, R.; Krause, B. & Stumme, G.: Managing publications and bookmarks with BibSonomy. In: Cattuto, C.; Ruffo, G. & Menczer, F. (Hrsg.): HT '09: Proceedings of the 20th ACM Conference on Hypertext and Hypermedia. New York, NY, USA: ACM, 2009, S. 323-324
[Volltext] [Kurzfassung]
[BibTeX]
In this demo we present BibSonomy, a social bookmark and publication sharing system.
Hotho, A.; Jäschke, R.; Benz, D.; Grahl, M.; Krause, B.; Schmitz, C. & Stumme, G.: Social Bookmarking am Beispiel BibSonomy. In: Blumauer, A. & Pellegrini, T. (Hrsg.): Social Semantic Web. Berlin, Heidelberg: Springer, 2009X.media.press , S. 363-391
[Volltext] [Kurzfassung]
[BibTeX]
BibSonomy ist ein kooperatives Verschlagwortungssystem (Social Bookmarking System), betrieben vom Fachgebiet Wissensverarbeitung
der Universität Kassel. Es erlaubt das Speichern und Organisieren von Web-Lesezeichen und Metadaten für wissenschaftlichePublikationen. In diesem Beitrag beschreiben wir die von BibSonomy bereitgestellte Funktionalität, die dahinter stehende Architektursowie das zugrunde liegende Datenmodell. Ferner erläutern wir Anwendungsbeispiele und gehen auf Methoden zur Analyse der in BibSonomy und ähnlichen Systemen enthaltenen Daten ein.
Jäschke, R.; Eisterlehner, F.; Hotho, A. & Stumme, G.: Testing and Evaluating Tag Recommenders in a Live System. In: Benz, D. & Janssen, F. (Hrsg.): Workshop on Knowledge Discovery, Data Mining, and Machine Learning. 2009, S. 44 -51
[Volltext] [Kurzfassung]
[BibTeX]
The challenge to provide tag recommendations for collaborative tagging systems has attracted quite some attention of researchers lately. However, most research focused on evaluation and
development of appropriate methods rather than tackling the practical challenges of how to integrate recommendation methods into real tagging systems, record and evaluate their performance.
In this paper we describe the tag recommendation framework we developed for our social bookmark and publication sharing system BibSonomy. With the intention to develop, test, and evaluate recommendation algorithms and supporting cooperation with researchers, we designed the framework to be easily extensible,
open for a variety of methods, and usable independent from BibSonomy. Furthermore, this paper presents an evaluation of two exemplarily deployed recommendation methods, demonstrating
the power of the framework.
Markines, B.; Cattuto, C.; Menczer, F.; Benz, D.; Hotho, A. & Stumme, G.: Evaluating Similarity Measures for Emergent Semantics of Social Tagging. 18th International World Wide Web Conference. 2009, S. 641-641
[Volltext] [Kurzfassung]
[BibTeX]
Social bookmarking systems and their emergent information structures, known as folksonomies, are increasingly important data sources for Semantic Web applications. A key question for harvesting semantics from these systems is how to extend and adapt traditional notions of similarity to folksonomies, and which measures are best suited for applications such as navigation support, semantic search, and ontology learning. Here we build an evaluation framework to compare various general folksonomy-based similarity measures derived from established information-theoretic, statistical, and practical measures. Our framework deals generally and symmetrically with users, tags, and resources. For evaluation purposes we focus on similarity among tags and resources, considering different ways to aggregate annotations across users. After comparing how tag similarity measures predict user-created tag relations, we provide an external grounding by user-validated semantic proxies based on WordNet and the Open Directory. We also investigate the issue of scalability. We ?nd that mutual information with distributional micro-aggregation across users yields the highest accuracy, but is not scalable; per-user projection with collaborative aggregation provides the best scalable approach via incremental computations. The results are consistent across resource and tag similarity.
Voss, J.; Hotho, A. & Jäschke, R.: Mapping Bibliographic Records with Bibliographic Hash Keys. In: Kuhlen, R. (Hrsg.): Information: Droge, Ware oder Commons?. Verlag Werner Hülsbusch, 2009Proceedings of the ISI
[Volltext] [Kurzfassung]
[BibTeX]
This poster presents a set of hash keys for bibliographic records called bibkeys. Unlike other methods of duplicate detection, bibkeys can directly be calculated from a set of basic metadata fields (title, authors/editors, year). It is shown how bibkeys are used to map similar bibliographic records in BibSonomy and among distributed library catalogs and other distributed databases.
Benz, D.; Grobelnik, M.; Hotho, A.; Jäschke, R.; Mladenic, D.; Servedio, V. D. P.; Sizov, S. & Szomszor, M.: Analyzing Tag Semantics Across Collaborative Tagging Systems. In: Alani, H.; Staab, S. & Stumme, G. (Hrsg.): Social Web Communities. Dagstuhl, Germany: Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2008Dagstuhl Seminar Proceedings
[Volltext] [Kurzfassung]
[BibTeX]
The objective of our group was to exploit state-of-the-art Information Retrieval methods for finding associations and dependencies between tags, capturing and representing differences in tagging behavior and vocabulary of various folksonomies, with the overall aim to better understand the semantics of tags and the tagging process. Therefore we analyze the semantic content of tags in the Flickr and Delicious folksonomies. We find that: tag context similarity leads to meaningful results in Flickr, despite its narrow folksonomy character; the comparison of tags across Flickr and Delicious shows little semantic overlap, being tags in Flickr associated more to visual aspects rather than technological as it seems to be in Delicious; there are regions in the tag-tag space, provided with the cosine similarity metric, that are characterized by high density; the order of tags inside a post has a semantic relevance.
Berendt, B.; Glance, N. & Hotho, A. (Hrsg.): Wikis, Blogs, Bookmarking Tools - Mining the Web 2.0 Workshop. Workshop at 18th Europ. Conf. on Machine Learning (ECML'08) / 11th Europ. Conf. on Principles and Practice of Knowledge Discovery in Databases (PKDD'08), 2008
[Volltext]
[BibTeX]
Cattuto, C.; Benz, D.; Hotho, A. & Stumme, G.: Semantic Analysis of Tag Similarity Measures in Collaborative Tagging Systems. , 2008
[Volltext] [Kurzfassung]
[BibTeX]
Social bookmarking systems allow users to organise collections of resources on the Web in a collaborative fashion. The increasing popularity of these systems as well as first insights into their emergent semantics have made them relevant to disciplines like knowledge extraction and ontology learning. The problem of devising methods to measure the semantic relatedness between tags and characterizing it semantically is still largely open. Here we analyze three measures of tag relatedness: tag co-occurrence, cosine similarity of co-occurrence distributions, and FolkRank, an adaptation of the PageRank algorithm to folksonomies. Each measure is computed on tags from a large-scale dataset crawled from the social bookmarking system del.icio.us. To provide a semantic grounding of our findings, a connection to WordNet (a semantic lexicon for the English language) is established by mapping tags into synonym sets of WordNet, and applying there well-known metrics of semantic similarity. Our results clearly expose different characteristics of the selected measures of relatedness, making them applicable to different subtasks of knowledge extraction such as synonym detection or discovery of concept hierarchies.
Cattuto, C.; Benz, D.; Hotho, A. & Stumme, G.: Semantic Grounding of Tag Relatedness in Social Bookmarking Systems. The Semantic Web - ISWC 2008. Springer Berlin / Heidelberg, 2008 (Lecture Notes in Computer Science 5318), S. 615-631
[Volltext] [Kurzfassung]
[BibTeX]
Collaborative tagging systems have nowadays become important data sources for populating semantic web applications. For tasks
like synonym detection and discovery of concept hierarchies, many researchers introduced measures of tag similarity. Eventhough most of these measures appear very natural, their design often seems to be rather ad hoc, and the underlying assumptionson the notion of similarity are not made explicit. A more systematic characterization and validation of tag similarity interms of formal representations of knowledge is still lacking. Here we address this issue and analyze several measures oftag similarity: Each measure is computed on data from the social bookmarking system del.icio.us and a semantic grounding isprovided by mapping pairs of similar tags in the folksonomy to pairs of synsets in Wordnet, where we use validated measuresof semantic distance to characterize the semantic relation between the mapped tags. This exposes important features of theinvestigated similarity measures and indicates which ones are better suited in the context of a given semantic application.
Hotho, A.; Benz, D.; Jäschke, R. & Krause, B. (Hrsg.): ECML PKDD Discovery Challenge 2008 (RSDC'08). Workshop at 18th Europ. Conf. on Machine Learning (ECML'08) / 11th Europ. Conf. on Principles and Practice of Knowledge Discovery in Databases (PKDD'08), 2008
[Volltext]
[BibTeX]
Jäschke, R.; Hotho, A.; Schmitz, C.; Ganter, B. & Stumme, G.: Discovering Shared Conceptualizations in Folksonomies. In: Web Semantics: Science, Services and Agents on the World Wide Web 6 (2008), Nr. 1, S. 38-53
[Volltext]
[Kurzfassung]
[BibTeX]
Social bookmarking tools are rapidly emerging on the Web. In such systems users are setting up lightweight conceptual structures called folksonomies. Unlike ontologies, shared conceptualizations are not formalized, but rather implicit. We present a new data mining task, the mining of all frequent tri-concepts, together with an efficient algorithm, for discovering these implicit shared conceptualizations. Our approach extends the data mining task of discovering all closed itemsets to three-dimensional data structures to allow for mining folksonomies. We provide a formal definition of the problem, and present an efficient algorithm for its solution. Finally, we show the applicability of our approach on three large real-world examples.
Jäschke, R.; Krause, B.; Hotho, A. & Stumme, G.: Logsonomy ? A Search Engine Folksonomy. Proceedings of the Second International Conference on Weblogs and Social Media(ICWSM 2008). AAAI Press, 2008
[Volltext] [Kurzfassung]
[BibTeX]
In social bookmarking systems users describe bookmarks
by keywords called tags. The structure behind
these social systems, called folksonomies, can be
viewed as a tripartite hypergraph of user, tag and resource
nodes. This underlying network shows specific
structural properties that explain its growth and the possibility
of serendipitous exploration.
Search engines filter the vast information of the web.
Queries describe a user?s information need. In response
to the displayed results of the search engine, users click
on the links of the result page as they expect the answer
to be of relevance. The clickdata can be represented as a
folksonomy in which queries are descriptions of clicked
URLs. This poster analyzes the topological characteristics
of the resulting tripartite hypergraph of queries,
users and bookmarks of two query logs and compares it
two a snapshot of the folksonomy del.icio.us.
Jäschke, R.; Marinho, L.; Hotho, A.; Schmidt-Thieme, L. & Stumme, G.: Tag Recommendations in Social Bookmarking Systems. In: AI Communications 21 (2008), Nr. 4, S. 231-247
[Volltext]
[Kurzfassung]
[BibTeX]
Collaborative tagging systems allow users to assign keywords - so called "tags" - to resources. Tags are used for navigation, finding resources and serendipitous browsing and thus provide an immediate benefit for users. These systems usually include tag recommendation mechanisms easing the process of finding good tags for a resource, but also consolidating the tag vocabulary across users. In practice, however, only very basic recommendation strategies are applied.
In this paper we evaluate and compare several recommendation algorithms on large-scale real life datasets: an adaptation of
user-based collaborative filtering, a graph-based recommender built on top of the FolkRank algorithm, and simple methods based on counting tag occurences. We show that both FolkRank and Collaborative Filtering provide better results than non-personalized baseline methods. Moreover, since methods based on counting tag occurrences are computationally cheap, and thus usually preferable for real time scenarios, we discuss simple approaches for improving the performance of such methods. We show, how a simple recommender based on counting tags from users and resources can perform almost as good as the best recommender.
Krause, B.; Hotho, A. & Stumme, G.: A Comparison of Social Bookmarking with Traditional Search. Advances in Information Retrieval, 30th European Conference on IR Research, ECIR 2008. Springer, 2008 (4956), S. 101-113
[Volltext] [Kurzfassung]
[BibTeX]
Social bookmarking systems allow users to store links to internet resources on a web page. As social bookmarking systems are growing in popularity, search algorithms have been developed that transfer the idea of link-based rankings in the Web to a social bookmarking system?s
data structure. These rankings differ from traditional search engine rankings in that they incorporate the rating of users.
In this study, we compare search in social bookmarking systems with traditionalWeb search. In the first part, we compare the user activity and behaviour in both kinds of systems, as well as the overlap of the underlying sets of URLs. In the second part,we compare graph-based and vector space rankings for social bookmarking systems with commercial search engine rankings.
Our experiments are performed on data of the social bookmarking system Del.icio.us and on rankings and log data from Google, MSN, and AOL. We will show that part of the difference between the systems is due to different behaviour (e. g., the concatenation of multi-word lexems
to single terms in Del.icio.us), and that real-world events may trigger similar behaviour in both kinds of systems. We will also show that a graph-based ranking approach on folksonomies yields results that are closer to the rankings of the commercial search engines than vector space
retrieval, and that the correlation is high in particular for the domains that are well covered by the social bookmarking system.
Krause, B.; Jäschke, R.; Hotho, A. & Stumme, G.: Logsonomy - social information retrieval with logdata. HT '08: Proceedings of the nineteenth ACM conference on Hypertext and hypermedia. New York, NY, USA: ACM, 2008, S. 157-166
[Volltext] [Kurzfassung]
[BibTeX]
Social bookmarking systems constitute an established part of the Web 2.0. In such systems users describe bookmarks by keywords called tags. The structure behind these social systems, called folksonomies, can be viewed as a tripartite hypergraph of user, tag and resource nodes. This underlying network shows specific structural properties that explain its growth and the possibility of serendipitous exploration.
Today's search engines represent the gateway to retrieve information from the World Wide Web. Short queries typically consisting of two to three words describe a user's information need. In response to the displayed results of the search engine, users click on the links of the result page as they expect the answer to be of relevance.
This clickdata can be represented as a folksonomy in which queries are descriptions of clicked URLs. The resulting network structure, which we will term logsonomy is very similar to the one of folksonomies. In order to find out about its properties, we analyze the topological characteristics of the tripartite hypergraph of queries, users and bookmarks on a large snapshot of del.icio.us and on query logs of two large search engines. All of the three datasets show small world properties. The tagging behavior of users, which is explained by preferential attachment of the tags in social bookmark systems, is reflected in the distribution of single query words in search engines. We can conclude that the clicking behaviour of search engine users based on the displayed search results and the tagging behaviour of social bookmarking users is driven by similar dynamics.
Krause, B.; Schmitz, C.; Hotho, A. & Stumme, G.: The Anti-Social Tagger - Detecting Spam in Social Bookmarking Systems. AIRWeb '08: Proceedings of the 4th international workshop on Adversarial information retrieval on the web. New York, NY, USA: ACM, 2008, S. 61-68
[Volltext]
[BibTeX]
May, M.; Berendt, B.; Cornuéjols, A.; Gama, J.; Giannotti, F.; Hotho, A.; Malerba, D.; Menesalvas, E.; Morik, K.; Pedersen, R.; Saitta, L.; Saygin, Y.; Schuster, A. & Vanhoof, K.: Research Challenges in Ubiquitous Knowledge Discovery. Next Generation of Data Mining (Chapman & Hall/Crc Data Mining and Knowledge Discovery Series). 1. Aufl. Chapman & Hall/CRC, 2008
[Volltext]
[BibTeX]
Völker, J.; Vrande?i?, D.; Sure, Y. & Hotho, A.: AEON - An approach to the automatic evaluation of ontologies. In: Applied Ontology 3 (2008), Nr. 1-2, S. 41-62
[Volltext]
[Kurzfassung]
[BibTeX]
OntoClean is an approach towards the formal evaluation of taxonomic relations in ontologies. The application of OntoClean consists of two main steps. First, concepts are tagged according to meta-properties known as rigidity, unity, dependency and identity. Second, the tagged concepts are checked according to predefined constraints to discover taxonomic errors. Although OntoClean is well documented in numerous publications, it is still used rather infrequently due to the high costs of application. Especially, the manual tagging of concepts with the correct meta-properties requires substantial efforts of highly experienced ontology engineers. In order to facilitate the use of OntoClean and to enable the evaluation of real-world ontologies, we provide AEON, a tool which automatically tags concepts with appropriate OntoClean meta-properties and performs the constraint checking. We use the Web as an embodiment of world knowledge, where we search for patterns that indicate how to properly tag concepts. We thoroughly evaluated our approach against a manually created gold standard. The evaluation shows the competitiveness of our approach while at the same time significantly lowering the costs. All of our results, i.e. the tool AEON as well as the experiment data, are publicly available.
Benz, D. & Hotho, A.: Position Paper: Ontology Learning from Folksonomies.. In: Hinneburg, A. (Hrsg.): LWA 2007: Lernen - Wissen - Adaption, Halle, September 2007, Workshop Proceedings (LWA). Martin-Luther-University Halle-Wittenberg, 2007, S. 109-112
[Volltext]
[BibTeX]
Berendt, B.; Hotho, A.; Mladenic, D. & Semeraro, G. (Hrsg.): From Web to Social Web: Discovering and Deploying User and Content Profiles . Springer, 2007 (LNCS 4736)
[Volltext]
[Kurzfassung]
[BibTeX]
This book constitutes the refereed proceedings of the Workshop on Web Mining, WebMine 2006, held in Berlin, Germany, September 18th, 2006. Topics included are data mining based on analysis of bloggers and tagging, web mining, XML mining and further techniques of knowledge discovery. The book is especially valuable for those interested in the aspects of the Social Web (Web 2.0) and its inherent dynamic and diversity of user-generated content.
Cattuto, C.; Schmitz, C.; Baldassarri, A.; Servedio, V. D. P.; Loreto, V.; Hotho, A.; Grahl, M. & Stumme, G.: Network Properties of Folksonomies. In: AI Communications 20 (2007), Nr. 4, S. 245 - 262
[Volltext]
[BibTeX]
Proceedings of the First International Workshop on Emergent Semantics and Ontology Evolution, ESOE 2007, co-located with ISWC 2007 + ASWC 2007, Busan, Korea, November 12th, 2007. CEUR Workshop Proceedings , 2007
[Volltext]
[BibTeX]
Grahl, M.; Hotho, A. & Stumme, G.: Conceptual Clustering of Social Bookmark Sites. In: Hinneburg, A. (Hrsg.): Workshop Proceedings of Lernen - Wissensentdeckung - Adaptivität (LWA 2007). Martin-Luther-Universität Halle-Wittenberg, 2007, S. 50-54
[Volltext]
[BibTeX]
Grahl, M.; Hotho, A. & Stumme, G.: Conceptual Clustering of Social Bookmarking Sites. 7th International Conference on Knowledge Management (I-KNOW '07). Graz, Austria: Know-Center, 2007, S. 356-364
[Kurzfassung]
[BibTeX]
Currently, social bookmarking systems provide intuitive support for browsing locally their content. A global view is usually presented by the tag cloud of the
system, but it does not allow a conceptual drill-down, e. g., along a conceptual hierarchy. In this paper, we present a clustering approach for computing such a conceptual hierarchy for a given folksonomy. The hierarchy is complemented with ranked lists of users and resources most related to each cluster. The rankings are computed using our FolkRank algorithm. We have evaluated our approach on large scale data from the del.icio.us bookmarking system.
Hotho, A. & Stumme, G.: Mining the World Wide Web. In: Künstliche Intelligenz (2007), Nr. 3, S. 5-8
[Volltext]
[BibTeX]
Jäschke, R.; Hotho, A.; Schmitz, C. & Stumme, G.: Analysis of the Publication Sharing Behaviour in BibSonomy. Proceedings of the 15th International Conference on Conceptual Structures. Sheffield, England: 2007 (LNCS 4604)
[Kurzfassung]
[BibTeX]
BibSonomy is a web-based social resource sharing system which allows users to organise and share bookmarks and publications in a collaborative manner. In this paper we present the system, followed by a description of the insights in the structure of its bibliographic data that we gained by applying techniques we developed in the area of Formal Concept Analysis.
Jäschke, R.; Grahl, M.; Hotho, A.; Krause, B.; Schmitz, C. & Stumme, G.: Organizing Publications and Bookmarks in BibSonomy. In: Alani, H.; Noy, N.; Stumme, G.; Mika, P.; Sure, Y. & Vrandecic, D. (Hrsg.): Workshop on Social and Collaborative Construction of Structured Knowledge (CKC 2007) at WWW 2007. Banff, Canada: 2007
[Volltext]
[BibTeX]
Jäschke, R.; Marinho, L.; Hotho, A.; Schmidt-Thieme, L. & Stumme, G.: Tag Recommendations in Folksonomies. In: Hinneburg, A. (Hrsg.): Workshop Proceedings of Lernen - Wissensentdeckung - Adaptivität (LWA 2007). Martin-Luther-Universität Halle-Wittenberg, 2007, S. 13-20
[Volltext]
[BibTeX]
Jäschke, R.; Marinho, L. B.; Hotho, A.; Schmidt-Thieme, L. & Stumme, G.: Tag Recommendations in Folksonomies. In: Kok, J. N.; Koronacki, J.; de Mántaras, R. L.; Matwin, S.; Mladenic, D. & Skowron, A. (Hrsg.): Knowledge Discovery in Databases: PKDD 2007, 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, Warsaw, Poland, September 17-21, 2007, Proceedings. Springer, 2007 (Lecture Notes in Computer Science 4702), S. 506-514
[Volltext]
[BibTeX]
Schmitz, C.; Grahl, M.; Hotho, A.; Stumme, G.; Catutto, C.; Baldassarri, A.; Loreto, V. & Servedio, V. D. P.: Network Properties of Folksonomies. Proc. WWW2007 Workshop ``Tagging and Metadata for Social Information Organization''. Banff: 2007
[BibTeX]
Völker, J.; Vrandecic, D.; Sure, Y. & Hotho, A.: Learning Disjointness. In: Franconi, E.; Kifer, M. & May, W. (Hrsg.): Proceedings of the European Semantic Web Conference, ESWC2007. Springer-Verlag, 2007 (Lecture Notes in Computer Science 4519)
[Volltext]
[BibTeX]
Ackermann, M.; Berendt, B.; Grobelnik, M.; Hotho, A.; Mladenic, D.; Semeraro, G.; Spiliopoulou, M.; Stumme, G.; Svatek, V. & van Someren, M.: Semantics, Web and Mining. 2006
[Volltext]
[BibTeX]
Bloehdorn, S. & Hotho, A.: Boosting for Text Classification with Semantic Features. Advances in Web Mining and Web Usage Analysis. Springer, 2006 (LNCS 3932), S. 149-166
[Volltext]
[BibTeX]
Bloehdorn, S.; Cimiano, P. & Hotho, A.: Learning Ontologies to Improve Text Clustering and Classification. From Data and Information Analysis to Knowledge Engineering. Springer Berlin Heidelberg, 2006, S. 334-341
[Volltext] [Kurzfassung]
[BibTeX]
Recent work has shown improvements in text clustering and classification tasks by integrating conceptual features extracted from ontologies. In this paper we present text mining experiments in the medical domain in which the ontological structures used are acquired automatically in an unsupervised learning process from the text corpus in question. We compare results obtained using the automatically learned ontologies with those obtained using manually engineered ones. Our results show that both types of ontologies improve results on text clustering and classification tasks, whereby the automatically acquired ontologies yield a improvement competitive with the manually engineered ones.
ER -
Haase, P.; Ehrig, M.; Hotho, A. & Schnizler, B.: Personalized Information Access in a Bibliographic Peer-to-Peer System. In: Staab, S. & Stuckenschmidt, H. (Hrsg.): Peer-to-Peer and SemanticWeb, Decentralized Management and Exchange of Knowledge and Information. Springer , 2006, S. 143-158
[BibTeX]
Hoser, B.; Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G.: Semantic Network Analysis of Ontologies. Proceedings of the 3rd European Semantic Web Conference. Budva, Montenegro: Springer, 2006 (LNCS 4011), S. 514-529
[Volltext]
[BibTeX]
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G.: BibSonomy: A Social Bookmark and Publication Sharing System. Proceedings of the Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures. 2006, S. 87-102
[Volltext]
[BibTeX]
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G.: Emergent Semantics in BibSonomy. Proc. Workshop on Applications of Semantic Technologies, Informatik 2006. Dresden: 2006 (P-94)
[Volltext]
[BibTeX]
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G.: FolkRank: A Ranking Algorithm for Folksonomies. Proc. FGIR 2006. 2006, S. 111-114
[Volltext] [Kurzfassung]
[BibTeX]
In social bookmark tools users are setting up
lightweight conceptual structures called folksonomies. Currently,
the information retrieval support is limited. We present a formal
model and a new search algorithm for folksonomies, called
FolkRank, that exploits the structure of the folksonomy. The
proposed algorithm is also applied to find communities within the
folksonomy and is used to structure search results. All findings are
demonstrated on a large scale dataset. A long version of this paper
has been published at the European Semantic Web Conference
2006.
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G.: Information Retrieval in Folksonomies: Search and Ranking. Proceedings of the 3rd European Semantic Web Conference . Budva, Montenegro: Springer, 2006 (LNCS 4011), S. 411-426
[Volltext]
[BibTeX]
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G.: Trend Detection in Folksonomies. In: Avrithis, Y. S.; Kompatsiaris, Y.; Staab, S. & O'Connor, N. E. (Hrsg.): Proc. First International Conference on Semantics And Digital Media Technology (SAMT). Springer, 2006 (Lecture Notes in Computer Science 4306), S. 56-70
[Volltext]
[BibTeX]
Jäschke, R.; Hotho, A.; Schmitz, C.; Ganter, B. & Stumme, G.: TRIAS - An Algorithm for Mining Iceberg Tri-Lattices. Proc. 6th ICDM conference. Hong Kong: 2006
[BibTeX]
Jäschke, R.; Hotho, A.; Schmitz, C. & Stumme, G.: Wege zur Entdeckung von Communities in Folksonomies. In: Braß, S. & Hinneburg, A. (Hrsg.): Proc. 18. Workshop Grundlagen von Datenbanken. Halle-Wittenberg: Martin-Luther-Universität , 2006, S. 80-84
[BibTeX]
Schmitz, C.; Hotho, A.; Jäschke, R. & Stumme, G.: Content Aggregation on Knowledge Bases using Graph Clustering. Proceedings of the 3rd European Semantic Web Conference. Budva, Montenegro: Springer, 2006 (LNCS 4011), S. 530-544
[Volltext]
[BibTeX]
Schmitz, C.; Hotho, A.; Jäschke, R. & Stumme, G.: Kollaboratives Wissensmanagement. In: Pellegrini, T. & Blumauer, A. (Hrsg.): Semantic Web - Wege zur vernetzten Wissensgesellschaft. Springer, 2006, S. 273-290
[Volltext] [Kurzfassung]
[BibTeX]
Wissensmanagement in zentralisierten Wissensbasen erfordert
einen hohen Aufwand für Erstellung und Wartung, und es entspricht nicht
immer den Anforderungen der Benutzer. Wir geben in diesem Kapitel einen Überblick
über zwei aktuelle Ansätze, die durch kollaboratives Wissensmanagement
diese Probleme lösen können. Im Peer-to-Peer-Wissensmanagement unterhalten
Benutzer dezentrale Wissensbasen, die dann vernetzt werden können, um
andere Benutzer eigene Inhalte nutzen zu lassen. Folksonomies versprechen, die
Wissensakquisition so einfach wie möglich zu gestalten und so viele Benutzer in
den Aufbau und die Pflege einer gemeinsamen Wissensbasis einzubeziehen.
Schmitz, C.; Hotho, A.; Jäschke, R. & Stumme, G.: Mining Association Rules in Folksonomies. In: Batagelj, V.; Bock, H.-H.; Ferligoj, A. & ?iberna, A. (Hrsg.): Data Science and Classification (Proc. IFCS 2006 Conference). Berlin/Heidelberg: Springer, 2006Studies in Classification, Data Analysis, and Knowledge Organization , S. 261-270
[Volltext]
[BibTeX]
Stumme, G.; Hotho, A. & Berendt, B.: Semantic Web Mining - State of the Art and Future Directions. In: Journal of Web Semantics 4 (2006), Nr. 2, S. 124-143
[Volltext]
[Kurzfassung]
[BibTeX]
SemanticWeb Mining aims at combining the two fast-developing research areas SemanticWeb andWeb Mining.
This survey analyzes the convergence of trends from both areas: an increasing number of researchers is working on
improving the results ofWeb Mining by exploiting semantic structures in theWeb, and they make use ofWeb Mining
techniques for building the Semantic Web. Last but not least, these techniques can be used for mining the Semantic
Web itself.
The Semantic Web is the second-generation WWW, enriched by machine-processable information which supports
the user in his tasks. Given the enormous size even of today?s Web, it is impossible to manually enrich all of
these resources. Therefore, automated schemes for learning the relevant information are increasingly being used.
Web Mining aims at discovering insights about the meaning of Web resources and their usage. Given the primarily
syntactical nature of the data being mined, the discovery of meaning is impossible based on these data only. Therefore,
formalizations of the semantics of Web sites and navigation behavior are becoming more and more common.
Furthermore, mining the Semantic Web itself is another upcoming application. We argue that the two areas Web
Mining and Semantic Web need each other to fulfill their goals, but that the full potential of this convergence is not
yet realized. This paper gives an overview of where the two areas meet today, and sketches ways of how a closer
integration could be profitable.
Proc. of the European Web Mining Forum 2005. , 2005
[BibTeX]
Berendt, B.; Hotho, A. & Stumme, G.: Semantic Web Mining and the Representation, Analysis, and Evolution of Web Space. In: Svatek, V. & Snasel, V. (Hrsg.): Proc. of the 1st Intl. Workshop on Representation and Analysis of Web Space. Technical University of Ostrava, 2005, S. 1-16
[Volltext]
[BibTeX]
Bloehdorn, S.; Cimiano, P.; Hotho, A. & Staab, S.: An Ontology-based Framework for Text Mining. In: LDV Forum - GLDV Journal for Computational Linguistics and Language Technology 20 (2005), Nr. 1, S. 87-112
[BibTeX]
Proceedings of the Workshop on Learning in Web Search (LWS 2005) . , 2005
[Volltext]
[BibTeX]
Cimiano, P.; Hotho, A. & Staab, S.: Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis. In: Journal of Artificial Intelligence Research (JAIR) 24 (2005), S. 305-339
[Volltext]
[BibTeX]
Haase, P.; Hotho, A.; Schmidt-Thieme, L. & Sure, Y.: Collaborative and Usage-Driven Evolution of Personal Ontologies.. In: Gómez-Pérez, A. & Euzenat, Jé. (Hrsg.): ESWC. Springer, 2005 (Lecture Notes in Computer Science 3532), S. 486-499
[BibTeX]
Hotho, A.; Nürnberger, A. & Paaß, G.: A Brief Survey of Text Mining. In: LDV Forum - GLDV Journal for Computational Linguistics and Language Technology 20 (2005), Nr. 1, S. 19-62
[Volltext]
[BibTeX]
Hotho, A.: Text Clustern mit Hintergrundwissen (Dissertationsbeschreibung). In: Künstliche Intelligenz (KI) 1 (2005), S. 62-64
[Volltext]
[BibTeX]
Berendt, B.; Hotho, A. & Stumme, G.: Usage Mining for and on the Semantic Web.. In: Kargupta, H.; Joshi, A.; Sivakumar, K. & Yesha, Y. (Hrsg.): Data Mining Next Generation Challenges and Future Directions. Boston: AAAI Press, 2004, S. 461-481
[BibTeX]
Berendt, B.; Hotho, A.; Mladenic, D.; van Someren, M.; Spiliopoulou, M. & Stumme, G. (Hrsg.): Web Mining: From Web to Semantic Web. Heidelberg: Springer, 2004 (LNAI 3209)
[BibTeX]
Bloehdorn, S. & Hotho, A.: Boosting for Text Classification with Semantic Features. Proceedings of the MSW 2004 workshop at the 10th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2004, S. 70-87
[Volltext]
[BibTeX]
Bloehdorn, S. & Hotho, A.: Boosting for Text Classification with Semantic Features (reprint). Proceedings of the Workshop on Text-based Information Retrieval (TIR-04) at the 27th German Conference on Artificial Intelligence. 2004
[Volltext]
[BibTeX]
Bloehdorn, S. & Hotho, A.: Text Classification by Boosting Weak Learners based on Terms and Concepts. Proceedings of the Fourth IEEE International Conference on Data Mining. IEEE Computer Society Press, 2004, S. 331-334
[Volltext]
[BibTeX]
Cimiano, P.; Hotho, A. & Staab, S.: Clustering Ontologies from Text. Proceedings of the Conference on Languages Resources and Evaluation (LREC). Lisbon, Portugal: ELRA - European Language Ressources Association, 2004
[Volltext]
[BibTeX]
Cimiano, P.; Hotho, A. & Staab, S.: Comparing Conceptual, Divise and Agglomerative Clustering for Learning Taxonomies from Text. In: de Mántaras, R. Ló. & Saitta, L. (Hrsg.): Proceedings of the European Conference on Artificial Intelligence (ECAI'04). Valencia, Spain: IOS Press, 2004, S. 435-439
[Volltext]
[BibTeX]
Cimiano, P.; Hotho, A.; Stumme, G. & Tane, J.: Conceptual Knowledge Processing with Formal Concept Analysis and Ontologies. Proceedings of the The Second International Conference on Formal Concept Analysis (ICFCA 04). Springer, 2004 (LNCS 2961)
[Volltext]
[BibTeX]
Cimiano, P.; Hotho, A. & Staab, S.: Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis. , 2004
[Volltext]
[BibTeX]
Haase, P.; Ehrig, M.; Hotho, A. & Schnizler, B.: Personalized Information Access in a Bibliographic Peer-to-Peer System. Proceedings of the AAAI Workshop on Semantic Web Personalization, 2004. AAAI Press, 2004, S. 1-12
[Volltext]
[BibTeX]
Hotho, A.; Sure, Y. & Getoor, L.: A workshop report: mining for and from the Semantic Web at KDD 2004.. In: SIGKDD Explorations 6 (2004), Nr. 2, S. 142-143
[BibTeX]
Hotho, A.: Clustern mit Hintergrundwissen. Berlin: Akademische Verlagsgesellschaft Aka GmbH, 2004 (Diski 286)
[Volltext]
[BibTeX]
Hotho, A.: Clustern mit Hintergrundwissen. Universität Karlsruhe (TH), Institut AIFB, D-76128 Karlsruhe, University of Karlsruhe, 2004
[BibTeX]
International Workshop on Mining for and from the Semantic Web (MSW2004). , 2004
[Volltext]
[BibTeX]
Semantic Web Personalization. , 2004
[BibTeX]
Proceedings of the 1st European Web Mining Forum (EWMF 2003). Cavtat/Dubrovnik, Croatia, 2003
[BibTeX]
Hotho, A.; Staab, S. & Stumme, G.: Explaining Text Clustering Results using Semantic Structures. Proc. of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD. 2003 (LNCS 2838), S. 217-228
[BibTeX]
Lehren - Lernen - Wissen - Adaptivität (LLWA 2003). Universität Karlsruhe, 2003
[BibTeX]
Hotho, A.; Maedche, A.; Staab, S. & Zacharias, V.: On Knowledgeable Unsupervised Text Mining. Text Mining. 2003, S. 131-152
[BibTeX]
Hotho, A.; Staab, S. & Stumme, G.: Ontologies Improve Text Document Clustering. Proc. of the ICDM 03, The 2003 IEEE International Conference on Data Mining. 2003, S. 541-544
[Volltext]
[BibTeX]
Hotho, A.; Staab, S. & Stumme, G.: Text Clustering Based on Background Knowledge. , 2003
[Volltext]
[BibTeX]
Hotho, A.; Staab, S. & Stumme, G.: WordNet improves text document clustering. Proc. of the SIGIR 2003 Semantic Web Workshop. Toronto, Canada: 2003
[Volltext]
[BibTeX]
Lauser, B. & Hotho, A.: Automatic multi-label subject indexing in a multilingual environment. Proc. of the 7th European Conference in Research and Advanced Technology for Digital Libraries, ECDL 2003. Springer, 2003 (LNCS 2769), S. 140-151
[BibTeX]
Oberle, D.; Berendt, B.; Hotho, A. & Gonzalez, J.: Conceptual User Tracking. In: Ruiz, E. M.; Segovia, J. & Szczepaniak, P. S. (Hrsg.): Advances in Web Intelligence, First International Atlantic Web Intelligence Conference, AWIC 2003, Madrid, Spain, May 5-6, 2003, Proceedings. Springer, 2003 (Lecture Notes in Artificial Intelligence 2663), S. 142-154
[Volltext]
[BibTeX]
Staab, S. & Hotho, A.: Ontology-based Text Document Clustering.. Intelligent Information Processing and Web Mining, Proceedings of the International IIS: IIPWM'03 Conference held in Zakopane. 2003, S. 451-452
[Volltext]
[BibTeX]
Studer, R.; Stumme, G.; Handschuh, S.; Hotho, A. & Motik, B.: Building and Using the Semantic Web. New Trends in Knowledge Processing - Data Mining, Semantic Web and Computational Science. Proc. 6th Sanken International Symposium. Osaka, Japan: 2003, S. 31-34
[BibTeX]
Studer, R.; Volz, R.; Stumme, G. & Hotho, A.: Semantic Web - State of the art and future directions. In: KI Heft, Special Issue on the Semantic Web 3 (2003), S. 5-9
[BibTeX]
Semantic Web Mining. Helsinki, 2002
[BibTeX]
Berendt, B.; Hotho, A. & Stumme, G.: Towards Semantic Web Mining. In: Horrocks, I. & Hendler, J. A. (Hrsg.): Proceedings of the First International Semantic Web Conference: The Semantic Web (ISWC 2002). Sardinia, Italy: Springer, 2002 (Lecture Notes in Computer Science (LNCS) 2342), S. 264-278
[BibTeX]
Bozsak, E.; Ehrig, M.; Handschuh, S.; Hotho, A.; Maedche, A.; Motik, B.; Oberle, D.; Schmitz, C.; Staab, S.; Stojanovic, L.; Stojanovic, N.; Studer, R.; Stumme, G.; Sure, Y.; Tane, J.; Volz, R. & Zacharias, V.: KAON - Towards a Large Scale Semantic Web. In: Bauknecht, K.; Tjoa, A. M. & Quirchmayr, G. (Hrsg.): E-Commerce and Web Technologies, Third International Conference, EC-Web 2002, Proceedings. Berlin: Springer, 2002 (LNCS 2455), S. 304-313
[BibTeX]
Ehrig, M.; Handschuh, S.; Hotho, A.; Maedche, A.; Motik, B.; Oberle, D.; Schmitz, C.; Staab, S.; Stojanovic, L.; Stojanovic, N.; Studer, R.; Stumme, G.; Sure, Y.; Tane, J.; Volz, R. & Zacharias, V.: KAON - Towards a large scale Semantic Web. In: Bauknecht, K.; Tjoa, A. M. & Quirchmayr, G. (Hrsg.): Proc. E-Commerce and Web Technologies, Third International Conference, EC-Web 2002. Aix-en-Provence %, France : Springer, 2002LNCS
[BibTeX]
Hartmann, J.; Hotho, A. & Stumme, G.: Semantic Web Mining for Building Information Portals (Position Paper). Proc. Arbeitskreistreffen Knowledge Discovery, Oldenburg, Sept. 2002. 2002, S. 34-38
[BibTeX]
Hotho, A. & Stumme, G.: Conceptual Clustering of Text Clusters. Proceedings of FGML Workshop. Special Interest Group of German Informatics Society (FGML --- Fachgruppe Maschinelles Lernen der GI e.V.), 2002, S. 37-45
[Volltext]
[BibTeX]
Hotho, A.; Maedche, A.; Staab, S. & Zacharias, V.: On Knowledgeable Unsupervised Text Mining . Proc. of Text Mining Workshop. 2002
[Volltext]
[BibTeX]
Hotho, A.; Maedche, A. & Staab, S.: Text Clustering Based on Good Aggregations. In: Künstliche Intelligenz (KI) 16 (2002), Nr. 4, S. 48-54
[Volltext]
[BibTeX]
Stumme, G.; Berendt, B. & Hotho, A.: Usage Mining for and on the Semantic Web. Next Generation Data Mining. Proc. NSF Workshop, Baltimore, Nov. 2002. 2002, S. 77-86
[Volltext]
[BibTeX]
Hotho, A.: Analyse von Wettbewerbsverlusten im Telekommunikationsmarkt und mögliche Gegenmaßnahmen. , 2001
[BibTeX]
Hotho, A.; Maedche, A. & Staab, S.: Ontology-based Text Clustering. Proc. of the Workshop ``Text Learning: Beyond Supervision'' at IJCAI 2001. Seattle, WA, USA, August 6, 2001. 2001
[BibTeX]
Hotho, A.; Maedche, A.; Staab, S. & Studer, R.: SEAL-II -- The Soft Spot between Richly Structured and
Unstructured Knowledge. In: Journal of Universal Computer Science (J.UCS) 7 (2001), Nr. 7, S. 566-590
[BibTeX]
Hotho, A.; Maedche, A. & Staab, S.: Text Clustering Based on Good Aggregations. ICDM '01: Proceedings of the 2001 IEEE International Conference on Data Mining. Washington, DC, USA: IEEE Computer Society, 2001, S. 607-608
[Volltext]
[BibTeX]
Semantic Web Mining. Freiburg, 2001
[BibTeX]
Hotho, A.: Analyse von Wettbewerbsverlusten im Telekommunikationsmarkt und mögliche Gegenmaßnahmen. , 2000
[BibTeX]
Maedche, A.; Hotho, A. & Wiese, M.: Enhancing Preprocessing in Data-Intensive Domains using
Online-Analytical Processing. Data Warehousing and Knowledge Discovery, Second International Conference, DaWaK 2000, London, UK. Springer, 2000 (LNCS 1874), S. 258-264
[BibTeX]
Staab, S.; Angele, Jü.; Decker, S.; Hotho, A.; Maedche, A.; Schnurr, H.-P.; Studer, R. & Sure, Y.: AI for the Web - Ontology-based Community Web Portals. AAAI 2000/IAAI 2000 - Proceedings of the 17th National Conference on Artificial Intelligence and 12th Innovative Applications of Artificial Intelligence Conference, Austin/TX, USA, July 30-August 3, 2000. AAAI Press/MIT Press, 2000
[Volltext]
[BibTeX]
Staab, S.; Angele, J.; Decker, S.; Erdmann, M.; Hotho, A.; Maedche, A.; Schnurr, H.-P.; Studer, R. & Sure, Y.: Semantic Community Web Portals. WWW9 -- Proceedings of the 9th International World Wide Web Conference, Amsterdam, The Netherlands. Elsevier, 2000, S. 473-491
[BibTeX]
|
|
|