Workshop on Web Mining 2006 (WebMine)

September 18th, 2006, Berlin, Germany

The World Wide Web is a rich source of information about human behavior. It contains large amount of data organized via interconnected Web pages, traces of information search, user feedback on items of interest, etc. One of the important characteristics of the Web in addition to large data volumes is its dynamics, where content, structure and usage is changing over time. This shows up in the rise of related research areas like communities of practice, knowledge management, web communities, and peer to peer. In particular the notion of collaborative work and thus the need of its systematic analysis become more and more important. For instance, to develop effective web applications, it is essential to analyze patterns hidden in the usage of web resources, their contents and their interconnections. Machine Learning and Data mining methods have been used extensively to find patterns in usage of the network by exploiting both contents and link structures.

The topics of the workshop include:

analysis of weblogs and blogs
semantic association identification by link analysis
discovering social structures and communities
predicting trends and user behavior
analysis of dynamic networks
using prior knowledge and the semantic web for user modelling
using content of the web for modelling
applications in e-commerce, education
discovering misuse and fraud
combining the web with data from other sources
network or link analysis of social resources sharing systems
analysis of folksonomies or web 2.0 application
deriving profiles from usage
personalized delivery of news and journals
semantic web personalization
semantic web technologies for recommender systems

Submission

Important Dates:
-----------------
Submissions: June 28
Notifications: August 2
Camera ready: August 9
Workshop day: September 18

We invite two types of submissions for this workshop:

Technical papers (maximum 12 pages)
Short position papers (maximum 6 pages)

Submitted papers will be peer-reviewed and selected on the basis of these reviews. Accepted papers will be presented at the workshop. All submissions should be made electronically if possible, by email attachment and preferably in Postscript or PDF format.

All submissions must be sent to . Although not required for the initial submission, we recommend to follow the format guidelines of ECML/PKDD (Springer LNCS -- LaTeX Style File), as this will be the required format for accepted papers.

The workshop proceedings will be distributed during the workshop. We plan to issue a post workshop publication of selected papers by Springer Lecture Notes.

Accepted Papers

XPath-Wrapper Induction by generalizing tree traversal patterns (Tobias Anton, Technical University of Darmstadt, Germany)
An Analysis of Bloggers and Topics for a Blog Recommender System (Conor Hayes, Paolo Avesani, Sriharsha Veeramachaneni, ITC-IRST, Trento, Italy)
Web Usage Mining and XML Mining: a real case study (Federico Michele Facca, Politecnico di Milano, Italy)
Semi-Supervised Learning to Extract Attribute-Value Pairs from Product Descriptions on the Web (Katharina Probst, Rayid Ghani, Marko Krema, Andy Fano, Yan Liu, Accenture Technology Labs, Chicago, USA/Carnegie Mellon University, Pittsburgh, PA, USA)
Mining Associations from Web Query Logs (Benjamin Rey, Pradhuman Jhala, Yahoo! Research, Burbank, USA)
Collaborative Filtering: Fallacies and Insights in Measuring Similarity (Panagiotis Symeonidis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, and Yannis Manolopoulos, Aristotle University of Thessaloniki, Greece)
Discovering User Profiles from Papers by Using Word Sense Disambiguation (G. Semeraro, P. Basile, M. Degemmis, and P. Lops, Universita di Bari, Italy)

Workshop Program

09:30-09:45	Welcome and opening
Session 1
09:45-10:30	Invited Talk Web Usage Mining & Personalization in Noisy, Dynamic, and Ambiguous Environments Olfa Nasraoui, University of Louisville
10:30-11:00	Coffee Break
Session 2
11:00-11:30	Collaborative Filtering: Fallacies and Insights in Measuring Similarity (Panagiotis Symeonidis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, and Yannis Manolopoulos, Aristotle University of Thessaloniki, Greece)
11:30-12:00	An Analysis of Bloggers and Topics for a Blog Recommender System (Conor Hayes, Paolo Avesani, Sriharsha Veeramachaneni, ITC-IRST, Trento, Italy)
12:00-12:30	Semi-Supervised Learning to Extract Attribute-Value Pairs from Product Descriptions on the Web (Katharina Probst, Rayid Ghani, Marko Krema, Andy Fano, Yan Liu, Accenture Technology Labs, Chicago, USA/Carnegie Mellon University, Pittsburgh, PA, USA)
12:30-14:00	Lunch Break
Session 3
14:00-14:30	*Web Usage Mining and XML Mining: a real case study* (Federico Michele Facca, Politecnico di Milano, Italy)
14:30-15:00	Mining Associations from Web Query Logs (Benjamin Rey, Pradhuman Jhala, Yahoo! Research, Burbank, USA)
15:00-15:30	XPath-Wrapper Induction by generalizing tree traversal patterns (Tobias Anton, Technical University of Darmstadt, Germany)
15:30-16:00	Coffee Break
Session 4
16:00-16:30	Discovering User Profiles from Papers by Using Word Sense Disambiguation (G. Semeraro, P. Basile, M. Degemmis, and P. Lops, University of Bari, Italy)
16:30-17:00	Discussion, Closing

Proceedings

The electronic workshop proceedings are available as

Organization

Workshop Chairs:

Bettina Berendt, Institute of Information Systems of Humboldt University Berlin, Germany
Andreas Hotho, Group of Knowledge and Data Engineering, University of Kassel, Germany
Dunja Mladenic, J. Stefan Institute, Ljubljana, Slovenia
Giovanni Semeraro, Department of Informatics, University of Bari, Italy

Program Committee (to be extended)

Sarabjot Anand, UK
Mathias Bauer, Germany
Bettina Berendt, Germany
Janez Brank, Slovenia
Michelangelo Ceci, Italy
Marco Degemmis, Italy
Miha Grcar, Slovenia
Marko Grobelnik, Slovenia
Andreas Hotho, Germany
Pasquale Lops, Italy
Dunja Mladenic, Slovenia
Bamshad Mobasher, USA
Ion Muslea, USA
Giovanni Semeraro, Italy
Myra Spiliopoulou, Germany
Gerd Stumme, Germany
Maarten van Someren, Netherlands

Related Events

Kontakt: