The World Wide Web is a rich source of information
about human behavior. It contains large amount of data organized via
interconnected Web pages, traces of information search, user feedback
on items of interest, etc. One of the important characteristics of the
Web in addition to large data volumes is its dynamics, where content,
structure and usage is changing over time. This shows up in the rise of
related research areas like communities of practice, knowledge
management, web communities, and peer to peer. In particular the notion
of collaborative work and thus the need of its systematic analysis
become more and more important. For instance, to develop effective web
applications, it is essential to analyze patterns hidden in the usage
of web resources, their contents and their interconnections. Machine
Learning and Data mining methods have been used extensively to find
patterns in usage of the network by exploiting both contents and link
structures.
The topics of the workshop include:
analysis of weblogs and blogs
semantic association identification by link
analysis
discovering social structures and communities
predicting trends and user behavior
analysis of dynamic networks
using prior knowledge and the semantic web for
user modelling
using content of the web for modelling
applications in e-commerce, education
discovering misuse and fraud
combining the web with data from other sources
network or link analysis of social resources
sharing systems
Important Dates:
-----------------
Submissions: June 28
Notifications: August 2
Camera ready: August 9
Workshop day: September 18
We invite two types of submissions for this
workshop:
Technical papers (maximum 12 pages)
Short position papers (maximum 6 pages)
Submitted papers will be peer-reviewed and
selected on the basis of these reviews. Accepted papers will be
presented at the workshop. All submissions should be made
electronically if possible, by email attachment and preferably in
Postscript or PDF format.
All submissions must be sent to
. Although not required for the initial submission, we recommend to
follow the format guidelines of ECML/PKDD
(Springer LNCS -- LaTeX
Style File), as this will be the required format for accepted
papers.
The workshop proceedings will be distributed
during the workshop. We plan to issue a post workshop publication of
selected papers by Springer Lecture Notes.
Accepted Papers
XPath-Wrapper Induction by
generalizing tree traversal patterns
(Tobias Anton, Technical University of Darmstadt, Germany)
An Analysis of Bloggers and Topics
for a Blog Recommender System
(Conor Hayes, Paolo Avesani, Sriharsha Veeramachaneni, ITC-IRST,
Trento, Italy)
Web Usage Mining and XML Mining: a
real case study
(Federico Michele Facca, Politecnico di Milano, Italy)
Semi-Supervised Learning to Extract
Attribute-Value Pairs from Product Descriptions on the Web
(Katharina Probst, Rayid Ghani, Marko Krema, Andy Fano, Yan Liu,
Accenture Technology Labs, Chicago, USA/Carnegie Mellon University,
Pittsburgh, PA, USA)
Mining Associations from Web Query
Logs
(Benjamin Rey, Pradhuman Jhala, Yahoo! Research, Burbank, USA)
Collaborative Filtering: Fallacies
and Insights in Measuring Similarity
(Panagiotis Symeonidis, Alexandros Nanopoulos, Apostolos N.
Papadopoulos, and Yannis Manolopoulos, Aristotle University of
Thessaloniki, Greece)
Discovering User Profiles from Papers
by Using Word Sense Disambiguation
(G. Semeraro, P. Basile, M. Degemmis, and P. Lops, Universita di Bari,
Italy)
Workshop Program
09:30-09:45
Welcome and opening
Session 1
09:45-10:30
Invited Talk Web Usage Mining &
Personalization in Noisy, Dynamic, and Ambiguous Environments
Olfa Nasraoui, University of Louisville
Collaborative Filtering: Fallacies and Insights
in Measuring Similarity (Panagiotis Symeonidis, Alexandros Nanopoulos,
Apostolos N. Papadopoulos, and Yannis Manolopoulos, Aristotle
University of Thessaloniki, Greece)
11:30-12:00
An Analysis of Bloggers and Topics for a Blog
Recommender System (Conor Hayes, Paolo Avesani, Sriharsha
Veeramachaneni, ITC-IRST, Trento, Italy)
12:00-12:30
Semi-Supervised Learning to Extract
Attribute-Value Pairs from Product Descriptions on the Web (Katharina Probst, Rayid Ghani, Marko Krema,
Andy Fano, Yan Liu, Accenture Technology Labs, Chicago, USA/Carnegie
Mellon University, Pittsburgh, PA, USA)
12:30-14:00
Lunch Break
Session 3
14:00-14:30
Web Usage Mining and XML Mining: a real case
study (Federico Michele Facca, Politecnico di Milano, Italy)
14:30-15:00
Mining Associations from Web Query Logs
(Benjamin Rey, Pradhuman Jhala, Yahoo! Research, Burbank, USA)
15:00-15:30
XPath-Wrapper Induction by generalizing tree
traversal patterns
(Tobias Anton, Technical University of Darmstadt, Germany)
15:30-16:00
Coffee Break
Session 4
16:00-16:30
Discovering User Profiles from Papers by Using
Word Sense Disambiguation (G. Semeraro, P. Basile, M. Degemmis, and P.
Lops, University of Bari, Italy)
16:30-17:00
Discussion, Closing
Proceedings
The electronic workshop proceedings are available as