To
be Held in Conjunction with
First
SIAM International Conference on Data Mining (SDM 2001)
Extracting content from text continues to be an important research problem for information processing and management. Approaches to capture the semantics of text-based document collections may be based on Bayesian models, probability theory, vector space models, statistical models, or even graph theory. No doubt we have moved beyond the bag of words notion of text documents to exploit not only the patterns but also the structure of term usage. As digital libraries and the World-Wide-Web continue to proliferate the enormous volume of online textual material, effective yet scalable approaches to text mining will be needed. How can we know what a document is about without having to read it? How do you classify or categorize documents from diverse sources? These are some of the fundamental yet simple questions we must be able to address.
A one-day workshop on Web Mining is being held in conjunction with SDM 2001 in Chicago (April '01) to bring together researchers from a variety of disciplines to present their current approaches and results in text mining.
Algorithms and Models
- Bayesian Models
- Concept Decomposition
- Orthogonal Decompostiton
- Probabilistic Models
- Vector Space Models
- Latent Semantic Indexing
- Graph-based Models
- Software and Toolkits
Applications
- Factor Analysis
- Text Classification
- Text Parsing
- Text Segmentation
- Text Summarization
- Software and Toolkits
- Query Structures
- Stemming
Attendees are required to register for SDM 2001 so that no separate registration is needed for this workshop.
To submit a paper for consideration, send 4 copies of the manuscript to Ms. Peggy Stewart (see address below). Electronic submissions (postscript or PDF versions printable on 8.5 x 11 paper only) are strongly encouraged. To guarantee consideration, manuscripts must be received by January 25, 2001, and must be no more than 10 pages excluding figures, tables, and references. Submission of work in progress is also encouraged.
Send all submissions to:
Ms. Peggy Stewart
Attn: Text Mining Workshop
Army
High Performance Computing Research Center
1100 South Washington
Avenue
Minneapolis, MN 55415
Tel: (612) 626-8079
Fax: (612)
626-1596
stewart@arc.umn.edu
All papers accepted and received by the March 13, 2001 deadline (see
below) will appear in the Workshop Proceedings which will be bound by
the Army High Performance Computing Research Center (AHPCRC)
and distributed to workshop attendees.
Select either Postscript or PDF formats.
Cliff Behrens, Telcordia Technologies, Inc.
Katy Börner, Indiana
Murray Browne, Internet Pictures Corp.
Inderjit Dhillon, Texas
Chris Ding, Lawrence Berkeley National Lab. (NERSC)
Kyle Gallivan,
Florida State
Liz Jessup, Colorado
Jacob Kogan, Univ. of Maryland, Baltimore County
Dian Martin, Tennessee
Haesun Park, Minnesota
Dulce Ponceleon, IBM Almaden
Bill Pottenger, Lehigh
Padma Raghavan, Penn State
Flavio Sartoretto, Univ. of Venezia (Italy)
Peiling Wang, Tennessee
Layne Watson, Virgina Tech
Jason Wu, Boeing
|
Organizer Michael W. Berry Department of Computer Science 203 Claxton Complex University of Tennessee Knoxville, TN 37996-3450 Phone: (865) 974-3838 Fax: (865) 974-4404 |
Assistant Susan (Ling) Wo Department of Computer Science 203 Claxton Complex University of Tennessee Knoxville, TN 37996-3450 Phone: (865) 974-0995 Fax: (865) 974-4404 |
Nineteenth century coal mining photos
source
Last modified on Feb 21, 2001.