KDD Nugget 94:12, e-mailed 94-06-15 Contents: * X. Wu, AI'94 Tutorial on Intelligent Learning Database Systems (Nov 94, Queensland, Australia) * A. Freitas, Query: Are there LARGE KDD benchmark databases ? * D. Fisher, CFP: AI & Statistics 1995 workshop (June 30 deadline!) * A. Motro, CFP: Next Generation Information Technologies and Systems The KDD Nuggets is a moderated list for the exchange of information relevant to Knowledge Discovery in Databases (KDD, also known as Data Mining), e.g. application descriptions, conference announcements, tool reviews, information requests, interesting ideas, clever opinions, etc. It has been coming out about every two-three weeks, depending on the quantity and urgency of submissions.. Back issues, FAQ, and other KDD-related information are now available via Mosaic, URL http://info.gte.com/~kdd/ or by anonymous ftp to ftp.gte.com, cd /pub/kdd, get README Email contributions to kdd@gte.com; add/delete requests to kdd-request@gte.com -- Gregory Piatetsky-Shapiro (moderator) ********************* Official disclaimer *********************************** * All opinions expressed herein are those of the writers (or the moderator) * * and not necessarily of their respective employers (or GTE Laboratories) * ***************************************************************************** ~~~~~~~~~~~~ Quotable Quote ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ The old saying: statistics are like a bikini--what they reveal is tantalyzing, what they cover up is vital. from usenet ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ----------------------------- From: Xindong Wu Subject: AI'94 Tutorial on Intelligent Learning Database Systems Date: Thu, 2 Jun 1994 09:57:44 +1000 (+1000) Seventh Australian Joint Conference on Artificial Intelligence (AI'94) "Sowing the Seeds for the Future" C A L L F O R P A R T I C I P A T I O N S AI'94 Tutorial on Intelligent Learning Database Systems by Dr Xindong Wu Dept. of Computer Science, James Cook University, Townsville, Queensland 4811, Australia xindong@coral.cs.jcu.edu.au 22 November 1994 ABSTRACT Knowledge acquisition from databases is a research frontier for both database technology and machine learning (ML) techniques, and has seen sustained research over recent years. It also acts as a link between the two fields, thus offering a dual benefit. Firstly, since database technology has already found wide application in many fields, ML research obviously stands to gain >from this greater exposure and established technological foundation. Secondly, ML techniques can augment the ability of existing database systems to represent, acquire, and process a collection of expertise such as those which form part of the semantics of many advanced applications (e.g. CAD/CAM). This full-day tutorial will present and discuss techniques for the following 3 interconnected phases in constructing intelligent learning database systems: (1) Translation of standard database information into a form suitable for use by a rule-based system; (2) Using machine learning techniques to produce rule bases from databases; and (3) Interpreting the rules produced to solve users' problems and/or reduce data spaces. It will suit a wide audience (including postgraduate students and industrial people) from databases, expert systems, and machine learning. CONTENTS 1. Knowledge Acquisition from Databases: Problem and Domain 1.1 Problems in Conventional Databases 1.2 Research Topics in Intelligent Databases 1.3 Requirements for Knowledge Discovery in Databases 2. Typical Inductive Learning Algorithms 2.1 The ID3 Family 2.2 The AQ Family 2.3 The HCV Family 3. Integrating More Semantic Information into Data Models 3.1 The E-R Model 3.2 Deductive and Object-Oriented Databases 3.3 More Expressive Representations 4. An Intelligent Learning Database System To introduce a PC shell developed at Edinburgh 5. Conclusions and Research Directions in the Field 6. A Practical Component Using a PC Lab COURSE MATERIAL - Xindong Wu, Research Issues in Intelligent Learning Database Systems, Proceedings of the Seventh Annual Florida AI Research Symposium, Pensacola Beach, Florida, U.S.A., May 5-7, 1994, 137--141. - Xindong Wu, Inductive Learning: Algorithms and Frontiers, Artificial Intelligence Review, 7(1993), 2: 93-108. - Xindong Wu, KEshell2: An Intelligent Learning Data Base System, Research and Development in Expert Systems IX, M.A. Bramer and R.W. Milne (Eds.), Cambridge University Press, U.K., 1992, 253--272. BIO-DATA OF THE PRESENTER Dr Xindong Wu received his first and Master's degrees in Computer Science from Hefei University of Technology, China, and his Ph.D. in Artificial Intelligence from the University of Edinburgh, Britain. In the past, he has authored 2 technical books, Expert Systems Technology (1988) and Constructing Expert Systems (1990). He has also published over 60 papers in various periodicals (such as Expert Systems: The International Journal of Knowledge Engineering, Artificial Intelligence Review, Informatica, and the Journal of Computer Science and Technology) and in conference proceedings (e.g., Research and Development in Expert Systems IX, and the 21st ACM Computer Science Conference). His technical interests include machine learning, expert systems, intelligent database systems, and knowledge-based software engineering. He is an editor on the Editorial Board of the Europe-based Informatica: An International Journal of Computing and Informatics, and a member of the Editorial Board of the U.S.A.-based International Journal of Computers and Their Applications. He has taught courses in Combinatorial Mathematics, Expert Systems, Knowledge Representation and Inference, Machine Learning, Advanced Data Structures and Databases, Introduction to Computer Science, and Artificial Intelligence. PREREQUISITES Databases, Expert Systems, and (preferably) Prolog. ------------------------------- From: Freitas A A Date: Thu, 2 Jun 94 10:30:50 BST Subject: Looking for KDD benchmark I wonder if there are some databases used as a kind of 'benchmark' for KDD. The University of California at Irvine's repository of machine learning databases seems to contain relatively small data sets, rather than the large data sets associated with KDD. Alex Alves Freitas PhD student, University of Essex, UK e-mail: freial@essex.ac.uk -------------------------------- Date: Tue, 7 Jun 1994 10:59:05 -0400 From: dfisher@vuse.vanderbilt.edu (Douglas H. Fisher) Subject: AI Stats list (V. 1, N. 6) Call For Papers Fifth International Workshop on Artificial Intelligence and Statistics January 4-7, 1995 Ft. Lauderdale, Florida PURPOSE: This is the fifth in a series of workshops which has brought together researchers in Artificial Intelligence and in Statistics to discuss problems of mutual interest. The exchange has broadened research in both fields and has strongly encouraged interdisciplinary work. This workshop will have as its primary theme: ``Learning from data'' Papers on other aspects of the interface between AI & Statistics are *strongly* encouraged as well (see TOPICS below). FORMAT: To encourage interaction and a broad exchange of ideas, the presentations will be limited to about 20 discussion papers in single session meetings over three days (Jan. 5-7). Focussed poster sessions will provide the means for presenting and discussing the remaining research papers. Papers for poster sessions will be treated equally with papers for presentation in publications. Attendance at the workshop will *not* be limited. The three days of research presentations will be preceded by a day of tutorials (Jan. 4). These are intended to expose researchers in each field to the methodology used in the other field. The Tutorial Chair is Prakash Shenoy. Suggestions on tutorial topics can be sent to him at pshenoy@ukanvm.bitnet. LANGUAGE: The language will be English. TOPICS OF INTEREST: The fifth workshop has a primary theme of ``Learning from data'' At least one third of the workshop schedule will be set aside for papers with this theme. Other themes will be developed according to the strength of the papers in other areas, including but not limited to: - integrated man-machine modeling methods - empirical discovery and statistical methods for knowledge acquisition - probability and search - uncertainty propagation - combined statistical and qualitative reasoning - inferring causation - quantitative programming tools and integrated software for data analysis and modeling. - discovery in databases - meta data and design of statistical data bases - automated data analysis and knowledge representation for statistics - cluster analysis SUBMISSION REQUIREMENTS: Three copies of an extended abstract (up to four pages) should be sent to H. Lenz, Program Chair or D. Fisher, General Chair 5th Int'l Workshop on AI & Stats 5th Int'l Workshop on AI & Stats Free University of Berlin Box 1679, Station B Department of Economics Department of Computer Science Institute for Statistics Vanderbilt University and Econometrics Nashville, Tennessee 37235 14185 Berlin, Garystr 21 USA Germany or electronically (postscript or latex documents preferred) to ai-stats-95@vuse.vanderbilt.edu Submissions for discussion papers (and poster presentations) will be considered if *postmarked* by June 30, 1994. If the submission is electronic (e-mail), then it must be *received* by midnight June 30, 1994. Abstracts postmarked after this date but *before* July 31, 1994, will be considered for poster presentation *only*. Please indicate which topic(s) your abstract addresses and include an electronic mail address for correspondence. Receipt of all submissions will be confirmed via electronic mail. Acceptance notices will be mailed by September 1, 1994. Preliminary papers (up to 20 pages) must be returned by November 1, 1994. These preliminary papers will be copied and distributed at the workshop. PROGRAM COMMITTEE: General Chair: D. Fisher Vanderbilt U., USA Program Chair: H. Lenz Free U. Berlin, Germany Members: W. Buntine NASA (Ames), USA J. Catlett AT&T Bell Labs, USA P. Cheeseman NASA (Ames), USA P. Cohen U. of Mass., USA D. Draper UCLA, USA Wm. Dumouchel Columbia U., USA A. Gammerman U. of London, UK D. J. Hand Open U., UK P. Hietala U. Tampere, Finland R. Kruse TU Braunschweig, Germany S. Lauritzen Aalborg U., Denmark W. Oldford U. of Waterloo, Canada J. Pearl UCLA, USA D. Pregibon AT&T Bell Labs, USA E. Roedel Humboldt U., Germany G. Shafer Rutgers U., USA P. Smythe JPL, USA D. Spiegelhalter Cambridge U., UK MORE INFORMATION: For more information write dfisher@vuse.vanderbilt.edu or write to ai-stats-request@watstat.uwaterloo.ca to subscribe to the AI and Statistics mailing list. --------------------------------------------------------- Date: Wed, 15 Jun 94 08:41:33 -0500 From: ami@aviv.isse.gmu.edu (Amihai Motro) Subject: [DBWORLD:286] NGITS-95 Call for Papers Call for Papers NGITS '95 The Second International Workshop on Next Generation Information Technologies and Systems 27 - 29 June 1995 Hotel Carlton, Naharia, ISRAEL Supported by the Technion - Israel Institute of Technology and the Neaman Institute As information technology advances, requirements of and expectations from information systems change rapidly. This requires researchers and developers to continuously focus on the next generation of information systems. The NGITS Workshop provides an international forum for discussing issues and solutions related exclusively to next generation information systems and the technologies that would make them possible. These issues include, but are not limited to: o Data and knowledge base challenges: advanced models and languages, data integrity and quality, management of uncertainty and inconsistency, information security and privacy, management of spatial and temporal data o Software architectures for information systems: object-orientation, agent-orientation, extensibility, groupware, software repositories, application generators o Integration: intelligent integration and interchange of information, interoperability and cooperation among heterogeneous information systems, information mediation and brokering, standardization o AI techniques: knowledge management, knowledge representation and reasoning, knowledge discovery, information extraction and filtering, coordination technologies and agent architectures o Human-computer interaction: advanced user interfaces, human-computer cooperation o The impact of new technologies: multi-media, mobile computing, very high speed networks, etc. o Challenging applications: services and tools to support information infrastructure ("information super-highways"), digital libraries, large scientific and geographical databases, health care (medical) information systems, information systems for advanced manufacturing We solicit contributions of three kinds: * Full research papers * Short position papers * Proposals for panel discussions All contributions must emphasize their relevance to issues of next generation information technologies and systems, and must be addressed to an audience of diverse background and interests. The category of "research papers" is intended for technical papers describing research accomplishments. The category "position papers" is intended for papers that discuss new challenges and visionary solutions. Proposals for panels should include an abstract of the subject and likely participants. The workshop will feature paper sessions, panel discussions and talks by invited speakers. All accepted papers will appear in a workshop proceedings. Selected papers will be published in a special issue of the Journal of Intelligent Information Systems. ________________________ Program Committee Chairs ________________________ Ami Motro Moshe Tennenholtz Department of Information and Faculty of Industrial Engineering Software Systems Engineering and Management George Mason University Technion - Israel Institute of Technology Fairfax, Virginia 20030 Haifa, 32000 USA ISRAEL ngits@isse.gmu.edu ngits@ie.technion.ac.il _______________________ Information For Authors _______________________ For research papers authors should submit extended abstracts of 2000 words or less; the full papers that will appear in the proceedings are limited to 5000 words. Position papers are limited to 2000 words. We shall attempt to handle the submission and review processes by electronic mail and request that you submit your contributions by mailing a PostScript version of your paper to BOTH co-chairs at the above e-mail addresses. Otherwise, please send 3 copies of your paper to BOTH co-chairs at the above postal addresses. The organizers request advance notification of your intention to submit a paper: please send an e-mail message to both co-chairs giving the names of the authors and the title or subject of the submission. _______________ Important dates _______________ 31 October 1994 Intent-to-submit statements due 31 December 1994 Extended abstracts (for full papers), position papers, and panel proposals due 28 February 1995 Notification of acceptance 15 April 1995 Camera-ready manuscripts due 27-29 June 1995 The workshop _________________________ Location and Travel Funds _________________________ The workshop will take place in Naharia, a picturesque resort town on the Mediterranean sea, 30 kilometers north of Haifa. Limited travel funds will be available to assist some of the participants. ______________ General Chairs ______________ Opher Etzion Arie Segev Technion - Israel Institute of University of California at Berkeley Technology and Lawrence Berkeley Laboratories ISRAEL USA __________________ Program Committee _________________ Serge Abiteboul, INRIA, France Ron Ashany, National Science Foundation, USA Hagit Attiya, Technion, Israel Dan Berry, Carnegie Melon U., USA and Technion, Israel Elisa Bertino, U. Milano, Italy Yitzhak Birk, Technion, Israel Yuri Breitbart, U. Kentucky, USA Alex Brodsky, George Mason University, USA Peter Buneman, U. Pennsylvania, USA Wesley Chu, U. California, Los Angeles, USA Alessandro D'Atri, U. L'Aquila, Italy Dov Dori, Technion, Israel Oren Etzioni, U. Washington, USA Christos Faloutsos, U. Maryland, USA Mark Fox, U. Toronto, Canada Ophir Frieder, George Mason University, USA Les Gasser, U. Southern California, USA Yossi Gil, Technion, Israel Tomasz Imielinski, Rutgers University, USA Alfons Kemper, Aachen, Germany Fred Lochovsky, U. of Science and Technology, Hong Kong Dennis McLeod, U. Southern California, USA John Mylopoulos, U. Toronto, Canada Gregory Piatetsky-Shapiro, GTE Laboratories, USA Jeff Rosenschein, Hebrew University, Israel Doron Rotem, Lawrence Berkeley Laboratories, USA Amit Sheth, Bellcore, USA Peretz Shoval, Ben Gurion U., Israel Avi Silberschatz, U. Texas, Austin and AT&T Bell Labs, USA Ouri Wolfson, U. Illinois, Chicago, USA ______________________________________________________________________________ ------------------------------------------------------------------------------- REPLIES TO THIS MESSAGE WILL GO ONLY TO THE SENDER The dbworld alias reaches many people, and should only be used for messages of general interest to the database community. Requests to get on or off dbworld should go to listproc@cs.wisc.edu. to subscribe send subscribe dbworld to unsubscribe send unsubscribe dbworld if your address is going to change send an unsubscribe request from the old address (before the change) send a subscribe request from the new address (after the change) to find out more options send help -------------------------------------------------------------------------------