In this paper we present an infrastructure for conducting data and text mining over distributed data and computational resources. Our approach is based on extending the Discovery Net infrastructure, a gridcomputing
environment for data mining, to allow end users to construct complex
distributed text and data mining workflows. We describe our architecture, data model and visual programming approach and also present a number of text mining examples conducted over biological data to highlight the advantages of our system.
pubs.doc.ic.ac.uk: built & maintained by Ashok Argent-Katwala.