This glossary attempts to provide unambiguous definitions for terms used in the project.


A robust runtime environment that is capable of executing process definitions created for the Business Process Execution Language (BPEL) standard. See for more information.
Advanced Data Mining and Integration Research for Europe, a vision and novel set of concepts introduced by this project.
An implementation of the SOAP ("Simple Object Access Protocol") submission to W3C. See for more information.
Advanced Models, Applications and Software Systems for High Performance Computing. See for more information.
Austrian Grid
Consortium combines Austria's leading researchers in advanced computing technologies with well-recognized partners in grid-dependant application areas. See for more information.


BSD License
Berkeley Software Distribution License. See for more information.


In a service-oriented architecture, a client is a software component or other program unit that makes use of the capabilities offered by a service.
One of the computational items involved in DMI processes, namely: data collections, data resources, functions, gateways, libraries, PE, PE instances and (named) types.
A description of the six phases of data mining recognised by the CRoss Industry Standard Process for Data Mining, a forum of data mining systems vendors and application developers. This process model is applicable in diverse industry sectors and the six phases are usually undertaken iteratively. See for more information.


Database Access and Integration Services Working Group. A group at OGF which develops standards for managing structured data resources via web services, See for more information.
Data collection
A collection of data, e.g. a file, list of files, a relational table, a set of relational tables, and XML document, an RDF database, etc. that is intended to be used together.
Data integration
Data integration is the process of combining data residing at different sources and providing the user with a unified view of these data. This process emerges in a variety of situations both commercial (when two similar companies need to merge their databases) and scientific (combining research results from different bioinformatics repositories). See for more information.
Data mining
Data mining, also called Knowledge-Discovery in Databases (KDD) is the process of automatically searching large volumes of data for patterns using tools such as classification, association rule mining, clustering, etc. See for more information.
A past EU project. See for more information.
Data resource
A computational service that stores and manages a set of data collections, e.g. a file system or DBMS.
Dynamic Bit Encoding ­ a new method based on BESS (bit encoded sparse structure) for indexing multidimensional data. Used in data cube construction developed within the GridMiner Project.
A commercial data mining system for interactive mining of multiple-level knowledge in large relational databases. See for more information.
Database Management System, computer software designed for the purpose of managing databases.
Data Format Description Language Working Group. See for more information.
Data Integration Applications: Linking Organizations to Gain Understanding and Experience, a series of international workshops, inter-site visits, and research sharing to realise broad standards and interoperating core software toolkits designed to enable global data access and integration (DAI). See for more information.
DIDC engineer
A person whose primary interest is delivering better distributed computational systems by taking into account, often dynamically, the properties of the data they are processing.
Data Intensive Systems Process Engineering Language, ADMIRE's canonical form of DMQL.
A U.K. project developing a service-oriented data mining system.
Data Mining Group. An independent, vendor led group which develops data mining standards, such as the Predictive Model Markup Language (PMML). See for more information.
Data Mining and Integration ­ an integrated approach to specification and implementation of data mining and integration processes introduced by ADMIRE.
DMI expert
A person who specialises in inventing, improving or delivering data mining and data integration capabilities to domain experts and other DMI experts.
DMI Gateway
A computational resource, typically a web service, that will receive requests for DMI-process enactment and will conduct a sequence of steps arranging that enactment or rejecting it.
DMI Portal
A tool designed for a particular group of domain experts that can be used via their browsers; it enables them to choose a DMI-process template, supply its parameters and arrange its enactment.
DMI process
A sequence of computational supported steps required at any phase of the CRISP-DM approach. Generally, such processes are designed using a DMI tool or language by composing DMI-processing elements. They are then sent via a DMI gateway to a DMI-enactment service.
DMI process template
A partially specified DMI process that has been prepared by DMI experts that could fufifil the deployment phase of CRISP-DM. It would be used by a domain expert to submit, e.g. via a DMI portal, a DMI process to a DMI gateway.
DMI processing element (DPE) or processing element (PE)
A primitive or composite software component encapsulating a DMI algorithm and providing for its use in DMI processes. Multiple instances may be used in one process. It has a specified structure of inputs and outputs.
DMI language query processor
Data Mining Query Language; allows declarative specification of data mining goals.
Data Mining Extension, a core component of Microsoft's SQL 2005 server data mining platform.
Domain expert
A person whose primary expertise is in a discipline that uses data mining and integration.
Data Pre-Processing, a phase of the knowledge discovery process.
A Distributed Query Processing (DQP) system is an example of a data integration component and can potentially provide effective declarative support for service orchestration as well as data integration. See for more information.
D3P Framework
Data statistics, Data understanding, Data Preprocessing ­ a novel framework for data quality management on the Grid developed by the University of Vienna.


Enterprise Grid Alliance, EGA previously sponsored the development of solutions for Grid Computing in the enterprise. See for more information. EGA merged with GGF to form the OGF.
An EU project on Semantic Web technologies. See for more details.



An EU project demonstrating how Grid technologies can be used to transform healthcare and enable Europe to lead that transformation. See for more information.
Gene Ontology
The Gene Ontology project provides a controlled vocabulary to describe gene and gene product attributes in any organism. See for more information.
Global Grid Forum, a community forum that promotes and supports the development, deployment, and implementation of Grid technologies. See for more information. The GGF has been succeeded by the Open Grid Forum (OGF).
Light Weight Middleware for Grid Computing. See for more information.
Globus Alliance
The Globus Alliance is an association dedicated to developing fundamental technologies needed to build grid computing infrastructures. See for more information.
GridMiner Assistant, an ontology based tool for automated DMI workflow construction at high abstraction level.
Grid Resources for Industrial Applications. See for more information.
A system that is concerned with the integration, virtualization, and management of services and resources in a distributed, heterogeneous environment that supports collections of users and resources (virtual organizations) across traditional administrative and organizational domains (real organisations).
Grid Computing
Grid Computing is an emerging computing model that treats all resources as a collection of manageable entities with common interfaces to such functionality as lifetime management, discoverable properties and accessibility via open protocols. See for more information.
Grid-based data-mining infrastructure including software services for sequential, parallel and distributed data and text mining, On-Line Analytical Processing, data integration on mediator/wrapper principles, data quality monitoring and improving based on advanced data statistics, and visualisation of the results of data analysis and surveying tasks. See for more information.
Grid Service
The formal definition of this term is deprecated. In general use, a Grid service is a Web service that is designed to operate in a Grid environment, and meets the requirements of the Grid(s) in which it participates.
A semantic registry extending the capabilities of the UDDI standard by enabling the registration of non-web-service 'invocable activities' (e.g. scripts and java methods) and the attachment of information (annotations).



Information Integrator
See for more information.
See for more information.
Infrastructure Service Bus
In Silico Proteome Integrated Data Environment Resource. See for more information.


Job Submission Description Language. See for more information.


Knowledge Discovery in Databases. See Data Mining
Open-source scientific workflow system that allows scientists to design scientific workflows and execute them efficiently using emerging Grid-based approaches to distributed computation. See for more information.
Knowledge Web
A 4 year Network of Excellence project funded by the European Commission 6th Framework Programme. See for more information.
K-WF Grid
The main objective of this project is to enable the knowledge-based support of workflow construction and execution in a Grid computing environment. See for more information.


A named collection of PE, functions and types designed to work well together to perform a coherent set of DMI tasks.


Data that describes data. Metadata may include references to schemas, provenance, and information quality.
A project at the University of Georgia, U.S.A. addressing Semantic Web technologies. See for more information.
The MKBEEM project (Multilingual Knowledge Based European Electronic Marketplace) develops a mediation system which adapts the language and the trading conditions of an Internet sales point according to its international customership. See for more details.
mock objects
A technique for improving the design of code within test-driven development. See for more information.
A UK eScience project. See for more information.


The aim of the NeOn project is to create the service-oriented, open infrastructure, and associated methodology, to support the development life-cycle of such a new generation of semantic applications. See for more information.
See for more information.


Organization for the Advancement of Structured Information Standards. See for more information.
Is an environment for the annotation of pre-existing Grid Services and development of new complex Semantic Grid Services. See for more information.
Open Grid Forum. See for more information.
Open Grid Services Architecture.
OGSA Basic Execution Services Working Group. See for more information.
The objective of the GGF OGSA ByteIO working group is to define a minimal Web Service interface for providing "POSIX-like" file functionality.
Open Grid Service Architecture Data Access and Integration. The aim of the OGSA-DAI project is to develop middleware to assist with access and integration of data from separate sources via the Grid. See for more information.
The OGSA Data WG is producing a data architecture as part of the larger OGSA effort. This data architecture describes the data services in the OGSA architecture and explains how they can be orchestrated to implement a range of data-oriented capabilities. See for more information.
OGSA Secure Channel
See for more information.
OGSA WS-RF Basic Profile
See for more information.
See DQP.
On-Line Analytical Processing, is an approach to quickly providing answers to analytical queries that are dimensional in nature. See for more information.
OLE DB for Data Mining
Object Linking and Embedding Database. Defines a data mining API to Microsoft's OLE DB environment and defines the basis of data mining as the data mining model virtual object. See for more information.
The Open Middleware Infrastructure Institute UK aim to provide supported and sustainable software for the UK e-Science community and its collaborators. See for more information.
A project aiming to producing the technological infrastructure for the rapid prototyping and development of knowledge-intensive distributed open services for the Semantic Grid. See for more information.
Web Ontology Language. See for more information.
Semantic Markup for Web Services. See for more information.
A java-based system, which assists researchers in managing, searching and sharing ontology metadata in a peer-to-peer network. See for more information.


A commonly recurring structure detectable after abstraction that occurs in DMI processes, these may receive special treatment.
PE instance
An instance of a primitive or composite PE that may be used, i.e. executed, during a DMI-process enactment.
Predictive Model Markup Language is an XML-based language which provides a way for applications to define statistical and data mining models and to share models between PMML compliant applications. See for more information.


Quality of Service. See for more information.


A free software environment for statistical computing and graphics. See for more information.
Resource Description Framework, a language for representing information about resources in the World Wide Web. See for more information.
Reference Model WG
Reference Model Working Group. See for more information.
A service that stores descriptions of DMI components. These descriptions include information used by humans and information used by DMI tools and enactment services.
A service that accepts DMI process definitions, preserves them and supports their shared use.
In OGSA, a resource is an entity that is useful in a Grid environment. The term usually encompasses entities that are pooled (e.g. hosts, software licenses, IP addresses, etc.) or that provide a given capacity (e.g. disks, networks, memory, databases). However, entities such as processes, print jobs, database query results and virtual organizations may also be represented and handled as resources.See for more information.
RDF(S) Grid Access Bridge ­ a component of the OntoGrid project platform.


SAS Enterprise Miner
A commercial data mining system. See for m ore information.
Semantic Annotations for WSDL and XML Schema. See for more information.
The Science Environment for Ecological Knowledge (SEEK) is a system designed to facilitate not only data acquisition and archiving, but integrating, transforming, analyzing, and synthesizing ecological and biodiversity data that was previously intractable. See for more information.
Single European Employment Market Place. EU project enhancing the already provided e-Employment services by developing a highly advanced, ontology-based, peer-to-peer (P2P) based interoperability infrastructure that will be based on Service Oriented Architecture and will offer access to semantically enriched employment services.
Semantic annotation
Used for providing formal descriptions of the entities involved in the DMI processes.
Semantic Discovery
Mechanisms aiming to improving the completeness and accuracy of search techniques using semantic technologies.
Semantic Grid
Is an extension of the current Grid in which information and services are given well-defined meaning through machine-processable descriptions which maximize the potential for sharing and reuse. See for more information.
Semantic Matching
Exploits the properties of formal descriptions for solving a particular problem associated to the usage of the entities involved in the DMI processes.
Semantic mediation
Techniques aiming to hide a logical model behind a conceptual model (ontology) to which it is explicitly mapped.
Semantic registries
Components that provide mechanisms for publishing semantic annotated information.
Semantic Web
An evolving extension of the World Wide Web in which web content can be expressed not only in natural language, but also in a form that can be understood, interpreted and used by software agents, thus permitting them to find, share and integrate information more easily. See for more information.
Service Oriented Architecture
This term is increasingly used to refer to an architectural style of building reliable distributed systems that deliver functionality as services, with the additional emphasis on loose coupling between interacting services. An SOA can be based on Web services (which provide basic interoperability), but it may use other technologies instead.
A dynamically created service that allows access to information about the enactment of a DMI process.
EU project providing the basis for a cost-effective, global information system for meteorology and the environment. See for more information.
Semantic-OGSA. See for more information.
Query Language for RDF. See for more information.
The underlying mechanism used for composing DMI-processing elements to generate composite DPE, DMI-process templates and DMI-processes. A stream takes a sequence of values of a specfified type from an output of one DPE to an input of another DPE preserving order. It may move data with any granularity and may provide buffering.
Semantic Web Services Language. See for more information.
Semantic Web Services Ontology. See for more information.


Aims to provide a language and software tools to facilitate easy use of workflow and distributed compute technology within the eScience community. See for more information.
An open source problem solving environment developed at Cardiff University that combines an intuitive visual interface with powerful data analysis tools. See for more information.
A formal description of the structure of a value that enables a limited set of operations on the value. ADMIRE deals simultaneously with three type systems: structural types defining the representation of DMI-process definitions, DMI types that describe values that are used in data mining and application domain types that are used in a specfic application domain.


See for more information.
See for more information.



A gateway registry for selecting, retrieving and exploring semantic data and ontologies.
Web Service
A software system designed to support interoperable machine- or application-oriented interaction over a network. A Web service has an interface described in a machine-processable format (specifically WSDL). Other systems interact with the Web service in a manner prescribed by its description using SOAP messages, typically conveyed using HTTP with an XML serialization in conjunction with other Web-related standards.
Workflow Enactment Engine Project is an easy to use and manage workflow enactment service for WS-I/WS-RF services. See for more information.
Is a design pattern where a piece of code allows classes to work together that normally could not because of incompatible interfaces. See for more information.
Web Services Business Process Execution Language. See for more information.
A specification defining core operations required to access structured data resources in a Grid environment using web services. See
A framework for defining ontology access service interfaces using the WS-DAI vocabulary and for defining the set of messages, properties and behaviors needed to provide ontology access to ontologies implemented in RDF(S)
A rendering of the WS-DAI specification for accessing relational data resources. See
A rendering of the WS-DAI specification for accessing XML data resources. See
Web Services Description Language. See for more information.
Web Service Semantics. See for more information.
Web Service Modeling Language. See for more inforamtion.
Web Service Modeling Ontology. See for more information.
Web Service Execution Environment. See for more information.
Web Services Notification. See for more information.
WS-Naming WG
The objective of this working group is to work on two specifications (RNS and WS-NR) to realize a three level name space for OGSA and to produce WS-Naming naming specification based on WS-Addressing. Thus, both RNS and WS-Naming must be combinable with OGSA Basic Profile. See for more information.
Web Services Resource Framework. See for more information.


A query language able to handle and join in a single query data sources exposing XML, relational or legacy data formats. See for more information.