<source data query>
To train a data mining model and create predictions from a mining model, you need to access data that is external to the Microsoft SQL Server 2005 Analysis Services (SSAS) database. You use the <source data query> clause in Data Mining Extensions (DMX) to define the external data. The INSERT, SELECT FROM <model> PREDICTION JOIN (DMX), and SELECT FROM NATURAL PREDICTION JOIN statements all use <source data query>.
Query types
The three most common ways to specify source data are:
OPENQUERY (DMX)
This statement queries data that is external to an instance of Analysis Services, by using an existing data source.While OPENQUERY is similar in function to OPENROWSET, OPENQUERY has the following benefits:
- A DMX query is much easier to write with OPENQUERY. Instead of creating a new connection string every time that you write a query, you can take advantage of the existing connection string in the data source. The data source object can also control data access for individual users.
- The administrator has more control over how the data on the server is accessed. For example, the administrator can manage which providers are loaded into the server and which external data can be accessed.
- OPENROWSET (DMX)
This statement queries data that is external to an instance of Analysis Services, by using an existing data source.
- SHAPE (DMX)
This statement queries multiple data sources to create a nested table. By using SHAPE, you can combine data from multiple sources into a single hierarchical table. This lets you take advantage of the ability of Analysis Services to nest tables by imbedding a table within a table.
To specify the source data, you can also use the following options:
- Any valid DMX statement
- Any valid Multidimensional Expressions (MDX) statement
- A table that returns a stored procedure
- An XMLA rowset
- A rowset parameter
See Also
Reference
Data Mining Extensions (DMX) Data Manipulation Statements
Data Mining Extensions (DMX) Statement Reference