Data Catalog Specifics
Much as we try to make the interaction with all Data Catalogs identical, there are some oddities.
OpenDataDiscovery
This catalog doesn't really have a hierarchy of database
→ schema
→ table
. It contains the concept of DataSource
, but not all of these even have associated data sets (i.e. things with a schema that PACE can interpret).
In order to provide a useful hierarchy we interpret all those Data Sources that have at least one DataSet as a database
. Schemas don't really exist on OpenDataDiscovery, but we need it in the PACE hierarchy. We create one dummy Schema with an id
of schema
and a name identical to the Data Source name.
The tables work as expected.
An example (using the OpenDataDiscovery sample data). I've used jq
to simplify the output somewhat.
List Databases
This list all data sources with at least one OpenDataDiscovery Data Set.
List Schemas
There's just one schema in each Database
, because OpenDataDiscovery doesn't have schemas as such.
List Tables
To list the tables (same as in any other catalog), we need the database
and the schema
. OpenDataDiscovery uses numeric entity id's, and the one for the BookShop Transactional
postgres database is 3
. The query below shows that the name
of the schema
is identical to the name
of the database
.
Get Data Policy
Retrieving a blueprint data policy works the same as with any other catalog. The table named
Last updated