Data Hub Central Community Edition (DHCCE)
Data Hub Central Community Edition provides a visual way to interact with and experience your true multi-model data hub. You can go from loading data, to integrating, exploring, validating, and exporting curated results using the following components:
This project was formerly known as Envision. This project has been renamed and enhanced to be compatible with Data Hub 5.4.
- Upload to load source csv files as-is into the data hub via drag and drop
- Connect to visually model the specification for how you want to express your integrated business concepts as entities and relationships
- Integrate to map your source data to your model as well as deduplicate and master it
- Explore to view your harmonized data graph along with provenance
- Export to export your newly haromnized entities as .csv files for consumption by BI tools
- Know to search and navigate your semantics ontology.
Integration with MarkLogic Data Hub
Note: If you are using MarkLogic Data Hub 5.2.x then continue to use DHCCE v2.0.5.
You can use Connect to create models that you can visualize in Explore with new as well as existing data hubs. Integrate also works with new as well as existing data hubs. Know provides a visualization for Semantics triples only.
Run the jar
The jar is configured so that you can drop it into a DHF project directory and run it there.
java -jar hub-central-community.jar
Non-standard ports or different User
On first run DHCCE will install its modules into your DHF modules database. This happens before DHCCE tries to read your hub configuration. If you are using non-standard ports you’ll need to make a few changes to an application.properties file. Simply create application.properties next to hub-central-community.jar.
# Change these as needed marklogic.username=admin marklogic.password=admin marklogic.port=8011 marklogic.managePort=8002 marklogic.adminPort=8001
Note: Models will be saved in a directory called ./conceptConnectorModels, which can be found sibling to the hub-central-community.jar file by default. You can move your existing models here or see below for how to specify a different models directory.
If you need to point it at another folder where the DHF is installed, run like so:
java -DdhfDir=/full/path/to/your/datahub -jar hub-central-community.jar
If you have existing Connect models you’d like to use you can also specify a different models directory:
java -DdhfDir=/full/path/to/your/datahub -DmodelsDir=/full/path/to/your/models/dir -jar hub-central-community.jar
And if you need to specify the DHF environment (it defaults to local):
java -DdhfEnv=prod -DdhfDir=/full/path/to/your/datahub -DmodelsDir=/full/path/to/your/models/dir -jar hub-central-community.jar