CluedIn enricher for Danish CVR, which supplies information on companies registered in Denmark. The CVR service allows searching by either a company's CVR number or name.
This repository contains the code and associated tests for enriching Danish Companies of Entities and Clues.
The enricher creates searches for Organization entity properties which fulfill the following:
- Its vocabulary is mapped to the
CluedInOrganization.CodesCVRcore vocabulary. - Its vocabulary is mapped to the
CluedInOrganization.OrganizationNameand another property mapped toCluedInOrganization.AddressCountryCodecore vocabulary is either"dk","danmark"or"denmark".
In order to connect to the CVR API, you need to obtain a username and password for the service. A guide in Danish for how to do this can be found at https://data.virk.dk/datakatalog/erhvervsstyrelsen/system-til-system-adgang-til-cvr-data.
The registration consists of writing a mail to cvrselvbetjening@erst.dk in which you request access to the service on behalf of the company that wishes to use the CVR data. CluedIn has a login for development purposes, but for production it will ultimately be the customer's login that should be used.
Once this login is obtained, the app setting Providers.ExternalSearch.CVR.LiveEndPoint must be set to the endpoint of the CVR search API alongside your username and password.
The endpoint has the following format:
http://<username>:<password>@distribution.virk.dk/cvr-permanent/_search
To use the CVR External Search with the CluedIn server you will have to add the CluedIn.Enricher.CVR nuget package to your environment.
A mocked environment is required to run integration and acceptance tests. The mocked environment can be built and run using the following Docker command:
docker-compose up --build -dUse the following commands to run all Unit and Integration tests within the repository:
dotnet test .\ExternalSearch.CVR.sln --filter Unit
dotnet test .\ExternalSearch.CVR.sln --filter IntegrationTo run Pester acceptance tests
invoke-pesterTo review the WireMock HTTP proxy logs
docker-compose logs wiremockCluedIn is the Cloud-native Master Data Management Platform that brings data teams together enabling them to deliver the foundation of high-quality, trusted data that empowers everyone to make a difference.
We're different because we use enhanced data management techniques like Graph and Zero Upfront Modelling to accelerate the time taken to prepare data to deliver insight by as much as 80%. Installed in as little as 20 minutes from the Azure Marketplace, CluedIn is fully integrated with Microsoft Purview and the full Microsoft Fabric suite, making it the preferred choice for Azure customers.
To learn more about CluedIn, contact the team today.