datagov-catalog

New Data.gov catalog UI

Local Development

Copy the sample environment file before starting the app: cp .env.sample .env.
Update values in .env as needed for your local services; the file is ignored by Git.

Local Accessibility Testing

We use pa11y-ci for accessibility testing, which uses node to run the tests. In order to properly run the tests the container must be running and have the test data loaded in.

Install the dependencies: npm install
Load the test data if it hasn't been loaded yet: make load-test-data
Run pa11y tests: make test-pa11y

Poetry version used in CI

CI uses the latest Poetry release.
- Update your local Poetry to the latest to match CI:
  - make poetry-update

Harvest Database Configuration

This application reuses the harvest database defined in the datagov-harvester repository. The SQLAlchemy models have been duplicated locally in app/models.py for isolation.
Interact with the shared DB through CatalogDBInterface (app/database/interface.py), which mirrors the logic in the harvester repo and keeps query semantics consistent between apps.

table "dataset_view_count" seeding

This table stores the view count records for each dataset slug, and joined on dataset view refresh to fill the popularity column.
The data is primarily populated from external sources such as Google Analytics.
For testing purposes, the table can be seeded using this SQL script.

CREATE OR REPLACE FUNCTION public.generate_popularity()
RETURNS integer
LANGUAGE plpgsql
VOLATILE
AS $$
BEGIN
  RETURN CASE
    WHEN random() < 0.80 THEN (random() * 51)::integer                    -- 80%: 0-50
    WHEN random() < 0.90 THEN (51 + random() * 50)::integer               -- 10%: 51-100
    WHEN random() < 0.95 THEN (101 + random() * 900)::integer             -- 5%: 101-1000
    ELSE (1001 + random() * 4000)::integer                                -- 5%: 1001-5000
  END;
END;
$$;

-- To seed the table with fake view count,
-- delete all rows first
TRUNCATE TABLE dataset_view_count;

-- seed the table
INSERT INTO dataset_view_count (id, dataset_slug, view_count)
SELECT
    gen_random_uuid()::VARCHAR(36)  AS id,
    slug                            AS dataset_slug,
    generate_popularity()           AS view_count
FROM dataset;

Name		Name	Last commit message	Last commit date
Latest commit History 769 Commits
.github		.github
app		app
config		config
docs		docs
proxy		proxy
shared		shared
tests		tests
tools		tools
.env		.env
.env.sample		.env.sample
.gitignore		.gitignore
.pa11yci		.pa11yci
.profile		.profile
.python-version		.python-version
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
app-start.sh		app-start.sh
docker-compose.yml		docker-compose.yml
docker-compose_debug.yml		docker-compose_debug.yml
docker-compose_prod.yml		docker-compose_prod.yml
gunicorn.conf.py		gunicorn.conf.py
manifest.yml		manifest.yml
package.json		package.json
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
run.py		run.py
runtime.txt		runtime.txt
vars.development.yml		vars.development.yml
vars.prod.yml		vars.prod.yml
vars.staging.yml		vars.staging.yml
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

datagov-catalog

Local Development

Local Accessibility Testing

Poetry version used in CI

Harvest Database Configuration

table "dataset_view_count" seeding

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 8

Languages

GSA/datagov-catalog

Folders and files

Latest commit

History

Repository files navigation

datagov-catalog

Local Development

Local Accessibility Testing

Poetry version used in CI

Harvest Database Configuration

table "dataset_view_count" seeding

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 8

Languages

Packages