Development#

This page describes the development of sdmx. Contributions are welcome!

Code style#

  • Apply the following to new or modified code:

    isort -rc . && black . && mypy . && flake8
    

    Respectively, these:

    • isort: sort import lines at the top of code files in a consistent way, using isort.

    • black: apply black code style.

    • mypy: check typing using mypy.

    • flake8: check code style against PEP 8 using flake8.

  • Follow the 7 rules of a great Git commit message.

  • Write docstrings in the numpydoc style.

Test specimens#

New in version 2.0.

A variety of specimens—example files from real web services, or published with the standards—are used to test that sdmx correctly reads and writes the different SDMX message formats. Since v2.0, specimens are stored in the separate sdmx-test-data repository.

Running the test suite requires these files. To retrieve them, use one of the following methods:

  1. Obtain the files by one of two methods:

    1. Clone khaeru/sdmx-test-data:

      $ git clone git@github.com:khaeru/sdmx-test-data.git
      
    2. Download https://github.com/khaeru/sdmx-test-data/archive/.zip

  2. Indicate where pytest can find the files, by one of two methods:

    1. Set the SDMX_TEST_DATA environment variable:

      # Set the variable only for one command
      $ SDMX_TEST_DATA=/path/to/files pytest
      
      # Export the variable to the environment
      $ export SDMX_TEST_DATA
      $ pytest
      
    2. Give the option --sdmx-test-data=<PATH> when invoking pytest:

      $ pytest --sdmx-test-data=/path/to/files
      

The files are:

  • Arranged in directories with names matching particular sources in sources.json.

  • Named with:

    • Certain keywords:

      • -structure: a structure message, often associated with a file with a similar name containing a data message.

      • ts: time-series data, i.e. with a TimeDimensions at the level of individual Observations.

      • xs: cross-sectional data arranged in other ways.

      • flat: flat DataSets with all Dimensions at the Observation level.

      • ss: structure-specific data messages.

    • In some cases, the query string or data flow/structure ID as the file name.

    • Hyphens - instead of underscores _.

Releasing#

Before releasing, check:

Address any failures before releasing.

  1. Create a new branch:

    $ git checkout -v release/X.Y.Z
    
  2. Edit doc/whatsnew.rst. Comment the heading “Next release”, then insert another heading below it, at the same level, with the version number and date.

  3. Make a commit with a message like “Mark vX.Y.Z in doc/whatsnew”.

  4. Tag the version as a release candidate, i.e. with a rcN suffix, and push:

    $ git tag vX.Y.ZrcN
    $ git push --tags --set-upstream origin release/X.Y.Z
    
  5. Open a pull request with the title “Release vX.Y.Z” using this branch. Check:

    If needed, address any warnings or errors that appear and then continue from step (3), i.e. make (a) new commit(s) and tag, incrementing the release candidate number, e.g. from rc1 to rc2.

  6. Merge the PR using the “rebase and merge” method.

  7. (optional) Tag the release itself and push:

    $ git tag vX.Y.Z
    $ git push --tags origin main
    

    This step (but not step (3)) can also be performed directly on GitHub; see (7), next.

  8. Visit https://github.com/khaeru/sdmx/releases and mark the new release: either using the pushed tag from (7), or by creating the tag and release simultaneously.

  9. Check at https://github.com/khaeru/sdmx/actions?query=workflow:publish and https://pypi.org/project/sdmx1/ that the distributions are published.

Internal code reference#

testing: Testing utilities#

class sdmx.testing.MessageTest[source]#

Bases: object

Base class for tests of specific specimen files.

directory: Union[str, pathlib.Path] = PosixPath('.')#
filename: str#
msg(path)[source]#
path(test_data_path)[source]#
class sdmx.testing.SpecimenCollection(base_path)[source]#

Bases: object

Collection of test specimens.

as_params(format=None, kind=None, marks={})[source]#

Generate pytest.param() from specimens.

One param() is generated for each specimen that matches the format and kind arguments (if any). Marks are attached to each param from marks, wherein the keys are partial paths.

expected_data(path)[source]#

Return the expected to_pandas() result for the specimen path.

sdmx.testing.XFAIL = {'unsupported': MarkDecorator(mark=Mark(name='xfail', args=(), kwargs={'strict': True, 'reason': 'Not implemented by service', 'raises': (<class 'requests.exceptions.HTTPError'>, <class 'NotImplementedError'>, <class 'ValueError'>)})), 503: MarkDecorator(mark=Mark(name='xfail', args=(), kwargs={'raises': <class 'requests.exceptions.HTTPError'>, 'reason': '503 Server Error: Service Unavailable'}))}#

Marks for use below.

sdmx.testing.assert_pd_equal(left, right, **kwargs)[source]#

Assert equality of two pandas objects.

sdmx.testing.generate_endpoint_tests(metafunc)[source]#

pytest hook for parametrizing tests that need an “endpoint” fixture.

This function relies on the DataSourceTest base class defined in test_sources. It:

  • Generates one parametrization for every Resource (= REST API endpoint).

  • Applies pytest “xfail” (expected failure) marks according to:

    1. Source.supports, i.e. if the particular source is marked as not supporting certain endpoints, the test is expected to fail.

    2. DataSourceTest.xfail, any other failures defined on the source test class (e.g. DataSourceTest subclass).

    3. DataSourceTest.xfail_common, common failures.

sdmx.testing.parametrize_specimens(metafunc)[source]#

Handle @pytest.mark.parametrize_specimens(…).

sdmx.testing.pytest_addoption(parser)[source]#

Add the --sdmx-test-data command-line option to pytest.

sdmx.testing.pytest_configure(config)[source]#

Handle the --sdmx-test-data command-line option.

sdmx.testing.pytest_generate_tests(metafunc)[source]#

Generate tests.

Calls both parametrize_specimens() and generate_endpoint_tests().

sdmx.testing.specimen(pytestconfig)[source]#

Fixture: the SpecimenCollection.

sdmx.testing.test_data_path(pytestconfig)[source]#

Fixture: the Path given as –sdmx-test-data.

sdmx.testing.testsource()[source]#

Fixture: the Source.id of a non-existent data source.

util: Utilities#

class sdmx.util.BaseModel[source]

Bases: pydantic.main.BaseModel

Common settings for pydantic.BaseModel in sdmx.

class Config[source]

Bases: object

copy_on_model_validation = 'none'
validate_assignment = True
sdmx.util.summarize_dictlike(dl, maxwidth=72)[source]

Return a string summary of the DictLike contents.

sdmx.util.validate_dictlike(cls)[source]

Adjust cls so that its DictLike members are validated.

This is necessary because DictLike is a subclass of dict, and so pydantic fails to call __get_validators__() and register those on BaseModels which include DictLike members.

Inline TODOs#

Todo

Support selection of language for conversion of InternationalString.

(The original entry is located in /home/docs/checkouts/readthedocs.org/user_builds/sdmx1/checkouts/v2.7.0/doc/api/writer.rst, line 61.)

Todo

Currently other functions in writer.xml all pass the style argument to this function. As an enhancement, allow user or automatic selection of different reference styles.

(The original entry is located in /home/docs/checkouts/readthedocs.org/user_builds/sdmx1/envs/v2.7.0/lib/python3.7/site-packages/sdmx/writer/xml.py:docstring of sdmx.writer.xml.reference, line 3.)