_images/nccidxclean_logo_dark_300.png _images/nccidxclean_logo_300.png

Welcome to NCCIDxClean’s Documentation#

Introduction#

Welcome to NCCIDxClean (NCCID eXtended Clean), a pipeline to further enhance quality, integrity and reusability of the UK’s National COVID Chest Imaging Database (NCCID) clinical data. This is an extension to the original NHSx cleaning pipeline for the clinical data of the database. It has been expanded and adjusted to correct additional systematic inconsistencies in the raw data, such as patient sex, oxygen levels and date values.

Our paper is now available:

Breger, A.*, Selby, I.*, Roberts, M. et al. A pipeline to further enhance quality, integrity and reusability of the NCCID clinical data. Sci Data 10, 493 (2023). https://doi.org/10.1038/s41597-023-02340-7

* Joint first authors

Comparison of the original NCCID cleaning pipeline and NCCIDxClean.

A visual comparison of the original NCCID cleaning pipeline and NCCIDxClean [#myfootnote]_. This figure is a modified version of a figure from our paper, which may be requested above.#

User Guide#

Information about this package, how to customise it, and how if differs from the original NHSx cleaning package.

API Reference#

A reference for the modules and functions in this package.

Indices and Tables#