AI Operations

Datasets

Curated, de-identified imaging corpora used for training and validation. Lineage tracked end-to-end.

Total datasets
48
Total volume
94.2 TB
Annotated images
12.4M
IRB approvals
32
Active datasets
DatasetModalityStudiesSizeLabel coverageSplitLicense
Chest CT Curated v8
ds-chest-ct-v8
CT412,88418.4 TB
98.2%
train/val/test 80/10/10BAA + IRB
Wrist X-Ray v4
ds-wrist-xr-v4
XR84,212412 GB
99.1%
75/15/10BAA + IRB
Brain MRI Multi-site v6
ds-brain-mr-v6
MR94,1188.2 TB
96.8%
80/10/10BAA + IRB
Mammography Screening v3
ds-mammo-v3
MG208,4023.1 TB
99.4%
80/10/10BAA + IRB
Abdomen CT Segmentation v2
ds-abd-ct-v2
CT32,5082.8 TB
94.0%
70/15/15BAA + IRB
Demographics balance · Chest CT v8
Female48%
Male51%
Non-binary1%
<40 yrs18%
40-6552%
65+30%
Asian24%
Black18%
Hispanic21%
White33%
Other4%
Lineage · last 90 days
  1. Today
    ds-chest-ct-v8 → FractureNet v4.2.1 training
  2. May 18
    Added 4,212 studies from Singapore General
  3. May 12
    Re-labeled 1,801 priors after RADPEER session
  4. May 04
    Compliance scan: 0 PHI leaks, attestation #2401 signed
  5. Apr 28
    Bias audit: gender parity within 4pp, ethnicity within 6pp
  6. Apr 19
    IRB amendment approved (UCSF #2024-3082)