Tables Overview
Database Structure
The SALUS healthcare research database consists of the following core tables:
1. Patients Table
- Description: Contains patient demographic information
- Key Fields: Patient ID, Sex, Date of Birth
- Record Count: 3466137
Information
- Patient IDs are anonymized and unique across all tables
- All dates are stored in YYYY-MM-DD format
- Missing values are coded as NA
Final Cleaning Notes
- Patient IDs should be anonymized and standardized to SALUS ID (SID) format
- Column names should be mapped to standardized naming conventions for research use:
- BRTHDD -> DATE_BIRTH
- ORDDD -> DATE_VISIT
- INQRFLAG -> SURVEY_TYPE
- TESTCD -> TEST_CODE
- Missing values standardized to NA (., -, _, etc.)
2. Visits Table
- Description: Tracks all healthcare visit encounters
- Key Fields: Patient ID, Date of Visit, Count of Visit
- Record Count: ##,### records
3. Survey Table
- Description: Contains patient-reported outcomes and survey responses
- Primary Key: survey_id
- Record Count: 25,000 records
- Key Fields: Survey type, responses, completion dates, scores
4. Measure Table
- Description: Stores clinical measurements and vital signs
- Primary Key: measure_id
- Record Count: 100,000 records
- Key Fields: Vital signs, BMI, blood pressure, temperature
5. Lab Table
- Description: Records laboratory test results and diagnostics
- Primary Key: lab_id
- Record Count: 75,000 records
- Key Fields: Test types, results, reference ranges, collection dates
6. Doctor Table
- Description: Contains healthcare provider information
- Primary Key: doctor_id
- Record Count: 500 records
- Key Fields: Specialties, credentials, departments, contact details
Each table is carefully structured to maintain referential integrity while capturing comprehensive healthcare data. The tables are interconnected through foreign key relationships to enable complex queries and analysis.