Load and return the IDB radiology reports dataset (PHI).
Returns:
Name | Type |
Description |
data |
Bunch
|
Dictionary-like object, with the following attributes.
- data : pandas DataFrame
The data matrix.
|
Examples
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15 | from msk_cdm.datasets import connect_to_db
from msk_cdm.datasets.impact import load_radiology_reports_idb
# Connect to the database
auth_file = 'path/to/config.txt'
connect_to_db(auth_file=auth_file)
# Load the dataset
df_radiology_reports = load_radiology_reports_idb()
# Access the data
df = df_radiology_reports['data']
# Display the first few rows of the data
print(df.head())
|
Source code in msk_cdm/datasets/phi/idb/datasets_phi_idb.py
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71 | def load_radiology_reports_idb() -> Bunch:
"""Load and return the IDB radiology reports dataset (PHI).
Returns:
data: Dictionary-like object, with the following attributes.
- **data** : pandas DataFrame
The data matrix.
Examples
--------
```python
from msk_cdm.datasets import connect_to_db
from msk_cdm.datasets.impact import load_radiology_reports_idb
# Connect to the database
auth_file = 'path/to/config.txt'
connect_to_db(auth_file=auth_file)
# Load the dataset
df_radiology_reports = load_radiology_reports_idb()
# Access the data
df = df_radiology_reports['data']
# Display the first few rows of the data
print(df.head())
```
"""
df = _loader._load_phi_idb_radiology_reports()
data = Bunch(data=df)
return data
|