Load and return the MSK-IMPACT treatment timeline dataset (deidentified).
Returns:
Name | Type |
Description |
data |
Bunch
|
Dictionary-like object, with the following attributes.
- data : pandas DataFrame
The data matrix.
- description_columns (Future release) : list
The names of the dataset columns.
- description_dataset (Future release) : str
The full description of the dataset.
- filename (Future release) : str
The path to the location of the data.
|
Examples
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15 | from msk_cdm.datasets import connect_to_db
from msk_cdm.datasets.impact import load_data_timeline_treatment
# Connect to the database
auth_file = 'path/to/config.txt'
connect_to_db(auth_file=auth_file)
# Load the dataset
df_timeline_treatment = load_data_timeline_treatment()
# Access the data
df_treat = df_timeline_treatment['data']
# Display the first few rows of the data
print(df_treat.head())
|
Source code in msk_cdm/datasets/impact/datasets_impact.py
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394 | def load_data_timeline_treatment() -> Bunch:
"""Load and return the MSK-IMPACT treatment timeline dataset (deidentified).
Returns:
data: Dictionary-like object, with the following attributes.
- **data** : pandas DataFrame
The data matrix.
- **description_columns** (Future release) : list
The names of the dataset columns.
- **description_dataset** (Future release) : str
The full description of the dataset.
- **filename** (Future release) : str
The path to the location of the data.
Examples
--------
```python
from msk_cdm.datasets import connect_to_db
from msk_cdm.datasets.impact import load_data_timeline_treatment
# Connect to the database
auth_file = 'path/to/config.txt'
connect_to_db(auth_file=auth_file)
# Load the dataset
df_timeline_treatment = load_data_timeline_treatment()
# Access the data
df_treat = df_timeline_treatment['data']
# Display the first few rows of the data
print(df_treat.head())
```
"""
df = _loader._load_impact_data_timeline_treatment()
output = Bunch(data=df)
return output
|