Primoz
cdff4da930
Merge branch 'ml_pipeline' of https://repo.ijs.si/junoslukan/straw2analysis into ml_pipeline
2022-10-17 22:15:17 +02:00
Primoz
ad5f50babe
Correctly imputed data uploaded on STRAW (all targets)
2022-10-12 12:48:10 +02:00
Primoz
466cd3dc23
Processing of a newly cleaned script. Addition of two ML models. And modifications with one hot encoding.
2022-10-10 16:47:00 +02:00
Primoz
27b2282ee0
Datasets (phone&E4 features) and Jupyter script of regression models.
2022-08-24 16:18:40 +02:00
junos
a8fd96d2f1
Add analysis using RAPIDS.
2022-08-23 16:41:41 +02:00
junos
e33a49c9fc
Add a demo of pipeline.
2021-11-17 10:44:49 +01:00
junos
d34c2ec5e9
Merge branch 'ambient' into ml_pipeline
2021-11-17 10:39:55 +01:00
junos
005b09cfdf
[WIP] Fix tests to use pyprojroot.
2021-10-29 12:07:12 +02:00
junos
6fc0d962ae
Remove low values of pressure.
2021-10-22 18:09:17 +02:00
junos
92fbda242b
Explore barometer and temperature data.
...
Add docstrings to models.
2021-10-14 17:59:33 +02:00
junos
6302a0f0d9
Merge ambient sensors into one file.
...
Explore barometer sensor data for one phone.
2021-10-13 16:57:38 +02:00
junos
a63a7eac99
[WIP] Add a test for SensorFeatures.
...
Additional analysis for adherence.
Small corrections.
2021-10-13 13:39:58 +02:00
junos
b8c7606664
Add an option to read cached labels from a file.
2021-09-15 15:45:49 +02:00
junos
ed062d25ee
Add export capabilities to labels.py.
2021-09-15 15:36:36 +02:00
junos
20748890a8
Further refactor by moving helper functions.
2021-09-15 15:14:54 +02:00
junos
28699a0fdf
Enable reading features from csv files.
2021-09-14 17:42:34 +02:00
junos
af9e81fe40
Document the SensorFeatures class and its __init__ method.
2021-09-13 17:43:47 +02:00
junos
b19eebbb92
Refactor machine_learning/pipeline.py by defining one class by file.
2021-09-13 11:41:57 +02:00
junos
c1bb4ddf0f
Save calculated features to csv files.
2021-08-23 16:36:26 +02:00
junos
0152fbe4ac
Delete the leftover class.
...
Add more prints.
2021-08-23 16:09:23 +02:00
junos
3611fc76f7
Fill NaNs after merging all features.
2021-08-21 19:48:57 +02:00
junos
ee30c042ea
Fill NaNs introduced in merge for proximity.
2021-08-21 19:40:42 +02:00
junos
a71e132edf
Prepare the first full pipeline.
2021-08-21 19:04:09 +02:00
junos
24c4bef7e2
Print some more messages.
2021-08-21 19:03:44 +02:00
junos
11381d6447
Add some print statements for monitoring progress.
2021-08-21 18:54:02 +02:00
junos
d19995385d
Account for the case when there is no data for days with labels.
2021-08-21 18:49:57 +02:00
junos
f73f86486a
Fill communication features with appropriate values.
2021-08-21 18:28:22 +02:00
junos
aed73bb7ed
Add fill values for communication for rows with no calls/smses.
2021-08-21 18:17:58 +02:00
junos
8507ff5761
Check for NaNs in the data, since sklearn.LinearRegression cannot handle them.
2021-08-21 17:46:00 +02:00
junos
0b85ee8fdc
Merge branch 'master' into ml_pipeline
2021-08-21 17:37:45 +02:00
junos
e2e268148d
Fill in 0.5 for undefined ratio.
...
When there are no calls and no smses (of a particular type), the ratio is undefined. But since their number is the same, I argue that the ratio can represent that with a 0.5, similarly to the case where no_calls_all = no_sms_all != 0.
2021-08-21 17:33:31 +02:00
junos
00015a3b8d
Fill in zeroes when joining or unstacking.
...
If there are no calls or smses for a particular day, there is no corresponding row in the features dataframe. When joining these, however, NaNs were introduced. Since a value of 0 is meaningful for all of these features, replace NaNs with 0's.
2021-08-21 17:31:15 +02:00
junos
065cd4347e
[WIP] Add a class for model validation.
2021-08-20 19:44:50 +02:00
junos
0b98d59aad
Aggregate labels using grouping_variable.
2021-08-20 19:17:22 +02:00
junos
08fdec34f1
Merge features into a common df.
...
But first, group communication by the grouping_variable.
2021-08-20 17:59:00 +02:00
junos
72b16af75c
Make group_by consistent with communication.
2021-08-20 17:52:31 +02:00
junos
d6337e82ac
Merge branch 'master' into ml_pipeline
2021-08-20 17:43:53 +02:00
junos
9a319ac6e5
Add an option to group on other than just participant_id.
2021-08-20 17:41:12 +02:00
junos
6592612db7
Add a similar class for labels.
2021-08-19 17:44:04 +02:00
junos
97c693d252
Add a getter for communication data.
2021-08-19 17:36:26 +02:00
junos
93f136b080
Add a method to get communication features.
2021-08-19 17:32:02 +02:00
junos
5be3e82797
Accept nested feature configuration.
...
To do this, pass a dict as parameters to SensorFeatures class, rather than actually reading the object from yaml file.
2021-08-19 17:23:23 +02:00
junos
429aa43bd1
Add communication features to pipeline.
2021-08-19 17:05:44 +02:00
junos
0ed34e97b3
Convert the class into a YAML object.
...
Add an example config file and demonstrate its usage in ex_ml_pipeline.ipynb.
2021-08-19 16:31:42 +02:00
junos
52664eb40b
Implement getters.
2021-08-19 11:47:59 +02:00
junos
de92e1309d
Merge branch 'master' into ml_pipeline
2021-08-18 17:30:36 +02:00
junos
777e6f0a58
calls_sms_features() now returns all communication features.
2021-08-18 15:41:47 +02:00
junos
2d78aacd18
Compile a list of contact features and add a test.
2021-08-18 15:35:42 +02:00
junos
c88336481e
Add a test for SMS features.
2021-08-18 15:28:46 +02:00
junos
1bc996413e
Clarify names for no_all calls/sms feature.
...
Add another test.
2021-08-18 15:23:30 +02:00