Commit Graph

399 Commits (master)

Author SHA1 Message Date
junos 38a405d378 Add index when inserting one row. 2023-05-17 18:13:20 +02:00
junos 2c5a0b4157 Label plot axes. 2023-05-17 16:32:27 +02:00
junos 0409c9e982 Fix format specification. 2023-05-16 17:22:09 +02:00
junos a7446cc34a Specify columns to aggregate and save figures as pdfs. 2023-05-16 17:05:43 +02:00
junos 118e686491 Specify format directly as infer_datetime_format was deprecated. 2023-05-16 17:04:48 +02:00
junos 9417a1b9f1 Do not break markdown lines. 2023-05-16 16:37:34 +02:00
junos 7b5db88f1d Remove and ignore results. 2023-05-16 16:22:29 +02:00
junos 0f8f0b0fb6 Update URL call. 2023-05-16 16:17:53 +02:00
junos 26c7d22b83 Add an option to save figures. 2023-05-16 16:17:06 +02:00
junos 87781840d4 Use concat instead of append which was deprecated. 2023-05-12 16:32:08 +02:00
junos 3091328fc5 Format comments. 2023-05-11 16:51:38 +02:00
junos 055e87dbac Return scores for classification. 2023-05-10 23:51:12 +02:00
junos f58d20ffc2 Update classification runner. 2023-05-10 23:17:44 +02:00
junos 075fdab9ea Select segment and save results. 2023-05-10 23:00:03 +02:00
junos 91e7352480 Thoroughly refactor classification runner. 2023-05-10 22:50:00 +02:00
junos 35c09374dd Free up memory during model building. 2023-05-10 21:44:40 +02:00
junos b505fb2b6a Thoroughly refactor regression runner. 2023-05-10 20:30:51 +02:00
junos 47b1ecdbb9 First format with black and then check with flake8. 2023-05-10 15:29:32 +02:00
junos 24744c288d Extract one step of preparation into a separate function. 2023-05-10 15:28:09 +02:00
junos caeaf03239 Provide data instead of csv input. 2023-05-10 15:20:33 +02:00
junos cd5d8b6a10 Update rapids and add regex=True.
Reformat debug_heatmap.
2023-05-10 15:12:27 +02:00
junos 3e38b64b45 Merge branch 'ml_pipeline' 2023-05-10 15:02:17 +02:00
junos 76071fd550 Start using pre-commit hooks. 2023-04-24 15:38:54 +02:00
junos c66e046014 Use methods in helper.py. 2023-04-21 21:41:00 +02:00
junos 48118f125d Reformat ml_pipeline_regression.py 2023-04-21 21:34:54 +02:00
junos 583ee82e80 Add xgboost to dependencies and reformat helper.py. 2023-04-21 21:33:06 +02:00
junos 59552c18a9 Update python to 3.11. 2023-04-21 18:08:54 +02:00
Primoz 26804cf8ea Repair preprocessing one hot encoding of test set. 2023-04-21 13:24:31 +02:00
Primoz 865225994b Added testing section after feature selection. 2023-04-20 13:29:14 +02:00
Primoz 259be708aa Improve the feature selection method with validations etc. 2023-04-20 13:26:20 +02:00
Primoz 0594993133 Add GroupKFold to feature selection CV. Start with generic metric calculation procedure. 2023-04-20 11:20:26 +02:00
Primoz 1cbc743cf7 Add kBest method to initially filter out the worst performing features. Update comments. 2023-04-20 10:12:16 +02:00
Primoz 2a8f1ee613 Merge branch 'ml_pipeline' of https://repo.ijs.si/junoslukan/straw2analysis into ml_pipeline 2023-04-19 15:56:52 +02:00
Primoz ce13a9e13b Implement feature selection method which is used in ML pipeline. 2023-04-19 15:56:34 +02:00
junos a4ad4c3200 Check the relationship between is_system_app and System category. 2023-04-19 11:06:33 +02:00
junos 7e565c34db Only reclassify apps not found on Play Store.
Update coding files for app categories.
2023-04-19 11:01:00 +02:00
junos d6eea0fc00 Completely classify unknown applications. 2023-04-19 10:54:46 +02:00
junos 711b451eff Start to better classify system apps. 2023-04-19 09:48:57 +02:00
junos 0e66a5a963 Completely remove PACKAGE_NAMES_HASHED and instead provide a differently structured file. 2023-04-19 09:29:40 +02:00
junos c88cecc063 Categorize applications in config.yaml. 2023-04-18 20:39:58 +02:00
junos 66754a24aa Create and save catalogue. 2023-04-18 16:10:11 +02:00
junos 297eb45933 Merge branch 'ml_pipeline' 2023-04-18 15:55:03 +02:00
junos 0b16aa6fe4 Clean up categories. 2023-04-18 15:49:33 +02:00
junos d092e17e33 Explore saved categories. 2023-04-18 15:34:06 +02:00
junos d000551884 Ignore only some files in presentation. 2023-04-18 14:57:59 +02:00
junos 5b9a1dba1a Revert "Ignore presentation."
This reverts commit ec7cd09a09.
2023-04-18 14:54:35 +02:00
Primoz 10ca47583c Implement feature selection methods (WIP). 2023-04-14 17:20:22 +02:00
junos c0904dd681 Update rapids and various PyCharm settings. 2023-04-12 19:38:15 +02:00
junos f672709ea6 JupyText config. 2023-04-05 20:14:47 +02:00
junos 633d029dc2 Merge remote-tracking branch 'origin/ml_pipeline' into ml_pipeline 2023-03-01 13:37:18 +01:00