Commit Graph

369 Commits (112d96871590aa792168d0640fd1aff23cc9da84)

Author SHA1 Message Date
Primoz 0594993133 Add GroupKFold to feature selection CV. Start with generic metric calculation procedure. 2023-04-20 11:20:26 +02:00
Primoz 1cbc743cf7 Add kBest method to initially filter out the worst performing features. Update comments. 2023-04-20 10:12:16 +02:00
Primoz 2a8f1ee613 Merge branch 'ml_pipeline' of https://repo.ijs.si/junoslukan/straw2analysis into ml_pipeline 2023-04-19 15:56:52 +02:00
Primoz ce13a9e13b Implement feature selection method which is used in ML pipeline. 2023-04-19 15:56:34 +02:00
junos a4ad4c3200 Check the relationship between is_system_app and System category. 2023-04-19 11:06:33 +02:00
junos 7e565c34db Only reclassify apps not found on Play Store.
Update coding files for app categories.
2023-04-19 11:01:00 +02:00
junos d6eea0fc00 Completely classify unknown applications. 2023-04-19 10:54:46 +02:00
junos 711b451eff Start to better classify system apps. 2023-04-19 09:48:57 +02:00
junos 0e66a5a963 Completely remove PACKAGE_NAMES_HASHED and instead provide a differently structured file. 2023-04-19 09:29:40 +02:00
junos c88cecc063 Categorize applications in config.yaml. 2023-04-18 20:39:58 +02:00
junos 66754a24aa Create and save catalogue. 2023-04-18 16:10:11 +02:00
junos 297eb45933 Merge branch 'ml_pipeline' 2023-04-18 15:55:03 +02:00
junos 0b16aa6fe4 Clean up categories. 2023-04-18 15:49:33 +02:00
junos d092e17e33 Explore saved categories. 2023-04-18 15:34:06 +02:00
junos d000551884 Ignore only some files in presentation. 2023-04-18 14:57:59 +02:00
junos 5b9a1dba1a Revert "Ignore presentation."
This reverts commit ec7cd09a09.
2023-04-18 14:54:35 +02:00
Primoz 10ca47583c Implement feature selection methods (WIP). 2023-04-14 17:20:22 +02:00
junos c0904dd681 Update rapids and various PyCharm settings. 2023-04-12 19:38:15 +02:00
junos f672709ea6 JupyText config. 2023-04-05 20:14:47 +02:00
junos 633d029dc2 Merge remote-tracking branch 'origin/ml_pipeline' into ml_pipeline 2023-03-01 13:37:18 +01:00
junos ec7cd09a09 Ignore presentation. 2023-03-01 13:37:10 +01:00
Primoz 8a532fa95a Add a ML pipeline script to develop a whole pipeline. 2023-02-23 10:41:36 +01:00
Primoz bccc1cd1de Clean and fix Preprocessing module. 2023-02-23 10:40:58 +01:00
Primoz 9ed863b7a1 Add a CrossValidation module with all the required methods. 2023-02-23 10:40:17 +01:00
Primoz f69cb25266 Add planning comments. 2023-02-22 18:12:52 +01:00
Primoz 7f6ae9b323 Add imputation and One-Hot Encoding Methods. 2023-02-22 18:05:01 +01:00
Primoz 8f6cb3f444 Add preprocessing class. 2023-02-22 13:44:03 +01:00
Primoz ef12f64fe5 Add feature selection Class skeleton. 2023-02-20 11:51:34 +01:00
Primoz 63741c3627 Add feature selection pipeline script with initail plan. 2023-02-15 12:27:39 +01:00
Primoz 806ca1b37d Added excel intermediate scores file 2023-02-06 14:02:24 +01:00
Primoz d06da18c26 Fix error with method name. 2023-02-06 11:54:54 +01:00
Primoz 93a34986d9 Comment code sections and change to pd.concatinate method. 2023-02-06 11:31:21 +01:00
Primoz 08e81610a9 Fix method params 2023-02-06 11:21:17 +01:00
Primoz ef78f179cd Add neccessary parameters to plot method. 2023-02-06 11:19:04 +01:00
Primoz afeb7b4872 Remove data_yield from features. 2023-02-06 11:16:53 +01:00
Primoz ea3f805ba7 Change ML model and ddd CV to sequential feat_select. Add std lines to plots. 2023-02-06 11:09:15 +01:00
Primoz e3aef2dae7 Add save to file code, and todo comment 2023-02-01 15:13:57 +01:00
Primoz b286753696 Add vizualization sections for sequential addition of sensors' features. 2023-02-01 13:51:56 +01:00
Primoz 07ef72dec5 Implement algorithm for sequential adding of the most important features. 2023-01-25 14:19:29 +01:00
Primoz 85e572fca0 Expand analysis of the features (individualy and by sensor groups). 2023-01-23 16:32:07 +01:00
Primoz 6a98c8cdcf Prepare scripts for feature importance analysis. 2023-01-19 16:20:43 +01:00
Primoz d263b32564 Temp: remove stratified logo from ml pipeline. 2023-01-19 09:26:55 +01:00
Primoz ad2fab133f Explore features with Entropy and IG. 2023-01-13 17:08:56 +01:00
junos 72fdd9c5ec Use stratified downsampling.
And run all models with a method from machine_learning.helper.
2023-01-04 21:48:37 +01:00
junos b0b9edccc4 Unhide jupyter code cells and outputs. 2023-01-04 21:25:12 +01:00
junos 61d786b2ca Describe LOSO results. 2023-01-04 20:38:05 +01:00
junos 37eada4a2e Present results for stressful events. 2023-01-04 20:00:08 +01:00
junos af6843634c Update RAPIDS submodule. 2023-01-04 18:22:53 +01:00
junos 8bbe0b2ba8 Merge branch 'ml_pipeline' of repo.ijs.si:junoslukan/straw2analysis into ml_pipeline
# Conflicts:
#	.gitignore
#	exploration/ml_pipeline_classification_with_clustering.py
2023-01-04 18:19:43 +01:00
junos 1516d1c000 Cluster by demand_control_ratio_quartile. 2023-01-04 18:16:14 +01:00