Study of synthesis images noise detection using 26 attributes

2 Ramas

Jérôme BUISINE 7cc4631e5d Add test of RFE performances and n_jobs		hace 4 años
OpenML_datasets	f94ba1ea70 update openML datasets used	hace 4 años
analysis	166ba8d97e Add study with classical optimisation	hace 4 años
features_selection	48c4349333 sorted open ml problems in order to well restart	hace 4 años
generate	c409191b1c update prediction script for new dataset structure	hace 5 años
modules @ cebf2adbf1	0a05939b74 Update of optimization process using backups	hace 6 años
optimization	c3a4e96cac avoid train every modulo 0	hace 4 años
prediction	c409191b1c update prediction script for new dataset structure	hace 5 años
rnn @ 8c2fc8888c	427d117327 Update use of surrogate	hace 4 años
utils	2fc4db3bfb use of wsao module for accelerate ILS (using surrogate)	hace 5 años
wsao @ a92ca5a285	c41293b6fa computation of mae and save it	hace 4 años
.gitignore	475f2851d1 Add of ocurences displayed	hace 4 años
.gitmodules	41659bc1c2 add of training using rnn models	hace 4 años
LICENSE	dc0463b6b5 Project initialization	hace 6 años
README.md	30d257e0f7 update of the whole project to enable use of new dataset	hace 5 años
check_random_forest_perfomance.py	8dc3803465 update number of features as input	hace 4 años
check_random_forest_perfomance_rfe.py	7cc4631e5d Add test of RFE performances and n_jobs	hace 4 años
custom_config.py	e4f5839e36 Use surrogate from scract as proposed framework	hace 5 años
data_attributes.py	a4119a186e update kolmogorov attributes	hace 5 años
find_best_attributes.py	6032efa1b1 Use of population for rendering surrogate	hace 4 años
find_best_attributes_from.py	a2fb893050 svm can now be used for selector	hace 5 años
find_best_attributes_surrogate.py	38ca51bff9 use of random forest	hace 4 años
find_best_attributes_surrogate_dl.py	6032efa1b1 Use of population for rendering surrogate	hace 4 años
find_best_attributes_surrogate_openML.py	6032efa1b1 Use of population for rendering surrogate	hace 4 años
find_best_attributes_surrogate_openML_multi.py	6032efa1b1 Use of population for rendering surrogate	hace 4 años
find_best_attributes_surrogate_openML_multi_specific.py	6032efa1b1 Use of population for rendering surrogate	hace 4 años
find_best_filters.py	6032efa1b1 Use of population for rendering surrogate	hace 4 años
models.py	65849b6228 reduce of number of params	hace 4 años
requirements.txt	c52c6fae6c now use of macop Python package for optimization process	hace 5 años
run_openML_surrogate.py	48c4349333 sorted open ml problems in order to well restart	hace 4 años
run_openML_surrogate_multi.py	c41293b6fa computation of mae and save it	hace 4 años
run_openML_surrogate_multi_specific.py	33cf98b131 enable run of commands	hace 4 años
run_surrogate_rendering.sh	b111f51621 Reduce pop size	hace 4 años
train_model.py	c409191b1c update prediction script for new dataset structure	hace 5 años
train_model_attributes.py	b73b27ab44 add balanced data into SVC	hace 5 años
train_model_filters.py	b73b27ab44 add balanced data into SVC	hace 5 años

Noise detection using 26 attributes

Description

Noise detection on synthesis images with 26 attributes obtained using few filters.

Filters list:

average
wiener
median
gaussian
wavelet

Requirements

pip install -r requirements.txt

Project structure

Link to your dataset

You need database which respects this structure:

dataset/
- Scene1/
- Scene1_00050.png
- Scene1_00070.png
- ...
- Scene1_01180.png
- Scene1_01200.png
- Scene2/
- ...
- ...

Code architecture description

modules/*: contains all modules usefull for the whole project (such as configuration variables)
analysis/*: contains all jupyter notebook used for analysis during thesis
generate/*: contains python scripts for generate data from scenes (described later)
data_processing/*: all python scripts for generate custom dataset for models
prediction/*: all python scripts for predict new threshold from computed models
data_attributes.py: files which contains all extracted features implementation from an image.
custom_config.py: override the main configuration project of modules/config/global_config.py
train_model.py: script which is used to run specific model available.

Generated data directories:

data/*: folder which will contain all generated .train & .test files in order to train model.
data/saved_models/*: all scikit learn or keras models saved.
data/models_info/*: all markdown files generated to get quick information about model performance and prediction obtained after running run/runAll_*.sh script.
data/results/: This folder contains model_comparisons.csv file used for store models performance.

License

The MIT license

README.md