Study of synthesis images noise detection using 26 attributes

Jérôme BUISINE a57efee223 Merge branch 'release/v0.3.6' hace 3 años
OpenML_datasets f94ba1ea70 update openML datasets used hace 3 años
analysis cb6026f2c7 Add of 26 features metric hace 4 años
features_selection 48c4349333 sorted open ml problems in order to well restart hace 3 años
generate c409191b1c update prediction script for new dataset structure hace 3 años
modules @ cebf2adbf1 0a05939b74 Update of optimization process using backups hace 4 años
optimization 9d71b77f3f new version of surrogate hace 3 años
prediction c409191b1c update prediction script for new dataset structure hace 3 años
rnn @ 8c2fc8888c 427d117327 Update use of surrogate hace 3 años
utils 2fc4db3bfb use of wsao module for accelerate ILS (using surrogate) hace 3 años
wsao @ a92ca5a285 c41293b6fa computation of mae and save it hace 3 años
.gitignore 0a05939b74 Update of optimization process using backups hace 4 años
.gitmodules 41659bc1c2 add of training using rnn models hace 3 años
LICENSE dc0463b6b5 Project initialization hace 4 años
README.md 30d257e0f7 update of the whole project to enable use of new dataset hace 3 años
custom_config.py e4f5839e36 Use surrogate from scract as proposed framework hace 3 años
data_attributes.py a4119a186e update kolmogorov attributes hace 3 años
find_best_attributes.py 6032efa1b1 Use of population for rendering surrogate hace 3 años
find_best_attributes_from.py a2fb893050 svm can now be used for selector hace 3 años
find_best_attributes_surrogate.py 908bbd7f91 use of max_samples param hace 3 años
find_best_attributes_surrogate_dl.py 6032efa1b1 Use of population for rendering surrogate hace 3 años
find_best_attributes_surrogate_openML.py 6032efa1b1 Use of population for rendering surrogate hace 3 años
find_best_attributes_surrogate_openML_multi.py 6032efa1b1 Use of population for rendering surrogate hace 3 años
find_best_attributes_surrogate_openML_multi_specific.py 6032efa1b1 Use of population for rendering surrogate hace 3 años
find_best_filters.py 6032efa1b1 Use of population for rendering surrogate hace 3 años
models.py 2fc4db3bfb use of wsao module for accelerate ILS (using surrogate) hace 3 años
requirements.txt c52c6fae6c now use of macop Python package for optimization process hace 3 años
run_openML_surrogate.py 48c4349333 sorted open ml problems in order to well restart hace 3 años
run_openML_surrogate_multi.py c41293b6fa computation of mae and save it hace 3 años
run_openML_surrogate_multi_specific.py 33cf98b131 enable run of commands hace 3 años
run_surrogate_rendering.sh 7b37ee926c multi run script hace 3 años
train_model.py c409191b1c update prediction script for new dataset structure hace 3 años
train_model_attributes.py b73b27ab44 add balanced data into SVC hace 3 años
train_model_filters.py b73b27ab44 add balanced data into SVC hace 3 años

README.md

Noise detection using 26 attributes

Description

Noise detection on synthesis images with 26 attributes obtained using few filters.

Filters list:

  • average
  • wiener
  • median
  • gaussian
  • wavelet

Requirements

pip install -r requirements.txt

Project structure

Link to your dataset

You need database which respects this structure:

  • dataset/
    • Scene1/
    • Scene1_00050.png
    • Scene1_00070.png
    • ...
    • Scene1_01180.png
    • Scene1_01200.png
    • Scene2/
    • ...
    • ...

Code architecture description

  • modules/*: contains all modules usefull for the whole project (such as configuration variables)
  • analysis/*: contains all jupyter notebook used for analysis during thesis
  • generate/*: contains python scripts for generate data from scenes (described later)
  • data_processing/*: all python scripts for generate custom dataset for models
  • prediction/*: all python scripts for predict new threshold from computed models
  • data_attributes.py: files which contains all extracted features implementation from an image.
  • custom_config.py: override the main configuration project of modules/config/global_config.py
  • train_model.py: script which is used to run specific model available.

Generated data directories:

  • data/*: folder which will contain all generated .train & .test files in order to train model.
  • data/saved_models/*: all scikit learn or keras models saved.
  • data/models_info/*: all markdown files generated to get quick information about model performance and prediction obtained after running run/runAll_*.sh script.
  • data/results/: This folder contains model_comparisons.csv file used for store models performance.

License

The MIT license