Urban_Sounds_Classifier

Basic sound classifier using pytorch RNN with GRU or LSTM

This a basic demo of a sound classifier. The idea is mainly to train using Reccurent NN with pytorch, and to benchmark the accuracy and training time differences between GRU and LSTM.

I use a Docker image to be able to use my AMD GPU with ROC and pytorch. The latter lib is thus customized, and may not work with torchaudio. Therefore I used librosa lib here to preprocess the audio (normalization, MFCC extraction)

Improvements :

better logs, eventually using Tensorboard or any similar tool (Clear ML ?)
try to tweak hyper parameters to improve scores
add a K fold option for training
try the same algo w/ lightning (for training purpose)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
002_RNN_LSTM_GRU.py		002_RNN_LSTM_GRU.py
README.md		README.md
dataset.py		dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Urban_Sounds_Classifier

Improvements :

About

Releases

Packages

Languages

DavidNativ/Urban_Sounds_Classifier

Folders and files

Latest commit

History

Repository files navigation

Urban_Sounds_Classifier

Improvements :

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages