SER (Speech Emotion Recognition)

음성 감정 인식 인공지능 모델 제작

음성을 통해 화자의 감정을 인식할 수 있는 모델을 제작

Dataset

RAVDESS Dataset

Libraries

torch : 1.6.0
torchaudio : 0.6.0

Conditions

모든 음원은 고정길이를 가지고 있음

Functions

utils/features.py

feature를 추출할때 사용할 기능들을 담고 있음

extract_spectrogram

torchaudio 라이브러리를 사용해서 음원의 spectrogram을 추출

source
sample_rate
n_fft : None (win_length와 동일)
window_size : 0.025
window_stride : 0.01

return type

Dimension (…, freq, time)

extract_mel_spectrogram

torchaudio 라이브러리를 사용해서 음원의 mel spectrogram을 추출

source
sample_rate
n_mels : 80
n_fft : None (win_length와 동일)
window_size : 0.025
window_stride : 0.01

return type

Dimension (…, freq, time)

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.idea		.idea
configs		configs
old		old
utils		utils
README.md		README.md
dataset.py		dataset.py
playground.py		playground.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SER (Speech Emotion Recognition)

음성 감정 인식 인공지능 모델 제작

Dataset

Libraries

Conditions

Functions

utils/features.py

extract_spectrogram

return type

extract_mel_spectrogram

return type

About

Releases

Packages

Languages

waverdeep/SER-Speech-Emotion-Recognition

Folders and files

Latest commit

History

Repository files navigation

SER (Speech Emotion Recognition)

음성 감정 인식 인공지능 모델 제작

Dataset

Libraries

Conditions

Functions

utils/features.py

extract_spectrogram

return type

extract_mel_spectrogram

return type

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages