Audio-Feat-Extraction
所属分类:特征抽取
开发工具:Jupyter Notebook
文件大小:0KB
下载次数:0
上传日期:2023-10-13 17:41:25
上 传 者:
sh-1993
说明: 在Python中使用Librosa进行音频处理和特征提取。波形可视化、频谱图和提取特征,如频谱...,
(Audio Processing and Feature Extraction using Librosa in Python. Waveform visualization, spectrograms, and extract features like spectral centroid and MFCCs.)
文件列表:
extras/ (0, 2023-10-13)
extras/test-gb.wav (2993162, 2023-10-13)
extras/tone-gb.wav (748316, 2023-10-13)
main.py (152, 2023-10-13)
main_AFE(FEFA).ipynb (5982922, 2023-10-13)
# Audio Processing Feature Extraction
## Overview
This project provides code snippets for audio processing and feature extraction using the Librosa library in Python. Librosa is a powerful tool for music and audio analysis, offering functionalities for loading audio files, visualizing waveforms, creating spectrograms, and extracting features like spectral centroid and MFCCs.
## Credits
- Code snippets and inspiration from:
- [Towards Data Science: Extract Features of Music](https://towardsdatascience.com/extract-features-of-music-75a3f9bc265d)
- [Kaggle Notebook: Feature Extraction from Audio](https://www.kaggle.com/code/ashishpatel26/feature-extraction-from-audio/notebook)
- [extras/test-gb.mav (excerpt from the track "Gentle Blade" from the Sekiro: Shadows Die Twice Original Soundtrack - Yuka Kitamura)](https://www.youtube.com/watch?v=tOzQ_OqnY2s)
- [ChatGPT](https://chat.openai.com)
## How it Works
The code snippets cover the following aspects:
1. **Loading and Playing Audio**
- Load an audio file.
- Play the audio within a Jupyter Notebook.
2. **Waveform Visualization**
- Visualize the audio waveform, showcasing its amplitude and structure.
3. **Spectrogram Visualization**
- Generate and display a spectrogram.
- Visualize the log-scaled spectrogram for detailed frequency content.
4. **Creating a Sine Wave**
- Generate a synthetic sine wave signal for testing purposes.
5. **Feature Extraction**
- Display the waveform of an audio signal.
- Explore feature extraction techniques.
6. **Spectral Centroid Visualization**
- Compute and visualize the spectral centroid, representing the center of mass of frequencies in the sound.
7. **MFCC (Mel-Frequency Cepstral Coefficients) Visualization**
- Compute and display MFCCs, a representation of the short-term power spectrum of a sound.
## Packages Used
- `librosa`: Core library for audio processing.
- `numpy`: Used for numerical operations.
- `matplotlib`: Required for generating visualizations.
- `sklearn`: Used for normalization in some visualizations.
## Setup
Make sure to install the required packages using the following:
```bash
pip install librosa numpy matplotlib scikit-learn
近期下载者:
相关文件:
收藏者: