2,000 audio clips across 50 environmental sound categories (rain, sirens, etc.).
ESC-50 is a curated dataset of 2,000 environmental audio recordings organized into 50 sound classes, such as animal sounds, natural noises, and human activities. Each audio clip is five seconds long and manually labeled to ensure high-quality annotations. Developed by Karol J. Piczak, the dataset is designed to provide a standardized benchmark for environmental sound classification.
Esc-50 Is Widely Used For Training And Benchmarking Sound Classification Models. It Supports Research In Environmental Sound Recognition, Acoustic Scene Analysis, And Audio Event Detection. The Dataset’s Structured Taxonomy And Balanced Class Distribution Make It Valuable For Evaluating Machine Learning Models And Comparing Algorithmic Performance Across Studies.
Attribution-Non-Commercial 4.0 International (CC BY-NC 4.0)
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.