voice-dataset/
├── speaker1/
│   ├── 0001.wav
│   ├── 0002.wav
│   └── ...
└── metadata.csv