Voice Activity Detection
Transformers
PyTorch
TensorBoard
Safetensors
pyannet
speaker-diarization
speaker-segmentation
Generated from Trainer
pyannote
pyannote.audio
pyannote-audio-model
audio
voice
speech
speaker
speaker-change-detection
overlapped-speech-detection
resegmentation
Inference Endpoints
File size: 417 Bytes
2c619f0 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
model:
_target_: pyannote.audio.models.segmentation.PyanNet.PyanNet
linear:
hidden_size: 128
num_layers: 2
lstm:
batch_first: true
bidirectional: true
dropout: 0.0
hidden_size: 128
monolithic: true
num_layers: 4
num_channels: 1
sample_rate: 16000
sincnet:
sample_rate: 16000
stride: 10
task:
duration: 10.0
max_speakers_per_chunk: 3
max_speakers_per_frame: 2
|