README / README.md
mseid-spotify's picture
Update README.md
1716cfd
---
title: README
emoji: πŸš€
colorFrom: pink
colorTo: green
sdk: static
pinned: false
---
<p align="center">
<img src="https://huggingface.co/spaces/spotify/README/resolve/main/spotify_r-d_logo.png">
</p>
[Spotify Research](https://research.atspotify.com/) is part of Spotify R&D, the technology engine that drives everything you love about the Spotify app. Spotify Research is dedicated to extending the state of the art in audio. With over 15 years of experience, Spotify Research is working on the hardest problems using a broad range of AI methods to understand listeners, creators, the content in the Spotify catalog, and the streaming business. Research areas include matching content and listeners, extracting signals from the audio catalog using natural language understanding and multimedia information retrieval methods, evaluation and algorithmic responsibility.
## Project Showcase
### Basic Pitch
![Basic Pitch Logo](https://user-images.githubusercontent.com/213293/167478083-de988de2-9137-4325-8a5f-ceeb51233753.png)
[Basic Pitch](https://github.com/spotify/basic-pitch) is a Python library for Automatic Music Transcription (AMT), using lightweight neural network developed by [Spotify's Audio Intelligence Lab](https://research.atspotify.com/audio-intelligence/). It's small, easy-to-use, pip install-able and npm install-able via its [sibling repo](https://github.com/spotify/basic-pitch-ts) or can be accessed at [basicpitch.io](https://github.com/spotify/basic-pitch).
Basic Pitch may be simple, but it is far from "basic"! basic-pitch is efficient and its multipitch support, ability to generalize across instruments, and note accuracy competes with much larger and more resource-hungry AMT systems.
## Datasets
Datasets
Spotify has a few [datasets to explore](https://research.atspotify.com/datasets/). A highlight is the [Spotify Podcast Dataset](https://podcastsdataset.byspotify.com/) consisting of over 100,000 episodes each in English and Portuguese from different podcast shows on Spotify. The dataset is available for research purposes. We released the podcast dataset more widely to facilitate research on podcasts through the lens of speech and audio technology, natural language processing, information retrieval, and linguistics. The dataset contains over 100,000 hours of audio, and over 1 billion transcribed words.
## Jobs
We are hiring! Find roles open:
- [Spotify Research](https://research.atspotify.com/jobs/)
- [All of Spotify](https://www.lifeatspotify.com/jobs)