Preprint Fetcher just got an audio upgrade
June 08, 2026 | Gabriella Estevam
After several months of using Preprint Fetcher, I can say that it has transformed my daily reading. My preprint feed is curated to my exact research interests, both project-specific and broad, and fully automated. Since the papers are pulled from bioRxiv daily, there is not much that I miss, and if there is a title that doesn’t make it, I still have other RSS feeds and platforms that fill the gap.
I strive to begin each morning with reading, but on days where I am on-the-go, I’d still like to have a method to check my preprint feed. Therefore, I’ve now added a feature (again with the help of Sonnet 4.6) that converts the feed into audio, in addition to the local HTML bookmark update.
The full repository is live on my GitHub and bakes in the audio conversion using open source text-to-speech, which can be changed to suit specific audio preferences. Now, right when Preprint Fetcher runs for the day, it creates an MP3 which can be played locally through audio_feed.py or through an app like Apple Music.
Originally, I planned on building a full podcast-inspired experience where Preprint Fetcher would directly upload the audio feed into Spotify or Apple Music, complete with an intro, cover art, and other features I could use on a commute. That is still the vision, but will require more buildout, especially around the listening experience.
For now, here’s how to implement Preprint Fetcher audio:
Download scripts
- Found here in this Github repository
- I also recommend setting up a virtual environment - instructions in the README
Install Piper TTS
Piper provides open source, in-browser text-to-speech (TTS), and requires no cloud subscriptions.
pip install piper-tts
Download voice model
This downloads the libritts voice model locally, and to switch to a different model change VOICE_MODEL path in audio_feed.py from the available voices on Piper . Again, no data leaves your machine during playback.
mkdir -p ~/preprint-fetcher-audio/voices && cd ~/preprint-fetcher-audio/voices
curl -LO https://huggingface.co/rhasspy/piper-voices/resolve/main/en/en_US/libritts/high/en_US-libritts-high.onnx
curl -LO https://huggingface.co/rhasspy/piper-voices/resolve/main/en/en_US/libritts/high/en_US-libritts-high.onnx.json
Test setup
This checks that: Piper is installed, the voice model exists, and audio plays.
python test_audio.py
Run the audio feed
Preprint Fetcher will generate an MP3 of the feed, including title, authors, and key findings extracted from the abstract.
python audio_feed.py
To preview the script without generating audio:
python audio_feed.py --list
To generate the MP3 without playing it:
python audio_feed.py --no-play
What to expect
Each paper is introduced as:
“Paper 1 of 5. [title]. By [first author] from the [last author] group, and colleagues. Key findings. [2-3 sentences extracted from the abstract].”
The “key findings” are extracted from sentences containing phrases like “we show”, “we found”, “our results”, and “importantly” from the abstract.
The the MP3 lives at feed_output/feed.mp3 and can be shared to your phone or opened in any audio player.