Preprint Fetcher just got an audio upgrade

June 08, 2026 | Gabriella Estevam

After several months of using Preprint Fetcher, I can say that it has transformed my daily reading. My preprint feed is curated to my exact research interests, both project-specific and broad, and fully automated. Since the papers are pulled from bioRxiv daily, there is not much that I miss, and if there is a title that doesn’t make it, I still have other RSS feeds and platforms that fill the gap.

I strive to begin each morning with reading, but on days where I am on-the-go, I’d still like to have a method to check my preprint feed. Therefore, I’ve now added a feature (again with the help of Sonnet 4.6) that converts the feed into audio, in addition to the local HTML bookmark update.

The full repository is live on my GitHub and bakes in the audio conversion using open source text-to-speech, which can be changed to suit specific audio preferences. Now, right when Preprint Fetcher runs for the day, it creates an MP3 which can be played locally through audio_feed.py or through an app like Apple Music.

Originally, I planned on building a full podcast-inspired experience where Preprint Fetcher would directly upload the audio feed into Spotify or Apple Music, complete with an intro, cover art, and other features I could use on a commute. That is still the vision, but will require more buildout, especially around the listening experience.


For now, here’s how to implement Preprint Fetcher audio:


Download scripts

  • Found here in this Github repository
  • I also recommend setting up a virtual environment - instructions in the README


Install Piper TTS

Piper provides open source, in-browser text-to-speech (TTS), and requires no cloud subscriptions.

pip install piper-tts


Download voice model

This downloads the libritts voice model locally, and to switch to a different model change VOICE_MODEL path in audio_feed.py from the available voices on Piper . Again, no data leaves your machine during playback.

  mkdir -p ~/preprint-fetcher-audio/voices && cd ~/preprint-fetcher-audio/voices
  curl -LO https://huggingface.co/rhasspy/piper-voices/resolve/main/en/en_US/libritts/high/en_US-libritts-high.onnx
  curl -LO https://huggingface.co/rhasspy/piper-voices/resolve/main/en/en_US/libritts/high/en_US-libritts-high.onnx.json


Test setup

This checks that: Piper is installed, the voice model exists, and audio plays.

  python test_audio.py


Run the audio feed

Preprint Fetcher will generate an MP3 of the feed, including title, authors, and key findings extracted from the abstract.

  python audio_feed.py


To preview the script without generating audio:

  python audio_feed.py --list


To generate the MP3 without playing it:

  python audio_feed.py --no-play


What to expect

Each paper is introduced as:

“Paper 1 of 5. [title]. By [first author] from the [last author] group, and colleagues. Key findings. [2-3 sentences extracted from the abstract].”

The “key findings” are extracted from sentences containing phrases like “we show”, “we found”, “our results”, and “importantly” from the abstract.

The the MP3 lives at feed_output/feed.mp3 and can be shared to your phone or opened in any audio player.

← Back to writing
LinkedIn GitHub
©