🌅 Better Morning

Important

This repo is 98% vibe-coded, including this readme. But I use it, so it should kinda work.

🌅 Better Morning

Better Morning is an automated news digest system that fetches articles from user-defined RSS feeds, summarizes them using a Language Model (LLM) of your choice, and delivers a personalized news summary periodically (e.g. daily).

I use this for:

Summarize news
Stay updated about work and project calls
Receive summaries of external newsletters
Filtering noise, such as public acts, work announcements, newsletters

This tools works best when paired with automated RSS generators:

https://politepaul.com
https://morss.it
https://workers.cloudflare.com (for interacting with more complex pages)

You may also use https://kill-the-newsletter.com for transforming newsletters into RSS parsable by this tool.

And you may use this tool's SMTP functionality or https://blogtrottr.com/ to get the digest to your mailbox.

✨ Features

Configurable RSS Collections: Define multiple collections of RSS feeds using simple TOML files.
Smart Content Extraction: Optionally follow article links to extract full content, not just RSS summaries, using a web page parser.
LLM-Powered Summarization: Utilizes litellm to interface with various LLM providers (e.g., OpenAI, Anthropic, Cohere) for intelligent summarization.
Customizable Prompts: Tailor summarization prompts at a global or collection level.
Daily Automation: Runs automatically via a GitHub Action at a user-defined schedule.
Flexible Output: Publish your daily digest as a GitHub Release or send it directly to your email.
Token Management: Automatically truncates content if it exceeds a specified token limit to prevent excessive LLM costs.
Google Scholar Alerts: Automatically retrieve articles from google scholar alerts.
Article Age Filtering: Only include recent articles using max_age (per collection).
LLM-Based Filtering: Optionally include/exclude entries using a boolean LLM query at collection/feed level.

🚀 Getting Started

Follow these steps to set up and run your Better Morning news digest.

1. Clone the Repository

git clone https://github.com/your-username/better-morning.git
cd better-morning

2. Configure Global Settings (`config.toml`)

Create a config.toml file in the root of your repository (if it doesn't already exist) to define global settings. This file specifies defaults and sensitive environment variable names.

Example config.toml:

# filepath: config.toml
[llm_settings]
# Use any litellm-supported model
reasoner_model = "openai/gpt-4o"  # model used for selecting articles and for the final summaries
light_model = "openai/gpt-3.5-turbo" # model used for summarizing individual articles
# thinking_effort_reasoner/thinking_effort_light; conversion from string to integer is done by
# llmlite
thinking_effort_light = "medium"
thinking_effort_reasoner = "8192"

# optionals:
temperature = 0.7
output_language = "Italian"
prompt_template = """Summarize this article concisely in exactly {k_words_each_summary} words:

Title: {title}
Content: {content}

Summary:"""

[content_extraction_settings]
# you can set this to true to follow article links that are inside the content of the article,
# useful for including alerts and list of news
# you can also set this at the colelction and feed levels in collections/*.toml
# default is false
follow_article_links = false

[filter_settings]
# optional LLM-based boolean filtering after full content extraction
# if set, all entries are fetched and internal links are merged into the parent content
# the model must return JSON like {"include": true} to keep the entry
# when unsupported, the system retries and falls back to JSON extraction; if still invalid, it excludes
filter_query = "Include only articles about EU AI regulation updates"
filter_model = "openai/gpt-4o"

[output_settings]
output_type = "email"                              # Options: "github_release", "email", github_release is not working
smtp_server = "smtp.gmail.com"                     # Required if output_type is "email"
smtp_port = 587                                    # Required if output_type is "email"

3. Define RSS Feed Collections (`collections/*.toml`)

Create TOML files in the collections/ directory to define your news categories. Each file represents a collection and can override global settings. A default_news.toml is provided as an example.

Example collections/default_news.toml:

# filepath: collections/default_news.toml
name = "General Daily News"
max_age = "2d"  # Only include articles from the last 2 days (see below for details)

[[feeds]]
url = "https://rss.nytimes.com/services/xml/rss/nyt/HomePage.xml"
name = "New York Times"

[[feeds]]
url = "https://www.theguardian.com/uk/rss"
name = "The Guardian"

[[feeds]]
url = "https://www.bbc.com/news/rss/newsonline_world_edition/front_page/rss.xml"
name = "BBC World News"
follow_article_links = false # override parent setting for this feed only

filter_query = "Include only business or economy-related news"
filter_model = "openai/gpt-4o"

[llm_settings]
n_most_important_news = 5
k_words_each_summary = 100
prompt_template = "Summarize this news article for a general audience: {title}. Content: {content}"

[content_extraction_settings]
follow_article_links = true # if the article content contains links, this will follow them (you can enable this at the feed level only also)
link_filter_pattern = "https:\/\/scholar.google.com\/scholar_url\?url=.+" # this is a regex pattern to filter links to follow

[filter_settings]
# collection-level defaults (feed-level overrides take precedence)
filter_query = "Include only articles about higher-education policy or university funding"
filter_model = "openai/gpt-4o"

collection_prompt = "Provide a comprehensive but concise overview of the most significant global news from various sources."

LLM-Based Filtering (Collection/Feed)

You can define an LLM query that decides whether each entry should be included. When a filter query is set, every entry is fully fetched (no title-based pre-filtering), internal links are followed, and all linked content is merged into the parent article before evaluation. You may use LLM-based filtering on the content for each article, but it would be slow and costly. The standard way in this tool is to do LLM-based filtering using only the titles of the articles.

Precedence: feed-level settings override collection-level settings.

Example:

name = "Research"

[filter_settings]
filter_query = "Include only articles about EU AI regulation updates"
filter_model = "openai/gpt-4o"

[[feeds]]
url = "https://example.com/rss.xml"
name = "Example Feed"
filter_query = "Include only articles about AI safety policy"
filter_model = "openai/gpt-4o"

Filtering Articles by Age (`max_age`)

You can control how old articles are allowed to be for each collection using the max_age setting in your collection TOML files. This helps ensure your digest only includes recent or relevant news.

Supported formats:

Time span: e.g. 2d (2 days), 1h (1 hour), 30m (30 minutes), 7d (7 days)
Special value: last-digest — Only include articles published after the last successful digest for this collection (uses a cached timestamp).

Example:

# filepath: collections/recent_news.toml
name = "Recent News Only"
max_age = "2d"  # Only include articles from the last 2 days

[[feeds]]
url = "https://feeds.bbci.co.uk/news/rss.xml"
name = "BBC News"
max_articles = 10

How last-digest works:

When you set max_age = "last-digest", the system remembers the time of the last successful digest for each collection.
On the next run, only articles newer than that time are included.
The timestamp is cached in a file under history/ and persists across runs (including GitHub Actions and local runs).
If no previous digest exists, all articles are included on the first run.

4. Set Up GitHub Secrets

For the GitHub Action to function, you need to configure secrets in your repository settings.

Go to your GitHub repository.
Navigate to Settings -> Secrets and variables -> Actions.
Click New repository secret for each of the following:
- BETTER_MORNING_LLM_API_KEY: Your API key for the chosen LLM provider (e.g., OpenAI API Key). Required.
- BETTER_MORNING_SMTP_USERNAME: (Optional, if using email output) Your SMTP username/email address.
- BETTER_MORNING_SMTP_PASSWORD: (Optional, if using email output) Your SMTP password or app-specific password.
- BETTER_MORNING_RECIPIENT_EMAIL: (Optional, if using email output) The email address to send the digest to.
- Note: GITHUB_TOKEN is automatically provided by GitHub Actions for creating releases, so you don't need to set it manually. Ensure your repository's Settings > Actions > General > Workflow permissions are set to Read and write permissions.

5. Set Up GitHub Action

The .github/workflows/daily_digest.yml file defines the GitHub Action that runs your daily digest generation.

Trigger: It's set to run daily at 07:30 UTC and can also be triggered manually via workflow_dispatch.
Steps: It checks out your code, sets up Python with uv, installs dependencies, and executes src/main.py.

You need to set the cron schedule and the branch for running it (see line branch: personal) in the workflow file.

📝 Output Options

Based on your output_type in config.toml:

GitHub Release: A new GitHub Release will be created daily with the digest content. The release will be tagged with daily-digest-YYYY-MM-DD.
Email: The digest will be sent to the email address specified in the BETTER_MORNING_RECIPIENT_EMAIL environment variable.

Run Locally

Install Dependencies

This project uses uv as the package manager for speed and efficiency.

pip install uv
uv sync

To test the digest generation locally without deploying to GitHub Actions, you can use the run_local.py script. This script sets up dummy environment variables where needed and executes the main application logic. If GitHub or email output is configured but secrets are not fully set, the digest will be saved to a local Markdown file.

./.venv/bin/python run_local.py

Note: For local LLM summarization to work, you will need to set the BETTER_MORNING_LLM_API_KEY environment variable with a valid API key for your chosen LLM provider. For example:

export BETTER_MORNING_LLM_API_KEY="sk-your-llm-api-key"
./.venv/bin/python run_local.py

📄 License

This project is licensed under the GPL v3 License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 153 Commits
.github/workflows		.github/workflows
ai		ai
collections		collections
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.toml		config.toml
pyproject.toml		pyproject.toml
run.sh		run.sh
run_local.py		run_local.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌅 Better Morning

✨ Features

🚀 Getting Started

1. Clone the Repository

2. Configure Global Settings (`config.toml`)

3. Define RSS Feed Collections (`collections/*.toml`)

LLM-Based Filtering (Collection/Feed)

Filtering Articles by Age (`max_age`)

4. Set Up GitHub Secrets

5. Set Up GitHub Action

📝 Output Options

Run Locally

Install Dependencies

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🌅 Better Morning

✨ Features

🚀 Getting Started

1. Clone the Repository

2. Configure Global Settings (config.toml)

3. Define RSS Feed Collections (collections/*.toml)

LLM-Based Filtering (Collection/Feed)

Filtering Articles by Age (max_age)

4. Set Up GitHub Secrets

5. Set Up GitHub Action

📝 Output Options

Run Locally

Install Dependencies

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

2. Configure Global Settings (`config.toml`)

3. Define RSS Feed Collections (`collections/*.toml`)

Filtering Articles by Age (`max_age`)

Packages