From facets to prompts: Making RAG smarter in PostgreSQL

In a previous blog post, we built a simple retrieval pipeline where we chunked the documentation, embedded the text, and queried PostgreSQL with vector similarity to find relevant passages. Then in the most recent post, we shifted gears and focused on the ingestion side and looked at extraction of facets such as version, operating system, document type, and component, and stored them alongside...
Improving RAG in PostgreSQL: From basic retrieval to smarter context

In my previous blog post, we built the simplest possible Retrieval Augmented Generation (RAG) pipeline inside PostgreSQL. We embedded our manuals, stored those vectors in a table, ran a similarity search, and handed the top 5 results straight to a Large Language Model. The result was encouraging, we could already see the model drawing on our content rather than inventing information. But as with...
Detection and resolution of conflicts in PostgreSQL logical replication

At this year’s PGConf.dev, the premier gathering for PostgreSQL contributors, developers, and community leaders, Zhijie Hou and I had the opportunity talk about the challenges and solutions around conflict handling in logical replication — a topic increasingly relevant as PostgreSQL adoption continues to grow.
How to upgrade replication clusters without downtime

At PGConf India 2025, I shared strategies for upgrading PostgreSQL replication clusters with no disruption to operations—highlighting examples and the evolving capabilities of logical replication.
Data security and compliance challenges in today’s landscape

The global average cost of data breach in 2021 was over $4.2M†, surging 10% from last year. As data becomes ever more valuable, organizations must fend off cyber-attacks which keep increasing in number and sophistication.