Blog

Building in the AI-native era — where cloud meets coding assistants and agentic AI.

AI Mar 10, 2026

Vector Search vs Semantic Search: They're Not the Same Thing

Vector search, semantic search, keyword search, hybrid search — these terms get used interchangeably but they mean different things. This post breaks down what each actually does, when each matters, and why hybrid search wins for RAG.

12 min read
Cloud Mar 5, 2026

When Your Keys Get Locked In: Navigating AWS KMS Import Limitations

AWS KMS doesn't allow key material export by design. When an external PKI partner generates keys but doesn't retain them, you're stuck. Here are the four AWS alternatives — CloudHSM, XKS, Private CA, and fixing the process — with a decision framework to pick the right one.

13 min read
AI Mar 1, 2026

World Monitor: How Open-Source OSINT Is Democratizing Global Intelligence

A deep dive into World Monitor — an open-source intelligence dashboard that aggregates 150+ feeds, 40+ geospatial layers, and AI-powered analysis into a real-time situational awareness platform. What OSINT is, how these platforms work under the hood, and why it matters now more than ever.

9 min read
Security Feb 27, 2026

Your Security Team Wants to Privatize Your App — Here's What They Actually Need

When your security team says 'make it private', they usually mean 'make it secure.' This post compares four approaches — VPC privatization, WAF IP allowlisting, CloudFront + auth hardening, and AWS Verified Access — and explains why Zero Trust beats network perimeters for internal applications.

9 min read
AI Feb 26, 2026

LLM Architecture Explained Simply: 10 Questions From Prompt to Token

A beginner-friendly walkthrough of how an LLM actually works end-to-end: from typing a prompt to receiving a response — covering tokenization, embeddings, Transformer layers, KV cache, the training loop, embeddings for search, and why decoder-only models won.

16 min read
AI Feb 23, 2026

Transformer Anatomy: Attention + FFN Demystified

A deep dive into the Transformer architecture — how attention connects tokens and why the Feed-Forward Network is the real brain of the model. Plus the key to understanding Mixture of Experts (MoE).

15 min read
AI Feb 9, 2026

RAG on AWS: Which Vector Store Is Right for You?

AWS now offers 9 different ways to store and search vectors for RAG workloads. This guide compares every option through the Well-Architected Framework to help you pick the right one.

21 min read
Development Jan 26, 2026

AWS Backup Cost Analysis

EBS snapshot costs were growing month-over-month with no clear explanation or optimization strategy.

3 min read
Development Jan 26, 2026

SEO Enhancement - agiusalexandre.com

Blog posts weren't appearing correctly in search results or showing rich previews when shared on social media.

2 min read