Geek Logbook

Tech sea log book

Batch Means Two Different Things: Why the Term Became Confusing in Data Engineering

In data systems, some of the most common words are also the most overloaded. Few terms illustrate this better than batch. Historically, batch processing described a very specific operating model: work was accumulated, grouped, and executed later, usually without direct user interaction. In contrast, online systems handled requests interactively, responding as operations arrived. This distinction

Why apt upgrade Didn’t Update VS Code (and What Actually Happened)

Problem Statement But Visual Studio Code (Visual Studio Code) remains outdated. it updates successfully. This behavior is not accidental. It is a consequence of how APT resolves dependencies under different upgrade strategies. Key Concept: APT Upgrade Policies APT provides multiple upgrade strategies with different levels of aggressiveness: Command Behavior apt upgrade Upgrades packages without installing

Why Terraform Does Not Deploy Your Lambda Container Image

When teams start packaging AWS Lambda functions as container images, a common misunderstanding appears quickly: “I created the Lambda with Terraform, so why is AWS saying the image does not exist?” The answer is simple. Terraform can provision the infrastructure that references a container image, but it does not automatically build and publish that image

Can an AWS VPC Have Two Peering Connections? Yes. But Should It?

When teams begin structuring cloud networks in AWS, one of the first connectivity mechanisms they encounter is VPC Peering. It is simple, direct, and usually easy to implement for small environments. A common question appears quickly: Can a VPC have two peering connections? The answer is straightforward: yes, it can. A single VPC can maintain

Sending Events to Multiple PostHog Projects from the Same Website

In some architectures, a single website needs to send analytics events to multiple PostHog projects. This situation commonly appears in the following scenarios: PostHog supports this setup by allowing multiple instances of the JavaScript SDK to run simultaneously on the same website. How Multiple PostHog Instances Work PostHog allows initializing multiple instances of the SDK

Lambda vs n8n: A Simple Explanation for Data Workflows

Introduction When building data systems or integrating APIs, a common question appears: should we use AWS Lambda or n8n? Both tools can automate processes, call APIs, and move data between systems, but they are not the same thing and should not be used for the same purpose. The simplest way to understand the difference is

Should You Use AWS Lambda or AWS Glue to Update Records in HubSpot?

When integrating HubSpot with a data platform on AWS, a common architectural decision appears quickly: Should updates to HubSpot be executed from AWS Lambda or AWS Glue?The correct choice depends on workload characteristics, latency requirements, and system design principles. This article explains the decision from an architectural and data engineering perspective. The Nature of the

Understanding client_ingestion_warning in PostHog: Are You Losing Data?

When using PostHog with the default posthog-js configuration, you may encounter the following warning: In the UI, this appears as: This article explains what it means, why it happens, and whether you are losing data. What Is client_ingestion_warning? client_ingestion_warning is an internal warning emitted by the PostHog JavaScript SDK when the client-side rate limiter is