Rost Glukhov | Personal site and technical blog

vLLM Quickstart: High-Performance LLM Serving

vLLM is a high-throughput, memory-efficient inference and serving engine for Large Language Models (LLMs) developed by UC Berkeley’s Sky Computing Lab.

DGX Spark AU Pricing: $6,249-$7,999 at Major Retailers

The NVIDIA DGX Spark (GB10 Grace Blackwell) is now available in Australia at major PC retailers with local stock. If you’ve been following the global DGX Spark pricing and availability, you’ll be interested to know that Australian pricing ranges from $6,249 to $7,999 AUD depending on storage configuration and retailer.

Creating custom planner inserts combines the satisfaction of analog planning with the flexibility of digital design tools. Here are just the notes on printing them.

Extract Text from PDFs with PDFMiner in Python

PDFMiner.six is a powerful Python library for extracting text, metadata, and layout information from PDF documents.

Playwright is a powerful, modern browser automation framework that revolutionizes web scraping and end-to-end testing.

This post is just to show a nice photo of the snake-arranged coals in my BBQ, ready to fire.

Detecting AI Slop: Techniques & Red Flags

The proliferation of AI-generated content has created a new challenge: distinguishing genuine human writing from “AI slop” - low-quality, mass-produced synthetic text.

Self-Hosting Cognee: Choosing LLM on Ollama

Cognee is a Python framework for building knowledge graphs from documents using LLMs. But does it work with self-hosted models?

BAML vs Instructor: Structured LLM Outputs

When working with Large Language Models in production, getting structured, type-safe outputs is critical. Two popular frameworks - BAML and Instructor - take different approaches to solving this problem.

Choosing the Right LLM for Cognee: Local Ollama Setup

Choosing the Best LLM for Cognee demands balancing graph-building quality, hallucination rates, and hardware constraints. Cognee excels with larger, low-hallucination models (32B+) via Ollama but mid-size options work for lighter setups.

You install KVM on Ubuntu 24.04 by checking CPU virtualization support, installing the KVM/libvirt packages, enabling the libvirtd service, and (optionally) installing virt‑manager for a GUI.

Go Workspace Structure: From GOPATH to go.work

Managing Go projects effectively requires understanding how workspaces organize code, dependencies, and build environments.

A well-configured bash prompt displaying git repository information can dramatically improve your development workflow.

SEO Breadcrumbs: Schema Markup Implementation Guide

Breadcrumb navigation combined with proper schema markup is one of the most effective yet underutilized SEO techniques that can significantly improve your website’s search visibility and user experience.

Snap vs Flatpak: Ultimate Guide for 2025

Universal package managers have transformed Linux software distribution, making cross-distribution compatibility a reality. Snap and Flatpak emerged as the leading solutions, each bringing distinct philosophies to solving dependency hell and distribution fragmentation.

Go Project Structure: Practices & Patterns

Structuring a Go project effectively is fundamental to long-term maintainability, team collaboration, and scalability. Unlike frameworks that enforce rigid directory layouts, Go embraces flexibility—but with that freedom comes the responsibility to choose patterns that serve your project’s specific needs.

vLLM Quickstart: High-Performance LLM Serving

DGX Spark AU Pricing: $6,249-$7,999 at Major Retailers

DIY Printing Planner Inserts - 3 ways

Extract Text from PDFs with PDFMiner in Python

Playwright: Web Scraping & Testing

Snake BBQ method

Detecting AI Slop: Techniques & Red Flags

Self-Hosting Cognee: Choosing LLM on Ollama

BAML vs Instructor: Structured LLM Outputs

Choosing the Right LLM for Cognee: Local Ollama Setup

Install KVM on Ubuntu 24.04

Go Workspace Structure: From GOPATH to go.work

Show Git Branch & Status in Bash Prompt

SEO Breadcrumbs: Schema Markup Implementation Guide

Snap vs Flatpak: Ultimate Guide for 2025

Go Project Structure: Practices & Patterns