Create Next App

Selected GenAI Systems

A set of end-to-end builds demonstrating agent orchestration, retrieval pipelines, streaming UX, and production-grade AI backends.

Perplexity-Style AI Research Tool

AI-powered search with intelligent query routing, Tavily web search, and Gemini LLM summarization. Features streaming responses and persistent sessions.

Key implementation

LangGraph-based routing for search vs LLM
Tavily API for real-time web search
Gemini LLM with streaming responses
Supabase for persistent chat history

FastAPITavily APIGeminiLangGraphSupabase

View on GitHub

AI Video Ads Generator

Personalized AI video ads: LangChain scripts, HeyGen avatars, async processing via Inngest. Tracks jobs and manages rendering workflows.

Key implementation

Event-driven video generation with Inngest
HeyGen API for AI avatar videos
LangChain for script generation
Job persistence for long-running video tasks

Next.jsFastAPILangChainInngestHeyGen

View on GitHub

AI Short Video Generator

Converts topics to narrated short videos: scripts, TTS audio, AI images, captions. Async rendering with Remotion.

Key implementation

Scene-based scripts with Gemini
TTS + AI image pipeline
Async video rendering with Remotion
Supabase queues for workflow orchestration

Next.jsFastAPIRemotionTTSSupabase

View on GitHub

Engineering Focus

Designing GenAI systems that address real-world challenges in LLM orchestration, retrieval pipelines, state management, and scalable AI workflows.

LLM System Architecture

Designing modular GenAI systems with clear separation between UI, API, and AI orchestration layers.

Agentic Workflows (LangGraph)

Building deterministic agent workflows with explicit control flow and tool routing.

Retrieval-Augmented Generation (RAG)

Designing retrieval pipelines using vector search and grounded document retrieval.

Streaming AI Interfaces

Implementing real-time LLM responses using streaming inference and incremental output.

Stateful AI Systems

Designing AI workflows that maintain session memory, checkpoints, and conversation state.

Tool-Using AI Systems

Building AI agents that interact with APIs, databases, and external tools.

Structured AI Outputs

Designing pipelines that produce validated JSON or schema-based outputs for downstream automation.

Prompt Pipelines & LLM Control

Designing multi-step prompt pipelines for reliable reasoning and task execution.

LinkedIn GitHub

Dr. Partha Majumder

Independent GenAI Systems Engineer

I design and build production-ready GenAI systems end-to-end — from LLM orchestration architecture to deployed full-stack applications.

My work spans agentic workflows, retrieval systems, streaming pipelines, and scalable AI infrastructure, with strong defaults around state management, evaluation, observability, and cost/latency control.

With 15+ years in applied AI/ML systems — across optimization, simulation, deep learning, and modern LLM architectures — I focus on building AI systems that are robust, scalable, and production-ready.

Credibility

15+ years in applied AI/ML systems — from optimization and deep learning to modern GenAI architectures
Built multiple production-grade GenAI systems across research assistants, media generation, and AI tooling
Expertise in agentic workflows, async orchestration, and streaming AI systems

Background

Senior systems engineering experience
End-to-end system implementations
Deployable architectures with full source code

Tech Stack

Python · FastAPI · Async APIs

LangChain · LangGraph

Full-Stack GenAI Systems Engineering

Selected GenAI Systems

Perplexity-Style AI Research Tool

AI Video Ads Generator

AI Short Video Generator

Typical Development Workflow

Problem Definition & System Scope

System Architecture & Orchestration

Implementation with Guardrails

Deployment & Observability

Engineering Focus

LLM System Architecture

Agentic Workflows (LangGraph)

Retrieval-Augmented Generation (RAG)

Streaming AI Interfaces

Stateful AI Systems

Tool-Using AI Systems

Structured AI Outputs

Prompt Pipelines & LLM Control

AI Infrastructure & Tooling

Deployment & Infrastructure

Reliability & Operations

Containerized Deployment

FastAPI Service Architecture

Event-Driven Execution

Rate Limiting & API Protection

Environment & Configuration Management

Streaming AI Systems

Async AI Pipelines

CI/CD Pipelines

Observability & Evaluation

Guardrails & AI Safety

Dr. Partha Majumder