Paravision Lab logoParavision Lab
GenAI Systems Lab

Full-Stack GenAI Systems Engineering

Paravision Lab is my independent engineering portfolio where I design and build GenAI systems focusing on LLM infrastructure, agent workflows, and reliable AI backends.

The portfolio highlights full-stack GenAI systems that not only implement these technologies but also demonstrate best practices in reliable architectures, asynchronous workflows, and modern AI infrastructure.

Architecture
Reference architecture
Idea → UI → API → orchestration
Idea
Next.js UI
FastAPI API
LangGraph Agent
RAG + Tools
Async Jobs (Inngest)
AI Output
Observability & Infrastructure
LangSmith
Supabase
Redis

Selected GenAI Systems

A set of end-to-end builds demonstrating agent orchestration, retrieval pipelines, streaming UX, and production-grade AI backends.

Perplexity-Style AI Research Tool

AI-powered search with intelligent query routing, Tavily web search, and Gemini LLM summarization. Features streaming responses and persistent sessions.

Key implementation

  • LangGraph-based routing for search vs LLM
  • Tavily API for real-time web search
  • Gemini LLM with streaming responses
  • Supabase for persistent chat history
FastAPITavily APIGeminiLangGraphSupabase

AI Video Ads Generator

Personalized AI video ads: LangChain scripts, HeyGen avatars, async processing via Inngest. Tracks jobs and manages rendering workflows.

Key implementation

  • Event-driven video generation with Inngest
  • HeyGen API for AI avatar videos
  • LangChain for script generation
  • Job persistence for long-running video tasks
Next.jsFastAPILangChainInngestHeyGen

AI Short Video Generator

Converts topics to narrated short videos: scripts, TTS audio, AI images, captions. Async rendering with Remotion.

Key implementation

  • Scene-based scripts with Gemini
  • TTS + AI image pipeline
  • Async video rendering with Remotion
  • Supabase queues for workflow orchestration
Next.jsFastAPIRemotionTTSSupabase

Typical Development Workflow

A structured approach used across product builds, emphasizing deterministic orchestration, system evaluation, and observable AI behavior.

  1. 1

    Problem Definition & System Scope

    Define the AI workflow, system constraints, user interaction patterns, and evaluation criteria before implementation.

  2. 2

    System Architecture & Orchestration

    Design orchestration flows, tool interactions, retrieval pipelines, and state management to ensure deterministic and debuggable AI workflows.

  3. 3

    Implementation with Guardrails

    Implement AI pipelines with structured outputs, evaluation hooks, retries, and cost/latency controls.

  4. 4

    Deployment & Observability

    Deploy systems with monitoring, logging, streaming support, and observability hooks for debugging and evaluation.

Engineering Focus

Designing GenAI systems that address real-world challenges in LLM orchestration, retrieval pipelines, state management, and scalable AI workflows.

LLM System Architecture

Designing modular GenAI systems with clear separation between UI, API, and AI orchestration layers.

Agentic Workflows (LangGraph)

Building deterministic agent workflows with explicit control flow and tool routing.

Retrieval-Augmented Generation (RAG)

Designing retrieval pipelines using vector search and grounded document retrieval.

Streaming AI Interfaces

Implementing real-time LLM responses using streaming inference and incremental output.

Stateful AI Systems

Designing AI workflows that maintain session memory, checkpoints, and conversation state.

Tool-Using AI Systems

Building AI agents that interact with APIs, databases, and external tools.

Structured AI Outputs

Designing pipelines that produce validated JSON or schema-based outputs for downstream automation.

Prompt Pipelines & LLM Control

Designing multi-step prompt pipelines for reliable reasoning and task execution.

AI Infrastructure & Tooling

Infrastructure and patterns demonstrated across portfolio builds

Docker · Redis · Inngest · GitHub Actions · LangSmith · Arcjet

Deployment & Infrastructure

Reliability & Operations

Containerized Deployment

Docker and Docker Compose for consistent local and deployed environments

FastAPI Service Architecture

FastAPI-based architecture supporting streaming inference and modular backend components

Event-Driven Execution

Inngest for background jobs, retries, and durable execution

Rate Limiting & API Protection

Redis and Arcjet for rate limiting and request throttling

Environment & Configuration Management

Environment secrets and runtime settings for reproducible deployments

Streaming AI Systems

Token-based LLM responses with incremental output delivery

Async AI Pipelines

Background workers and queue-based systems for long-running AI tasks

CI/CD Pipelines

GitHub Actions pipelines for automated testing, builds, and deployments

Observability & Evaluation

Comprehensive tracing, structured logging, and evaluation via LangSmith

Guardrails & AI Safety

Schema enforcement, input validation, and output verification systems

Dr. Partha Majumder

Independent GenAI Systems Engineer

I design and build production-ready GenAI systems end-to-end — from LLM orchestration architecture to deployed full-stack applications.

My work spans agentic workflows, retrieval systems, streaming pipelines, and scalable AI infrastructure, with strong defaults around state management, evaluation, observability, and cost/latency control.

With 15+ years in applied AI/ML systems — across optimization, simulation, deep learning, and modern LLM architectures — I focus on building AI systems that are robust, scalable, and production-ready.

Credibility
  • 15+ years in applied AI/ML systems — from optimization and deep learning to modern GenAI architectures
  • Built multiple production-grade GenAI systems across research assistants, media generation, and AI tooling
  • Expertise in agentic workflows, async orchestration, and streaming AI systems
Background
  • Senior systems engineering experience
  • End-to-end system implementations
  • Deployable architectures with full source code
Tech Stack
Python · FastAPI · Async APIs
LangChain · LangGraph