Senior Full Stack AI Engineer

Job Openings Senior Full Stack AI Engineer

About the job Senior Full Stack AI Engineer

About the Role

If you're the kind of engineer who gets excited about building AI systems that actually work in production — not just demos — this might be the role for you.

We're growing our team and looking for a Senior Full Stack AI Engineer who can move across the full stack with confidence: from crafting clean, responsive frontends to architecting backend systems that scale, to wiring together LLMs, vector databases, and inference pipelines that power real products used by real people.

This isn't a role where you'll be handed a neat spec and told to execute. You'll help shape the architecture, make meaningful technical decisions, and work with people who care deeply about doing things right.

Role Details

Experience: 5+ years in professional software engineering

Employment: Full-Time

Location: Remote-first

Focus: AI/LLM systems, Full Stack development, Cloud infrastructure

What You'll Be Working On

Day to day, you'll be involved in a mix of the following:

Designing and building scalable AI-powered web applications, end to end

Creating clean, fast frontends with React.js and Next.js

Writing solid backend services and APIs with FastAPI — built for performance and reliability

Building RAG pipelines, AI agents, and LLM orchestration systems that work at scale

Architecting semantic search and vector retrieval systems that return meaningful results

Setting up distributed, event-driven systems with proper queue and caching layers

Deploying and managing cloud infrastructure, including GPU workloads for AI inference

Profiling and optimizing systems for speed, cost, and resilience

Contributing to architectural decisions and helping the team level up technically

What We're Looking For

We care more about what you can do than how perfectly your resume matches a checklist. That said, here are the areas where you'll need to be genuinely strong:

Frontend

React.js

Next.js

TypeScript

Tailwind CSS

Redux / Zustand

Server-Side Rendering (SSR)

Static Site Generation (SSG)

WebSocket Integration

Performance Optimization

CDN & Asset Delivery

Backend & Systems

Python & FastAPI

RESTful APIs

Async Programming

Microservices

PostgreSQL / MongoDB

Redis

RabbitMQ / Kafka

Celery / BullMQ

Event-Driven Architecture

Distributed Systems

API Gateway Design

Auth & RBAC

AI & LLM Engineering

LangChain / LangGraph

LlamaIndex

RAG Pipelines

Prompt Engineering

AI Agent Frameworks

Embeddings & Semantic Search

Streaming LLM Integrations

Context & Memory Management

OpenAI / Claude / Llama

Mistral / Gemini / Open-Source LLMs

Model Fine-Tuning & Inference

Hugging Face Transformers

LoRA / QLoRA / PEFT

Model Quantization

GPU Inference Optimization

vLLM / Ollama / TGI

Batch Inference Pipelines

CUDA Fundamentals

Vector Databases

Pinecone

Weaviate

Qdrant

ChromaDB

FAISS

Milvus

Cloud, Infrastructure & DevOps

Docker & Kubernetes

AWS / GCP / Azure

CI/CD Pipelines

Nginx & Linux Admin

GPU Infrastructure

Cloudflare / CloudFront

S3 / GCS Object Storage

Prometheus / Grafana

ELK Stack

Terraform / Ansible

Nice to Have (But Not Required)

These aren't dealbreakers, but they'll definitely get our attention:

Experience with OCR, Computer Vision, or multimodal AI

Hands-on work with YOLO or real-time inference systems

Background in high-concurrency or real-time platforms

Familiarity with web scraping, document extraction, or data ingestion pipelines

Experience running AI infrastructure at enterprise scale

Or refer someone