Job Openings Senior Full Stack AI Engineer

About the job Senior Full Stack AI Engineer

About the Role

If you're the kind of engineer who gets excited about building AI systems that actually work in production — not just demos — this might be the role for you.

We're growing our team and looking for a Senior Full Stack AI Engineer who can move across the full stack with confidence: from crafting clean, responsive frontends to architecting backend systems that scale, to wiring together LLMs, vector databases, and inference pipelines that power real products used by real people.

This isn't a role where you'll be handed a neat spec and told to execute. You'll help shape the architecture, make meaningful technical decisions, and work with people who care deeply about doing things right.

Role Details

  • Experience: 5+ years in professional software engineering
  • Employment: Full-Time
  • Location: Remote-first
  • Focus: AI/LLM systems, Full Stack development, Cloud infrastructure

What You'll Be Working On

Day to day, you'll be involved in a mix of the following:

  • Designing and building scalable AI-powered web applications, end to end
  • Creating clean, fast frontends with React.js and Next.js
  • Writing solid backend services and APIs with FastAPI — built for performance and reliability
  • Building RAG pipelines, AI agents, and LLM orchestration systems that work at scale
  • Architecting semantic search and vector retrieval systems that return meaningful results
  • Setting up distributed, event-driven systems with proper queue and caching layers
  • Deploying and managing cloud infrastructure, including GPU workloads for AI inference
  • Profiling and optimizing systems for speed, cost, and resilience
  • Contributing to architectural decisions and helping the team level up technically

What We're Looking For

We care more about what you can do than how perfectly your resume matches a checklist. That said, here are the areas where you'll need to be genuinely strong:

Frontend

  • React.js
  • Next.js
  • TypeScript
  • Tailwind CSS
  • Redux / Zustand
  • Server-Side Rendering (SSR)
  • Static Site Generation (SSG)
  • WebSocket Integration
  • Performance Optimization
  • CDN & Asset Delivery

Backend & Systems

  • Python & FastAPI
  • RESTful APIs
  • Async Programming
  • Microservices
  • PostgreSQL / MongoDB
  • Redis
  • RabbitMQ / Kafka
  • Celery / BullMQ
  • Event-Driven Architecture
  • Distributed Systems
  • API Gateway Design
  • Auth & RBAC

AI & LLM Engineering

  • LangChain / LangGraph
  • LlamaIndex
  • RAG Pipelines
  • Prompt Engineering
  • AI Agent Frameworks
  • Embeddings & Semantic Search
  • Streaming LLM Integrations
  • Context & Memory Management
  • OpenAI / Claude / Llama
  • Mistral / Gemini / Open-Source LLMs

Model Fine-Tuning & Inference

  • Hugging Face Transformers
  • LoRA / QLoRA / PEFT
  • Model Quantization
  • GPU Inference Optimization
  • vLLM / Ollama / TGI
  • Batch Inference Pipelines
  • CUDA Fundamentals

Vector Databases

  • Pinecone
  • Weaviate
  • Qdrant
  • ChromaDB
  • FAISS
  • Milvus

Cloud, Infrastructure & DevOps

  • Docker & Kubernetes
  • AWS / GCP / Azure
  • CI/CD Pipelines
  • Nginx & Linux Admin
  • GPU Infrastructure
  • Cloudflare / CloudFront
  • S3 / GCS Object Storage
  • Prometheus / Grafana
  • ELK Stack
  • Terraform / Ansible

Nice to Have (But Not Required)

These aren't dealbreakers, but they'll definitely get our attention:

  • Experience with OCR, Computer Vision, or multimodal AI
  • Hands-on work with YOLO or real-time inference systems
  • Background in high-concurrency or real-time platforms
  • Familiarity with web scraping, document extraction, or data ingestion pipelines
  • Experience running AI infrastructure at enterprise scale