Back to Projects

Nexus AI

Production-grade multi-agent AI research system built with FastAPI, LangGraph, Gemini, PostgreSQL, and Docker.

Overview

Nexus AI is a backend-focused multi-agent AI platform that simulates an autonomous research workflow. Instead of relying on a single prompt-response interaction, the system coordinates multiple AI agents through a LangGraph execution pipeline where each agent performs a specialized task such as:

  • Query analysis
  • Research planning
  • Web information gathering
  • Context aggregation
  • Report generation
  • Final summarization

The platform supports real-time streaming responses, tool calling, persistent report storage, and production-ready deployment using Docker.

Core Capabilities

Multi-Agent Orchestration

Built using LangGraph to coordinate multiple AI agents through a stateful execution graph. Agents operate independently while sharing structured state across the workflow.

Current Agent Pipeline:

  • Research Planner Agent
  • Web Search Agent
  • Context Collection Agent
  • Summarizer Agent

Tool Calling & Web Search

Integrated tool-calling architecture allows agents to dynamically invoke external tools during execution. Current tools include real-time web search and external context retrieval.

Streaming AI Responses

Implements asynchronous token streaming for real-time response generation. Supports ChatGPT-style incremental output delivery using FastAPI streaming responses.

Persistent Research Storage & Cloud Native

Research reports and session outputs are stored in PostgreSQL using SQLAlchemy ORM. The system is designed around cloud-first infrastructure principles, integrating the Gemini API for LLM inference and Neon PostgreSQL for managed database hosting.

System Architecture

                β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                β”‚     Frontend UI     β”‚
                β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                          β”‚
                          β–Ό
                β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                β”‚    FastAPI Server   β”‚
                β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                          β”‚
                          β–Ό
                β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                β”‚   LangGraph Engine  β”‚
                β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                          β”‚
      β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
      β–Ό                   β–Ό                   β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ Planner Agentβ”‚  β”‚ Search Agent β”‚  β”‚ Summary Agent  β”‚
β””β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜
       β”‚                 β”‚                   β”‚
       β–Ό                 β–Ό                   β–Ό
 β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”      β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
 β”‚ Gemini APIβ”‚    β”‚ Web Search β”‚      β”‚ PostgreSQL  β”‚
 β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜      β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Tech Stack

Backend

  • Python
  • FastAPI
  • LangGraph
  • LangChain
  • SQLAlchemy

AI Services

  • Gemini 3.1 Flash Lite
  • Tool Calling
  • Streaming Response
  • Agentic Workflows

Infrastructure

  • PostgreSQL
  • Neon Database
  • Docker
  • Render

Roadmap & Future Plans

  • Retrieval-Augmented Generation (RAG) and PDF ingestion pipeline
  • Authentication & RBAC
  • Redis caching layer and background task workers
  • Vector database integration
  • Agent memory systems
  • Observability, tracing, and multi-user workspace support