Inafana – Building Digital Solutions That Work for You
An advanced Retrieval-Augmented Generation (RAG) platform with hybrid search, intelligent re-ranking, and multi-tenant support for building custom knowledge bases from documents and web content.
Juno RAG is a production-ready Retrieval-Augmented Generation platform that enables users to build intelligent knowledge bases from their own documents and web content. The system combines cutting-edge AI technologies with a modern, user-friendly interface to deliver accurate, context-aware responses based on custom knowledge sources.
The platform features a modern React frontend with a beautiful dark-themed UI, drag-and-drop file uploads, and a real-time chat interface. The FastAPI backend is optimized for production use with advanced prompt engineering, query expansion, and multi-provider LLM support (OpenAI, Anthropic, Groq).
The system uses ChromaDB as a persistent vector database, storing document embeddings locally for fast semantic search. Documents are intelligently chunked with overlap to maintain context, and each chunk is converted into high-dimensional vectors using SentenceTransformers models. When users ask questions, the system performs hybrid search to find relevant chunks, re-ranks them for accuracy, and sends the top results to the LLM for context-grounded answer generation.
Advanced features include query analysis and expansion for complex questions, source pre-filtering to eliminate irrelevant content, and support for multiple knowledge source modes (knowledgebase-only or generic+knowledgebase). The platform is designed with security in mind, using backend-controlled session management and user isolation at the database level.
ActionFlow leverages a modern tech stack with React and Tailwind CSS for the frontend, Django REST Framework for the API layer, and Playwright for headless browser automation. The system uses JWT authentication for secure access and supports both SQLite for development and PostgreSQL for production deployments. The Chrome extension utilizes Manifest V3 for modern browser compatibility and Chrome Storage API for state management.
Users can create complex multi-step workflows by combining multiple recordings, configure scheduling with timezone awareness, and monitor execution through detailed activity logs. The platform's variable system allows for parameterized workflows that can be reused with different data sets, making it ideal for testing, data entry automation, and repetitive task management.
The client needed a sophisticated RAG (Retrieval-Augmented Generation) system that could transform their document collections and web content into an intelligent, queryable knowledge base. The solution required the ability to answer questions accurately based solely on the provided documents, with full transparency about information sources.
The system needed to handle large documents efficiently, process queries quickly, and maintain good response times even with extensive knowledge bases. The architecture required lazy loading of models, optimized embedding generation, and efficient vector search operations.
Implement secure multi-tenancy where each user's data is completely isolated. Use backend-controlled session management to prevent client-side manipulation of user identities. Ensure all file uploads are validated for type and size, with proper error handling for malicious or corrupted files.