The Documentation is a Work in Progress

🚧🚧🚧🚧🚧 We are still writing the documentation prior to an official release 🚧🚧🚧🚧

Archivist is an easy-to-use desktop application that:

1. Brings local, offline, AI chat capabilities to your personal computer; and
2. Enables you to safely utilize your own personal/private/proprietary files and documents for the AI to reference

It is designed for people who want to leverage the power of AI without relying on cloud providers.

What makes Archivist special is its ability to learn from your own files and documents. Simply upload any files you want—like PDFs, Word documents, or text files—and Archivist makes their content available during your AI conversations. You can ask questions about specific documents or across your entire collection, getting insightful answers based on your personal information. No technical knowledge required—just upload your files, ask questions, and get smart responses that draw directly from your own curated library of documents.

About the Name

An archivist is an information professional who assesses, collects, organizes, preserves, maintains control over, and provides access to records and archives determined to have long-term value... Archivists keep records that have enduring value as reliable memories of the past, and they help people find and understand the information they need in those records. Source: Wikipedia

This application serves as a capability demonstration rather than a finished product. As system integrators specializing in AI solutions for professional services firms, developing mass-market desktop applications isn't our core business model.

We've made deliberate packaging decisions that prioritize demonstrability over startup speed. The application wraps what would typically be an always-on network service into a distributable format you can try on your own hardware. There is an inherent speed tradeoff with this approach.

Query Tab
Pre-Process Tab
Upload Tab
Browse Tab
Inspect Tab
AI Settings Tab
Help and Licensing Tab

Value Proposition

Local/private/secure [[Retrieval Augmented Generation|RAG]] application for individual users
Fully functional at no cost
No subscription fees
No service outages
No chat limits
Easy setup
Low cost one-time paid [[Licensing]] option to enable users to upload custom LLMs
Support [[Open Source]]
Powered by commercial-grade AI technology by [[IBM]]
Makes your data [[Data Portability|portable]] for other AI applications
Runs on a laptop
Swap models as technology advances
Cost advantage over time
- No ongoing subscription fees to ChatGPT Plus ($20/month), Claude Pro ($20/month), etc.
- One-time payment vs. potentially hundreds of dollars per year
- Clear ROI after just a few months compared to commercial AI subscriptions
Reliability and independence
- Works offline, including in areas with poor internet
- Not subject to API rate limits or service outages
- No need to worry about price increases
Resource efficiency
- Optimized for their specific use case
- Better performance on targeted tasks than general-purpose AI
- Faster responses for document-specific questions

Feature Overview

Your personal, purpose-built, [[Retrieval Augmented Generation]] application
Uses metadata filtering to enable you to precisely control which information the AI considers—whether you want answers from a specific individual document, a curated collection of related files ([[File Sets]]), or your entire knowledge base—ensuring your conversations remain focused on exactly the information that matters to your current task.
Simple AI chat with a local LLM
Voice input so you can dictate your messages
Pre-processing text
Infinitely flexible text chunking strategies
Simple import/export to improve [[Data Portability]] and reduce vendor lock-in

Explore the power of private data intelligence with our showcase application! This demonstration highlights our expertise in:

Seamless ingestion of your proprietary documents
Intelligent chunking and vector embedding
Optimized retrieval with contextual awareness
Natural language generation grounded in your data
Secure, local processing for sensitive information