Build an AI-Powered Multimodal MCP Chatbot

This project demonstrates how to build a modern chatbot capable of interacting with documents, images, and videos. By leveraging Jac's unique programming paradigms, you'll create a multimodal AI assistant with advanced capabilities.

Features

Multimodal Interaction: Chat with PDFs, text files, images, and videos.
Context-Aware Responses: Search documents and provide relevant answers.
Web Search Integration: Answer general questions using real-time web search.
AI Vision: Understand and discuss images and videos.
Intelligent Query Routing: Automatically route questions to specialized AI handlers.

Learning Objectives

Object Spatial Programming (OSP): Organize your application using Jac's node-walker architecture.
Mean Typed Programming (MTP): Enable AI to classify and route user queries automatically.
Model Context Protocol (MCP): Build modular, reusable AI tools.
Multimodal AI Development: Work with text, images, and videos in one application.

Technologies Used

Jac Language: Core application logic.
Jac Cloud: Backend server infrastructure.
Streamlit: User-friendly web interface.
ChromaDB: Document search and storage.
OpenAI GPT: AI chat and vision capabilities.
Serper API: Real-time web search.

Project Structure

client.jac: Web interface for chat and file uploads.
server.jac: Main application logic using Object Spatial Programming.
server.impl.jac: Implementation details for server.jac.
mcp_server.jac: Tool server for document and web search.
mcp_client.jac: Interface for tool communication.
tools.jac: Document processing and search logic.

Setup Instructions

Prerequisites

Python 3.12 or newer.
API keys for OpenAI and Serper.

Installation

Install required packages:

pip install jaclang jac-cloud jac-streamlit byllm langchain langchain-community langchain-openai langchain-chroma chromadb openai pypdf tiktoken requests mcp[cli] anyio

Set environment variables:

export OPENAI_API_KEY=<your-openai-key>
export SERPER_API_KEY=<your-serper-key>

Running the Application

Start the tool server:
```
jac run mcp_server.jac
```
Start the main application:
```
jac start server.jac
```
Launch the web interface:
```
jac streamlit client.jac
```

Access the web interface at http://localhost:8501.

Usage

Register and log in using the web interface.
Upload files: PDFs, text files, images, or videos.
Start chatting: Ask questions about your uploaded content or general topics.

Troubleshooting

Ensure all dependencies are installed and compatible.
Verify API keys are set correctly.
Check server logs for errors.
Ensure services are running on their respective ports.

Extending the Chatbot

Add support for new file types (e.g., audio, spreadsheets).
Integrate additional tools (e.g., weather APIs, database connections).
Enhance AI models for specialized tasks.
Implement hybrid search combining keyword and semantic search.
Create custom chat nodes for domain-specific queries.

API Endpoints

POST /user/register: Create a new user account.
POST /user/login: Login and get an access token.
POST /walker/upload_file: Upload files (requires authentication).
POST /walker/interact: Chat with the AI (requires authentication).

Visit http://localhost:8000/docs for full API documentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build an AI-Powered Multimodal MCP Chatbot

Features

Learning Objectives

Technologies Used

Project Structure

Setup Instructions

Prerequisites

Installation

Running the Application

Usage

Troubleshooting

Extending the Chatbot

API Endpoints

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Build an AI-Powered Multimodal MCP Chatbot

Features

Learning Objectives

Technologies Used

Project Structure

Setup Instructions

Prerequisites

Installation

Running the Application

Usage

Troubleshooting

Extending the Chatbot

API Endpoints