Ollama AI Answers Plugin for SearXNG

Single file install
Does not block result loading time
Based on ai-answers-searxng by cra88y

A SearXNG plugin that generates local AI overviews powered by Ollama, using search results as RAG context.

Features:

Installation

Place ai_answers.py into the searx/plugins directory of your instance (or mount it in a container) and enable it in settings.yml:

plugins:
  searx.plugins.ai_answers.SXNGPlugin:  
    active: true

Configure via environment variables:

LLM_URL: Ollama chat completions endpoint. Default: http://ollama:11434/v1/chat/completions
LLM_MODEL: Model name as listed in Ollama. Default: llama3.2

LLM_SYSTEM_PROMPT: Overrides the system prompt. Default: You are a direct, citation-accurate search synthesis engine.
LLM_MAX_TOKENS: Default 200.
LLM_TEMPERATURE: Default 0.2.
LLM_CONTEXT_DEEP_COUNT: Results used as context with full snippets. Default 5.
LLM_CONTEXT_SHALLOW_COUNT: Results with headlines only (additional breadth). Default 15.
LLM_TABS: Tab whitelist, comma delimited. Default general,science,it,news.
LLM_INTERACTIVE: UI mode. Default true (interactive: copy, regenerate, follow up). Set to false for simple response only.
LLM_QUESTION_MARK_REQUIRED: Only trigger AI answers when the query contains ?. Default false.

environment:
  - LLM_URL=http://ollama:11434/v1/chat/completions
  - LLM_MODEL=llama3.2

LLM_URL=http://ollama:11434/v1/chat/completions
LLM_MODEL=llama3.2

pip install flask flask-babel
python tests/demo.py   # UI demo at localhost:5000