know3

know3 v3.21

Local RAG + Training Data Engine · 100% Offline · by Dr. Khaled Diab

Connecting to Ollama... |

LLM Model

Embedding Model

Chunk Size (chars)

Collection Name

📚

Drop documents here to ingest

PDF · EPUB · DOCX · TXT — parsed, chunked, embedded & stored locally

Documents

Chunks

Training Pairs

Collections

👁️ Review Chunks (Optional - Filter before generating)

Review and filter chunks before generating pairs to improve quality and reduce processing time.

Sample Mode

Process every Nth chunk (1 = all)

Filter Junk

0 selected

Ingest documents first to see chunks

Tip: Select the chunks you want to generate pairs from. Use "Sample Mode" to test with every Nth chunk first.

Content Domain

Adjusts system prompt for domain

Pairs Per Chunk

System Prompt

Processing... 0%

Generated Pairs

🔧 Tools & Advanced Features

📤 Load Custom Data

Upload JSON or JSONL files. Fields are auto-detected — works with any key structure.

📁

Drop JSON/JSONL files here

or click to browse · multiple files supported

Collection Name

Text Field (auto-detect)

Leave blank to auto-detect the longest text field

💬 RAG Conversation

Chat with your documents · Multi-turn · Source citations

🧠

Ingest documents first, then ask questions here

⚙️ Advanced Options

Temperature

0.7

Top-K Results

Collection

Sources

Export Format

Include Source Chunks

⚠ Setup Required

Step 1: Install Ollama → ollama.com/download
Step 2: Pull models:
  ollama pull llama3.2:3b (LLM)
  ollama pull nomic-embed-text (Embeddings for RAG)
Step 3: Set CORS (one time):
  • Mac: launchctl setenv OLLAMA_ORIGINS "*" then restart Ollama
  • Windows: Set env variable OLLAMA_ORIGINS=* in System Properties
  • Linux: OLLAMA_ORIGINS="*" ollama serve
Step 4: Refresh this page

⚙️ Settings

🌐 Ollama Connection

Status

Checking...

Ollama Host

Default: http://localhost:11434 · For LAN: http://192.168.x.x:11434

☁️ Cloud Inference (Faster)

Provider

Cloud = faster inference but documents leave your machine

🤖 Models

LLM Model

Language model for generation

Embedding Model

Vector embedding for search

🔍 RAG Settings

RAG Mode

How RAG queries work

Temperature

Creativity (0.1=precise, 1.0=creative) 0.7

Top-K Results

How many chunks to retrieve

Include Source Citations

Show sources in RAG responses

⚡ Generation

Domain

Content type (adjusts prompts)

Pairs Per Chunk

Training pairs to generate

🎨 Theme

Preset Themes

Emerald

Ocean

Sunset

Cyber Pink

Royal Gold

Monochrome

Custom Accent Color

Pick any color — the full UI adapts automatically

🔧 Advanced

Chunk Size (characters)

Text split size (200-3000)

Default Collection

Where new chunks are stored

Remove all chunks and pairs

Generic	Books, articles, general knowledge
Coding	Code docs, API references, tutorials
Scientific	Math, physics, chemistry textbooks
Literature	Novels, essays, humanities texts
Legal	Contracts, statutes, legal documents
Business	Case studies, financial reports
Research Papers	Academic papers, methodology-focused

know3

know3

Drop documents here to ingest

Generated Pairs

🔧 Tools & Advanced Features

📤 Load Custom Data

💬 RAG Conversation

Welcome to know3 👋

Step 1: Install Ollama

Step 2: Pull AI Models

Step 3: Enable Browser Access (CORS)

Step 4: Upload Your Documents

Step 5: Generate & Query

Step 6: Export & Fine-Tune

Help & FAQ

📄 JSONL (Recommended)

📄 JSON

📄 Alpaca

💬 ShareGPT

📊 CSV

know3

Created by

What is know3?

Capabilities

⚙️ Settings