NovaKitv1.0

Document Chat

Upload documents and chat with them using AI. Get answers with citations from PDFs, URLs, YouTube videos, and more.

Document Chat

Chat with your documents using AI. Upload PDFs, paste URLs, import YouTube videos, or paste text — then ask questions and get intelligent answers with citations.

Document Chat is available on Pro and higher plans. Access it in AI → Documents.

Overview

Document Chat uses Retrieval-Augmented Generation (RAG) to provide accurate, grounded answers from your content. Every response includes citations linking back to the exact source passages.

Supported Sources

Source TypeDescriptionFormats
File UploadUpload documents directlyPDF, DOCX, MD, TXT, CSV
URL ImportExtract content from web pagesAny public URL
YouTubeImport video transcriptsAny YouTube video with captions
Paste TextPaste content directlyPlain text, Markdown

Getting Started

Add a Source

Navigate to AI → Documents and click Add Source. Choose from:

  • Upload File: Drag & drop or select files
  • URL: Paste a web page URL
  • YouTube: Paste a YouTube video URL
  • Paste Text: Paste text content directly

Wait for Processing

Documents are automatically processed:

  1. Parsing: Content is extracted from the source
  2. Chunking: Content is split into semantic chunks
  3. Embedding: Chunks are converted to vector embeddings
  4. Indexing: Content is indexed for semantic search

Start Chatting

Click on any document to open the chat interface. Ask questions in natural language and get AI-powered answers with citations.

Source Types

File Upload

Upload documents directly from your computer. Supported formats:

FormatExtensionMax SizeNotes
PDF.pdf10MBScanned PDFs not supported
Word.docx10MBModern Word format only
Markdown.md10MBGitHub-flavored Markdown
Text.txt10MBPlain text files
CSV.csv10MBTabular data

Scanned PDFs (image-based) are not currently supported. Documents must contain selectable text.

URL Import

Import content from any public web page:

# Supported URLs
https://example.com/article
https://docs.example.com/guide
https://blog.example.com/post

The system will:

  • Fetch the page content
  • Extract the main article/content
  • Remove navigation, ads, and boilerplate
  • Preserve headings, lists, and formatting

Pages behind authentication or paywalls cannot be imported. The URL must be publicly accessible.

YouTube Import

Import YouTube video transcripts:

# Supported URL formats
https://www.youtube.com/watch?v=VIDEO_ID
https://youtu.be/VIDEO_ID

Requirements:

  • Video must have captions (auto-generated or manual)
  • Videos without captions cannot be imported

The system imports the full transcript with timestamps, allowing you to ask questions about any part of the video.

Paste Text

Paste any text content directly:

  • Meeting notes
  • Email threads
  • Code snippets
  • Research notes

Chat Interface

Asking Questions

Ask questions in natural language:

  • "What are the key findings in this report?"
  • "Summarize the main arguments"
  • "What does it say about X?"
  • "Compare the approaches mentioned in section 2 and 3"

Citations

Every AI response includes numbered citations:

The report indicates revenue grew 35% year-over-year [1],
driven primarily by enterprise expansion [2].

Click any citation to:

  • View the original passage
  • See the page number or location
  • Navigate to that section in the document

Chat Settings

Customize chat behavior:

SettingOptionsDescription
ModelGPT-4, Claude 3.5, etc.AI model for responses
Temperature0.0 - 1.0Response creativity
Response LengthConcise, Balanced, DetailedOutput verbosity
Max Sources1-10Number of chunks to retrieve

Collections

Organize documents into collections for:

  • Project Organization: Group related documents
  • Multi-Document Chat: Chat across an entire collection
  • Team Collaboration: Share collections with team members

Creating Collections

  1. Click New Collection in the sidebar
  2. Enter a name and optional description
  3. Choose a color for visual organization
  4. Click Create

Moving Documents

  • Drag documents into collections
  • Use the document menu → Move to Collection
  • Documents can belong to one collection at a time

Collection Chat

Chat with all documents in a collection at once:

  1. Select a collection in the sidebar
  2. Click Chat with Collection
  3. Ask questions that span multiple documents

API Access

Access Document Chat via the REST API.

Upload Document

curl -X POST https://api.novakit.ai/v1/documents/upload \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "file=@document.pdf" \
  -F "collection_id=optional-collection-id"

Import from URL

curl -X POST https://api.novakit.ai/v1/documents/import \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "source_type": "url",
    "url": "https://example.com/article",
    "name": "Example Article"
  }'

Import YouTube

curl -X POST https://api.novakit.ai/v1/documents/import \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "source_type": "youtube",
    "url": "https://www.youtube.com/watch?v=VIDEO_ID"
  }'

Chat with Document

curl -X POST https://api.novakit.ai/v1/documents/chat \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "document_ids": ["doc_123", "doc_456"],
    "message": "What are the key findings?",
    "model": "gpt-4o"
  }'

Response Format

{
  "content": "The key findings include...[1]...[2]",
  "citations": [
    {
      "index": 1,
      "chunk_id": "chunk_abc",
      "document_id": "doc_123",
      "text": "Original passage...",
      "page": 5
    }
  ],
  "usage": {
    "input_tokens": 1234,
    "output_tokens": 567
  }
}

Billing

Document Chat uses credits from your Chat Tokens quota:

OperationCredits
Document embedding~1 credit per 1000 characters
Chat queryStandard chat rates by model

Check your quota usage in Dashboard → Usage.

Best Practices

Document Preparation

  • Clean formatting: Remove unnecessary headers/footers
  • Text-based PDFs: Ensure PDFs contain selectable text
  • Reasonable length: Very long documents may have lower retrieval accuracy

Effective Questions

  • Be specific: "What does section 3 say about pricing?" vs "Tell me about pricing"
  • Reference context: "Based on the Q4 report, what were the growth metrics?"
  • Ask follow-ups: Build on previous answers for deeper exploration

Collection Strategy

  • Group by project or topic
  • Keep collections focused (5-20 documents)
  • Use descriptive names

Troubleshooting

Document Processing Failed

ErrorCauseSolution
"Unsupported file type"Wrong formatUse PDF, DOCX, MD, TXT, or CSV
"File too large"Exceeds 10MBSplit into smaller files
"No text content"Scanned PDFUse text-based documents
"Failed to fetch URL"Inaccessible pageCheck URL is public
"No transcript available"No YouTube captionsUse video with captions

Chat Issues

IssueSolution
Irrelevant answersRephrase question, be more specific
Missing citationsIncrease "Max Sources" setting
Slow responsesReduce response length setting

Retry Failed Documents

For URL and YouTube imports that failed:

  1. Go to AI → Documents
  2. Find the failed document
  3. Click Retry to attempt processing again

File uploads cannot be retried. You'll need to upload the file again.

Limits

LimitFreeProTeam
Documents5UnlimitedUnlimited
File size10MB10MB10MB
Collections1UnlimitedUnlimited
API accessNoYesYes

On this page