Add Docling Support, Pydantic JSON Validation, and Configurable LLM API URL by mohamed-em2m · Pull Request #101 · VectifyAI/PageIndex

mohamed-em2m · 2026-02-05T15:26:59Z

This pull request introduces several enhancements to improve flexibility, structure, and extensibility:

✅ Added Docling support for improved document processing and parsing.

✅ Integrated Pydantic models to ensure structured and validated JSON outputs.

✅ Added support for custom LLM API base URLs, allowing easier integration with self-hosted or alternative LLM providers.

♻️ Refactored parts of the codebase to improve clarity and maintainability.

banghasan · 2026-02-09T02:48:28Z

👍

adityasasidhar · 2026-02-11T11:39:41Z

@mohamed-em2m

Hey, I saw your implementation and it looks fantastic !!

…tructured output Integrate new features from PR VectifyAI#101 into LiteLLM architecture: - Add PDFReader class supporting docling, PyMuPDF, and PyPDF2 parsers - Add response_format (Pydantic BaseModel) support to all LLM API functions - Add read_pdf() utility function with page/full output formats - Update extract_text_from_pdf, get_text_of_pages, get_page_tokens to use PDFReader - Add docling and pydantic to requirements.txt - Add PDF parser config options to config.yaml Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

mohamed-em2m added 4 commits February 5, 2026 14:08

adding configure to support set base url of llms api

fd3e52c

add feature supporting docling and support pydantic

4b8c7da

adding pydantic to requirements file

76e0425

adding pydantic to requirements file

feaf75d

mohamed-em2m changed the title ~~adding configure to support set base url of llms api~~ Add Docling Support, Pydantic JSON Validation, and Configurable LLM API URL Feb 11, 2026

This was referenced Feb 11, 2026

Introduce Pydantic validation for LLM JSON outputs in page_index.py #99

Open

Support custom models #90

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add Docling Support, Pydantic JSON Validation, and Configurable LLM API URL#101

Add Docling Support, Pydantic JSON Validation, and Configurable LLM API URL#101
mohamed-em2m wants to merge 4 commits intoVectifyAI:mainfrom
mohamed-em2m:main

mohamed-em2m commented Feb 5, 2026 •

edited

Loading

Uh oh!

banghasan commented Feb 9, 2026

Uh oh!

adityasasidhar commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

mohamed-em2m commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

banghasan commented Feb 9, 2026

Uh oh!

adityasasidhar commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mohamed-em2m commented Feb 5, 2026 •

edited

Loading