Skip to content

Conversation

@panwarnalini-hub
Copy link

Summary

Adds comprehensive docstrings to PdfPipelineOptions class with detailed parameter documentation and production usage examples.

Changes

  • Added detailed class-level docstring to PdfPipelineOptions
  • Documented all 9 parameters with explanations, defaults, and use cases
  • Included 4 complete usage examples covering common scenarios
  • Added production considerations (Databricks compatibility, timeout recommendations, OCR engine issues)
  • Preserved existing inline comments and type hints

Motivation

Addresses #2747 - improves developer experience by providing clear documentation for pipeline configuration options based on real production usage.

Examples included

  1. Basic digital PDF extraction (no OCR)
  2. Scanned PDF with OCR and high-resolution images
  3. Scientific papers with tables and formulas
  4. Databricks-compatible configuration

Based on production experience with clinical trial document processing using Docling v2.

- Add detailed class docstring with parameter descriptions
- Include 4 complete usage examples (digital PDF, scanned PDF, scientific papers, Databricks)
- Document production considerations (timeouts, OCR engines, distributed systems)
- Preserve existing inline comments and type hints
- Add cross-references to related classes

Addresses docling-project#2747
@github-actions
Copy link
Contributor

github-actions bot commented Dec 31, 2025

DCO Check Passed

Thanks @panwarnalini-hub, all your commits are properly signed off. 🎉

@dosubot
Copy link

dosubot bot commented Dec 31, 2025

Related Documentation

Checked 10 published document(s) in 1 knowledge base(s). No updates required.

How did I do? Any feedback?  Join Discord

@mergify
Copy link

mergify bot commented Dec 31, 2025

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

I, Nalini Panwar <[email protected]>, hereby add my Signed-off-by to this commit: 47ae0483c9a5eeba6d0cda8595d35b09b7

Signed-off-by: Nalini Panwar <[email protected]>
I, Nalini Panwar <[email protected]>, hereby add my Signed-off-by to this commit: 47ae048

Signed-off-by: Nalini Panwar <[email protected]>
I, Nalini Panwar <[email protected]>, hereby add my Signed-off-by to this commit: 47ae048

Signed-off-by: Nalini Panwar <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant