Skip to content

Commit

Permalink
Deployed f78cb2d with MkDocs version: 1.6.1
Browse files Browse the repository at this point in the history
  • Loading branch information
enoch3712 committed Nov 26, 2024
1 parent 5ca4095 commit 0d1467e
Show file tree
Hide file tree
Showing 11 changed files with 41 additions and 46 deletions.
1 change: 0 additions & 1 deletion core-concepts/classification/mom/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -1767,7 +1767,6 @@ <h2 id="best-practices">Best Practices<a class="headerlink" href="#best-practice
<li>Consider using different model providers for better diversity</li>
<li>Monitor and log classification results for each model</li>
</ul>
<p>For more examples and advanced usage, check out the <a href="examples/">examples directory</a> in the repository. </p>



Expand Down
1 change: 0 additions & 1 deletion core-concepts/classification/vision/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -1863,7 +1863,6 @@ <h2 id="example-with-multiple-models">Example with Multiple Models<a class="head
<a id="__codelineno-1-24" name="__codelineno-1-24" href="#__codelineno-1-24"></a> <span class="n">image</span><span class="o">=</span><span class="kc">True</span>
<a id="__codelineno-1-25" name="__codelineno-1-25" href="#__codelineno-1-25"></a><span class="p">)</span>
</code></pre></div>
<p>For more examples and advanced usage, check out the <a href="examples/">examples directory</a> in the repository. </p>



Expand Down
1 change: 0 additions & 1 deletion core-concepts/contracts/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -1634,7 +1634,6 @@ <h2 id="basic-usage">Basic Usage<a class="headerlink" href="#basic-usage" title=
<a id="__codelineno-1-5" name="__codelineno-1-5" href="#__codelineno-1-5"></a> <span class="k">pass</span>
</code></pre></div>
</details>
<p>For more examples and advanced usage, check out the <a href="examples/">examples directory</a> in the repository.</p>



Expand Down
2 changes: 1 addition & 1 deletion core-concepts/document-loaders/aws-textract/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -1765,7 +1765,7 @@ <h2 id="best-practices">Best Practices<a class="headerlink" href="#best-practice
<li>Process pages individually for large documents</li>
<li>Monitor API quotas and costs</li>
</ol>
<p>For more examples and implementation details, check out the <a href="examples/">examples directory</a> in the repository. </p>
<p>For more examples and implementation details, check out the <a href="../../examples/aws-textract">AWS Stack</a> in the repository. </p>



Expand Down
2 changes: 1 addition & 1 deletion core-concepts/document-loaders/azure-form/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -1754,7 +1754,7 @@ <h2 id="best-practices">Best Practices<a class="headerlink" href="#best-practice
<li>Handle tables and paragraphs separately for better accuracy</li>
<li>Process documents page by page for large files</li>
</ul>
<p>For more examples and advanced usage, check out the <a href="examples/">examples directory</a> in the repository. </p>
<p>For more examples and implementation details, check out the <a href="../../examples/azure-form.md">Azure Stack</a> in the repository. </p>



Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -1791,7 +1791,7 @@ <h2 id="best-practices">Best Practices<a class="headerlink" href="#best-practice
<li>Monitor API quotas</li>
</ol>
<p>Document AI supports PDF, TIFF, GIF, JPEG, PNG with a maximum file size of 20MB or 2000 pages.</p>
<p>For more examples and implementation details, check out the <a href="examples/">examples directory</a> in the repository. </p>
<p>For more examples and implementation details, check out the <a href="../../examples/google-document-ai">Google Stack</a> in the repository. </p>



Expand Down
2 changes: 1 addition & 1 deletion core-concepts/document-loaders/tesseract/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -1752,7 +1752,7 @@ <h2 id="best-practices">Best Practices<a class="headerlink" href="#best-practice
<li>Consider image preprocessing for better accuracy</li>
<li>Set appropriate page segmentation mode based on document layout</li>
</ul>
<p>For more examples and advanced usage, check out the <a href="examples/">examples directory</a> in the repository.</p>
<p>For more examples and advanced usage, check out the <a href="../../examples/local-processing">Local Stack</a> in the repository.</p>



Expand Down
10 changes: 4 additions & 6 deletions getting-started/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -1682,28 +1682,26 @@ <h2 id="native-features-that-you-want">Native Features that you want<a class="he
<li>
<p><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M14 2H6c-1.1 0-2 .9-2 2v16c0 1.1.9 2 2 2h12c1.1 0 2-.9 2-2V8l-6-6zM6 20V4h7v5h5v11H6z"></path></svg></span> <strong>Extraction with Pydantic</strong></p>
<p>Extract structured data from any document type using Pydantic models for validation, custom features, and prompt engineering capabilities.</p>
<p><a href="../examples/extraction"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 16 16"><path d="M8.22 2.97a.75.75 0 0 1 1.06 0l4.25 4.25a.75.75 0 0 1 0 1.06l-4.25 4.25a.75.75 0 0 1-1.042-.018.75.75 0 0 1-.018-1.042l2.97-2.97H3.75a.75.75 0 0 1 0-1.5h7.44L8.22 4.03a.75.75 0 0 1 0-1.06"></path></svg></span> Learn More</a></p>
<!-- <p><a href="../examples/extraction"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 16 16"><path d="M8.22 2.97a.75.75 0 0 1 1.06 0l4.25 4.25a.75.75 0 0 1 0 1.06l-4.25 4.25a.75.75 0 0 1-1.042-.018.75.75 0 0 1-.018-1.042l2.97-2.97H3.75a.75.75 0 0 1 0-1.5h7.44L8.22 4.03a.75.75 0 0 1 0-1.06"></path></svg></span> Learn More</a></p> -->
</li>
<li>
<p><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M12 2L4.5 20.29l.71.71L12 18l6.79 3 .71-.71z"></path></svg></span> <strong>Classification & Split</strong></p>
<p>Intelligent document classification and splitting with support for consensus strategies, eager/lazy splitting, and confidence thresholds.</p>
<p><a href="../examples/classification"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 16 16"><path d="M8.22 2.97a.75.75 0 0 1 1.06 0l4.25 4.25a.75.75 0 0 1 0 1.06l-4.25 4.25a.75.75 0 0 1-1.042-.018.75.75 0 0 1-.018-1.042l2.97-2.97H3.75a.75.75 0 0 1 0-1.5h7.44L8.22 4.03a.75.75 0 0 1 0-1.06"></path></svg></span> Learn More</a></p>
<!-- <p><a href="../examples/classification"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 16 16"><path d="M8.22 2.97a.75.75 0 0 1 1.06 0l4.25 4.25a.75.75 0 0 1 0 1.06l-4.25 4.25a.75.75 0 0 1-1.042-.018.75.75 0 0 1-.018-1.042l2.97-2.97H3.75a.75.75 0 0 1 0-1.5h7.44L8.22 4.03a.75.75 0 0 1 0-1.06"></path></svg></span> Learn More</a></p> -->
</li>
<li>
<p><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M12 1 3 5v6c0 5.55 3.84 10.74 9 12 5.16-1.26 9-6.45 9-12V5l-9-4zm0 10.99h7c-.53 4.12-3.28 7.79-7 8.94V12H5V6.3l7-3.11v8.8z"></path></svg></span> <strong>PII Detection</strong></p>
<p>Automatically detect and handle sensitive personal information in documents with privacy-first approach and advanced validation.</p>
<p><a href="../examples/pii"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 16 16"><path d="M8.22 2.97a.75.75 0 0 1 1.06 0l4.25 4.25a.75.75 0 0 1 0 1.06l-4.25 4.25a.75.75 0 0 1-1.042-.018.75.75 0 0 1-.018-1.042l2.97-2.97H3.75a.75.75 0 0 1 0-1.5h7.44L8.22 4.03a.75.75 0 0 1 0-1.06"></path></svg></span> Learn More</a></p>
<!-- <p><a href="../examples/pii"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 16 16"><path d="M8.22 2.97a.75.75 0 0 1 1.06 0l4.25 4.25a.75.75 0 0 1 0 1.06l-4.25 4.25a.75.75 0 0 1-1.042-.018.75.75 0 0 1-.018-1.042l2.97-2.97H3.75a.75.75 0 0 1 0-1.5h7.44L8.22 4.03a.75.75 0 0 1 0-1.06"></path></svg></span> Learn More</a></p> -->
</li>
<li>
<p><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M21.7 18.6-6.9-6.9q-.275-.275-.637-.425-.363-.15-.763-.15-.425 0-.787.15-.363.15-.638.425L8.1 15.5q-.275.275-.425.638-.15.362-.15.762 0 .425.15.788.15.362.425.637l6.9 6.9q.275.275.638.425.362.15.787.15.425 0 .788-.15.362-.15.637-.425l4.9-4.9q.275-.275.425-.637.15-.363.15-.788 0-.425-.15-.787-.15-.363-.425-.638Z"></path></svg></span> <strong>LLM and OCR Agnostic</strong></p>
<p>Freedom to choose and switch between different LLM providers and OCR engines based on your needs and cost requirements.</p>
<p><a href="../examples/integrations"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 16 16"><path d="M8.22 2.97a.75.75 0 0 1 1.06 0l4.25 4.25a.75.75 0 0 1 0 1.06l-4.25 4.25a.75.75 0 0 1-1.042-.018.75.75 0 0 1-.018-1.042l2.97-2.97H3.75a.75.75 0 0 1 0-1.5h7.44L8.22 4.03a.75.75 0 0 1 0-1.06"></path></svg></span> Learn More</a></p>
<!-- <p><a href="../examples/integrations"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 16 16"><path d="M8.22 2.97a.75.75 0 0 1 1.06 0l4.25 4.25a.75.75 0 0 1 0 1.06l-4.25 4.25a.75.75 0 0 1-1.042-.018.75.75 0 0 1-.018-1.042l2.97-2.97H3.75a.75.75 0 0 1 0-1.5h7.44L8.22 4.03a.75.75 0 0 1 0-1.06"></path></svg></span> Learn More</a></p> -->
</li>
</ul>
</div>

<p>Check out our <a href="./advanced-usage.md">advanced usage guide</a> for more complex scenarios like document classification, batch processing, and custom LLM integration.</p>




Expand Down
2 changes: 1 addition & 1 deletion search/search_index.json

Large diffs are not rendered by default.

64 changes: 32 additions & 32 deletions sitemap.xml
Original file line number Diff line number Diff line change
Expand Up @@ -2,130 +2,130 @@
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<url>
<loc>https://enoch3712.github.io/ExtractThinker/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/classification/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/classification/basic/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/classification/mom/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/classification/tree/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/classification/vision/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/contracts/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/document-loaders/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/document-loaders/aws-textract/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/document-loaders/azure-form/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/document-loaders/google-document-ai/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/document-loaders/pdf-plumber/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/document-loaders/pypdf/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/document-loaders/spreadsheet/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/document-loaders/tesseract/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/document-loaders/web-loader/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/extractors/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/extractors/batch/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/extractors/image_charts/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/llm-integration/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/process/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/splitters/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/splitters/image/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/core-concepts/splitters/text/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/examples/aws-textract/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/examples/azure-document-intelligence/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/examples/google-document-ai/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/examples/image-chart-processing/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/examples/local-processing/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/examples/resume-processing/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
<url>
<loc>https://enoch3712.github.io/ExtractThinker/getting-started/</loc>
<lastmod>2024-11-25</lastmod>
<lastmod>2024-11-26</lastmod>
</url>
</urlset>
Binary file modified sitemap.xml.gz
Binary file not shown.

0 comments on commit 0d1467e

Please sign in to comment.