pdf-processing

Here are 42 public repositories matching this topic...

dissorial / doc-chatbot

Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.

chat typescript reactjs mongoose nextjs chatbot openai vectorization pinecone document-embedding tailwindcss pdf-processing gpt-3 openai-api gpt-4 langchain

Updated Jul 21, 2023
TypeScript

allenai / papermage

Star

library supporting NLP and CV research on scientific papers

python machine-learning natural-language-processing computer-vision scientific-papers multimodal pdf-processing

Updated Nov 8, 2024
Python

ahmedkhemiri95 / PDFs-TextExtract

Star

Multiple and Large PDF Documents Text Extraction.

python pdf parser data-science pdf-document text-analytics pdfs pypdf2 extract-text pdfminer pdf-processing pdfs-textextract

Updated Feb 2, 2024
Python

aws-samples / document-processing-pipeline-for-regulated-industries

Star

A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.

Updated Oct 25, 2021
Python

Govind-S-B / pdf-to-text-chroma-search

Star

Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma DB for similarity search based on user input.

text-extraction similarity-search pdf-processing vector-embeddings chromadb

Updated Oct 23, 2023
Python

ManasMadan / pdf-actions

Star

A NPM Package built on top of pdf-lib that provides functonalities like merge, rotate, split,download pdf to disk and many more...

react javascript pdf npm reactjs react-component pdf-merge pdf-split pdf-rotate pdf-merger pdf-downloader pdf-lib pdf-splitter pdf-processing pdf-download pdf-free pdf-online

Updated Oct 31, 2023
JavaScript

ManasMadan / PDFActions

Star

Built with pdf-actions NPM package.

react pdf reactjs react-component react-components pdf-merge pdf-split pdf-rotate pdf-merger pdf-downloader pdf-lib pdf-splitter pdf-processing pdf-download

Updated May 27, 2024
JavaScript

Inc44 / MaTools

Star

An all-in-one GUI management toolkit built with PyQt6, offering a suite of tools for file synchronization, media organization, PDF merging, code formatting, and more.

python rust productivity application gui qt ocr image-processing video-processing speech-recognition youtube-downloader file-management audio-processing pdf-processing code-formatting

Updated Nov 16, 2024
Python

ranguy9304 / LangGraphRAG

Star

LangGraphRAG: A terminal-based Retrieval-Augmented Generation system using LangGraph. Features include message history caching, query transformation, and vector database retrieval. Ideal for NLP researchers and developers working on advanced conversational AI and information retrieval systems.

python natural-language-processing information-retrieval chatbot web-scraping nlp-machine-learning rag terminal-application pdf-processing vector-database openai-api langgraph

Updated Jul 13, 2024
Python

Yardenrsk / PsychometryReceiverCV

Star

A side project to easily get and annotate questions and answers to the PsychometryBot project DB using computer vision and pdf parsing

pandas opencv-python pdf-processing

Updated Sep 18, 2022
Python

thinhuos0913 / python_useful_mini_projects

Star

This is some useful mini projects that I had worked for self-learning Python programming.

python opencv ocr image-processing pdf-processing

Updated May 20, 2024
Python

arsath-eng / RAG1-NVIDIA-GENAI

Star

A powerful Retrieval Augmented Generation (RAG) application built with NVIDIA AI endpoints and Streamlit. This solution enables intelligent document analysis and question-answering using state-of-the-art language models, featuring multi-PDF processing, FAISS vector store integration, and advanced prompt engineering.

embeddings question-answering document-analysis faiss rag pdf-processing streamlit llm langchain vector-store nvidia-ai-faundry llama-models

Updated Oct 31, 2024
Python

dsckiet / covid-tracker-android-app

Star

A statistical data display and notifier app for Covid-19 pandemic.

statistics mvvm dagger2 pdf-processing

Updated May 15, 2022
Kotlin

akshatpunia26 / berrylit_pdf_chat

Star

Berrylit is a simple chatbot interface that allows users to upload a PDF file and ask a question related to its contents. The chatbot uses the Berri API for processing.

python api natural-language-processing chatbot pdf-processing streamlit

Updated Jun 26, 2023
Python

ydvrahul19 / Invoice-Manager

Star

A modern, intelligent invoice processing system with advanced multi-format data extraction capabilities. Process invoices from PDFs, Excel files, and images with smart data recognition.

react firebase material-ui data-extraction invoice-management pdf-processing framer-motion redux-toolkit invoice-processing

Updated Nov 23, 2024
JavaScript

setuc / pdf-annotation-with-azure-doc-intel

Star

Azure Document Intelligence Result Processor: A toolset for annotating PDFs based on Azure Document Intelligence analysis results, featuring a React web application and a standalone Python script for processing and visualizing extracted data with confidence indicators.

react javascript python vite pdf-annotation pdf-processing confidence-scores form-recognizer azure-document-intelligence

Updated Nov 6, 2024
JavaScript

Mateusz2734 / pdf-cli

Star

CLI tool to merge, compress, extract or delete pages from PDF

python cli pdf pdf-processing pdf-tool

Updated Oct 28, 2023
Python

allanninal / document-summarizer

Star

The Document Summarizer leverages Hugging Face’s facebook/bart-large-cnn model to transform lengthy documents into concise summaries. Built with ReactJS (Vite) for the frontend and Flask for the backend, it supports PDF and text files, offering real-time summarization for researchers, students, and professionals.

nlp flask reactjs text-summarization vite huggingface pdf-processing document-summarizer ai-tools open-source-cods

Updated Dec 7, 2024
JavaScript

Remisu / GajyunETL

Star

The goal of this project is to eliminate the need for paper by digitizing the process of handling client passport information.

automation sql-server database csharp etl dotnet logging data-integration csv-processing pdf-processing guesthouse-management

Updated Dec 13, 2024
C#

Francesco-Sovrano / Swiss-G2C-User-Guide-Analysis

Star

Extensive analysis of user guides in Swiss government-to-citizen software, correlating guide features with canton socio-economic factors.

natural-language-processing open-data web-scraping data-analysis government-data python-scripts public-sector user-documentation correlation-analysis pdf-processing content-classification swiss-digital-strategy

Updated Jan 30, 2024
Python

Improve this page

Add a description, image, and links to the pdf-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pdf-processing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-processing

Here are 42 public repositories matching this topic...

dissorial / doc-chatbot

allenai / papermage

ahmedkhemiri95 / PDFs-TextExtract

aws-samples / document-processing-pipeline-for-regulated-industries

Govind-S-B / pdf-to-text-chroma-search

ManasMadan / pdf-actions

ManasMadan / PDFActions

Inc44 / MaTools

ranguy9304 / LangGraphRAG

Yardenrsk / PsychometryReceiverCV

thinhuos0913 / python_useful_mini_projects

arsath-eng / RAG1-NVIDIA-GENAI

dsckiet / covid-tracker-android-app

akshatpunia26 / berrylit_pdf_chat

ydvrahul19 / Invoice-Manager

setuc / pdf-annotation-with-azure-doc-intel

Mateusz2734 / pdf-cli

allanninal / document-summarizer

Remisu / GajyunETL

Francesco-Sovrano / Swiss-G2C-User-Guide-Analysis

Improve this page

Add this topic to your repo