Skip to content

Latest commit

 

History

History
19 lines (14 loc) · 602 Bytes

DVC.md

File metadata and controls

19 lines (14 loc) · 602 Bytes

Scalable PDF document processing with DataChain and Unstructured.io

Datasets + LLMs + Pydantic = DataChain ...now with @huggingface !💛

DataChain by @DVCorg just added @huggingface support ! Create, Load, Transform HF Datasets with LLMs easily.

  • Pydantic for dataset schema
  • Use your own or public HF Datasets
  • Run your own or public HF Models

Screenshot 2024-11-05 143213