diff --git a/docs/step-by-step-guide.md b/docs/step-by-step-guide.md index 3fa0bc0..2ed8927 100644 --- a/docs/step-by-step-guide.md +++ b/docs/step-by-step-guide.md @@ -48,6 +48,33 @@ Cleaner input data ensures that the model works with reliable and consistent inf - Ensures the document is clean and ready for the next step. +### ๐Ÿ” **API Example** + +```py +from document_to_podcast.preprocessing import DATA_CLEANERS, DATA_LOADERS + +input_file = "example_data/introducing-mozilla-ai-investing-in-trustworthy-ai.html" +data_loader = DATA_LOADERS[".html"] +data_cleaner = DATA_CLEANERS[".html"] + +raw_data = data_loader(input_file) +print(raw_data[:200]) +""" + + + + + + +