PDF document processing
Imports that will be used
from typing import Any, Dict, List
import logging
import pathlib
PATH = pathlib.Path(__file__).parent.resolve()
from utca.core import (
Evaluator,
ForEach,
SetMemory,
MemorySetInstruction,
GetMemory,
MemoryGetInstruction,
Log,
Flush,
AddData,
ExecuteFunction,
)
from utca.implementation.datasources.pdf import (
PDFRead, PDFExtractTexts, PDFExtractImages, PDFFindTables
)
from utca.implementation.tasks import (
TransformersTextSummarization,
TransformersDocumentQandA
)Utilities functions for custom logic
Pipelines
ExecutionSchema for processing visual data:
TransformersDocumentQandAExecutionSchema for processing images:
ExecutionSchema for processing tables:
ExecutionSchema for text summarization:
ExecutionSchema for main pipeline:
Run program
Inputs
Results
Last updated