Knowledgator UTCA
KnowledgatorGitHubDiscord
  • Welcome to UTCA documentation!
  • Quickstart
  • Concepts
    • Components
    • Types of components
    • ExecutionSchema
    • Context
    • Scopes
  • Development and Contribution
    • Contribution
    • Future relises
  • Framework structure
  • Core
    • Component
    • BaseExecutor
    • Action
    • Executable
    • Evaluator
    • Memory management
    • Schemas
    • Exceptions
  • Structural components
    • ExecutionSchema
    • Switch
    • ForEach
    • Filter
    • While
    • Condition
    • BREAK
    • Log
  • Base Actions
    • Flush
    • AddData
    • RenameAttribute
    • RenameAttributeQuery
    • SetValue
    • UnpackValue
    • NestToKey
    • ExecuteFunction
  • Predictors
    • Predictor
    • Transformers predictors
    • Transformers schemas
    • TokenSearcherPredictor
    • ComprehendItPredictor
    • GLiNERPredictor
    • OpenAIChatGPTPredictor
    • OpenAIWhisperPredictor
  • Tasks
    • Task
    • ComprehendIt
    • TokenSearcherTextCleaner
    • TokenSearcherNER
    • TokenSearcherQandA
    • TokenSearcherRelationExtraction
    • GLiNER
    • GLiNERRelationExtraction
    • GLiNERQandA
    • OpenAIChat
    • WhisperSpeechToText
    • TransformersTextToSpeech
    • TransformersChartsAndPlotsAnalysis
    • TransformersDocumentQandA
    • TransformersImageClassification
    • TransformersVisualQandA
    • TransformersObjectDetection
    • TransformersTextEmbedding
    • TransformersEntityLinking
    • TransformersTokenClassifier
    • TransformersTextSummarization
    • TransformersTextualQandA
    • TransformersTextClassification
    • TransformersChat
    • Objects
    • Chat tasks utilities
    • Relation extraction tasks utilities
  • Executable Schemas
    • SemanticSearchSchema
    • Web2Meaning
    • RequestsHTML
  • Datasources
    • Audio
    • DB
      • SQL
      • Neo4j
      • Chroma
      • Qdrant
    • Google Documents
    • Google Sheets
    • Image
    • Index
    • JSON
    • PDF
    • Plain text
    • Video
  • Conditions
    • RePattern
    • SemanticCondition
  • APIs
    • GoogleCloudClient
  • Integrations
    • Google Cloud
  • Examples
    • Basic image classification
    • Text to speech
    • PDF document processing
Powered by GitBook
On this page
  • Module: implementation.schemas
  • Methods and properties
  • __init__
  • RequestsHTMLInput
  • __init__
  • RequestsHTMLOutput
  • __init__
  1. Executable Schemas

RequestsHTML

Basic requests-html scraper

PreviousWeb2MeaningNextDatasources

Last updated 1 year ago

Subclass of . Type of [, ]

Module: .schemas

Methods and properties

Main methods and properties


__init__

Arguments:

  • js_rendering (bool, optional): Specifies whether the page should be rendered. Defaults to False.

  • input_class (Type[], optional): Class for input validation. Defaults to .

  • output_class (Type[], optional): Class for output validation. Defaults to .

  • name (Optional[str], optional): Name for identification. If equals to None, class name will be used. Defaults to None.




RequestsHTMLInput


__init__

Arguments:

  • url (str): The URL of the page to be processed.




RequestsHTMLOutput


__init__

Arguments:

  • text (str): Text from page.

  • links (List[str]): Links from page.

Subclass of .

Subclass of .

Executable
Executable
Web2MeaningInput
Web2MeaningOutput
RequestsHTMLInput
RequestsHTMLInput
RequestsHTMLOutput
RequestsHTMLOutput
IOModel
IOModel
implementation