KItinerary
engine/extractorengine.h
83 * Additionally, there's an interactive testing and inspection tool called @c kitinerary-workbench
88 * There are a few unit tests for extractors in the kitinerary repository (see autotests/extractordata),
90 * and copyright issues (e.g. PDFs containing copyrighted vendor logos and user credit card details).
93 * External test data is assumed to be in a folder named @c kitinerary-tests next to the @c kitinerary
94 * source folder. The test program searches this folder recursively for folders with the following content
120 * @param fileName Used as a hint to determine the type, optional and used for MIME type auto-detection if needed.
164 ExtractFullPageRasterImages = 1, ///< perform expensive image processing on (PDF) documents containing full page raster images
Abstract base class for data extractors.
Definition abstractextractor.h:20
Instantiates KItinerary::ExtractorDocumentNode instances using the type-specific document processor.
Definition extractordocumentnodefactory.h:27
A node in the extracted document object tree.
Definition extractordocumentnode.h:50
void setAdditionalExtractors(std::vector< const AbstractExtractor * > &&extractors)
Sets additional extractors to run on the given data.
Definition engine/extractorengine.cpp:162
Hint
Hints about the document to extract based on application knowledge that can help the extractor.
Definition engine/extractorengine.h:162
@ ExtractFullPageRasterImages
perform expensive image processing on (PDF) documents containing full page raster images
Definition engine/extractorengine.h:164
@ ExtractGenericIcalEvents
generate Event objects for generic ical events.
Definition engine/extractorengine.h:165
void setData(const QByteArray &data, QStringView fileName={}, QStringView mimeType={})
Set raw data to extract from.
Definition engine/extractorengine.cpp:120
void setContent(const QVariant &data, QStringView mimeType)
Already decoded data to extract from.
Definition engine/extractorengine.cpp:125
void clear()
Resets the internal state, call before processing new input data.
Definition engine/extractorengine.cpp:114
void setContextDate(const QDateTime &dt)
Set the date the extracted document has been issued at.
Definition engine/extractorengine.cpp:135
void setContext(const QVariant &data, QStringView mimeType)
Provide a document part that is only used to determine which extractor to use, but not for extraction...
Definition engine/extractorengine.cpp:130
void setUseSeparateProcess(bool separateProcess)
Perform extraction of "risky" content such as PDF files in a separate process.
Definition engine/extractorengine.cpp:157
Collection of all known data extractors.
Definition extractorrepository.h:34
Classes for reservation/travel data models, data extraction and data augmentation.
Definition berelement.h:17
This file is part of the KDE documentation.
Documentation copyright © 1996-2025 The KDE developers.
Generated on Fri Jan 24 2025 11:52:35 by doxygen 1.13.2 written by Dimitri van Heesch, © 1997-2006
Documentation copyright © 1996-2025 The KDE developers.
Generated on Fri Jan 24 2025 11:52:35 by doxygen 1.13.2 written by Dimitri van Heesch, © 1997-2006
KDE's Doxygen guidelines are available online.