indexing: start of Indexing Export impl. based on (X)HTML export

Desktop / LibreOffice - Tomaž Vajngerl [] - 3 June 2021 21:05 UTC

This adds a indexing output/export to HTML code, which will output a xml document that will be used for indexing of the document. It is based on HTML to reuse the traversal through the doc. model.

It is enabled by setting the "IndexingOutput" export parameter.

This commit only adds in the groundwork, but the output is still more or less the same as html and the follow up commits will add more indexing specific changes. The only change is to use "indexing" as the top level element and the document is a valid xml (has the xml header).

Also add basic test that the indexing output/export works and can be parsed as an xml with "indexing" top level element.

Change-Id: I153b1a70da7cbcf0d33b8610d962e6b7ae23ad23 Reviewed-on:

ed984f8e6899 indexing: start of Indexing Export impl. based on (X)HTML export
sw/ | 64 +++++++++++++++++++++
sw/ | 1 +
sw/qa/extras/htmlexport/IndexingExport.cxx | 44 ++++++++++++++
.../data/IndexingExport_VariousParagraphs.odt | Bin 0 -> 11111 bytes
sw/source/filter/html/wrthtml.cxx | 31 +++++++++-
sw/source/filter/html/wrthtml.hxx | 5 +-
6 files changed, 142 insertions(+), 3 deletions(-)


  • Share