This repository has been archived on 2026-04-06. You can view files and clone it. You cannot open issues or pull requests or push a commit.
2021-10-06 19:29:13 +03:00
2020-11-25 17:15:32 +03:00
2020-02-14 14:48:09 +03:00
2021-09-07 16:09:29 +03:00
2020-02-14 14:48:09 +03:00
2020-11-25 17:15:32 +03:00
2021-08-20 16:15:45 +03:00
2020-02-14 14:48:09 +03:00
2020-02-14 14:48:09 +03:00
2020-02-14 14:48:09 +03:00
2021-10-06 19:29:13 +03:00
2020-02-14 14:48:09 +03:00
2021-10-05 17:19:09 +03:00
2020-02-26 16:05:32 +03:00
2021-09-07 16:09:18 +03:00
2021-04-23 15:21:24 +03:00

About

This repository contains code related to docx/epub files conversion to livecarta inner format.

Livecarta book format is tree structure, where nodes are chapters. Livecarta chapter is title + html code. Livecarta html code follows some restrictions:

  • On structure. It should be not nested, the less nested tags, the better.
    • It is only allowed that the <p> contains <span>, <span> contains <span>.
  • The structure tags that are used: span, p, blockquote, ul, ol, li, table.
  • List of all valid tags: iframe, a, span, blockquote, i, s, u, ul, ol, li, strong, p, img, caption, thead, tbody, td, th, tr, table, sup.
  • Styles are added as inline, i.e. attribute style in html tag.
  • Each tag has its own restrictions on attributes and style. See doc/style_config

Top level project structure

  • consumer.py - code which is responsible for receiving messages from rabbitMQ
  • class Access - contains API code which is responsible for interaction with server.
  • class Solver - contains code responsible for pipeline of solving the task: receiving book file, conversion, status updating, sending result back to server.
  • livecarta_config.py - constants that depend on LiveCarta
Description
Languages
Python 99.6%
Dockerfile 0.2%
Shell 0.2%