This repository was archived by the owner on Dec 1, 2023. It is now read-only.
forked from holloway/docvert
-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathREADME
More file actions
60 lines (38 loc) · 1.57 KB
/
README
File metadata and controls
60 lines (38 loc) · 1.57 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Docvert 5.1
Converts Word Processor office files (e.g. .DOC files) to OpenDocument, DocBook, and structured HTML.
WEB SERVICE (by default http://localhost:8080)
python ./docvert-web.py [-p PORT] [-H host]
COMMAND LINE
python ./docvert-cli.py
usage: docvert-cli.py [-h] [--version] --pipeline PIPELINE
[--response {auto,path,stdout}]
[--autopipeline {Break up over Heading 1.default,Nothing one long page}]
[--url URL] [--list-pipelines]
[--pipelinetype {tests,auto_pipelines,pipelines}]
infile [infile ...]
COMMUNITY
http://lists.catalyst.net.nz/mailman/listinfo/docvert
REQUIREMENTS
Python 2.6 or 2.7 (we'll support Python 3 when it supports PyUNO)
libreoffice
python-uno
python-lxml
python-imaging
pdf2svg
librsvg2-2
QUICKSTART GUIDE
sudo apt-get install libreoffice python-uno python-lxml python-imaging pdf2svg librsvg2-2
/usr/bin/soffice -headless -norestore -nologo -norestore -nofirststartwizard -accept="socket,port=2002;urp;"
# then in another terminal
cd ~
git clone git://github.com/holloway/docvert.git
cd docvert
python ./docvert-web.py
# and browse to http://localhost:8080
THE LIBREOFFICE DAEMON
If you want to convert Microsoft Office files you'll need to start a daemon:
/usr/bin/soffice -headless -norestore -nologo -norestore -nofirststartwizard -accept="socket,port=2002;urp;"
(this runs a single instance. If you want to run a pool of instances then
try something like http://oodaemon.sourceforge.net/
LICENCE
Released under the GPL3 see LICENCE