Convert Word file to HTML
Oct. 11th, 2007 08:55 pmWhile my aim with the Tooth and Claw RPG is to publish it primarily as a PDF, I also want to do an HTML release. But I'd really prefer not to go to the hassle of coding it by hand. Currently it's a Word document, and getting the PDF is really easy, I just print to Acrobat. Getting HTML appears to be a nightmare by comparison, at least if I use Word.
Is there an alternative that'll give me nice clean HTML without the huge amount of crud that Word puts in, e.g. micromanaging the position of every letter? But without mangling the page layout too badly?
Just tried Open Office - it is definitely NOT the answer to this one, layout was awful.
I have a very vague memory of a program called Stripper that did something like this, does anyone know if it's still around?
Is there an alternative that'll give me nice clean HTML without the huge amount of crud that Word puts in, e.g. micromanaging the position of every letter? But without mangling the page layout too badly?
Just tried Open Office - it is definitely NOT the answer to this one, layout was awful.
I have a very vague memory of a program called Stripper that did something like this, does anyone know if it's still around?
no subject
Date: 2007-10-11 08:20 pm (UTC)There was sill a bit of crud - but it was quite easy to strip out. Unfortuantely I don't think abiword runs on windows.
How many files are there? it might be possible do it by hand/custom perl relatively easily.
no subject
Date: 2007-10-11 08:25 pm (UTC)no subject
Date: 2007-10-11 11:12 pm (UTC)no subject
Date: 2007-10-12 12:00 am (UTC)Still leaves some crud, but all Office specific tags should be removed, leaving fairly standard html.
no subject
Date: 2007-10-12 06:27 am (UTC)no subject
Date: 2007-10-12 06:28 am (UTC)no subject
Date: 2007-10-12 12:27 pm (UTC)no subject
Date: 2007-10-12 02:15 pm (UTC)