-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Any plans to implement converter to html from .doc version of spec? #10
Comments
Hi @slashTPA , not really any plans. I had a quick try with libreoffice ( https://libreofficehelp.com/batch-convert-writer-documents-pdf-libreoffice/ ):
That seems to produce a pdf that looks okay (except for the front page). |
Hi @CoRfr, ETSI PDF becomes available just (somewhen) after the plenary meetings. As of now, that limits the HTML usage to Rel-15 specs, while Rel-16 is available only at 3gpp site as DOC. I had a look as well into this: another option might be https://pandoc.org/ as that seems to convert straight into HTML. Will check that later on. |
It means that we would need to track the PDFs that have been converted from DOC, and periodically check if there is an ETSI version available. There is already a cron in the docker container to periodically (weekly) sync from 3gpp website so it might not be that difficult to add that ETSI check. I tried pandoc but I ran into:
One thing that I regret is that at the moment the file processing is taken care of in the frontend. I mean there should be some sort of pool of worker to process the files. And the files should probably be hosted on a S3 bucket or something like that. |
I agree, for the doc => html path, having only one path is the better approach. |
Not really any plans either - my contributions to this project are now very limited as I do not use 3GPP documents on a daily basis like I was at the time were I wrote this. For the frontend processing, it's true that it should be a one shot operation per document, so not a big deal overall. |
OK, cool. Currently I'm a bit limited in time, but I'll try to squeeze that in ;-). |
Hi,
while retrieving .doc file works well for latest release (e.g. rel-16) it looks like that no converter towards pdf or html is implemented. Hence html is only available for older versions (e.g. rel-15).
Do you have any plans to implement converter to html from .doc version of spec?
Do you have any more detailed docu that might enlighten me to contribute to this project?
Cheers/T
The text was updated successfully, but these errors were encountered: