You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
TextAlignCenter.pdf
the result of parsing the pdf by using poppler directly on command line
WALDEN
BY
HENRY DAVID THOREAU
Here we have
some centered text lines
with background color
"fillc:#3277d3, bgcol:#beded9, rot:0"
/// truncated because too long
WALDEN
BY
HENRY DAVID THOREAU
Here we have
some centered text lines
with background color
"fillc:#3277d3, bgcol:#beded9, rot:0"
1854
94
// truncated because too long
````
### Expected Behaviour
expect the result to be:
WALDEN
BY
HENRY DAVID THOREAU
Here we have
some centered text lines
with background color
"fillc:#3277d3, bgcol:#beded9, rot:0"
/// truncated because too long
The text was updated successfully, but these errors were encountered:
Prerequisites
I have written a descriptive issue title
I have searched existing issues to ensure it has not already been reported
I agree to follow the Code of Conduct that this project adheres to
API/app/plugin version
7.2.2
Node.js version
20.14.0
Operating system
macOS
Operating system version (i.e. 20.04, 11.3, 10)
Sonoma(14.6.1)
Description
First of all thank you for this awesome library!
When I convert pdf to text while maintain the layout, I realise the first line of the page disrespect the layout. Seems because you trim the poppler output after receiving the response form poppler https://github.com/Fdawgs/node-poppler/blob/main/src/index.js#L1533
Steps to Reproduce
TextAlignCenter.pdf
the result of parsing the pdf by using poppler directly on command line
from
output
HENRY DAVID THOREAU
"fillc:#3277d3, bgcol:#beded9, rot:0"
/// truncated because too long
The text was updated successfully, but these errors were encountered: