Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Foreign language support #61

Open
JanStarman opened this issue Mar 18, 2024 · 1 comment
Open

Foreign language support #61

JanStarman opened this issue Mar 18, 2024 · 1 comment

Comments

@JanStarman
Copy link

Hello, I was doing POC on this package, but ran into problems when extracting PDFs with czech language.
All czech-specific characters, such as 'šěčřžýáíé' and so on, are extracted as '?'.
I didnt find any config setting for language or anything.

Am I missing something or is it not possible with this package?

Thank you for help,
John.

@wstumbo-mfj
Copy link
Contributor

It should be possible, but without seeing an example of what went wrong I can't provide any insight. If you can share an example PDF and your code, I'll look at it closer and try to help identify the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants