-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inconsistent parsing results US address. #19
Comments
Maybe try it using the command-line parser that comes with the C library (clone the C library https://github.com/openvenues/libpostal and build it with Sometimes Docker issues are related to the resource requirements being somewhat larger than the default specs (4GB of RAM usually works). There needs to be enough disk space to hold the models, so I would check the byte size of the files in the data dir between the working env and the Docker one and make sure everything's downloaded/decompressed properly. |
Thanks for the response. The
and an
And the following are outputted from my docker instance:
In my docker container i also tried |
Sorry realized i didnt include the entire response from Here is what i get from the docker instances
And from my local
Not entirely sure on how to decipher that but it does look confused around the city component. The build on the docker instance is confusing the city as |
yeah it looks like you have an old version of the C library and model (pre 1.0) on docker and the latest from-source version on your machine. Check how it's being installed. If it's through apt-get or something, might need to check that those packages are up-to-date or just install it from source. Takes a little longer on Ubuntu to compile the scanner but not excessively long and can always throw it into a base image if needed. |
Apologies if this is the wrong medium for this question but i'm at a wall. I'm getting inconsistent parsing results in my environments that is making it difficult to debug.
For example this address (it's a fake street address but real city, state, and zip) parses incorrectly in my docker instance(debian), but if i were to run it locally (m1 macos) it would parse correctly.
1111 main street, Chapel Hill, North Carolina 27516
It seems to confuse the state
North Carolina
and appendsNorth
to the city value:While in another instance,
They're consistent in their environment, i have not re-compiled my local (correctly parsing) instance but both the docker instance and my local instance are using the same forked version of Libpostal when compiling and configuring/setting up.
I imagine this is an open ended and hard to answer question, but im wondering if this has been seen before and would appreciate just any insight into why they're different results and why it's not recognizing the state. Thanks in advance .
The text was updated successfully, but these errors were encountered: