Add method `iterRecords_range` #309

lguez · 2024-10-15T06:18:37Z

I have tested this by reading about one million record:

#!/usr/bin/env python3

import time

import shapefile

r = shapefile.Reader("extremum")
t0 = time.perf_counter()

for i in range(10, 1000000):
    x = r.record(i)

t1 = time.perf_counter()
print(t1 - t0)
t0 = time.perf_counter()

for x in r.iterRecords_range(10, 1000000):
    pass

t1 = time.perf_counter()
print(t1 - t0)

I get:

$ test_iterRecords_range.py 
21.27043527108617
7.878919951850548

So iterRecords_range is somewhat faster.

Using iterRecords with a range option should be faster than calling record within a loop, since we avoid the multiple calls to seek.

This reverts commit e41b03c. JamesParrott pointed that I did not understand the way `__record` works: __record does not use oid to find the correct record, it just assumes it is the correct oid for the current position.

Using the method `iterRecords_range` should be somewhat faster than calling the method `record` within a loop, since we avoid the repeated calls to seek inside `record`.

JamesParrott · 2024-10-15T20:58:51Z

Great stuff - thanks Lionel :). This appears to be good from a quick look - I'll review it fully over the next few days and get back to you.

lguez · 2024-10-16T10:35:57Z

OK. Thanks. I have written a separate method because you said you prefered so, but I still think it would make simpler code and it would be clearer to the user to have additional optional arguments to the iterRecords method:

def iterRecords(self, fields=None, start=0, stop=None)

JamesParrott · 2024-10-17T19:35:17Z

Yeah, adjusting iterRecords is more sensible if the range iterator is not customisable, just start and stop (in future, step could be added too). It avoids duplication.

How's this look?
lguez/pyshp@master...GeospatialPython:pyshp:combine_iterRecords_range_into_iterRecords

[edit] I added one test, and the branch above passes it (and also passes all the previous ones).

I think "yields the same _Records, as Reader.record" covers the claim and the main contract with the user.

Can you, or anyone, think of anything else that should be tested?

lguez · 2024-10-18T06:52:24Z

No, it seems fine. Thanks.

JamesParrott · 2024-10-18T09:04:28Z

A branch has been made from this PR, which will be merged: https://github.com/GeospatialPython/pyshp/tree/combine_iterRecords_range_into_iterRecords

lguez added 3 commits October 11, 2024 20:19

Add option my_range to method iterRecords

e41b03c

Using iterRecords with a range option should be faster than calling record within a loop, since we avoid the multiple calls to seek.

Revert "Add option my_range to method iterRecords"

4efef9f

This reverts commit e41b03c. JamesParrott pointed that I did not understand the way `__record` works: __record does not use oid to find the correct record, it just assumes it is the correct oid for the current position.

Add method iterRecords_range

811d329

Using the method `iterRecords_range` should be somewhat faster than calling the method `record` within a loop, since we avoid the repeated calls to seek inside `record`.

JamesParrott closed this Oct 18, 2024

JamesParrott mentioned this pull request Oct 18, 2024

Combine iter records range into iter records #310

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add method `iterRecords_range` #309

Add method `iterRecords_range` #309

lguez commented Oct 15, 2024

JamesParrott commented Oct 15, 2024

lguez commented Oct 16, 2024

JamesParrott commented Oct 17, 2024 •

edited

Loading

lguez commented Oct 18, 2024

JamesParrott commented Oct 18, 2024

Add method iterRecords_range #309

Add method iterRecords_range #309

Conversation

lguez commented Oct 15, 2024

JamesParrott commented Oct 15, 2024

lguez commented Oct 16, 2024

JamesParrott commented Oct 17, 2024 • edited Loading

lguez commented Oct 18, 2024

JamesParrott commented Oct 18, 2024

Add method `iterRecords_range` #309

Add method `iterRecords_range` #309

JamesParrott commented Oct 17, 2024 •

edited

Loading