Data set build slowdown

In https://github.com/HostByBelle/ip-db-test-data#data-processing you say this:

> Unfortunately, this final step is proving to be quite slow due to it's time complexity which reduces the data size we can easily build. If you have ideas on how to optimize this, please share!

Have you considered using interval trees? https://en.wikipedia.org/wiki/Interval_tree That data structure is special-made for this kind of use case. I will caution that [Portion's `IntervalDict`](https://github.com/AlexandreDecan/portion/tree/master#map-intervals-to-data) does *not* implement an optimized data structure. It uses a sorted dict, but without leveraging the very thing sorted dicts could provide.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data set build slowdown #33

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Data set build slowdown #33

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions