Skip to content

rpki: Sync cache up to three times in collaborative run#78

Open
fjahr wants to merge 1 commit intomasterfrom
precache
Open

rpki: Sync cache up to three times in collaborative run#78
fjahr wants to merge 1 commit intomasterfrom
precache

Conversation

@fjahr
Copy link
Collaborator

@fjahr fjahr commented Apr 24, 2025

This implements the idea suggested here: #69 (comment) (Number 2 at the bottom).

In the case of a collaborative run (using the wait feature), kartograf will use the following behavior:

  • There is a predefined timeframe for sync caching. I chose 10 minutes for this but it needs further testing if this needs to be expanded. It worked for me on slowed down connections well enough but I didn't test different CPU performance scenarios.
  • On start, sync the cache once right away
  • Then, check how much time is still left of the predefined timeframe. If there is still 30% of time left, just run again right away for a second time. If not, wait until the time is about to run out and run one more time at the end of the predefined period.
  • After the second run has finished just wait until the time is running out and then run a third time.

Rationale:

  • Even on a very fast computer with fast connection the sync will run 3 times at most, even if probably 5-6 time were possible. I have decided to do it this way because I am not sure syncing more than 3 times can give us much better results. If sending many requests to the smaller repositories is indeed a problem for them like we discussed, then this is a measure to not DDOS them too much. And it's also easier to reason about this since it's more constrained and we are still looking to learn from these results initially.
  • I am hoping that most modern laptops with a stable home connection should be able to sync 3 times within 10 minutes. If not I would want to expand the windo to 15min.
  • Slower hardware like a cheap VPS or a Raspberry PI should at least be able to run twice at least within the used timeframe.
  • The logic is nicely contained in context, but happy to debate if it should be moved somewhere else. It's only used by rpki, so maybe it should rather be moved there.

Open questions:

  • Should we expand the default timeframe as mentioned above (needs testing)
  • I am still undecided if I want to add an option to change the timeframe. We want all people to be use the same timeframe so this could be a footgun but on the other hand it may make testing easier in the beginning.

This is what the output currently looks like:

Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 180 seconds
RPKI sync #2
...took 191 seconds
(now sleeping 224 seconds)
RPKI sync #3
...took 142 seconds
Downloaded RPKI Data, hash sum: 821a68e84dc69ceb09c4753020c67d45567edee6b08533f50143e72605583bc9
...finished in 0:12:54.375893

(In this particular one the second took longer than the first, which is surprising. It was on a fast connection so download speed may just not be a blocker. But most tests took longer on the first sync.)

@jurraca
Copy link
Collaborator

jurraca commented Apr 24, 2025

ACK 4b79e70.

impl makes sense. worth testing in a coordinated run with a few people.

I don't think we should leave it to the user to set a timeframe. The timespan won't affect the ability to all get the same hash, so let's try to settle on one that works. 10 mins feels right but let's test.

I see some "Error: max connections" in my debug.log but i'm not sure who they were for, and whether they were successfully connected to later.

run logs --- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1745504214 (2025-04-24 15:16:54 WEST) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1745504214 (2025-04-24 14:16:54 UTC, local: 2025-04-24 15:16:54 WEST)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /home/base/code/asmap/kartograf/data/1745504214/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /home/base/code/asmap/kartograf/data/1745504214/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /home/base/code/asmap/kartograf/data/1745504214/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /home/base/code/asmap/kartograf/data/1745504214/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /home/base/code/asmap/kartograf/data/1745504214/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 294 seconds
RPKI sync #2
...took 209 seconds
(now sleeping 93 seconds)
RPKI sync #3
...took 197 seconds
Downloaded RPKI Data, hash sum: 831613a27a971049db585bb3745f5bb151ee94090f9be7f999ec5a1db4d243cd
...finished in 0:13:25.608628

--- Validating RPKI ---

Validating RPKI ROAs
299929 raw RKPI ROA files found.
100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1200/1200 [02:16<00:00, 8.80it/s]
299929 RKPI ROAs validated and saved to /home/base/code/asmap/kartograf/out/1745504214/rpki/rpki_raw.json, file hash: a5619417b25b42144e1efe35d48fa0be749faf4dfeaea39680adeb2697ac39db
...finished in 0:02:37.019999

--- Parsing RPKI ---

Parsing 299929 ROAs
Result entries written: 590814
Duplicates found: 82519
Invalids found: 8370
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:32.955638

--- Sorting results ---

...finished in 0:00:08.463013

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: 925d24b58fe078bf30cd68d900b0efdda67f33b00e363bb3e365c0dd438c409c
Total runtime: 0:16:44.062188

@jurraca
Copy link
Collaborator

jurraca commented Apr 26, 2025

also would be nice to update the context test with these changes.

@fjahr fjahr force-pushed the precache branch 2 times, most recently from 8676065 to 59f9355 Compare April 27, 2025 21:26
@fjahr
Copy link
Collaborator Author

fjahr commented Apr 27, 2025

also would be nice to update the context test with these changes.

Done, also dealt with the remaining warmup naming.

@fjahr
Copy link
Collaborator Author

fjahr commented Apr 27, 2025

@brunoerg would you be available for a test run with me and @jurraca in the next 2-3 days?

@brunoerg
Copy link
Contributor

@brunoerg would you be available for a test run with me and @jurraca in the next 2-3 days?

Yes!

@brunoerg
Copy link
Contributor

I'm getting a "Connection refused" error when testing this PR:

--- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
The epoch for this run is: 1745934109 (2025-04-29 13:41:49 UTC, local: 2025-04-29 10:41:49 -03)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/brunogarcia/projects/kartograf/data/1745934109/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/brunogarcia/projects/kartograf/data/1745934109/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/brunogarcia/projects/kartograf/data/1745934109/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/brunogarcia/projects/kartograf/data/1745934109/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/brunogarcia/projects/kartograf/data/1745934109/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 594 seconds
Downloaded RPKI Data, hash sum: 8e740838bac13251321415b52c5b7c548503c4e5e68161e561b584db0e48695e
...finished in 0:10:56.212893

--- Fetching IRR ---

Downloading afrinic.db.gz
Downloaded afrinic.db.gz, file hash: 3fc8e0920190e7b6aac236c04227ed5b733ec33055f2c1ca34c8d7c020af2aa4
Downloading apnic.db.route.gz
Downloaded apnic.db.route.gz, file hash: 389086ef85b708e4e3ae954a7e1bbf8bfbd14c377ae4c532090a2c1db9ef2382
Downloading apnic.db.route6.gz
Downloaded apnic.db.route6.gz, file hash: e6c048303a472348abe9ac8edcec2de72c9f1510247bdda4c474aaaaaba7f918
Downloading arin.db.gz
Traceback (most recent call last):
  File "/Users/brunogarcia/projects/kartograf/./run", line 11, in <module>
    main()
  File "/Users/brunogarcia/projects/kartograf/kartograf/cli.py", line 104, in main
    Kartograf.map(args)
  File "/Users/brunogarcia/projects/kartograf/kartograf/kartograf.py", line 65, in map
    fetch_irr(context)
  File "/Users/brunogarcia/projects/kartograf/kartograf/timed.py", line 10, in wrapper
    result = func(*args, **kwargs)
             ^^^^^^^^^^^^^^^^^^^^^
  File "/Users/brunogarcia/projects/kartograf/kartograf/irr/fetch.py", line 41, in fetch_irr
    with FTP(host) as ftp:
         ^^^^^^^^^
  File "/Users/brunogarcia/.pyenv/versions/3.11.11/lib/python3.11/ftplib.py", line 121, in __init__
    self.connect(host)
  File "/Users/brunogarcia/.pyenv/versions/3.11.11/lib/python3.11/ftplib.py", line 158, in connect
    self.sock = socket.create_connection((self.host, self.port), self.timeout,
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/brunogarcia/.pyenv/versions/3.11.11/lib/python3.11/socket.py", line 863, in create_connection
    raise exceptions[0]
  File "/Users/brunogarcia/.pyenv/versions/3.11.11/lib/python3.11/socket.py", line 848, in create_connection
    sock.connect(sa)
ConnectionRefusedError: [Errno 61] Connection refused

@jurraca
Copy link
Collaborator

jurraca commented Apr 29, 2025

that's a new one, we should at least handle that correctly. It happened consistently on a few tries?
opened #79 for tracking

@brunoerg
Copy link
Contributor

that's a new one, we should at least handle that correctly. It happened consistently on a few tries? opened #79 for tracking

Yes. I just tried twice on this PR and both runs it happened, then I tried again but on master and it happened again :(

@jurraca
Copy link
Collaborator

jurraca commented Apr 29, 2025

I can reproduce this. The FTP address for ARIN is still https://ftp.arin.net/pub/rr/ and i'm able to download via the UI, so doesn't seem to be down. Not sure here.
Could you try this branch running just ./run map (without -irr) ?

@brunoerg
Copy link
Contributor

Could you try this branch running just ./run map (without -irr) ?

It worked fine.

--- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
The epoch for this run is: 1745947362 (2025-04-29 17:22:42 UTC, local: 2025-04-29 14:22:42 -03)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/brunogarcia/projects/kartograf/data/1745947362/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/brunogarcia/projects/kartograf/data/1745947362/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/brunogarcia/projects/kartograf/data/1745947362/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/brunogarcia/projects/kartograf/data/1745947362/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/brunogarcia/projects/kartograf/data/1745947362/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 356 seconds
Downloaded RPKI Data, hash sum: 05e7d4723f43edff4bc27e74bb11d3e196811577fe283ce525efa82f3296e195
...finished in 0:06:52.498349

--- Validating RPKI ---

Validating RPKI ROAs
299797 raw RKPI ROA files found.
100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1200/1200 [01:06<00:00, 17.94it/s]
299797 RKPI ROAs validated and saved to /Users/brunogarcia/projects/kartograf/out/1745947362/rpki/rpki_raw.json, file hash: 86bb15a994272199a011c1a836d3df04d2f660eb438e78a85c66c82b78334bed
...finished in 0:01:31.313057

--- Parsing RPKI ---

Parsing 299797 ROAs
Result entries written: 606995
Duplicates found: 86294
Invalids found: 2205
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:18.223953

--- Sorting results ---

...finished in 0:00:04.889057

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: 08e954ad74f716ada921ffb833ac8136513d66c60721289346f7ac542b181f52
Total runtime: 0:08:46.937830

@fjahr
Copy link
Collaborator Author

fjahr commented Apr 29, 2025

I'm getting a "Connection refused" error when testing this PR

Oh wow, interesting. Good we caught this here and didn't run into this in an actual run 😅 It appears ARIN has removed support for FTP very recently: https://www.arin.net/blog/2025/02/10/ftp-retirement/

It's kind of annoying but we'll have to switch over to https download I guess. It seems that we can just do that for all RIRs though and the change is rather minimal, I have opened a pull here: #80

@fjahr
Copy link
Collaborator Author

fjahr commented Apr 29, 2025

@brunoerg would you be available for a test run with me and @jurraca in the next 2-3 days?

Yes!

Sorry for being a bit late here, how does tomorrow 4pm UTC for you both @brunoerg @jurraca , i.e.:

$ ./run map -rv -w 1746021600

(skipping IRR for now so we don't depend on the other PR here)

If that doesn't work for you or you see this too late please propose a new time, thanks! :)

@brunoerg
Copy link
Contributor

@brunoerg would you be available for a test run with me and @jurraca in the next 2-3 days?

Yes!

Sorry for being a bit late here, how does tomorrow 4pm UTC for you both @brunoerg @jurraca , i.e.:


$ ./run map -rv -w 1746021600

(skipping IRR for now so we don't depend on the other PR here)

If that doesn't work for you or you see this too late please propose a new time, thanks! :)

Works for me.

@jurraca
Copy link
Collaborator

jurraca commented Apr 30, 2025

how does tomorrow 4pm UTC for you

works for me

@brunoerg
Copy link
Contributor

Is 1746021600 4pm UTC?

@fjahr
Copy link
Collaborator Author

fjahr commented Apr 30, 2025

Is 1746021600 4pm UTC?

Oh, sorry, I guess the tool I used didn't use UTC but my local timezone instead (UTC+2). Let's use the actual UTC 4pm: 1746028800.

$ ./run map -rv -w 1746028800

@jurraca
Copy link
Collaborator

jurraca commented Apr 30, 2025

success d722edb130bfa2135606223f12f3f146d37f16ca6ff1d51559c44f79a21e4938

logs --- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1746028800 (2025-04-30 17:00:00 WEST) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1746028800 (2025-04-30 16:00:00 UTC, local: 2025-04-30 17:00:00 WEST)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /home/base/code/asmap/kartograf/data/1746028800/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /home/base/code/asmap/kartograf/data/1746028800/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /home/base/code/asmap/kartograf/data/1746028800/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /home/base/code/asmap/kartograf/data/1746028800/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /home/base/code/asmap/kartograf/data/1746028800/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 333 seconds
RPKI sync #2
...took 267 seconds
Downloaded RPKI Data, hash sum: 5be989f776050d7b0ec035ad08b6473d69225a25cc331c484d9bf1d10ec2c45b
...finished in 0:10:46.961442

--- Fetching Routeviews pfx2as ---

Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/04/routeviews-rv2-20250428-0200.pfx2as.gz
Downloaded /home/base/code/asmap/kartograf/data/1746028800/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: 48e42246812ecfc4b4861cdb54e7096336a7f34227cd8b3dcf5ee7368f5d5577
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/04/routeviews-rv6-20250429-1200.pfx2as.gz
Downloaded /home/base/code/asmap/kartograf/data/1746028800/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: 5175ce6e5f1477124e72960212748391f95708c1adbf9a365075270d5d577f05
...finished in 0:00:17.881124

--- Validating RPKI ---
Validating RPKI ROAs
299962 raw RKPI ROA files found.
100%|█████████████████████████████████████████████████████████████████| 1200/1200 [02:07<00:00, 9.40it/s]
299962 RKPI ROAs validated and saved to /home/base/code/asmap/kartograf/out/1746028800/rpki/rpki_raw.json, file hash: dca267a3868d2638a2b4d80ee949e481a9bb93421246b0edb72fe8374a6db4a7
...finished in 0:02:28.966782

--- Parsing RPKI ---

Parsing 299962 ROAs
Result entries written: 560278
Duplicates found: 74680
Invalids found: 23338
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:31.674554

--- Parsing Routeviews pfx2as ---

Unzipping /home/base/code/asmap/kartograf/data/1746028800/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /home/base/code/asmap/kartograf/out/1746028800/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /home/base/code/asmap/kartograf/data/1746028800/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /home/base/code/asmap/kartograf/out/1746028800/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /home/base/code/asmap/kartograf/out/1746028800/collectors/pfx2asn.txt
Entries after cleanup: 1262104
...finished in 0:00:50.521844

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1262104 entries filtered down to 593067
Merging base file with filtered extra file
...finished in 0:08:20.536490

--- Sorting results ---

...finished in 0:00:15.606272

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: d722edb130bfa2135606223f12f3f146d37f16ca6ff1d51559c44f79a21e4938
Total runtime: 0:23:32.846494

@brunoerg
Copy link
Contributor

c3095a48081ba668f79cd3997fcbf43cba90a4c9bcbe752f26185587920ff8aa

logs --- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1746028800 (2025-04-30 13:00:00 -03) to launch mapping process.
Countdown: 1 second(s)
Starting...
The epoch for this run is: 1746028800 (2025-04-30 16:00:00 UTC, local: 2025-04-30 13:00:00 -03)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/brunogarcia/projects/kartograf/data/1746028800/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/brunogarcia/projects/kartograf/data/1746028800/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/brunogarcia/projects/kartograf/data/1746028800/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/brunogarcia/projects/kartograf/data/1746028800/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/brunogarcia/projects/kartograf/data/1746028800/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 2194 seconds
Downloaded RPKI Data, hash sum: f16604772b8a3d703ed81a38263d4761c2c8ed657750bc491dd943a4c2196146
...finished in 0:36:50.553079

--- Fetching Routeviews pfx2as ---

Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/04/routeviews-rv2-20250429-1200.pfx2as.gz
Downloaded /Users/brunogarcia/projects/kartograf/data/1746028800/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: 9fc578ec0440b360bcd80db0d4cbb5a6271d6e4ccb922d4607d5bf7f08a12d2b
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/04/routeviews-rv6-20250429-1200.pfx2as.gz
Downloaded /Users/brunogarcia/projects/kartograf/data/1746028800/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: 5175ce6e5f1477124e72960212748391f95708c1adbf9a365075270d5d577f05
...finished in 0:00:39.512073

--- Validating RPKI ---

Validating RPKI ROAs
58873 raw RKPI ROA files found.
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 236/236 [00:08<00:00, 27.84it/s]
58873 RKPI ROAs validated and saved to /Users/brunogarcia/projects/kartograf/out/1746028800/rpki/rpki_raw.json, file hash: 9126c071f2e3861d936e7bbbb0c0f2c81dab60b0bd66b4aee19a81771565377b
...finished in 0:00:11.319602

--- Parsing RPKI ---

Parsing 58873 ROAs
Result entries written: 53075
Duplicates found: 7047
Invalids found: 16772
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:01.650471

--- Parsing Routeviews pfx2as ---

Unzipping /Users/brunogarcia/projects/kartograf/data/1746028800/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /Users/brunogarcia/projects/kartograf/out/1746028800/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /Users/brunogarcia/projects/kartograf/data/1746028800/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /Users/brunogarcia/projects/kartograf/out/1746028800/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /Users/brunogarcia/projects/kartograf/out/1746028800/collectors/pfx2asn.txt
Entries after cleanup: 1262001
...finished in 0:00:27.308731

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1262001 entries filtered down to 1141651
Merging base file with filtered extra file
...finished in 0:00:40.605594

--- Sorting results ---

...finished in 0:00:08.516794

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: c3095a48081ba668f79cd3997fcbf43cba90a4c9bcbe752f26185587920ff8aa
Total runtime: 0:38:59.945277

@jurraca
Copy link
Collaborator

jurraca commented Apr 30, 2025

so your first RPKI sync took 2194 seconds... which is about 36 minutes. so it didn't have time to do other syncs.
You ended up with 58873 RPKI ROAs, while i had about more than 5x that with 299962 ROAs. Could you drop your debug.log here so I can take a look? Probably connection errors or getting blocked for some reason.

@brunoerg
Copy link
Contributor

so your first RPKI sync took 2194 seconds... which is about 36 minutes. so it didn't have time to do other syncs. You ended up with 58873 RPKI ROAs, while i had about more than 5x that with 299962 ROAs. Could you drop your debug.log here so I can take a look? Probably connection errors or getting blocked for some reason.

debug.log

@fjahr
Copy link
Collaborator Author

fjahr commented Apr 30, 2025

My result hash is c8bab8fdc10a9d0468b756ebe2c111cde078a685b8003829a3a20cafa4f289ec and just on the RPKI data it's dba4c8eaad7c02e175b6a80fd89e65a442ff17ef41c9f5e7abcf1a138d0eaf55, so no matches on either one.

That's a bit disappointing but it's also wild to me how big the differences are in download times. For me they were all as quick as I am used to (all <200sec) and so I also had three syncs. For @jurraca the sync were slower and that resulted in just two syncs. The second sync again took almost as long as the first which it throwing off my calculation quite a bit. I will need to lower the threshold of when we run the second sync right away and when we wait until the end of the warmup period. I'll set it at 50% instead of 70%. At least that's a learning I think.

But @brunoerg 's results are crazy to me, I have never seen a sync take this long and get that little data :D I only took a brief look at the debug log, there are a few things that stand out to me but I can't really make sense of it yet. In validation I see several failed to build authority chain with arin and ripe. In the download section I see these:

rpki-client: https://rrdp.arin.net/4a394319-7460-4141-a416-1addb69284ff/111416/snapshot.xml (2001:500:31::149): TLS read: read failed: error:0A000126:SSL routines::unexpected eof while reading
rpki-client: https://rrdp.arin.net/notification.xml: bad message digest
rpki-client: https://rrdp.arin.net/notification.xml: load from network failed, fallback to rsync
rpki-client: https://rrdp.afrinic.net/8fe3109e-2561-4627-8850-83ab94b9bb91/39133/snapshot... (2606:4700:3033::ac43:ccc4): timeout, connection closed
rpki-client: https://rrdp.afrinic.net/notification.xml: bad message digest
rpki-client: https://rrdp.afrinic.net/notification.xml: load from network failed, fallback to rsync
rsync warning: some files vanished before they could be transferred (code 24) at main.c(1872) [generator=3.4.1]
rpki-client: rsync rsync://rpki.arin.net/repository failed
rpki-client: .rsync/rpki.arin.net/repository: load from network failed, fallback to cache
rpki-client: https://rrdp.apnic.net/notification.xml: synchronisation timeout
rpki-client: https://rrdp.apnic.net/notification.xml: load from network failed, fallback to rsync
rpki-client: https://rrdp.ripe.net/notification.xml: synchronisation timeout
rpki-client: https://rrdp.ripe.net/notification.xml: load from network failed, fallback to rsync
rpki-client: https://rrdp.apnic.net/1976999f-5bc5-4767-b567-e9d15ac74007/10452/snapshot.xml (2606:4700::6812:eb44): data write: Broken pipe
rpki-client: https://rrdp.ripe.net/172322cf-c642-4e6f-806c-bd2375d8001a/123549/snapshot-9... (2600:1419:4c00::c8ae:9432): data write: Broken pipe
rsync: [receiver] read error: Connection reset by peer (54)
rsync error: error in socket IO (code 10) at io.c(807) [receiver=3.4.1]
rsync: connection unexpectedly closed (953623 bytes received so far) [generator]
rsync error: error in rsync protocol data stream (code 12) at io.c(232) [generator=3.4.1]
rpki-client: rsync rsync://rpki.ripe.net/repository failed
rpki-client: .rsync/rpki.ripe.net/repository: load from network failed, fallback to cache
rpki-client: rpki.ripe.net/repository/ripe-ncc-ta.mft: no valid manifest available
rpki-client: https://rpki-repository.nic.ad.jp/rrdp/ap/notification.xml (2001:dc2:1000:8000::2): connect: Connection refused
rpki-client: https://rpki-repository.nic.ad.jp/rrdp/ap/notification.xml (192.41.192.213): connect: Connection refused
rpki-client: https://rpki-repository.nic.ad.jp/rrdp/ap/notification.xml: load from network failed, fallback to rsync
rpki-client: https://repo-rpki.idnic.net/rrdp/notification.xml (2406:6cc0:a300:4622:116:193:188:25): connect: Network is unreachable
rpki-client: rpki.cnnic.cn (218.241.105.61): TLS read: read failed: error:0A000126:SSL routines::unexpected eof while reading
rpki-client: https://rrdp.sub.apnic.net/0f651db6-6aaa-4a11-9429-5f60b4d07c0a/196/snapshot... (2606:4700::6812:ec44): TLS handshake: handshake failed: Connection reset by peer
rpki-client: https://rrdp.sub.apnic.net/notification.xml: bad message digest
rpki-client: https://rrdp.sub.apnic.net/notification.xml: load from network failed, fallback to rsync
rpki-client: https://rrdp-rps.arin.net/222bc4dd-08cd-46c3-b744-f99835520fa4/67404/snapsho... (2001:500:13::148): short read, connection closed
rpki-client: https://rrdp-rps.arin.net/notification.xml: bad message digest
rpki-client: https://rrdp-rps.arin.net/notification.xml: load from network failed, fallback to rsync
rpki-client: https://rpki.cnnic.cn/rrdp/d4427960-abb3-49e4-902c-d815a518b867/snapshot/292... (42.83.145.17): TLS handshake: handshake failed: Connection reset by peer
rpki-client: https://rpki.cnnic.cn/rrdp/notify.xml: bad message digest
rpki-client: https://rpki.cnnic.cn/rrdp/notify.xml: load from network failed, fallback to rsync
rpki-client: rsync.rpki.tianhai.link/repo/TianhaiRpki/3/459D2B834A3BA08C0AA67C6E578146A8B1FE2C2F.mft: CRL has expired
rpki-client: rpki.zsq.im/repo/cernet_child/1/A828E0147CD8DE9E7A474DCDF06AD96FB37738EE.mft: CRL has expired
rpki-client: https://rpki-rrdp.mnihyc.com/rrdp/notification.xml (2606:4700:3035::6815:1718): short read, connection closed
rpki-client: https://rpki-rrdp.mnihyc.com/rrdp/notification.xml: load from network failed, fallback to rsync
rpki-client: rpki.luys.cloud/repo/LY-RPKI/1/47717A8A2E301D872DAC129F2CBAC587F01A0813.mft: CRL has expired
rpki-client: https://rrdp.paas.rpki.ripe.net/1c33ba5d-4e16-448d-9a22-b12599ef1cba/216247/... (2600:1419:4c00::c8ae:9412): timeout, connection closed
rpki-client: https://rrdp.paas.rpki.ripe.net/notification.xml: bad message digest
rpki-client: https://rrdp.paas.rpki.ripe.net/notification.xml: load from network failed, fallback to rsync
rpki-client: rpki.sub.apnic.net/repository/A91905300000/0/30356EF46D594DA57C7697011183ACFFE04613E0.mft: CRL has expired
rpki-client: rpki.sub.apnic.net/repository/A91905300000/3/B18F989967D30F2509E3D29707C5F85B5F6BDB6F.mft: CRL has expired
rpki-client: rpki.sub.apnic.net/repository/A9150E4D0000/0/6228A3C6D637801692F7D7D48251DAD36A3CDB7F.mft: CRL has expired
rpki-client: rpki.sub.apnic.net/repository/A91905300000/4/B7351B05B1619E9DE395F0B2FFC9EE1129F300B4.mft: CRL has expired
rpki-client: rpki.sub.apnic.net/repository/A91E54810000/0/D988B2DA7F4305F0CDB93BF88B6F5A9DF98D8940.mft: CRL has expired
rpki-client: rpki.sub.apnic.net/repository/A91905300000/7/4B1987D5160068092602529871C93EA320045A75.mft: CRL has expired
rpki-client: dev.tw: nodename nor servname provided, or not known
rpki-client: https://dev.tw/rpki/notification.xml: load from network failed, fallback to rsync
rsync: getaddrinfo: dev.tw 873: nodename nor servname provided, or not known
rsync error: error in socket IO (code 10) at clientserver.c(139) [Receiver=3.4.1]
rpki-client: rsync rsync://dev.tw/rpki failed
rpki-client: .rsync/dev.tw/rpki: load from network failed, fallback to cache
rpki-client: dev.tw/rpki/August/14/EE962658B1226680140B4EA86F7D68CF498158D9.mft: no valid manifest available
rpki-client: https://rpki-repo.registro.br/rrdp/notification.xml: synchronisation timeout
rpki-client: https://rpki-repo.registro.br/rrdp/notification.xml: load from network failed, fallback to rsync
rpki-client: https://rpki-repo.registro.br/rrdp/b0982bb6-6f99-448a-9f71-91a86a455e39/3601... (2001:12ff:0:2::50): data write: Broken pipe
rpki-client: rsync://rpki.apnic.net/member_repository: synchronisation timeout
rpki-client: .rsync/rpki.apnic.net/member_repository: load from network failed, fallback to cache

Also a lot of these "file has vanished" like this:

file has vanished: "/arin-rpki-ta/5e4a23ea-e80a-403e-b08c-2171da2157d3/0357272c-a79a-45bf-9586-92dd49ef3223/3c4491b9-92bb-4e3e-a60c-c28bc100dfdc/a0a78142-4066-358e-b365-4fbc25a9841f.roa" (in repository)

And then at the end of the download section:

rsync error: received SIGINT, SIGTERM, or SIGHUP (code 20) at rsync.c(763) [generator=3.4.1]
rsync error: received SIGUSR1 (code 19) at main.c(1620) [receiver=3.4.1]
rpki-client: rsync rsync://rpki.apnic.net/member_repository failed
rsync error: received SIGINT, SIGTERM, or SIGHUP (code 20) at rsync.c(763) [generator=3.4.1]
rsync error: received SIGUSR1 (code 19) at main.c(1620) [receiver=3.4.1]
rpki-client: rsync rsync://rsync.paas.rpki.ripe.net/repository failed

So all we can say for sure is something has clearly prevented the usual sync of the big RIR repos via rrdp. Was there something different about the connection compared to your previous usage of kartograf @brunoerg ? Since it seems to happen across the board I would rather put my money on some local issue this time...

@brunoerg
Copy link
Contributor

It's the same connection as always, nothing different. Perhaps I could try to use a VPN to see if anything changes.

@fjahr
Copy link
Collaborator Author

fjahr commented Apr 30, 2025

It's the same connection as always, nothing different. Perhaps I could try to use a VPN to see if anything changes.

Would definitely be interesting to know if using a VPN changes something...

@brunoerg
Copy link
Contributor

It's the same connection as always, nothing different. Perhaps I could try to use a VPN to see if anything changes.

Would definitely be interesting to know if using a VPN changes something...

Cool, I'll try it tomorrow. Do you think we can run it again at same time tomorrow?

@fjahr
Copy link
Collaborator Author

fjahr commented Apr 30, 2025

It's the same connection as always, nothing different. Perhaps I could try to use a VPN to see if anything changes.

Would definitely be interesting to know if using a VPN changes something...

Cool, I'll try it tomorrow. Do you think we can run it again at same time tomorrow?

Works for me, I should be online for the irc meeting anyway.

I hope I am getting this one correct on first try 😅

$ ./run map -rv -w 1746115200

@brunoerg
Copy link
Contributor

brunoerg commented May 1, 2025

I used VPN, set to Germany, result:

0f9b344e96f7615f9888bfc6cf4641392a1609043274194273cfa767de51f0c1

logs
--- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1746115200 (2025-05-01 13:00:00 -03) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1746115200 (2025-05-01 16:00:00 UTC, local: 2025-05-01 13:00:00 -03)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/brunogarcia/projects/kartograf/data/1746115200/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/brunogarcia/projects/kartograf/data/1746115200/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/brunogarcia/projects/kartograf/data/1746115200/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/brunogarcia/projects/kartograf/data/1746115200/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/brunogarcia/projects/kartograf/data/1746115200/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 3 seconds
RPKI sync #2
...took 0 seconds
(now sleeping 584 seconds)
RPKI sync #3
...took 3 seconds
Downloaded RPKI Data, hash sum: e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
...finished in 0:10:02.684407

--- Fetching Routeviews pfx2as ---

The page at https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/05/ couldn't be fetched. Trying the previous month.
Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/04/routeviews-rv2-20250429-1200.pfx2as.gz
Downloaded /Users/brunogarcia/projects/kartograf/data/1746115200/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: 9fc578ec0440b360bcd80db0d4cbb5a6271d6e4ccb922d4607d5bf7f08a12d2b
The page at https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/05/ couldn't be fetched. Trying the previous month.
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/04/routeviews-rv6-20250430-0600.pfx2as.gz
Downloaded /Users/brunogarcia/projects/kartograf/data/1746115200/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: d4cce079b2de44afed0c21ead775d653fab008305f0e8f6ac49f5957ec2ebe13
...finished in 0:01:19.459845

--- Validating RPKI ---

Validating RPKI ROAs
0 raw RKPI ROA files found.
0it [00:00, ?it/s]
0 RKPI ROAs validated and saved to /Users/brunogarcia/projects/kartograf/out/1746115200/rpki/rpki_raw.json, file hash: 4f53cda18c2baa0c0354bb5f9a3ecbe5ed12ab4d8e11ba873c2f11161202b945
...finished in 0:00:00.042939

--- Parsing RPKI ---

Parsing 0 ROAs
Result entries written: 0
Duplicates found: 0
Invalids found: 0
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:00.000144

--- Parsing Routeviews pfx2as ---

Unzipping /Users/brunogarcia/projects/kartograf/data/1746115200/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /Users/brunogarcia/projects/kartograf/out/1746115200/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /Users/brunogarcia/projects/kartograf/data/1746115200/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /Users/brunogarcia/projects/kartograf/out/1746115200/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /Users/brunogarcia/projects/kartograf/out/1746115200/collectors/pfx2asn.txt
Entries after cleanup: 1262287
...finished in 0:00:28.826688

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1262287 entries filtered down to 1262287
Merging base file with filtered extra file
...finished in 0:00:19.874836

--- Sorting results ---

...finished in 0:00:09.496421

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: 0f9b344e96f7615f9888bfc6cf4641392a1609043274194273cfa767de51f0c1
Total runtime: 0:12:20.904179

@fjahr
Copy link
Collaborator Author

fjahr commented May 1, 2025

Unfortunately I messed up this time and let my computer go to sleep at exactly the wrong time because I got distracted. I got a result that looks alright but it's not really good test.

logs
$ ./run map -rv -w 1746115200

--- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1746115200 (2025-05-01 18:00:00 CEST) to launch mapping process.
Countdown: 14 minute(s), 43 second(s)
Starting...
The epoch for this run is: 1746115200 (2025-05-01 16:00:00 UTC, local: 2025-05-01 18:00:00 CEST)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/FJ/projects/python/kartograf/data/1746115200/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/FJ/projects/python/kartograf/data/1746115200/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/FJ/projects/python/kartograf/data/1746115200/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/FJ/projects/python/kartograf/data/1746115200/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/FJ/projects/python/kartograf/data/1746115200/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 1219 seconds
Downloaded RPKI Data, hash sum: 2b70d72133c3f1c04d3b539ad2ac71befb118bc7e873c212fca28e46a6453037
...finished in 0:03:31.759650

--- Fetching Routeviews pfx2as ---

The page at https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/05/ couldn't be fetched. Trying the previous month.
Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/04/routeviews-rv2-20250430-0800.pfx2as.gz
Downloaded /Users/FJ/projects/python/kartograf/data/1746115200/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: 0407714f359d4c0dc269cf14c5dba9d4f830efc7c91432c1d47063a9544c7f6f
The page at https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/05/ couldn't be fetched. Trying the previous month.
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/04/routeviews-rv6-20250430-0600.pfx2as.gz
Downloaded /Users/FJ/projects/python/kartograf/data/1746115200/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: d4cce079b2de44afed0c21ead775d653fab008305f0e8f6ac49f5957ec2ebe13
...finished in 0:00:08.537682

--- Validating RPKI ---

Validating RPKI ROAs
257189 raw RKPI ROA files found.
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1029/1029 [00:15<00:00, 65.11it/s]
257189 RKPI ROAs validated and saved to /Users/FJ/projects/python/kartograf/out/1746115200/rpki/rpki_raw.json, file hash: ec7062e8d7e0619c6ca5b39c5d8110dc53f809bdb3f4f508156336d0f6cd4dec
...finished in 0:00:29.378026

--- Parsing RPKI ---

Parsing 257189 ROAs
Result entries written: 344077
Duplicates found: 52359
Invalids found: 17097
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:07.345714

--- Parsing Routeviews pfx2as ---

Unzipping /Users/FJ/projects/python/kartograf/data/1746115200/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /Users/FJ/projects/python/kartograf/out/1746115200/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /Users/FJ/projects/python/kartograf/data/1746115200/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /Users/FJ/projects/python/kartograf/out/1746115200/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /Users/FJ/projects/python/kartograf/out/1746115200/collectors/pfx2asn.txt
Entries after cleanup: 1262375
...finished in 0:00:16.881668

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1262375 entries filtered down to 814585
Merging base file with filtered extra file
...finished in 0:01:28.943488

--- Sorting results ---

...finished in 0:00:05.052372

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: bb2704356e6ea0d25a81a9978c68c3fb47c6f4ea8003990557442127da15d8c2
Total runtime: 0:23:36.615678

@brunoerg looks like you were having issues again but different this time, instead of one long connection with limited results there were now three tries but all very short and no results at all. I guess using a VPN made the situation even worse :-/

@brunoerg
Copy link
Contributor

brunoerg commented May 2, 2025

@brunoerg looks like you were having issues again but different this time, instead of one long connection with limited results there were now three tries but all very short and no results at all. I guess using a VPN made the situation even worse :-/

Yes, noticed the same, it didn't help.

@fjahr
Copy link
Collaborator Author

fjahr commented Jun 7, 2025

Hm, sorry for dropping the ball on this a bit here but I am still a bit unsure what to do. Our tests didn't work but @brunoerg 's problem doesn't seem to be related to this change and I am pretty hopeful that this change should improve things. We should do another run soon I think. Should we merge it before that or do you think we should do more testing first @jurraca @brunoerg ?

@jurraca
Copy link
Collaborator

jurraca commented Jun 8, 2025

yea unsure what to do as well. Probably good to do another run with us three first. im free next week.

@brunoerg
Copy link
Contributor

brunoerg commented Jun 9, 2025

Yes, better to do another run again. I'm free.

@fjahr
Copy link
Collaborator Author

fjahr commented Jun 9, 2025

@jurraca @brunoerg cool, if you see this in time let's try tomorrow (Tuesday, 10th of June) at 6pm CET/4pm GMT. If we miss it let me know and we can repeat it on Thursday same time.

$ ./run map -rv -irr -w 1749571200

--- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1749571200 (2025-06-10 18:00:00 CEST) to launch mapping process.
Countdown: 17 hour(s), 27 minute(s), 55 second(s)

I have also pushed a rebase of this PR since there were some fixes and this also means we can use IRR again.

@brunoerg
Copy link
Contributor

5ccb49d4989989bfe84bdb053b563c4f5325bb91a04d0862bf35fcd72c0603c7

Logs ➜ kartograf git:(78) ./run map -rv -irr -w 1749571200

--- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1749571200 (2025-06-10 13:00:00 -03) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1749571200 (2025-06-10 16:00:00 UTC, local: 2025-06-10 13:00:00 -03)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/brunogarcia/projects/kartograf/data/1749571200/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/brunogarcia/projects/kartograf/data/1749571200/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/brunogarcia/projects/kartograf/data/1749571200/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/brunogarcia/projects/kartograf/data/1749571200/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/brunogarcia/projects/kartograf/data/1749571200/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 324 seconds
RPKI sync #2
...took 260 seconds
(now sleeping 10 seconds)
RPKI sync #3
...took 261 seconds
Downloaded RPKI Data, hash sum: 31861cd4802f60f21c63475cddcb93555f5288245e42af76c54c26ae69811489
...finished in 0:15:14.414078

--- Fetching IRR ---

Downloading afrinic.db.gz
Downloaded afrinic.db.gz, file hash: 9716b95722732bec8d07951881c4af42e4873d5c4d596c4f07508dd05386eb60
Downloading apnic.db.route.gz
Downloaded apnic.db.route.gz, file hash: 65faa570180e8b8c3fb03094f6c561d09d04b8bf83ba0f5c2c1b192996f43882
Downloading apnic.db.route6.gz
Downloaded apnic.db.route6.gz, file hash: be0a9492c5d091400f09afc9d0d0a224cae8eba0dac6dcccde2243689df9b8b5
Downloading arin.db.gz
Downloaded arin.db.gz, file hash: 04482352a0b64ba1fec4f2b65544fa61cd16fe6a457d6fd51a4be9c7da5c6b16
Downloading lacnic.db.gz
Downloaded lacnic.db.gz, file hash: eae183c74546a061807a5991313dd7a679b32cb102782b7f99022a42472dc0f7
Downloading ripe.db.route.gz
Downloaded ripe.db.route.gz, file hash: 771a25dba5e036644d8244dfa3d765fb2eb0f8c6cb508e39f589469555794810
Downloading ripe.db.route6.gz
Downloaded ripe.db.route6.gz, file hash: 757fa0282a1f5bb10691628034c9e85557c2ea9ed5fd20b78a250a561acd4605
...finished in 0:00:31.678750

--- Fetching Routeviews pfx2as ---

Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/06/routeviews-rv2-20250608-1200.pfx2as.gz
Downloaded /Users/brunogarcia/projects/kartograf/data/1749571200/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: c918d792f17317bbba4bb902ce5d5ad1edb2e1ba8bbcdb1e751cd64cdf3b873b
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/06/routeviews-rv6-20250609-0600.pfx2as.gz
Downloaded /Users/brunogarcia/projects/kartograf/data/1749571200/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: a48cbd0683cf1a1b7f883583d69171482b77aa88f475f939a0cd1799b3ed1741
...finished in 0:00:07.399022

--- Validating RPKI ---

Validating RPKI ROAs
308165 raw RKPI ROA files found.
100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1233/1233 [02:15<00:00, 9.09it/s]
308165 RKPI ROAs validated and saved to /Users/brunogarcia/projects/kartograf/out/1749571200/rpki/rpki_raw.json, file hash: dc0847e7c61ca25ae6ea47e4e1aa818bf4ad844a87b6d8342d210479a76c9ac7
...finished in 0:02:41.678831

--- Parsing RPKI ---

Parsing 308165 ROAs
Result entries written: 552553
Duplicates found: 83130
Invalids found: 37110
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:17.722766

--- Parsing IRR ---

Extracting afrinic.db.gz
Extracting apnic.db.route.gz
Extracting apnic.db.route6.gz
Extracting arin.db.gz
Extracting lacnic.db.gz
Extracting ripe.db.route.gz
Extracting ripe.db.route6.gz
Parsing /Users/brunogarcia/projects/kartograf/out/1749571200/irr/lacnic.db
Found in this file: 15053
Parsing /Users/brunogarcia/projects/kartograf/out/1749571200/irr/ripe.db.route
Found in this file: 386459
Parsing /Users/brunogarcia/projects/kartograf/out/1749571200/irr/apnic.db.route6
Found in this file: 841922
Parsing /Users/brunogarcia/projects/kartograf/out/1749571200/irr/apnic.db.route
Found in this file: 610357
Parsing /Users/brunogarcia/projects/kartograf/out/1749571200/irr/arin.db
Found in this file: 105738
Parsing /Users/brunogarcia/projects/kartograf/out/1749571200/irr/afrinic.db
Found in this file: 115265
Parsing /Users/brunogarcia/projects/kartograf/out/1749571200/irr/ripe.db.route6
Found in this file: 146483
Found valid, unique entries: 2221277
...finished in 0:01:25.944052

--- Merging RPKI and IRR data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 2221277 entries filtered down to 505716
Merging base file with filtered extra file
...finished in 0:06:07.163119

--- Parsing Routeviews pfx2as ---

Unzipping /Users/brunogarcia/projects/kartograf/data/1749571200/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /Users/brunogarcia/projects/kartograf/out/1749571200/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /Users/brunogarcia/projects/kartograf/data/1749571200/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /Users/brunogarcia/projects/kartograf/out/1749571200/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /Users/brunogarcia/projects/kartograf/out/1749571200/collectors/pfx2asn.txt
Entries after cleanup: 1266387
...finished in 0:00:28.832596

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1266387 entries filtered down to 389277
Merging base file with filtered extra file
...finished in 0:04:36.050480

--- Sorting results ---

...finished in 0:00:12.188039

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: 5ccb49d4989989bfe84bdb053b563c4f5325bb91a04d0862bf35fcd72c0603c7
Total runtime: 0:31:46.690820

@jurraca
Copy link
Collaborator

jurraca commented Jun 10, 2025

I got stuck in transport and missed it, sorry guys. my bad :/

@fjahr
Copy link
Collaborator Author

fjahr commented Jun 10, 2025

I got 395707bc5227810e0ee436b9ac95fd91bfac72bb59abca42d0eb65093e0f0715

Mine and @brunoerg numbers seem to match on IRR and RV but I got about 500 more entries from RPKI...

Logs
$ ./run map -rv -irr -w 1749571200

--- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1749571200 (2025-06-10 18:00:00 CEST) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1749571200 (2025-06-10 16:00:00 UTC, local: 2025-06-10 18:00:00 CEST)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/FJ/projects/python/kartograf/data/1749571200/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/FJ/projects/python/kartograf/data/1749571200/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/FJ/projects/python/kartograf/data/1749571200/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/FJ/projects/python/kartograf/data/1749571200/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/FJ/projects/python/kartograf/data/1749571200/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 241 seconds
RPKI sync #2
...took 220 seconds
(now sleeping 134 seconds)
RPKI sync #3
...took 156 seconds
Downloaded RPKI Data, hash sum: 40da74ed65172f4d4ef77701b782fdef3629259699279ce9a62276b76174b93c
...finished in 0:13:07.386197

--- Fetching IRR ---

Downloading afrinic.db.gz
Downloaded afrinic.db.gz, file hash: 9716b95722732bec8d07951881c4af42e4873d5c4d596c4f07508dd05386eb60
Downloading apnic.db.route.gz
Downloaded apnic.db.route.gz, file hash: 65faa570180e8b8c3fb03094f6c561d09d04b8bf83ba0f5c2c1b192996f43882
Downloading apnic.db.route6.gz
Downloaded apnic.db.route6.gz, file hash: be0a9492c5d091400f09afc9d0d0a224cae8eba0dac6dcccde2243689df9b8b5
Downloading arin.db.gz
Downloaded arin.db.gz, file hash: 04482352a0b64ba1fec4f2b65544fa61cd16fe6a457d6fd51a4be9c7da5c6b16
Downloading lacnic.db.gz
Downloaded lacnic.db.gz, file hash: eae183c74546a061807a5991313dd7a679b32cb102782b7f99022a42472dc0f7
Downloading ripe.db.route.gz
Downloaded ripe.db.route.gz, file hash: 771a25dba5e036644d8244dfa3d765fb2eb0f8c6cb508e39f589469555794810
Downloading ripe.db.route6.gz
Downloaded ripe.db.route6.gz, file hash: 757fa0282a1f5bb10691628034c9e85557c2ea9ed5fd20b78a250a561acd4605
...finished in 0:00:59.455241

--- Fetching Routeviews pfx2as ---

Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/06/routeviews-rv2-20250608-1200.pfx2as.gz
Downloaded /Users/FJ/projects/python/kartograf/data/1749571200/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: c918d792f17317bbba4bb902ce5d5ad1edb2e1ba8bbcdb1e751cd64cdf3b873b
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/06/routeviews-rv6-20250609-0600.pfx2as.gz
Downloaded /Users/FJ/projects/python/kartograf/data/1749571200/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: a48cbd0683cf1a1b7f883583d69171482b77aa88f475f939a0cd1799b3ed1741
...finished in 0:00:07.546997

--- Validating RPKI ---

Validating RPKI ROAs
308164 raw RKPI ROA files found.
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1233/1233 [01:04<00:00, 19.10it/s]
308164 RKPI ROAs validated and saved to /Users/FJ/projects/python/kartograf/out/1749571200/rpki/rpki_raw.json, file hash: 85ec7f9ec5af5f4f9d5bc7a0c47286cf88868faf58fbe88c2228cda938351997
...finished in 0:01:21.746717

--- Parsing RPKI ---

Parsing 308164 ROAs
Result entries written: 553063
Duplicates found: 83184
Invalids found: 36632
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:12.248062

--- Parsing IRR ---

Extracting afrinic.db.gz
Extracting apnic.db.route.gz
Extracting apnic.db.route6.gz
Extracting arin.db.gz
Extracting lacnic.db.gz
Extracting ripe.db.route.gz
Extracting ripe.db.route6.gz
Parsing /Users/FJ/projects/python/kartograf/out/1749571200/irr/lacnic.db
Found in this file: 15053
Parsing /Users/FJ/projects/python/kartograf/out/1749571200/irr/ripe.db.route
Found in this file: 386459
Parsing /Users/FJ/projects/python/kartograf/out/1749571200/irr/apnic.db.route6
Found in this file: 841922
Parsing /Users/FJ/projects/python/kartograf/out/1749571200/irr/apnic.db.route
Found in this file: 610357
Parsing /Users/FJ/projects/python/kartograf/out/1749571200/irr/arin.db
Found in this file: 105738
Parsing /Users/FJ/projects/python/kartograf/out/1749571200/irr/afrinic.db
Found in this file: 115265
Parsing /Users/FJ/projects/python/kartograf/out/1749571200/irr/ripe.db.route6
Found in this file: 146483
Found valid, unique entries: 2221277
...finished in 0:00:56.136903

--- Merging RPKI and IRR data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 2221277 entries filtered down to 504677
Merging base file with filtered extra file
...finished in 0:03:41.034136

--- Parsing Routeviews pfx2as ---

Unzipping /Users/FJ/projects/python/kartograf/data/1749571200/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /Users/FJ/projects/python/kartograf/out/1749571200/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /Users/FJ/projects/python/kartograf/data/1749571200/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /Users/FJ/projects/python/kartograf/out/1749571200/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /Users/FJ/projects/python/kartograf/out/1749571200/collectors/pfx2asn.txt
Entries after cleanup: 1266387
...finished in 0:00:18.480307

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1266387 entries filtered down to 389275
Merging base file with filtered extra file
...finished in 0:02:55.221517

--- Sorting results ---

...finished in 0:00:07.360055

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: 395707bc5227810e0ee436b9ac95fd91bfac72bb59abca42d0eb65093e0f0715
Total runtime: 0:23:47.479335

I would say let's give it one more shot on Thursday and then discuss:

$ ./run map -rv -irr -w 1749744000

I guess it's a win that there was no massive difference in the numbers between @brunoerg and me like we have seen in the official runs previously. But if we get no matches at all across several runs between the three of us that might be evidence that this lowers the probability of exact matches and we might have to rethink the approach.

@jurraca
Copy link
Collaborator

jurraca commented Jun 12, 2025

my run failed on the IRR fetching step. I've never seen it crash from a connection refused, will have to fix.

logs --- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1749744000 (2025-06-12 18:00:00 CEST) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1749744000 (2025-06-12 16:00:00 UTC, local: 2025-06-12 18:00:00 CEST)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /home/base/code/asmap/kartograf/data/1749744000/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /home/base/code/asmap/kartograf/data/1749744000/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /home/base/code/asmap/kartograf/data/1749744000/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /home/base/code/asmap/kartograf/data/1749744000/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /home/base/code/asmap/kartograf/data/1749744000/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 280 seconds
RPKI sync #2
...took 274 seconds
(now sleeping 41 seconds)
RPKI sync #3
...took 267 seconds
Downloaded RPKI Data, hash sum: 193d6373c0f0e01bd49ca49cf1cc2765e9d3b328d1efec253b9193d677e3093c
...finished in 0:14:35.907997

--- Fetching IRR ---

Downloading afrinic.db.gz
Downloaded afrinic.db.gz, file hash: 975b4a9874d40958b311d19d2f6f93bc39e9e6461353fefe5a18a64f9cde8bff
Downloading apnic.db.route.gz
Downloaded apnic.db.route6.gz, file hash: 2d4738dac8f4b340df5e78a05e3f7c125be6ce48032b55cbe77308d0ddce7fd3
Downloading arin.db.gz
Traceback (most recent call last):
File "/home/base/code/asmap/kartograf/./run", line 11, in
main()
File "/home/base/code/asmap/kartograf/kartograf/cli.py", line 104, in main
if args.wait and (int(args.wait) < time.time()):
^^^^^^^^^^^^^^^^^^^
File "/home/base/code/asmap/kartograf/kartograf/kartograf.py", line 65, in map
fetch_irr(context)
File "/home/base/code/asmap/kartograf/kartograf/timed.py", line 10, in wrapper
result = func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "/home/base/code/asmap/kartograf/kartograf/irr/fetch.py", line 41, in fetch_irr
with open(local_file_path, 'wb') as f:
^^^^^^^^^
File "/nix/store/igyzv5c2v672mhmx1ha1i7062y6rggqs-python3-3.11.10/lib/python3.11/ftplib.py", line 121, in init
self.connect(host)
File "/nix/store/igyzv5c2v672mhmx1ha1i7062y6rggqs-python3-3.11.10/lib/python3.11/ftplib.py", line 158, in connect
self.sock = socket.create_connection((self.host, self.port), self.timeout,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/nix/store/igyzv5c2v672mhmx1ha1i7062y6rggqs-python3-3.11.10/lib/python3.11/socket.py", line 863, in create_connection
raise exceptions[0]
File "/nix/store/igyzv5c2v672mhmx1ha1i7062y6rggqs-python3-3.11.10/lib/python3.11/socket.py", line 848, in create_connection
sock.connect(sa)
ConnectionRefusedError: [Errno 111] Connection refused

@fjahr
Copy link
Collaborator Author

fjahr commented Jun 12, 2025

I got 21b6746978e4c1941fa2fb221c00a7f12e0199fa65d2d1424ec4c3ca8fd181f9, did you run @brunoerg ?

Logs
$ ./run map -rv -irr -w 1749744000

--- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1749744000 (2025-06-12 18:00:00 CEST) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1749744000 (2025-06-12 16:00:00 UTC, local: 2025-06-12 18:00:00 CEST)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/FJ/projects/python/kartograf/data/1749744000/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/FJ/projects/python/kartograf/data/1749744000/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/FJ/projects/python/kartograf/data/1749744000/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/FJ/projects/python/kartograf/data/1749744000/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/FJ/projects/python/kartograf/data/1749744000/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 231 seconds
RPKI sync #2
...took 161 seconds
(now sleeping 200 seconds)
RPKI sync #3
...took 173 seconds
Downloaded RPKI Data, hash sum: 395999e1226a4d6e42f51cc1f269031afb499973afcb3f5ff5e02eee11f46bf6
...finished in 0:13:35.801315

--- Fetching IRR ---

Downloading afrinic.db.gz
Downloaded afrinic.db.gz, file hash: 975b4a9874d40958b311d19d2f6f93bc39e9e6461353fefe5a18a64f9cde8bff
Downloading apnic.db.route.gz
Downloaded apnic.db.route.gz, file hash: 9daf476ba73bf1b8b8da3af3f08d8c7148d0e3a09af29c32142a261621eac55c
Downloading apnic.db.route6.gz
Downloaded apnic.db.route6.gz, file hash: 2d4738dac8f4b340df5e78a05e3f7c125be6ce48032b55cbe77308d0ddce7fd3
Downloading arin.db.gz
Downloaded arin.db.gz, file hash: 0950044fac5eb81d6bb93b8af65835235724a8e8fb8d658ad0016bef6170a03a
Downloading lacnic.db.gz
Downloaded lacnic.db.gz, file hash: bfe9b470bfa7aea054cc12c3382108666beb023f5ec6a137d6d49dc675ff4b06
Downloading ripe.db.route.gz
Downloaded ripe.db.route.gz, file hash: 4cc87947c88d521abeb00f52a73add981c1c50115c45e50c573fc59f4a62dc6f
Downloading ripe.db.route6.gz
Downloaded ripe.db.route6.gz, file hash: e7908870303f62500c960750c35236b6d507767a36f0dc6d3b0fcfa9700efe57
...finished in 0:01:01.788961

--- Fetching Routeviews pfx2as ---

Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/06/routeviews-rv2-20250610-1200.pfx2as.gz
Downloaded /Users/FJ/projects/python/kartograf/data/1749744000/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: cd316a2f4c86483494b5d342ad911ad2c6c504b0f9e7eeb748da6b7ac8f7fcd2
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/06/routeviews-rv6-20250611-1200.pfx2as.gz
Downloaded /Users/FJ/projects/python/kartograf/data/1749744000/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: 411a4815a5d3fef2d0ce541fce5a534f77b21294b96c17f13be3b66bc2098f18
...finished in 0:00:07.473918

--- Validating RPKI ---

Validating RPKI ROAs
308721 raw RKPI ROA files found.
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1235/1235 [00:38<00:00, 32.06it/s]
308721 RKPI ROAs validated and saved to /Users/FJ/projects/python/kartograf/out/1749744000/rpki/rpki_raw.json, file hash: 10a49a0b83548779e0659f77f348fb947c7826f0a52f68b63108785eafa2b5a8
...finished in 0:00:56.802349

--- Parsing RPKI ---

Parsing 308721 ROAs
Result entries written: 582986
Duplicates found: 83159
Invalids found: 20121
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:12.818387

--- Parsing IRR ---

Extracting afrinic.db.gz
Extracting apnic.db.route.gz
Extracting apnic.db.route6.gz
Extracting arin.db.gz
Extracting lacnic.db.gz
Extracting ripe.db.route.gz
Extracting ripe.db.route6.gz
Parsing /Users/FJ/projects/python/kartograf/out/1749744000/irr/lacnic.db
Found in this file: 15064
Parsing /Users/FJ/projects/python/kartograf/out/1749744000/irr/ripe.db.route
Found in this file: 386458
Parsing /Users/FJ/projects/python/kartograf/out/1749744000/irr/apnic.db.route6
Found in this file: 842134
Parsing /Users/FJ/projects/python/kartograf/out/1749744000/irr/apnic.db.route
Found in this file: 610469
Parsing /Users/FJ/projects/python/kartograf/out/1749744000/irr/arin.db
Found in this file: 105876
Parsing /Users/FJ/projects/python/kartograf/out/1749744000/irr/afrinic.db
Found in this file: 115207
Parsing /Users/FJ/projects/python/kartograf/out/1749744000/irr/ripe.db.route6
Found in this file: 146559
Found valid, unique entries: 2221767
...finished in 0:00:57.857569

--- Merging RPKI and IRR data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 2221767 entries filtered down to 580597
Merging base file with filtered extra file
...finished in 0:03:48.280810

--- Parsing Routeviews pfx2as ---

Unzipping /Users/FJ/projects/python/kartograf/data/1749744000/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /Users/FJ/projects/python/kartograf/out/1749744000/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /Users/FJ/projects/python/kartograf/data/1749744000/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /Users/FJ/projects/python/kartograf/out/1749744000/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /Users/FJ/projects/python/kartograf/out/1749744000/collectors/pfx2asn.txt
Entries after cleanup: 1268693
...finished in 0:00:18.652960

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1268693 entries filtered down to 339915
Merging base file with filtered extra file
...finished in 0:03:11.115850

--- Sorting results ---

...finished in 0:00:07.582217

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: 21b6746978e4c1941fa2fb221c00a7f12e0199fa65d2d1424ec4c3ca8fd181f9
Total runtime: 0:24:19.049265

@fjahr
Copy link
Collaborator Author

fjahr commented Jun 12, 2025

my run failed on the IRR fetching step. I've never seen it crash from a connection refused, will have to fix.

Looks like this is the issue that #80 fixed. Did you pull the latest rebase that I pushed 3 days ago? If not the fix is not included in your local branch and that cause the crash.

@brunoerg
Copy link
Contributor

I got 21b6746978e4c1941fa2fb221c00a7f12e0199fa65d2d1424ec4c3ca8fd181f9, did you run @brunoerg ?

Logs

Sorry, I missed it.

@jurraca
Copy link
Collaborator

jurraca commented Jun 23, 2025

shall we try for another run this week? @fjahr @brunoerg

@brunoerg
Copy link
Contributor

shall we try for another run this week? @fjahr @brunoerg

Yes, let's do it.

@fjahr
Copy link
Collaborator Author

fjahr commented Jun 24, 2025

shall we try for another run this week? @fjahr @brunoerg

Yes, let's do it.

Alright, let's try tomorrow 6p CET/4p GMT:

$ ./run map -rv -irr -w 1750867200

@jurraca
Copy link
Collaborator

jurraca commented Jun 25, 2025

I got 4729f40d910aa2da80e9acc2ae87b96768b82e5a4fc1e89e9271d21f226e12ce

RPKI sync #1
...took 327 seconds
RPKI sync #2
...took 262 seconds
(now sleeping 3 seconds)
RPKI sync #3
...took 271 seconds
logs --- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1750867200 (2025-06-25 18:00:00 CEST) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1750867200 (2025-06-25 16:00:00 UTC, local: 2025-06-25 18:00:00 CEST)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /home/base/code/asmap/kartograf/data/1750867200/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858
338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /home/base/code/asmap/kartograf/data/1750867200/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff
62aeb28d4940ade94
Downloaded TAL for ARIN to /home/base/code/asmap/kartograf/data/1750867200/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9
c81b139cb412639
Downloaded TAL for LACNIC to /home/base/code/asmap/kartograf/data/1750867200/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe
00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /home/base/code/asmap/kartograf/data/1750867200/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3
996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 327 seconds
RPKI sync #2
...took 262 seconds
(now sleeping 3 seconds)
RPKI sync #3
...took 271 seconds

Downloaded RPKI Data, hash sum: b4868c33a88eac74bbe40d9749413909cb2c3e112a628294954dd36c09269cde
...finished in 0:15:08.726060

--- Fetching IRR ---

Downloading afrinic.db.gz
Downloaded afrinic.db.gz, file hash: 431ae419c7f4961c4d652443b3b1a92230d842d17dd90eadf48f57669e5b2afc
Downloading apnic.db.route.gz
Downloaded apnic.db.route.gz, file hash: afdcfa0f497c9f5629a4df7f1af0c3bffcf440f5f3433b1cf44990e18c039968
Downloading apnic.db.route6.gz
Downloaded apnic.db.route6.gz, file hash: c9df7c6535021d30c579e7c6f60102c9faf4f8dc940cd09547fdeddabc3c48b5
Downloading arin.db.gz
Downloaded arin.db.gz, file hash: a8b974ba6c10618499410656aa8f3baf1c7749ff6de598c8c79d7f6807ef6d70
Downloading lacnic.db.gz
Downloaded lacnic.db.gz, file hash: e0b723765e4555372f3cf679691a4d4bcb8c18be1d38d1291dfddab8fc29e7a6
Downloading ripe.db.route.gz
Downloaded ripe.db.route.gz, file hash: 84713b756f548a23534b19187f7e0f867dcd35c3557dc1c6d470631e05d045d0
Downloading ripe.db.route6.gz
Downloaded ripe.db.route6.gz, file hash: 1d4e15119e2b8ab136593da3ecbc3d3448de8be631e8961a37bc771cfc357b5c
...finished in 0:01:01.564458

--- Fetching Routeviews pfx2as ---

Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/06/routeviews-rv2-20250623-1800.pfx2as.gz
Downloaded /home/base/code/asmap/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: e9edb8fd1994949f664082739252906e61062eddba4aa6eb14187474daf0c29d
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/06/routeviews-rv6-20250624-1200.pfx2as.gz
Downloaded /home/base/code/asmap/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: 9e2691444d568baf8ff4b6d41e1e3d80ae394d39ab6e92bb272addfd68bef730
...finished in 0:00:15.717225

--- Validating RPKI ---

Validating RPKI ROAs
315305 raw RKPI ROA files found.
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1262/1262 [02:17<00:00, 9.16it/s]
315305 RKPI ROAs validated and saved to /home/base/code/asmap/kartograf/out/1750867200/rpki/rpki_raw.json, file hash: f2d5827c3a62b868aea129959fe0deb46c23bb0c0de56d5680294f1ce4837629
...finished in 0:02:40.816524

--- Parsing RPKI ---

Parsing 315305 ROAs
Result entries written: 592452
Duplicates found: 85220
Invalids found: 21840
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:33.098165

--- Parsing IRR ---

Extracting afrinic.db.gz
Extracting apnic.db.route.gz
Extracting apnic.db.route6.gz
Extracting arin.db.gz
Extracting lacnic.db.gz
Extracting ripe.db.route.gz
Extracting ripe.db.route6.gz
Parsing /home/base/code/asmap/kartograf/out/1750867200/irr/afrinic.db
Found in this file: 115376
Parsing /home/base/code/asmap/kartograf/out/1750867200/irr/arin.db
Found in this file: 107151
Parsing /home/base/code/asmap/kartograf/out/1750867200/irr/ripe.db.route6
Found in this file: 146471
Parsing /home/base/code/asmap/kartograf/out/1750867200/irr/apnic.db.route6
Found in this file: 715534
Parsing /home/base/code/asmap/kartograf/out/1750867200/irr/lacnic.db
Found in this file: 15211
Parsing /home/base/code/asmap/kartograf/out/1750867200/irr/ripe.db.route
Found in this file: 386630
Parsing /home/base/code/asmap/kartograf/out/1750867200/irr/apnic.db.route
Found in this file: 610718
Found valid, unique entries: 2097091
...finished in 0:02:38.310353

--- Merging RPKI and IRR data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 2097091 entries filtered down to 475838
Merging base file with filtered extra file
...finished in 0:12:26.852326

--- Parsing Routeviews pfx2as ---

Unzipping /home/base/code/asmap/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /home/base/code/asmap/kartograf/out/1750867200/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /home/base/code/asmap/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /home/base/code/asmap/kartograf/out/1750867200/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /home/base/code/asmap/kartograf/out/1750867200/collectors/pfx2asn.txt
Entries after cleanup: 1271102
...finished in 0:00:50.751283

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1271102 entries filtered down to 344006
Merging base file with filtered extra file
...finished in 0:11:02.513607

--- Sorting results ---

...finished in 0:00:20.293948

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: 4729f40d910aa2da80e9acc2ae87b96768b82e5a4fc1e89e9271d21f226e12ce
Total runtime: 0:47:00.820404

@brunoerg
Copy link
Contributor

I got 31efa1b06aff97be78cacc1ff6d489b68d79ac89a68d1463bc0a2aca7a3869d5

logs --- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1750867200 (2025-06-25 13:00:00 -03) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1750867200 (2025-06-25 16:00:00 UTC, local: 2025-06-25 13:00:00 -03)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/brunogarcia/projects/kartograf/data/1750867200/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/brunogarcia/projects/kartograf/data/1750867200/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/brunogarcia/projects/kartograf/data/1750867200/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/brunogarcia/projects/kartograf/data/1750867200/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/brunogarcia/projects/kartograf/data/1750867200/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 663 seconds
Downloaded RPKI Data, hash sum: e50384e458b8b258af51c0dab5f40c0b7c57a76bcf3b4a5b9aa0277bd4b0ad42
...finished in 0:12:03.378835

--- Fetching IRR ---

Downloading afrinic.db.gz
Downloaded afrinic.db.gz, file hash: 431ae419c7f4961c4d652443b3b1a92230d842d17dd90eadf48f57669e5b2afc
Downloading apnic.db.route.gz
Downloaded apnic.db.route.gz, file hash: afdcfa0f497c9f5629a4df7f1af0c3bffcf440f5f3433b1cf44990e18c039968
Downloading apnic.db.route6.gz
Downloaded apnic.db.route6.gz, file hash: c9df7c6535021d30c579e7c6f60102c9faf4f8dc940cd09547fdeddabc3c48b5
Downloading arin.db.gz
Downloaded arin.db.gz, file hash: a8b974ba6c10618499410656aa8f3baf1c7749ff6de598c8c79d7f6807ef6d70
Downloading lacnic.db.gz
Downloaded lacnic.db.gz, file hash: e0b723765e4555372f3cf679691a4d4bcb8c18be1d38d1291dfddab8fc29e7a6
Downloading ripe.db.route.gz
Downloaded ripe.db.route.gz, file hash: 84713b756f548a23534b19187f7e0f867dcd35c3557dc1c6d470631e05d045d0
Downloading ripe.db.route6.gz
Downloaded ripe.db.route6.gz, file hash: 1d4e15119e2b8ab136593da3ecbc3d3448de8be631e8961a37bc771cfc357b5c
...finished in 0:01:05.936085

--- Fetching Routeviews pfx2as ---

Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/06/routeviews-rv2-20250623-1800.pfx2as.gz
Downloaded /Users/brunogarcia/projects/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: e9edb8fd1994949f664082739252906e61062eddba4aa6eb14187474daf0c29d
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/06/routeviews-rv6-20250624-1200.pfx2as.gz
Downloaded /Users/brunogarcia/projects/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: 9e2691444d568baf8ff4b6d41e1e3d80ae394d39ab6e92bb272addfd68bef730
...finished in 0:00:07.954969

--- Validating RPKI ---

Validating RPKI ROAs
316834 raw RKPI ROA files found.
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1268/1268 [01:09<00:00, 18.15it/s]
316834 RKPI ROAs validated and saved to /Users/brunogarcia/projects/kartograf/out/1750867200/rpki/rpki_raw.json, file hash: fba2834ff611d8673347df37aecfe0aa5332284df6d11651e09b542edc804dfe
...finished in 0:01:34.336224

--- Parsing RPKI ---

Parsing 316834 ROAs
Result entries written: 633126
Duplicates found: 92024
Invalids found: 2357
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:19.801036

--- Parsing IRR ---

Extracting afrinic.db.gz
Extracting apnic.db.route.gz
Extracting apnic.db.route6.gz
Extracting arin.db.gz
Extracting lacnic.db.gz
Extracting ripe.db.route.gz
Extracting ripe.db.route6.gz
Parsing /Users/brunogarcia/projects/kartograf/out/1750867200/irr/lacnic.db
Found in this file: 15211
Parsing /Users/brunogarcia/projects/kartograf/out/1750867200/irr/ripe.db.route
Found in this file: 386630
Parsing /Users/brunogarcia/projects/kartograf/out/1750867200/irr/apnic.db.route6
Found in this file: 715534
Parsing /Users/brunogarcia/projects/kartograf/out/1750867200/irr/apnic.db.route
Found in this file: 610718
Parsing /Users/brunogarcia/projects/kartograf/out/1750867200/irr/arin.db
Found in this file: 107151
Parsing /Users/brunogarcia/projects/kartograf/out/1750867200/irr/afrinic.db
Found in this file: 115376
Parsing /Users/brunogarcia/projects/kartograf/out/1750867200/irr/ripe.db.route6
Found in this file: 146471
Found valid, unique entries: 2097091
...finished in 0:01:18.254607

--- Merging RPKI and IRR data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 2097091 entries filtered down to 428966
Merging base file with filtered extra file
...finished in 0:05:08.706648

--- Parsing Routeviews pfx2as ---

Unzipping /Users/brunogarcia/projects/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /Users/brunogarcia/projects/kartograf/out/1750867200/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /Users/brunogarcia/projects/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /Users/brunogarcia/projects/kartograf/out/1750867200/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /Users/brunogarcia/projects/kartograf/out/1750867200/collectors/pfx2asn.txt
Entries after cleanup: 1271102
...finished in 0:00:27.909631

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1271102 entries filtered down to 335576
Merging base file with filtered extra file
...finished in 0:04:11.243630

--- Sorting results ---

...finished in 0:00:11.203262

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: 31efa1b06aff97be78cacc1ff6d489b68d79ac89a68d1463bc0a2aca7a3869d5
Total runtime: 0:26:29.855386

@fjahr
Copy link
Collaborator Author

fjahr commented Jun 25, 2025

No match 4894e80dbec46a1136eefde876ea3760f5156a450d1003d4b793213a42ba67b9 😞

@jurraca and my run seem to be similar (3 syncs) and we are close in the numbers but still diverge. @brunoerg had just one long sync and he also had more results than @jurraca and me, which I am not sure how to interpret....

logs
$ ./run map -rv -irr -w 1750867200

--- Start Kartograf ---

Kartograf version: 0.4.9
Using rpki-client version 9.5 (recommended).
Coordinated launch mode: Waiting until 1750867200 (2025-06-25 18:00:00 CEST) to launch mapping process.
Countdown: 0 second(s)
Starting...
The epoch for this run is: 1750867200 (2025-06-25 16:00:00 UTC, local: 2025-06-25 18:00:00 CEST)

--- Fetching RPKI ---

Downloaded TAL for AFRINIC to /Users/FJ/projects/python/kartograf/data/1750867200/rpki/tals/afrinic.tal, file hash: 2838ef30ea27ce5705abf5f5adb131d8c35b1f50858338a2f3c84bb207c2fa35
Downloaded TAL for APNIC to /Users/FJ/projects/python/kartograf/data/1750867200/rpki/tals/apnic.tal, file hash: 472e551f7c551c2e999e582b7c9437d3bee4900fe53afff62aeb28d4940ade94
Downloaded TAL for ARIN to /Users/FJ/projects/python/kartograf/data/1750867200/rpki/tals/arin.tal, file hash: 1f8bdb03bcc30a3b8e11fd9a87102fba250c22137a3c8baa9c81b139cb412639
Downloaded TAL for LACNIC to /Users/FJ/projects/python/kartograf/data/1750867200/rpki/tals/lacnic.tal, file hash: d44bb9394ab009c8b53e5efebf2a1c9450bab61a27efe00de5a3e4587a3a2f6a
Downloaded TAL for RIPE to /Users/FJ/projects/python/kartograf/data/1750867200/rpki/tals/ripe.tal, file hash: 59ca27ef93f23682749fcefe7c6d70fbc723343549ff9e4d3996acaff79817fb
Downloading RPKI Data, this may take a while.
RPKI sync #1
...took 230 seconds
RPKI sync #2
...took 213 seconds
(now sleeping 152 seconds)
RPKI sync #3
...took 183 seconds
Downloaded RPKI Data, hash sum: 6d43e461135d54174581dbabcd57f7e2935e1a49e0ed251f2aec3dfd7c943068
...finished in 0:13:43.599997

--- Fetching IRR ---

Downloading afrinic.db.gz
Downloaded afrinic.db.gz, file hash: 431ae419c7f4961c4d652443b3b1a92230d842d17dd90eadf48f57669e5b2afc
Downloading apnic.db.route.gz
Downloaded apnic.db.route.gz, file hash: afdcfa0f497c9f5629a4df7f1af0c3bffcf440f5f3433b1cf44990e18c039968
Downloading apnic.db.route6.gz
Downloaded apnic.db.route6.gz, file hash: c9df7c6535021d30c579e7c6f60102c9faf4f8dc940cd09547fdeddabc3c48b5
Downloading arin.db.gz
Downloaded arin.db.gz, file hash: a8b974ba6c10618499410656aa8f3baf1c7749ff6de598c8c79d7f6807ef6d70
Downloading lacnic.db.gz
Downloaded lacnic.db.gz, file hash: e0b723765e4555372f3cf679691a4d4bcb8c18be1d38d1291dfddab8fc29e7a6
Downloading ripe.db.route.gz
Downloaded ripe.db.route.gz, file hash: 84713b756f548a23534b19187f7e0f867dcd35c3557dc1c6d470631e05d045d0
Downloading ripe.db.route6.gz
Downloaded ripe.db.route6.gz, file hash: 1d4e15119e2b8ab136593da3ecbc3d3448de8be631e8961a37bc771cfc357b5c
...finished in 0:00:38.210179

--- Fetching Routeviews pfx2as ---

Downloading from https://publicdata.caida.org/datasets/routing/routeviews-prefix2as/2025/06/routeviews-rv2-20250623-1800.pfx2as.gz
Downloaded /Users/FJ/projects/python/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip4.txt.gz, file hash: e9edb8fd1994949f664082739252906e61062eddba4aa6eb14187474daf0c29d
Downloading from https://publicdata.caida.org/datasets/routing/routeviews6-prefix2as/2025/06/routeviews-rv6-20250624-1200.pfx2as.gz
Downloaded /Users/FJ/projects/python/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip6.txt.gz, file hash: 9e2691444d568baf8ff4b6d41e1e3d80ae394d39ab6e92bb272addfd68bef730
...finished in 0:00:07.284364

--- Validating RPKI ---

Validating RPKI ROAs
316743 raw RKPI ROA files found.
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1267/1267 [00:39<00:00, 31.68it/s]
316743 RKPI ROAs validated and saved to /Users/FJ/projects/python/kartograf/out/1750867200/rpki/rpki_raw.json, file hash: f8bd209d5f0e69338c3f2a5bd3e68c12881fc425633ecc4344ce5f67260e5e92
...finished in 0:00:57.602387

--- Parsing RPKI ---

Parsing 316743 ROAs
Result entries written: 593692
Duplicates found: 85611
Invalids found: 21852
Incompletes: 0
Non-ROA files: 0
...finished in 0:00:12.981157

--- Parsing IRR ---

Extracting afrinic.db.gz
Extracting apnic.db.route.gz
Extracting apnic.db.route6.gz
Extracting arin.db.gz
Extracting lacnic.db.gz
Extracting ripe.db.route.gz
Extracting ripe.db.route6.gz
Parsing /Users/FJ/projects/python/kartograf/out/1750867200/irr/lacnic.db
Found in this file: 15211
Parsing /Users/FJ/projects/python/kartograf/out/1750867200/irr/ripe.db.route
Found in this file: 386630
Parsing /Users/FJ/projects/python/kartograf/out/1750867200/irr/apnic.db.route6
Found in this file: 715534
Parsing /Users/FJ/projects/python/kartograf/out/1750867200/irr/apnic.db.route
Found in this file: 610718
Parsing /Users/FJ/projects/python/kartograf/out/1750867200/irr/arin.db
Found in this file: 107151
Parsing /Users/FJ/projects/python/kartograf/out/1750867200/irr/afrinic.db
Found in this file: 115376
Parsing /Users/FJ/projects/python/kartograf/out/1750867200/irr/ripe.db.route6
Found in this file: 146471
Found valid, unique entries: 2097091
...finished in 0:00:53.641537

--- Merging RPKI and IRR data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 2097091 entries filtered down to 471659
Merging base file with filtered extra file
...finished in 0:03:32.705006

--- Parsing Routeviews pfx2as ---

Unzipping /Users/FJ/projects/python/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip4.txt.gz
Formatting /Users/FJ/projects/python/kartograf/out/1750867200/collectors/routeviews_pfx2asn_ip4.txt
Unzipping /Users/FJ/projects/python/kartograf/data/1750867200/collectors/routeviews_pfx2asn_ip6.txt.gz
Formatting /Users/FJ/projects/python/kartograf/out/1750867200/collectors/routeviews_pfx2asn_ip6.txt
Cleaning /Users/FJ/projects/python/kartograf/out/1750867200/collectors/pfx2asn.txt
Entries after cleanup: 1271102
...finished in 0:00:18.989520

--- Merging Routeviews and base data ---

Parse base file to dictionary
Parse extra file to Pandas DataFrame
Merging extra prefixes that were not included in the base file.
Finished merging extra prefixes.
Finished filtering! Originally 1271102 entries filtered down to 343123
Merging base file with filtered extra file
...finished in 0:03:01.692598

--- Sorting results ---

...finished in 0:00:07.275137

--- Finishing Kartograf ---

The SHA-256 hash of the result file is: 4894e80dbec46a1136eefde876ea3760f5156a450d1003d4b793213a42ba67b9
Total runtime: 0:23:34.870195

Honestly, I think as of now it seems we should focus on something else for now. We should do some test rounds of #82 but I would also like to do a collaborative run in the meantime without these improvements because it has already been a couple of months since the last one. I will prepare a release for that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants