the dataset i used for my triangle project last year was generated piecewise over ~2 months, but by the time it was done, the tools had evolved to the point that i can produce a fresh dataset of the same size in 8 hours. that's pretty cool.

Follow

annnnd now i can do it in 17 minutes

well sort of :-P i added a toggle that disables the final exact verification. this can give a small number of false positives (< 0.1% in practice), so i can't use it for formal / guaranteed bounds, but for generating large amounts of exploratory data it is awesome

i'm running it on 125000 data points right now, and it should be done by 8am tomorrow

this is neat

Sign in to participate in the conversation
Awoo Space

Awoo.space is a Mastodon instance where members can rely on a team of moderators to help resolve conflict, and limits federation with other instances using a specific access list to minimize abuse.

While mature content is allowed here, we strongly believe in being able to choose to engage with content on your own terms, so please make sure to put mature and potentially sensitive content behind the CW feature with enough description that people know what it's about.

Before signing up, please read our community guidelines. While it's a very broad swath of topics it covers, please do your best! We believe that as long as you're putting forth genuine effort to limit harm you might cause – even if you haven't read the document – you'll be okay!