Furry data
Fiiinally managed to make my way through processing the e621 metadata dump they sent me. Pulled it from a corrupted and gross JSON file into a sqlite db.
sqlite> select count(*) from submissions;
700479
sqlite> select count(*) from tags;
1903
sqlite> select count(*) from artists;
186
sqlite> select count(*) from sources;
388
That's a lot of submissions @.@
re: Furry data
@makyo so I guess we can empirically say that the e621 community is pretty subby?
re: Furry data
@makyo I'm almost certain that can't possibly be right
re: Furry data
@makyo Some of those weren't duplicates, perhaps?
re: Furry data
@orrery Will do a bit more digging, but I filtered on both ID and image checksum.
re: Furry data
@makyo ... that's a lot of duplicates.
re: Furry data
:c
:C
I cleaned the data of duplicates and...
sqlite> select count(*) from submissions;
399