BigQuery dataset (queryable dump)

Allynay

EDIT 2022-01-10: There's now an official dump that can be found here. The database is `danbooru1.danbooru_public`.

My dump is no longer maintained.

Edit by @nonamethanks - 2020-11-21
Since BigQuery old-style links are dead, the database can now be found at this link
The database name requires the whole name to be selected, so use danbooru-data.danbooru.posts instead of the old [danbooru.posts] See forum #178212 for an example of updated query.
Danbooru now also has its official bigquery dump of most data, not just posts, see this commit for the URLs.

With some assistance from albert, I've set up a Google BigQuery data dump of the Danbooru posts tables so anyone who cares to do so can run queries. You can access the dump here. You may see Konachan and Yandere tables there too but those aren't complete so I don't recommend bothering with them.

It's updated nightly and should contain basically anything you can get through the API. If you really want to, you can download the whole thing as a dump using these instructions. Should save you from trying to scrape the API or something silly like that.

The query syntax is very similar to SQL, with a few differences due to fancier datatypes like repeated/nested fields.

An example query:

SELECT tags.name, COUNT(id) AS num, SUM(file_size)/1000000000 AS GB FROM [danbooru.posts] GROUP BY tags.name ORDER BY GB DESC LIMIT 20;

This returns the tags with the most file_size associated with them. For example this query will show you that 1girl has just over a terabyte of image data over 1.6 million images.

SELECT favs, COUNT(id) AS num, SUM(file_size)/1000000000 AS GB FROM [danbooru.posts] GROUP BY favs ORDER BY GB DESC LIMIT 20;

This groups by user favorites instead of tags. You can see that user 19831 has 414000 favourites, which are around 380GB.

Updated by nonamethanks almost 2 years ago

Reply

evazion

Row	rating	posts	GB	total_score	upvotes	downvotes	total_votes	average_score	average_upvotes	average_downvotes
1	s	1842132	1235.257949761	8745742	5916507	92639	6009146	4.74761960597829	3.2117714691455337	-0.05028901294804064
2	q	382207	280.737538207	3192023	2132681	49423	2182104	8.351555570672437	5.579910885985866	-0.12930951029154358
3	e	220375	136.929995903	1863289	1237469	63895	1301364	8.455083380601248	5.615287577992059	-0.28993760635280774

0	id	user	uploads	up_votes	down_votes	total_score	total_favs	avg_score	avg_favs
1	user #254161	Schrobby	54365	301200	-3320	531461	978985	9.78	18.01
2	user #32251	Ars	27588	206541	-3840	430274	926145	15.6	33.57
3	user #13047	NCAA_Gundam	44547	180026	-3307	322602	744092	7.24	16.7
4	user #11314	Kikimaru	59652	179956	-14952	317727	824897	5.33	13.83
5	user #1	albert	131514	283523	-4329	311596	1424246	2.37	10.83
6	user #102191	Mr_GT	46699	270253	-1467	279261	1194571	5.98	25.58
7	user #15754	Evangeline_A.K._McDowell	28459	164048	-1895	271430	628649	9.54	22.09
8	user #49091	nanami	64792	185851	-2065	269845	688443	4.16	10.63
9	user #19599	Apollyon	23333	160806	-3901	253020	886977	10.84	38.01
10	user #81291	CodeKyuubi	12917	135341	-1472	239133	575542	18.51	44.56
11	user #351692	dean_exia	14264	113847	-2718	228752	582483	16.04	40.84
12	user #366860	gary25566	24948	126220	-1506	227115	410068	9.1	16.44
13	user #49984	dereyoruk	15489	133326	-1653	210703	447979	13.6	28.92
14	user #30072	Doragonn	25081	122132	-1687	208806	457044	8.33	18.22
15	user #307587	psich	22420	87533	-862	189600	489302	8.46	21.82
16	user #460797	user_460797	12943	144479	-1807	173934	295100	13.44	22.8
17	user #356975	SciFi	26802	78308	-3295	173778	353118	6.48	13.18
18	user #13506	RaisingK	33145	115268	-574	169716	309194	5.12	9.33
19	user #20119	Snesso	22466	147889	-297	151270	572183	6.73	25.47
20	user #11077	Gunflame	14071	130845	-841	147476	668480	10.48	47.51
21	user #366578	DakuTree	16767	116616	-244	132209	388142	7.89	23.15
22	user #30466	Mysterio006	13747	110508	-425	124609	447036	9.06	32.52
23	user #330014	zaregoto	10462	76895	-1589	123137	236593	11.77	22.61
24	user #371758	zeparoh	11417	88884	-803	122663	214225	10.74	18.76
25	user #62191	v571866	13729	53600	-575	121689	317841	8.86	23.15
26	user #136247	Kadoya	24318	61369	-1266	117609	292031	4.84	12.01
27	user #110107	HNTI	9702	81257	-1313	116200	409189	11.98	42.18
28	user #159945	Tsuki_no_Sakura	19407	99427	-526	115106	191013	5.93	9.84
29	user #149704	keonas	10184	60866	-1225	113699	224802	11.16	22.07
30	user #95414	Herrmobel	20256	109377	-1024	112366	591594	5.55	29.21
31	user #412246	RinkaS	9941	53628	-1214	111051	171891	11.17	17.29
32	user #34945	Monki	13204	74785	-724	110866	222546	8.4	16.85
33	user #369231	Lannihan	9048	62863	-930	104697	184818	11.57	20.43
34	user #12464	Altered	15855	55236	-250	101231	236578	6.38	14.92
35	user #39411	Action_Kamen	15286	90192	-1022	95318	490680	6.24	32.1
36	user #206319	Xeano94	9857	53196	-1182	90155	224057	9.15	22.73
37	user #11896	animeboy12	11198	62691	-1706	86312	406400	7.71	36.29
38	user #166417	Randeel	5120	67819	-825	86067	143839	16.81	28.09
39	user #155924	magenta-crimson	18801	52406	-1126	85622	181505	4.55	9.65
40	user #372231	lkjh098	9568	26835	-2672	84502	174546	8.83	18.24
41	user #420773	Fenen	12607	28863	-594	80471	153202	6.38	12.15
42	user #133311	Gauron1786	16280	50431	-470	79910	181748	4.91	11.16
43	user #14602	Dbx	18120	47623	-455	79292	198847	4.38	10.97
44	user #397518	Sacriven	9700	62221	-1207	75693	133786	7.8	13.79
45	user #15864	Magus	8932	60385	-337	73323	144919	8.21	16.22
46	user #108584	Krugger	20056	44826	-1515	72319	218002	3.61	10.87
47	user #348646	Rastamepas	5549	53098	-431	71271	161269	12.84	29.06
48	user #39276	john1980	11788	31445	-1509	68703	204381	5.83	17.34
49	user #59648	Jigsy	6472	46830	-474	67811	205339	10.48	31.73
50	user #179709	Anonymous9000	53375	45869	-864	67264	196784	1.26	3.69

BigQuery dataset (queryable dump)

EDIT 2022-01-10: There's now an official dump that can be found here. The database is danbooru1.danbooru_public.

EDIT 2022-01-10: There's now an official dump that can be found here. The database is `danbooru1.danbooru_public`.