Results of the "Can you tell which images are AI generated?" survey

popcar2@programming.dev · edit-2 1 year ago

Results of the "Can you tell which images are AI generated?" survey

popcar2@programming.dev · 1 year ago

I have. Disappointingly there isn’t much difference, the people working in CS have a 9.59 avg while the people that aren’t have a 9.61 avg.

There is a difference in people that have used AI gen before. People that have got a 9.70 avg, while people that haven’t have a 9.39 avg score. I’ll update the post to add this.

lol@discuss.tchncs.de · 1 year ago

deleted by creator

WalrusDragonOnABike@kbin.social · edit-2 1 year ago

mean SD
No 9.40 2.27
Yes 9.74 2.30

Definitely not statistically significant.

popcar2@programming.dev · 1 year ago

I would say so, but the sample size isn’t big enough to be sure of it.

xkforce@lemmy.world · edit-2 1 year ago

So no. For a result to be “statistically significant” the calculated probability that it is the result of noise/randomness has to be below a given threshold. Few if any things will ever be “100% sure.”

Funderpants @lemmy.ca · 1 year ago

Can we get the raw data set? / could you make it open? I have academic use for it.

popcar2@programming.dev · edit-2 1 year ago

Sure, but keep in mind this is a casual survey. Don’t take the results too seriously. Have fun: https://docs.google.com/spreadsheets/d/1MkuZG2MiGj-77PGkuCAM3Btb1_Lb4TFEx8tTZKiOoYI

Do give some credit if you can.

Funderpants @lemmy.ca · 1 year ago

Of course! I’m going to find a way to integrate this dataset into a class I teach.

Funderpants @lemmy.ca · edit-2 1 year ago

If I can be a bother, would you mind adding a tab that details which images were AI and which were not? It would make it more usable, people could recreate the values you have on Sheet1 J1;K20

popcar2@programming.dev · 1 year ago

Done, column B in the second sheet contains the answers (Yes are AI generated, No aren’t)

Funderpants @lemmy.ca · 1 year ago

Awesome! Thanks very much.

Mic_Check_One_Two@reddthat.com · 1 year ago

I’d be curious to see the results broken down by image generator. For instance, how many of the Midjourney images were correctly flagged as AI generated? How does that compare to DALL-E? Are there any statistically significant differences between the different generators?

popcar2@programming.dev · 1 year ago

Are there any statistically significant differences between the different generators?

Every image was created by DALL-E 3 except for one. I honestly got lazy so there isn’t much data there. I would say DALL-E is much better in creating stylistic art but Midjourney is better at realism.

MooseBoys@lemmy.world · 1 year ago

Sampling from Lemmy is going to severely skew the respondent population towards more technical people, even if their official profession is not technical.

MysticKetchup@lemmy.world · 1 year ago

If you do another one of these, I would like to see artist vs non-artist. If anything I feel like they would have the most experience with regular art, and thus most able to spot incongruency in AI art.