#fandom data analysis
Explore tagged Tumblr posts
Text
Day 2 Fanfic Bachelor's Thesis
Got sidtracked by the sidequest "I already started my master's degree". Cannot recommend.
But got the first participants for my research \o/ (Still need a few more. Feel free to look at my pinned post, if you write fanfics about BBCs Merlin.)
Also lost some sleep because somehow the middle of the night seemed like the best time to start the necessary data analysis for my BA. No better time to get some numbers from AO3 than midnight :)
#fandom meta#ao3#fanfics#ao3 fanfic#ao3 author#merlin bbc#media science#fan studies#data analysis#fandom data analysis#merlin bbc meta#bachelor's degree#personal#fanfiction
1 note
·
View note
Text
One more day to contribute to fandom science, and if you've already submitted your response here's some fun facts about Good Omens fic on ao3 for you:
Prior to the release of the first season, there were 3,574 fics on ao3 under the Good Omens book fandom tag. Especially compared to current numbers, they are almost overwhelmingly general and teen. Popular tags included fluff, humor, crossover, established relationship, romance, and drabble.
The "anal sex" tag did not make its debut on the top tags list until January of 2024. The ratio of explicit fics is also much higher than any month since the s2 release. Y'all nasty (I love you).
The longest fic under the GO Tv tag is 1,041,533 words with over 200 chapters and is published in spanish. The second place is at 500k, also in Spanish. The third, and the longest english fic, is 479,886 words and 56 chapters, and it's a rarepair Crowley/Gabriel with Aziraphale as the villain. Interesting choices were made here, major respect for the author. Takes guts.
There are 150 pages of fic, or about 3,000 fics, with less than 50 words (my cutoff for calculating average wordcount). That's 3k archived works consisting of podfics, artwork, and short poetry. Very cool!
Y'all are all simps and suckers. With the singular exception of August 2023 (Neil you know what you did), the top tag accross all dates I pulled data from was always fluff.
As I said, if you haven't already PLEASE PLEASE PLEASE GO VOTE to contribute to the biggest survey of Good Omens fanfic statistics made to date, and maybe give this or the poll a reblog to get it in front of more writers.
Bonus fun fact: In the time it took me to type this post, 4 more fics were posted.
#will reblog in the morning#since it's literally one am#but it's fine everything is fine#this has been so fun and interesting and I can't wait to share all my math with you#good omens#good omens fandom#gomens#good omens 2#ao3#fandom meta#for science#good omens fanfiction#gomens fanfic data analysis#I need a snazzier tag for all this shit but that will work for now#also don't y'all come bitching at me that archival data is technically closer to a humanity or social science I don't wanna hear it
114 notes
·
View notes
Text
Netflix released, well, not a ton but some data about viewership recently and I couldn't help but look up the performance of my fandoms. They released the top 18,214 titles based on hours watched from Jan to June 2023.
Ordered in terms of rank.
ATLA: Avatar the Last Airbender (Not available globally - what countries are missing? IDK)
Woot woot! Avatar in the house. 45.5M hours for book 1, giving it a rank of 342.
Book 3 has more views than book 2. I'm gonna credit that to Zuko.
Teen Wolf (Not available globally - what countries are missing? IDK)
The top season is season 3, oddly enough, with a rank of 361! It got 43.9M hours streamed. What are you all rewatching?
Season 1 is the third most watched.
Jurassic World (Not available globally - what countries are missing? IDK)
A healthy rank of 536 with 34M views.
Yu-Gi-Oh (Not available in the US)
Season one is ranked 1,843, with 11.4M hours watched in the first half of the year. Wow! That's impressive for such an old show.
Only 54% of people moved on to the second season. Sad😢
Season 5 had more views than Season 4 or 3. Stop torturing yourselves with rewatching the ceremonial duel, y'all. It's not healthy.
Carmen SanDiego
Season one saw 5.9M hours watched, giving it a rank of 3,133! Not bad for a show from 2019. (How does YGO out rank this? And how is this show more popular than She-Ra? She-Ra is better imo)
I'm impressed the Steal or Not to Steal interactive story did as well as it did. (1.3M hours). But I also know there were some good JuliaxCarmen scenes that 100% inspired fics.
She-Ra (available globally)
Season 1 is ranked 3,213 with 5.7M hours viewed.
People then went on to watch Seasons 5 and 4. We're all rewatching the angst bits, aren't we? Or maybe it's the redemption arc.
Least watched season? 3.
Voltron
Season one is ranked 4,140 with 3.9M hours watched.
Why is season 2 the only one not available globally? Odd. 62% of people go on to watch it after season 1 anyway.
Season 8 is not the least watched season, season 5 is. However, season 5 is totally the one I rewatched twice when it first came out.
Fullmetal Alchemist (movies are global, anime are not)
It is an absolute crime nothing here is ranked high. The highest is the movie, The Revenge of Scar, with a rank of 5,666. 2.2M hours watched.
Only season 1 for both the 03 anime and Brotherhood is available, but the 03 anime is winning. It had an extra 200K hours viewed.
All of the movies had a higher hour watch count than any of the anime seasons.
BBC Merlin | Doctor Who (no data)
Want Data? Variety Article with download links.
#fandom#netflix#fandom analysis#engagement data#fma#bbc merlin#carmen sandiego#yugioh#jurassic world#teen wolf#atla
31 notes
·
View notes
Text
for any marvel heads that have survived my blocking purge and any fellow ex-marvel heads:
most shipped marvel characters (as in who made it into the most separate ships in the ao3 rankings between 2013 and 2023):
with 7 ships: Tony Stark
with 5 ships: Natasha Romanoff (4 of these are femslash, and then Clint is also there)
with 4 ships: Bucky Barnes tied with Loki (I cannot describe to you the level of validated my Lucky crack-shipper brain feels at these two being tied ((I need to stress Lucky is not one of them x'D)))
with 3 ships: Peter Parker tied with Steve Rogers tied with Y/N
(out of 33 total marvel ships)
#ship stats#data analysis#33 is a lot btw#like other fandoms will have like 1-2 ships#and then some fandoms just have like a bajillion#I'm not tagging this as marvel cause I don't want the current marvel heads to find me
8 notes
·
View notes
Text
A lot of Batfamily fans like fan content which dives into the trauma of some of them being siblings with people that have tried to kill them. While they aren't wrong, I would posit that they need more fan content which addresses the real issue here: the Batkids should absolutely make fun of those of them that failed at murdering one (or more) of the others.
I'm just saying, if my sibling failed at killing me or another relative, I would mock them about that until one of us died for real. I would give them birthday cards talking about how they truly put the 'attempted' in 'attempted fratricide'. I would ask them if they remembered their cringefail murder skills. I would bring that up at the slightest provocation. Anytime they got annoyed at me for something, I would be like, "What are you going to do, kill me? It's not like you're any good at it!"
#not my usual content#but it needs to be said#I have determined this after an in-depth survey of batfam fandom trends which is definitely for the love of genre and not for data analysis#[Scrambles to close suspiciously graph-looking windows on my computer.]#batfam#batman#dc universe#comics#superhero genre#siblings
14 notes
·
View notes
Text
Adrien Agreste…and Data from Star Trek. The parallels kept leaping out at me. Then I started talking about it with my husband and a friend and it spiralled. Bear in mind, this whole post is full of Star Trek spoilers, including the films, especially Nemesis.
Unnatural Creations
Adrien’s a sentibeing and Data’s an android. They’re both created by ‘unnatural’ means – created by a human rather than God/nature/whatever you believe. Consequently, they both begin their lives as blank slates, with no childhood memories, and they long to fulfil their programming. With Data, this is more literal, while in Wishmaker Adrien tells us all he ever wanted was to be what his parents wanted him to be.
This lends them a similar sense of naivety, leaving them open to being fooled or even used. In a way, they are both Pinocchio, wishing to be ‘real boys’. Pinocchio even wears a feather in his cap, which is what led me to do this drawing a few years ago.
Data and Adrien’s ‘fathers’ (creators) have made other children, too – other androids and sentibeings – not necessarily to public knowledge.
Dreams
Over time, both Adrien and Data come to desire individuality. An integral part of this is having dreams. Adrien’s dreams come in the form of personal ambitions, beginning with his love for Ladybug. Data has ambitions, too, but his dreams also come in a more literal form, i.e. as he evolves, he starts dreaming like a real human.
'Evil' Twins
Another crucial part of being human is self-reflection, highlighted by Data and Adrien’s ‘evil twins’. For Data, this is Lore, made according to the same design but with his own personality. He obtains the emotion chip intended for Data. Where Data is all logic, Lore is all feeling without boundaries. He’s the emotional potential exploding unchecked.
Adrien has a few ‘shadow’ sides, which I’ve explored before. An obvious one is Felix, wielder of the peacock miraculous, i.e. the power of emotion. Then there’s Cat Blanc, who is all of Adrien’s darker thoughts and feelings bursting forth uncontrolled.
What Lore, Felix and Cat Blanc have in common is that they are abused sons, bitter and resentful that their fathers never gave them the love they longed for. They lash out from grief and trauma. Also, Lore killed his/Data’s father, while Cat Blanc killed Gabriel in an alternate timestream.
Data needs to face and deal with this ‘shadow’ side before he’s ready to receive the emotion chip intended for him and find the happy medium between the two extremes of the emotional spectrum. We see this when he is forced to deactivate Lore, thus inheriting the emotion chip, learning from the mistakes his brother made and trying to do better.
Likewise, Adrien needs to come to terms with his ‘shadow’ before he’s ready to face the final battle with Monarch. In Conformation, this played out as Adrien having the emotional maturity to recognised when he was out of control and make the decision to remove his miraculous, stepping away from the battle before he could do damage.
This is without even getting into the mirror universes in both shows, where Data has a much more direct ‘evil’ alter, and Cat Noir comes head-to-head with Claw Noir. (Sadly, we never saw Data hug Evil Data the way we saw our two Noirs accept each other.) There was also the very brief Copycat moment. Adrien’s had a lot of doubles.
It’s interesting to note that Lore believed he was superior to humans and had a genocidal plan, much like Felix snaps all the non-sentibeings out of existence in Emotion. Lore also dresses like Data and poses like him to fool their father and others on the ship, like Felix does with Adrien, attempting to fool Gabriel and others in Adrien’s life. A key difference is that Lore never really gets the chance for redemption, whereas Felix is only 14 and has his whole life ahead of him, with the opportunity for positive evolution. He discovers the love he’s lacked as a child and chooses to change.
At one point, Lore also implants the emotion chip in Data and uses it to control him remotely, inspiring anger and hate in him, thus weaponising him. Felix makes it very clear that he would never try to control Adrien. Instead, it’s Gabriel who controls Adrien via the twin rings, the Alliance, and magical dust causing him to live out his worst nightmare. In Cat Blanc and Ephemeral, we also see Gabriel akumatise Adrien, using his darkest feelings to transform him into a weapon.
Perfection
When Data first meets his brother, Lore convinces Data that he was created as a 'less perfect' version of him. When they are later summoned back to their father's secret home, Data learns this is untrue and is fascinated to realise that he is 'not less perfect than Lore'. Adrien, too, is repeatedly described as 'perfect'.
Cats
Data has a beloved pet cat. Adrien, of course, spends all his private time with Plagg. I also can't help but notice that Data and Cat Noir's eyes are similar in colour.
Artistry / Education
Data is a keen painter and violinist, and well-versed in just about everything, bearing in mind he was programmed with encyclopaedic knowledge. Adrien is a pianist and, thanks to his father’s programming, is also highly skilled in multiple disciplines and languages…and seems to know a lot of very random information, e.g. Morse code.
Undead Mothers and the Demented Fathers Who Can’t Let Them Go
Adrien’s father can’t get over the death of his wife and keeps her cryogenically sustained in the basement. Similarly, Data’s ‘father’ was married to a woman Data deems his ‘mother’. She, too, died long ago, but his father transferred her personality and memories into an android body. She’s the only android in the galaxy who can pass as completely human, even in medical scans.
Self-Sacrifice
Cat Noir is always sacrificing himself for Ladybug. We don’t know if he remembers those moments of death or not. He seems to throw himself in front of her on instinct, as if it were the natural order of things. He believes he’s not as valuable as she is.
In Star Trek: Nemesis Data sacrificed himself for Picard – although he did see himself as of equal value. In fact, he argued strenuously throughout the series for equal status with humans. And to all of us watching Nemesis…I can’t begin to tell you how hard his ‘death’ hit me.
On paper, it makes sense for an android – a robot – to sacrifice himself so a human may live. But we spent seven seasons plus several films getting to know Data. Android or not, he was a beloved character, he was a friend, he formed relationships. That’s why the crew gave him a full funeral and eulogies.
Likewise, Cat’s sacrifices hit Ladybug hard, something many of us fanfiction writers have explored in depth.
Programming
Perhaps the biggest difference is that Data knew he was programmed. He knew he was at the mercy of his microchip. He knew when his emotion chip was installed. He was wholly conscious of his developmental journey.
Yet, even after five seasons, Adrien has no idea. He doesn’t know he’s been programmed because he doesn’t know he’s a sentibeing. Like something out of Blade Runner – and like Data’s mother – Adrien would pass the Empathy Test and be deemed human yet still have no idea of his origins.
The fact is, you can’t break your programming until you know you’ve been programmed.
Data was given all the chances he needed to live a full life and make a fully informed decision about what he wanted to do with it at the end when he made that final sacrifice. Adrien has none of that. He was edging towards revelation, realising his father was controlling him somehow, but all of that was taken from him at the end of S5. It’s like if Data were given a hard reset.
Marinette’s choice to keep the truth from Adrien mirrors Data’s choice to keep the truth from his android mother. She believes she’s a human, the original woman she was made to replicate, and he chooses to let her carry out her life under that pretence.
'I want to be a real boy!'
As I said, Data and Adrien are both like Pinocchio, trying to be ‘real boys’.
At the end of Data’s life, it didn’t matter whether he was made by a scientist or by organic means. He touched people’s lives, and that made him real in all the ways that matter.
Similarly, if Adrien got snapped away, people would remember him. They would mourn him. It doesn’t matter if there’s a body left behind or not. He too has touched so many lives, and that makes him just as real and important as any organic human.
In these ways, I believe they both succeed at being 'real' regardless of how they came into being.
I bet I could think of more parallels if I tried, but this has already got long enough :)
#ml meta#ml analysis#data#star trek data#commander data#star trek tng#star trek nemesis#star trek spoilers#ml adrien#adrien agreste#miraculous ladybug#ml fandom#chat noir#cat noir#ml s5#ml s5 finale#ml conformation#pinocchio#character analysis#ml felix#ml gabriel#sentibeing#sentimonster adrien#ml sentimonster
17 notes
·
View notes
Text
i'm sorry if you genuinely think bozzi and leclerc "copied the other driver/engineer's strategy" i canttttttt take you seriously
#do any of you understand how this team shit works. how this pre-race strategy meetings team shit works.#or calling this win 'lucky' be for reallllllll#i dont generally go for the block button but that should be an immediate block#its just fascinating the thought processes required to avoid admitting some of these guys are just good at their jobs#possibly better than others.#there's thoughts in me about the ways fandom 'character analysis' trends intersect with the way people talk about f1 on tumblr/twitter#while just completely forgetting or ignoring not just the competitive sports of it all but the very real ways the teams operate#did you guys know ferrari has a whole 'remote garage' of engineers in italy that tune in every race just to analyse data in real time#and feed back possible strategies to the pit wall that then get discussed and acted on based on drivers feedback?#do you GENUINELY think its just bryan bozzi leaning over fred's shoulder to copy adami's homework#you know ferrari has their very own hannah schmidt? maybe not as good as her but there's a dude in there whose job is 'tell us what to do'#maybe you could learn his name it might be helpful#sorry AND ONE MORE THING#how do you call yourself a leclerc fan and then turn around to call this a lucky win#it required outqualifying his teammate#it required taking advantage of the situation around him to jump lando at la roggia#it required sticking close to both mclarens in dirty air and taking a gamble on the early pit stop#it required 37 LAPS ON HARDS THAT NEVER WENT BELOW OR ABOVE 1:23:000 EXCEPT ONCE#and yes it required required teamwork. as most wins do unless you have a rocket under your ass (and/or don't know how to use it)#the only lucky part was lando once again fumbling the first lap and george taking himself out at turn 1#but you understand he still had to drive the rest of the 52 laps himself right. god#its too early for me to be this mad
3 notes
·
View notes
Text
the wilds ao3 and tumblr stats - April 2024
last two graphs is the first graph separated
i ran out of things to do for these so here's the updated info for March and April 2024
notable observations regarding April 2024:
the wilds tag on tumblr had over 100 posts in April 2024 - something that hasn't occurred in literally a year
leatin had more fics posted than shoni in April 2024. this has only happened like twice before [x, Findings/ Observations - Ships].
The post count is only for new content. That is, new fics on ao3 and new posts on tumblr. The numbers do not account for reblogs, chapters updated, etc.
Data for March and April 2024 collected on May 16, 2024 - 10pm EDT.
See this post (tumblr) and this post (ao3) for relevant notes and other observations for previous months.
Other Fandom Stats
The Wilds AO3 March 2024
My Fandom Stats List
#my stats#leatin#shoni#the wilds#btch talk#fandom stats#shout out to fandom stats#for helping me during my data analysis test on tuesday
6 notes
·
View notes
Text
Data from my fanfiction and mental illness survey is now publicly available!
It is stored safely and eternally on the Harvard Dataverse. In accordance with the request of my ethics board (University of Kent CREAG), the demographic data has been disaggregated (separated) from the long-form responses on fanfiction practices in the public files. However, anyone can request the disaggregated version through the link above.
The data has been randomised on all three files for safety, and redacted of the thankfully small amount of identifying information that was given by participants, thus rendering it as anonymous as possible.
It is under a creative commons license that allows anyone to use it for their research, but please do cite the original dataset if you do.
Thank you to everyone who took part - I know already that there are some incredible insights here and really thoughtful reflections on how we approach mental illness in fanfiction.
#fan studies#fandom studies#fanfic#mad studies#fandom research#fanfiction#phdresearch#fandom#fandom stats#mental illness#data#data analysis#academic#research#phd
12 notes
·
View notes
Text
staff saw “multiverse“ trending with the rise of eeaao and spider-verse so they decided add a silly graph that will engage w your sensory needs
#tumblr update#reblog graph#i've been staring at different flowery shapes since i discovered this feature#but also if you have brain for data collection/analysis or study sociology#you could made a somewhat 'full' picture of certain fandom's ecosystems?#and probably compare results maybe?#like im sure there could be results awfully similar to each other
7 notes
·
View notes
Text
Professor Membrane's carefully reinforced glass castle is hilarious. I watched the Invader Zim movie and the way he preserves his sanity whilst Dib desperately seeks approval and finally kind of processes that his denial has nothing to do with him was both relatable and hilarious. The kind of science the professor does involves breaking new grounds; and when you do that you really, really need to keep your sanity in check. More than most. His denial of aliens and magic seems to stem from past disappointment, which is also a mood.
#Does he let Dib fuck with shit alone partially because that's how he gained his own intelligence? Largely alone?#Or is it entirely unintentional?#Even when he denies reality he can't stand by when his children need him.#Invader Zim#Sorry for joining fandom analysis late. I didn't watch the show as a child.#PERSONAL DATA
11 notes
·
View notes
Note
wow, the statistical analysis you do with Good Omens is SO COOL! do you think you could show how you compile the data and display it? surely you don’t do it by hand, scrolling thru ao3 haha… surely…? (also sorry if you already posted a little instruction guide btw and i just didn’t see it)
Hi! Yes I'm quite happy to share, I love talking about it.
No, I do not scroll through ao3 by hand lol that would take soooooo long. I mean, what I do takes a good amount of time as well but it's manageable.
It starts with a spreadsheet that looks like this:
And then I determine exactly what the filter criteria looks like for each cell. For example, the cell that's highlighted involves setting the date range to 2019-09-01 to 2019-11-30 and then looking at the rating values in the sidebar like so:
and copying them over to the spreadsheet (I usually have a split screen between the two for ease of copying.
For the tags there's two methods, and different datasets I pull differently. The easy way is to pull from that same sidebar for the popular tags, which also display a quantity value. But if some of the tags are not consistently top ten, or if you're looking for more comprehensive data like the one I'm working on right now, you have to enter the tag itself as a search criteria and use this number:
The number in the sidebar differs from the total search number because of ao3's incredible tag wrangling system, which includes assosciated tags under the umbrella in your searches. Thus, you have to either use the sidebar or the search number consistently.
That way, you can track any number of factors accross time periods to get this kind of data (it's also super useful for finding extremely specific kinds of fics!) and then I just use the built-in graph maker in google sheets most days. Some of the more brightly colored ones do come from other sites, if I feel like customizing them visually, but most of them come from sheets. Additionally, most of my blog posts with data analysis have a "Read more" section with details about how the specific kinds of data were pulled if you're interested in specific details. I've also gotten better at identifying different mechanisms, so some of my first sets may not be as sophisticated lol.
Feel free to follow up with any other questions, I love answering them this is one of my favorite things!
9 notes
·
View notes
Text
.
#idk man#I published a chapter of my long fic which was the first explicitly queer chapter and like#I find it disheartening that bookmarks went down#like I was very clear that that ship was going to be included in the tags but idk man#I know there’s been so much written about the misogyny in fandom#but like to see readership go down after including a wlw ship#just like doesn’t feel good#of course maybe the reader noped out for another reason#I can’t know as much as I would like too#but my data analysis little ass wants answers#I also don’t really know what I’m getting at here#just like venting I guess#I know this isn’t a big deal#and I am incredibly grateful for everyone who’s read my work#like to the ends of the earth and back#just#musing on stuff
4 notes
·
View notes
Text
Having a Nerd Moment: What Should Go into an Analysis of AO3 comments:
Been mulling over putting together a dataset of AO3 comments to compare over time... 🤔
I'm mostly interested in whether commenting behaviors have changed since i started writing lots of fic 8 years ago.
Potentially include:
A unique number representing each commenter (so I can see if they comment on multiple things):
Fandom name:
Date comment received:
Date Fic Updated:
WIP v. Complete:
Length of the chapter:
Substantive: Y/N
Critical: Y/N
Editorial: Y/N
Flame: Y/N
Guest or active user:
Rating of the fic
Is the fic Ship-centric or Gen/plot-centric
What kind of other metrics do yall think would be interesting to track for comments?
#fanfiction#archive of our own#ao3#fanfiction analysis#fanfiction data#fandom studies#comment analysis#ao3 comments
3 notes
·
View notes
Text
Centreoftheselights just shared the new 2024 AO3 stats.
In less than two days their post has already received 16 MILLION views and 80 THOUSAND retweets/quote tweets, with every comment I've seen taking the data at face value and using it to draw conclusions, much to my horror.
While OP did change the title of the "new works" column to "works gained" so that they're at least not blatantly lying now (the bare minimum), the wording is still very misleading. More importantly though they continue to use the same extremely flawed methodology and continue to bury and obfuscate those flaws and what the data actually represents. Nowhere on the chart or the details provided on the main page does it even say that only publicly available works are counted... and that's not even the biggest problem!
This data is, yet again, garbage and absolutely should not be used to determine the current size and popularity of a fandom (inarguably the main reason for it existing).
AO3 Ship Stats: Year In Bad Data
You may have seen this AO3 Year In Review.
It hasn’t crossed my tumblr dash but it sure is circulating on twitter with 3.5M views, 10K likes, 17K retweets and counting. Normally this would be great! I love data and charts and comparisons!
Except this data is GARBAGE and belongs in the TRASH.
I first noticed something fishy when I realized that Steve/Bucky – the 5th largest ship on AO3 by total fic count – wasn’t on this Top 100 list anywhere. I know Marvel’s popularity has fallen in recent years, but not that much. Especially considering some of the other ships that made it on the list. You mean to tell me a femslash HP ship (Mary MacDonald/Lily Potter) in which one half of the pairing was so minor I had to look up her name because she was only mentioned once in a single flashback scene beat fandom juggernaut Stucky? I call bullshit.
Now obviously jumping to conclusions based on gut instinct alone is horrible practice... but it is a good place to start. So let’s look at the actual numbers and discover why this entire dataset sits on a throne of lies.
Here are the results of filtering the Steve/Bucky tag for all works created between Jan 1, 2023 and Dec 31, 2023:
Not only would that place Steve/Bucky at #23 on this list, if the other counts are correct (hint: they're not), it’s also well above the 1520-new-work cutoff of the #100 spot. So how the fuck is it not on the list? Let’s check out the author’s FAQ to see if there’s some important factor we’re missing.
The first thing you’ll probably notice in the FAQ is that the data is being scraped from publicly available works. That means anything privated and only accessible to logged-in users isn’t counted. This is Sin #1. Already the data is inaccurate because we’re not actually counting all of the published fics, but the bots needed to do data collection on this scale can't easily scrape privated fics so I kinda get it. We’ll roll with this for now and see if it at least makes the numbers make more sense:
Nope. Logging out only reduced the total by a couple hundred. Even if one were to choose the most restrictive possible definition of "new works" and filter out all crossovers and incomplete fics, Steve/Bucky would still have a yearly total of 2,305. Yet the list claims their total is somewhere below 1,500? What the fuck is going on here?
Let’s look at another ship for comparison. This time one that’s very recent and popular enough to make it on the list so we have an actual reference value for comparison: Nick/Charlie (Heartstopper). According to the list, this ship sits at #34 this year with a total of 2630 new works. But what’s AO3 say?
Off by a hundred or so but the values are much closer at least!
If we dig further into the FAQ though we discover Sin #2 (and the most egregious): the counting method. The yearly fic counts are NOT determined by filtering for a certain time period, they’re determined by simply taking a snapshot of the total number of fics in a ship tag at the end of the year and subtracting the previous end-of-year total. For example, if you check a ship tag on Jan 1, 2023 and it has 10,000 fics and check it again on Jan 1, 2024 and it now has 12,000 fics, the difference (2,000) would be the number of "new works" on this chart.
At first glance this subtraction method might seem like a perfectly valid way to count fics, and it’s certainly the easiest way, but it can and did have major consequences to the point of making the entire dataset functionally meaningless. Why? If any older works are deleted or privated, every single one of those will be subtracted from the current year fic count. And to make the problem even worse, beginning at the end of last year there was a big scare about AI scraping fics from AO3, which caused hundreds, if not thousands, of users to lock down their fics or delete them.
The magnitude of this fuck up may not be immediately obvious so let’s look at an example to see how this works in practice.
Say we have two ships. Ship A is more than a decade old with a large fanbase. Ship B is only a couple years old but gaining traction. On Jan 1, 2023, Ship A had a catalog of 50,000 fics and ship B had 5,000. Both ships have 3,000 new works published in 2023. However, 4% of the older works in each fandom were either privated or deleted during that same time (this percentage is was just chosen to make the math easy but it’s close to reality).
Ship A: 50,000 x 4% = 2,000 removed works Ship B: 5,000 x 4% = 200 removed works
Ship A: 3,000 - 2,000 = 1,000 "new" works Ship B: 3,000 - 200 = 2,800 "new" works
This gives Ship A a net gain of 1,000 and Ship B a net gain of 2,800 despite both fandoms producing the exact same number of new works that year. And neither one of these reported counts are the actual new works count (3,000). THIS explains the drastic difference in ranking between a ship like Steve/Bucky and Nick/Charlie.
How is this a useful measure of anything? You can't draw any conclusions about the current size and popularity of a fandom based on this data.
With this system, not only is the reported "new works" count incorrect, the older, larger fandom will always be punished and it’s count disproportionately reduced simply for the sin of being an older, larger fandom. This example doesn’t even take into account that people are going to be way more likely to delete an old fic they're no longer proud of in a fandom they no longer care about than a fic that was just written, so the deletion percentage for the older fandom should theoretically be even larger in comparison.
And if that wasn't bad enough, the author of this "study" KNEW the data was tainted and chose to present it as meaningful anyway. You will only find this if you click through to the FAQ and read about the author’s methodology, something 99.99% of people will NOT do (and even those who do may not understand the true significance of this problem):
The author may try to argue their post states that the tags "which had the greatest gain in total public fanworks” are shown on the chart, which makes it not a lie, but a error on the viewer’s part in not interpreting their data correctly. This is bullshit. Their chart CLEARLY titles the fic count column “New Works” which it explicitly is NOT, by their own admission! It should be titled “Net Gain in Works” or something similar.
Even if it were correctly titled though, the general public would not understand the difference, would interpret the numbers as new works anyway (because net gain is functionally meaningless as we've just discovered), and would base conclusions on their incorrect assumptions. There’s no getting around that… other than doing the counts correctly in the first place. This would be a much larger task but I strongly believe you shouldn’t take on a project like this if you can’t do it right.
To sum up, just because someone put a lot of work into gathering data and making a nice color-coded chart, doesn’t mean the data is GOOD or VALUABLE.
#please keep spreading this post#and for the love of god please do not spread these 'studies'#every time someone trusts their data and uses it in any kind of fandom analysis I die a little bit inside#16 MILLION VIEWS!#it should be illegal to spread misinformation to that many people#I will not rest until OP learns how to collect and present data correctly#ao3#ao3 stats#fandom
2K notes
·
View notes
Text
tfw you keep trying to write up a concise Introduction of a Complex and Interesting Concept You Think About a Lot, getting distracted by an infodumping derail about The Breadth of the Subject, and running out of steam and having to start over ashdndmfb
#whosebaby talks#me waving a sign over my head: DISLIKING CHARACTERS IS A HIGHLY NUANCED AND PERSONAL THING#AND EXPLORING THAT AND LEARNING WHAT YOU'RE SENSITIVE TO AND COMPARING NOTES LEADS TO RICH ANALYSIS#disliking a character can be a geiger counter for certain themes and tropes and narrative devices; shitty or otherwise#and it's a highly personalized one between people and that's okay#and your ability to notice and analyze things doesn't end with what personally presses your buttons#in fact it's highly important to learn to recognize that you *won't* always have a visceral reaction to shitty things worth talking about!#and you can learn so so so many things from 'my dislike of something in fiction is not necessarily petty or irrational'#'and being colored by my personal feelings and experiences does not make it useless data; nor mean it should be treated as unimportant'#'and knee-jerk personal emotion not being objective or universal =/= *any* opinion i might have about fiction is subjective'#'especially if it's even slightly informed *by* an emotional reaction'#'my being personally triggered by a rape scene when someone else isn't does not mean it's up for debate whether it's a depiction of rape'#because fuck that shit running into hell#'but the emotional reaction itself *isn't* objective or universal; and is not synonymous with having an opinion'#'and that makes for both a rich tool of storytelling and analysis; and a check on my own potential assholery as well as other people's'#and i think this approach and its process are *critically important*#for addressing and deconstructing misogynistic/racist/ableist/fatphobic/anti-survivor/etc trends#in who fandoms Just So Happen to Dislike En Masse compared to everyone else; and why#i could go on and on and on it's so interesting and imo such an important principle to go by#gnaws on a table edge about it
0 notes