×
  • remind me tomorrow
  • remind me next week
  • never remind me
Subscribe to the ANN Newsletter • Wake up every Sunday to a curated list of ANN's most interesting posts of the week. read more

Forum - View topic
Bar graph visual ratios


Goto page Previous  1, 2, 3  Next

Anime News Network Forum Index -> Site-related -> Bugs & Technical Questions
View previous topic :: View next topic  
Author Message
Dargonxtc



Joined: 13 Apr 2006
Posts: 4463
Location: Nc5xd7+ スターダストの海洋
PostPosted: Wed Dec 19, 2007 7:59 pm Reply with quote
abunai:

Out of those two options, as a user, I would rather the obvious spammers be deleted, than for the numbers to be taken down. It should help out the visibility of the graph as well, but still take into account the borderline spammers. This way the number of votes would actually mean something.

My that really bothers worries me about all of this, still, is not knowing where the line is that you must cross, in order to bring your rating down. I understand why you don't tell anyone about it, because then spammers would just find a way to work around it. But I still don't like it. For instance take a look at my manga ratings. I simply have not read very much manga. And I think as a consequence that my ratings will naturally drift toward the higher end of the spectrum. Would such ratings be flagged as spam? I am sure as I read more I will change ratings around(as I often do with anime), but should I artificially start changing them now to make sure I don't get get flagged. That is my worry.
Back to top
View user's profile Send private message Visit poster's website My Anime My Manga
abunai
Old Regular


Joined: 05 Mar 2004
Posts: 5463
Location: 露命
PostPosted: Wed Dec 19, 2007 8:43 pm Reply with quote
Don't worry about it. I know, that's easy to say, but seriously, don't.

As I understand the methodology involved, nobody has a "perfect 100%" rating (even Dan). The system is set up to compensate for skewed ratings, for whatever reason.

Let's take the example you give: lack of experience with manga causes you to give more high-end ratings. Why, by the way? Are you only reading the good stuff? If so, how do you recognize the good from the bad? An argument could be made that the heuristics should lower your rating, if you've only got high-end ratings (even if made in good faith), because you might lack experience enough with the bad stuff to form a balanced opinion.

As an admittedly inexperienced user, you will naturally have a limited spectrum of votes for the heuristics to judge you. As time goes by (and assuming you give out votes in a balanced fashion), your rating ought to stabilize.

But in the final analysis... what does it matter? All you can do, all any of us can do, is submit our fair, honest opinions to the database. Keep doing that, and there won't be a problem. As for those who don't... well... that's what the heuristics are for.

By the way, I don't know whether manga and anime are evaluated separately in the database. If they're not, then your limited sample in the manga section is going to be far outweighed by your contributions to the anime section, and the problem solves itself.

I suspect it's all one big database, with manga and anime merely being classes of records. But that's Dan's playground, you'll have to hope he reveals more on the subject.

- abunai
Back to top
View user's profile Send private message Visit poster's website My Anime My Manga
Dan42
Chief Encyclopedist


Joined: 02 Jan 2002
Posts: 3782
Location: Montreal
PostPosted: Wed Dec 19, 2007 10:08 pm Reply with quote
The problem is: when do you decide if a certain user is "undoubtedly" or "obviously" spamming? Maybe he just has peculiar tastes? Or just tastes that go counter to whoever is doing the judging? I am not willing to step on the slippery slope of letting humans decide if a particular person has "valid" ratings or opinions.

abunai has a good point about misleading numbers. I could change the graph to exclude users who are below a certain trustability, say, 5% (most spammers appear to be in the 0-5% range)

The filter doesn't look for certain characteristics to then flag people as spammer. There is no "line to cross"; it's a trustability continuum, from 0% to 100%.

abunai wrote:
Let's take the example you give: lack of experience with manga causes you to give more high-end ratings. Why, by the way? Are you only reading the good stuff? If so, how do you recognize the good from the bad? An argument could be made that the heuristics should lower your rating, if you've only got high-end ratings (even if made in good faith), because you might lack experience enough with the bad stuff to form a balanced opinion.
EXACTLY

BTW, manga and anime are not considered separately when calculating a user's trustability.

Ultimately, you have to remember that the purpose of the bar graph is to be fair to the people who view the page, not to those who vote.
Back to top
View user's profile Send private message Visit poster's website AIM Address My Anime My Manga
Tempest
I Run this place.
ANN Publisher


Joined: 29 Dec 2001
Posts: 10420
Location: Do not message me for support.
PostPosted: Wed Dec 19, 2007 11:02 pm Reply with quote
Dan42 wrote:
The problem is: when do you decide if a certain user is "undoubtedly" or "obviously" spamming?


When 5+ different user accounts vote identically on an anime within seconds of each other from the same IP, and then do so again on another show, and again on another and... Smile

-t
Back to top
View user's profile Send private message Send e-mail My Anime My Manga
Dargonxtc



Joined: 13 Apr 2006
Posts: 4463
Location: Nc5xd7+ スターダストの海洋
PostPosted: Wed Dec 19, 2007 11:05 pm Reply with quote
abunai wrote:
Let's take the example you give: lack of experience with manga causes you to give more high-end ratings. Why, by the way? Are you only reading the good stuff? If so, how do you recognize the good from the bad? An argument could be made that the heuristics should lower your rating, if you've only got high-end ratings (even if made in good faith), because you might lack experience enough with the bad stuff to form a balanced opinion.


Hmmm. Good point. I wasn't really thinking of it that way. But it does makes sense to me. And the more I think about it, it is actually equally fair to the voters as well. Where as veteran "fair" voters naturally receive slightly more pull for their vote than do the novice "fair" voters. Even if they both rate the ones they have both seen with the exact same ratings.

Dan42 wrote:
The filter doesn't look for certain characteristics to then flag people as spammer. There is no "line to cross"; it's a trustability continuum, from 0% to 100%.

This is good to hear. Could I also infer from this statement that just like it is possible for ones trustability to go down, that it is possible for ones trustability to also go up?

And one more silly question to anyone who can answer it. What would happen if a person who has seen say 200 shows or books, but rated every single one of them a 5?
Back to top
View user's profile Send private message Visit poster's website My Anime My Manga
Vortextk



Joined: 10 Jan 2006
Posts: 892
Location: Orlando, Fl
PostPosted: Thu Dec 20, 2007 12:23 am Reply with quote
Isn't it just the same thing that's being discussed? The only reason that 1 and 10 might be very common is for simple reasons. A person wants an anime to go up on the top lists, or just loves it and is immature about voting. Of course the same applies for the negative side of that, giving it a 1 to drop it or because you hate it and want to spam it. You can also say popular shows like Haruhi get spammed with low scores(as well as high ones I'm sure too) just because it IS popular. People are free to have their opinions, but I really doubt the worth of votes when you see, for example, 3 bad, 5 awful and 54 worst ever.

I would imagine their worth would be just as low as someone who voted 200 1's or 10's, it's just probably more uncommon to spam 5.
Back to top
View user's profile Send private message AIM Address My Anime
rti9



Joined: 08 Jul 2007
Posts: 1241
PostPosted: Thu Dec 20, 2007 2:18 am Reply with quote
Dan42 wrote:
The problem is: when do you decide if a certain user is "undoubtedly" or "obviously" spamming? Maybe he just has peculiar tastes? Or just tastes that go counter to whoever is doing the judging? I am not willing to step on the slippery slope of letting humans decide if a particular person has "valid" ratings or opinions.


I give my total support to Dan42, manual moderation is nothing but a pain in the ass.

tempest wrote:
When 5+ different user accounts vote identically on an anime within seconds of each other from the same IP, and then do so again on another show, and again on another and...Very Happy


If the system could identify these independently then it would be great. The thing is to not let human judgement interfere.

The rating system is fine. The team that developed it made an excellent job in my humble opinion. Despite all spamming, the top rated lists all seem to be "healthy" (meaning nothing absurd). I would be worried if industry-related users were using it to try to boost sales by interfering in the rating system, but I really doubt that is happening. And even if they were, then it would just prove that the algorithm works.

A question to add to Dargonxtc's that might help explain the bar graphs: does a vote given by someone who selects "seen all" and a vote given by someone who selects "seen some" have the same value?
Back to top
View user's profile Send private message My Anime My Manga
abunai
Old Regular


Joined: 05 Mar 2004
Posts: 5463
Location: 露命
PostPosted: Thu Dec 20, 2007 7:13 am Reply with quote
rti9 wrote:
I give my total support to Dan42, manual moderation is nothing but a pain in the ass.

As a forum moderator, I give you my unconditional ageement on that.

rti9 wrote:
A question to add to Dargonxtc's that might help explain the bar graphs: does a vote given by someone who selects "seen all" and a vote given by someone who selects "seen some" have the same value?

Good question. I don't know how Dan has implemented it, but I'd guess that they do have the same value. The idea of the rating system, after all, is not to produce fixed-in-stone evaluations ("Sorry, you already rated Medabots Excellent, and we're not going to let you change that to Bad, neener-neener!") but rather to reflect the fluidity of viewer opinions. Over time, "hit" shows tend to stabilize at a slightly different level, for instance, as people change their votes (usually downgrading from the Masterpiece ranking they'd originally given, so far as I can tell).

An example: many anime that are new in the Encyclopedia, will gather a slew of initial high or low votes, often before they've even aired in Japan. In the first place, these reflect the viewer expectation level, so they are not really "wrong" as such. In the second place, they tend to alter as the people who made the votes (usually the ones concerned enough to actually see the anime) alter them to reflect the actual impression they got from seeing it. And again, these votes are spammish in character, and tend to lower the trustability index of the users in question while they persist, so the heuristics catch this and compensate for it.

Someone asked, I think, whether the trustability index of a user can change -- the answer to this, as I know from asking Dan myself, is yes. Apparently the index of a user is recalculated (I'm guessing once a day, as part of regular database maintenance procedures, but it could be less often).

I don't know the algorithm whereby Dan calculates trustability, but here's how I would do it: I'd create a bell-curve with a peak centered slightly above average (this is a site for anime lovers, and a reasonable median vote would probably be a vote of Good, so let's be nice an say that that is the norm). Then I'd periodically match each user's actual vote distribution to this bell-curve and calculate how much he deviates from it. This would give me a trustability index that would correct for his inherent bias (which everybody, including myself, has, so there's no onus to the word here).

I'm sure Dan's system is somewhat more complex, but that's a stab in the dark at how it might work.

- abunai


Last edited by abunai on Thu Dec 20, 2007 9:42 am; edited 1 time in total
Back to top
View user's profile Send private message Visit poster's website My Anime My Manga
Tempest
I Run this place.
ANN Publisher


Joined: 29 Dec 2001
Posts: 10420
Location: Do not message me for support.
PostPosted: Thu Dec 20, 2007 8:28 am Reply with quote
rti9 wrote:
I give my total support to Dan42, manual moderation is nothing but a pain in the ass.

abunai wrote:
As a forum moderator, I give you my unconditional ageement on that.


Agreed.

-t
Back to top
View user's profile Send private message Send e-mail My Anime My Manga
woelfie
Encyclopedia Editor


Joined: 02 Jan 2005
Posts: 380
Location: Belgium
PostPosted: Sun Dec 23, 2007 5:30 pm Reply with quote
rti9 wrote:
A question to add to Dargonxtc's that might help explain the bar graphs: does a vote given by someone who selects "seen all" and a vote given by someone who selects "seen some" have the same value?

Suppose I've seen only part (let's say about 50) of Pokemon's 276 episodes, and I've come to the conclusion that it's really not worth watching one more of them.
Obviously, I can't say "seen all". Would that make my opinion less valid that that from the few dozens of people who had the courage to wrestle through the whole bunch, well knowing that they will not alter their opinion (be it good or bad) of the series ?
(please don't answer this rhetoric question)
Back to top
View user's profile Send private message Visit poster's website MSN Messenger ICQ Number My Anime My Manga
selenta
Subscriber



Joined: 19 Apr 2006
Posts: 1774
Location: Seattle, WA
PostPosted: Sun Dec 23, 2007 8:51 pm Reply with quote
woelfie wrote:
Obviously, I can't say "seen all". Would that make my opinion less valid that that from the few dozens of people who had the courage to wrestle through the whole bunch, well knowing that they will not alter their opinion (be it good or bad) of the series ?


I'm pretty sure Dan has said that 'seen all' and 'seen some' are valued the same, as they should be. The belief is that the rating reflects the user's current opinion of the show, and as they watch more or finish it, they will keep it updated. It's too much work with little to no meaningful importance to keep track of how much of a series each user has seen to influence the value of their vote, absolutely unnecessary. So yes, if you've seen enough of a show that you are comfortable rating it, then the 'value' of your vote is just the same as anyone else's.
Back to top
View user's profile Send private message Visit poster's website My Anime My Manga
Redbeard 101
Oscar the Grouch
Forums Superstar


Joined: 14 Aug 2006
Posts: 16935
PostPosted: Mon Dec 24, 2007 1:29 am Reply with quote
Here's a question. It may be totally retarded but as they say there's never such a thing as a dumb question.....right? Anime smile + sweatdrop

My question (if Dan can even answer this without giving insider info) is once the encyclopedia is done with all the genre upgrades and search options etc will that effect how a person's ratings are judged? What I'm thinking is some people are more favorable or critical towards different genres. A person may love comedies for example and not care so much for science fiction. So their ratings may be a lot more favorable for comedy titles as opposed to science fiction titles. So does this system that determines a spam rater as opposed to an honest rating consider this option of the genre of the title being rated?
Back to top
View user's profile Send private message My Anime My Manga
Dan42
Chief Encyclopedist


Joined: 02 Jan 2002
Posts: 3782
Location: Montreal
PostPosted: Mon Dec 24, 2007 11:56 am Reply with quote
rti9 wrote:
A question to add to Dargonxtc's that might help explain the bar graphs: does a vote given by someone who selects "seen all" and a vote given by someone who selects "seen some" have the same value?

They are worth the same but I would like to change that. Or at least reflect this datum in the graph. In the most complex case, imagine that the graph would become 3-D, with a "number of episodes watched" axis. Then you could see how the opinion distribution changes as people watch more episodes. e.g. people tend to rate "masterpiece" after episode 1, "good" at the halfway point and "awful" after finishing the series. But a 3-D graph would be too complex so I'd have to find a simpler representation for this data.
Back to top
View user's profile Send private message Visit poster's website AIM Address My Anime My Manga
dormcat
Encyclopedia Editor


Joined: 08 Dec 2003
Posts: 9902
Location: New Taipei City, Taiwan, ROC
PostPosted: Mon Dec 24, 2007 12:07 pm Reply with quote
Dan42 wrote:
In the most complex case, imagine that the graph would become 3-D, with a "number of episodes watched" axis. Then you could see how the opinion distribution changes as people watch more episodes. e.g. people tend to rate "masterpiece" after episode 1, "good" at the halfway point and "awful" after finishing the series. But a 3-D graph would be too complex so I'd have to find a simpler representation for this data.

3-D graphs are indeed too complex for both programmers and readers. How about having two separate bar graphs, one for "seen some" and one for "seen all?" A "will not finish" bar graph is optional, but that one is expected to be very lowly rated and does not really contributes to the overall rating. However, if a title has a significant portion of "will not finish" then it's likely to be crappy.
There are easier things to do, such as additional languages...
Back to top
View user's profile Send private message Visit poster's website AIM Address Yahoo Messenger MSN Messenger ICQ Number My Anime My Manga
Mylene



Joined: 07 Feb 2006
Posts: 2792
Location: Indiana
PostPosted: Mon Dec 24, 2007 12:56 pm Reply with quote
dormcat wrote:
A "will not finish" bar graph is optional, but that one is expected to be very lowly rated and does not really contributes to the overall rating. However, if a title has a significant portion of "will not finish" then it's likely to be crappy.


However, "Will not finish" does not necessarily indicate utter dislike of a series by any stretch of the imagination. Some NA Kodocha fans might shift the series into "Will not finish" simply because they will (may?) not have the opportunity to ever finish the series. I put several series such as Voltron into my "Will not finish," not because I hated them too much to continue, but because while I liked them in the past and gave them an average rating, I don't see myself going out of the way to watch every single episode.
Back to top
View user's profile Send private message My Anime My Manga
Display posts from previous:   
Reply to topic    Anime News Network Forum Index -> Site-related -> Bugs & Technical Questions All times are GMT - 5 Hours
Goto page Previous  1, 2, 3  Next
Page 2 of 3

 


Powered by phpBB © 2001, 2005 phpBB Group