FAForever Forums
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Login
    The current pre-release of the client ("pioneer" in the version) is only compatible to itself. So you can only play with other testers. Please be aware!

    Lets Talk about Rating

    Scheduled Pinned Locked Moved Suggestions
    27 Posts 15 Posters 634 Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • ZLOZ Offline
      ZLO
      last edited by

      I always thought that bumping up uncertainty slightly before every game is the FAF invention / implementation of TrueSkill. (Afaik that is why very high rated player can lose a tiny bit of rating after winning against low rating player)

      Why? because there were tons of complains about rating moving too slowly and getting very stale over time 😄

      TA4Life: "At the very least we are not slaves to the UI" | http://www.youtube.com/user/dimatularus | http://www.twitch.tv/zlo_rd

      1 Reply Last reply Reply Quote 1
      • C Offline
        Caliber
        last edited by Caliber

        @Brutus5000 I dont beleive I mentioned any desire to change true skill.

        Rather slowing it down a bit

        this would mean that in my situation my chart would look more like these

        dsjy.png

        Without such a large change in rating so quickly

        IndexLibrorumI 1 Reply Last reply Reply Quote -1
        • StrydxrS Offline
          Strydxr FAF Association Board
          last edited by

          I've heard enough, one gorbilgillion rating reduction for non Faf-Elite.

          Mods, can we ban this guy?

          1 Reply Last reply Reply Quote 1
          • IndexLibrorumI Offline
            IndexLibrorum Moderator @Caliber
            last edited by

            @Caliber said in Lets Talk about Rating:

            @Brutus5000 I dont beleive I mentioned any desire to change true skill.

            Rather slowing it down a bit

            Huh?

            "Design is an iterative process. The required number of iterations is one more than the number you have currently done. This is true at any point in time."

            See all my projects:

            C 1 Reply Last reply Reply Quote 0
            • FtXCommandoF Offline
              FtXCommando
              last edited by

              FAF’s system is the result of fiddling with the results of professional statisticians. Our tau is like triple the value those professional statisticians recommended which is what this guy is complaining about. Still a good change though, otherwise people would be gaining or losing 3 points a game. Supposed to win 30 games in a row to climb 100 rating?

              C 1 Reply Last reply Reply Quote 0
              • C Offline
                Caliber @FtXCommando
                last edited by Caliber

                @FtXCommando said in Lets Talk about Rating:

                FAF’s system is the result of fiddling with the results of professional statisticians. Our tau is like triple the value those professional statisticians recommended which is what this guy is complaining about. Still a good change though, otherwise people would be gaining or losing 3 points a game. Supposed to win 30 games in a row to climb 100 rating?

                Yes pretty much bang on, seeing as the average rating increase/decrease for a standard team game of 10 players results in a 10 points increase or decrease and you describe it as being a 3 point change then perhaps a middle ground may be a suitable alternative.

                also seeing as the outcome of the game becomes vastly more difficult to influance the players exist in the game maybe create a larger change based off of player numbers in game, such as I see points in a 4v4 change being at 11 and larger team games can still be 10 a smaller change based of off the number of players maybe a worthwile thing to took into, perhaps more like 2 points per player in game decrease so like this,

                1v1 15 points
                2v2 13 points
                3v3 11 points
                4v4 9 points
                5v5 7 points
                6v6 5 points

                obvisously this excludes player uncertainty calculations in this example but you get the idea

                as an example, seeing as the lower the player count the more you impact the game and the rating change would be more inline with individual performance.

                @Nomander in the link you gave the guy that created the "optimimum" parameters (axle) even states that is is very volitile, which is exactly what im trying to say

                N 1 Reply Last reply Reply Quote 0
                • C Offline
                  Caliber @IndexLibrorum
                  last edited by

                  This post is deleted!
                  1 Reply Last reply Reply Quote 0
                  • FtXCommandoF Offline
                    FtXCommando
                    last edited by

                    You can’t do what you’re talking about because you’re using shown rating for these values and trueskill as a system doesn’t care about the shown rating value. Every trueskill adjustment is a tinkering of your mu and your sigma, saying to “just” adjust it by X amount based on the size of the game is impossible because trueskill doesn’t internally communicate like that.

                    If you wanted a lower tau than argue for it to be 2% of sigma rather than 3%.

                    1 Reply Last reply Reply Quote 0
                    • SkratS Offline
                      Skrat
                      last edited by

                      Hi!
                      If you want to reduce rating variations, play several global 1x1 rating games. For the next few months, your rating variations will be like this. xD

                      image.png

                      Sorry for my English. I use translator

                      1 Reply Last reply Reply Quote 2
                      • N Offline
                        Nomander Balance Team @Caliber
                        last edited by

                        @Caliber said in Lets Talk about Rating:

                        @Nomander in the link you gave the guy that created the "optimimum" parameters (axle) even states that is is very volitile, which is exactly what im trying to say

                        The purpose of trueskill is to correctly predict games, which includes getting players up to their skill level quickly and having the rating value be accurate enough after reaching that level. If you can prove that some parameter adjustment improves its predictions, that would be convincing. Axle linked a github for his work, so it should be possible to adapt for someone savvy.

                        1 Reply Last reply Reply Quote 0
                        • R Offline
                          rampeer
                          last edited by rampeer

                          I wanted to pick it up, as I have some expertise (once I made a program that computes Elo rating for chess puzzles), and wanted to contribute to FAF somehow.

                          But realized that it's hard to devise any sort of controlled experiment. Some players get better over time, some players get worse, expecting the rating to be stable is wrong, and it's impossible to differentiate between rating drift due to misconfiguration and . Maybe let different AIs battle each other?..

                          But my hunch is that current change-per-game is too high. These zigzags are not normal, just look how rating graph looks for chess:
                          image.png
                          People stay in FAF for many thousands of games, so I do not buy "it's for people to get to their real rating quicker" argument.

                          Also, 4.5% draw probability is just wrong. Who has that many draws?

                          S 2 Replies Last reply Reply Quote 1
                          • waffelzNoobW Online
                            waffelzNoob
                            last edited by

                            that's because chess doesn't have wildcard teammates and opponents with a rating of 1800 that can perform anywhere between 1200 and 2000. global rating varies so much because the performance of your opponents and teammates is almost completely random. if you play 1v1 ladder, like in chess, your rating won't go up and down more than 100 rating

                            frick snoops!

                            1 Reply Last reply Reply Quote 1
                            • R Offline
                              rampeer
                              last edited by rampeer

                              I do not believe you. 1v1 rating graph looks just as jagged as global:

                              image.png

                              I am certain it's possible to sacrifice a bit of ramp-up time (note spike on the left) for overall smoother graph and more stable rating.

                              Also, what about draw probability? It feels like all the coefficients are off; will look into it later.

                              waffelzNoobW KnownSniperK 2 Replies Last reply Reply Quote 1
                              • maudlin27M Offline
                                maudlin27
                                last edited by

                                One problem is when rating covers a wide range of games as one number

                                Most obviously an issue with global - you can play like a 1700 on one popular map and like an 800 on another
                                Making rating take far longer to adjust makes it in turn far harder for people to try different games/maps, and could hurt retention. It also takes longer for a returning player’s rating to realise they’re not as good after a multi-year gap and only playing infrequently vs when they were playing constantly.

                                So it feels better to err on the side of faster rating adjustments than slower (outside new players) to me. I also don’t see having a smoother graph as being all that big a benefit compared to the downsides.

                                M27AI and M28AI developer:
                                https://forum.faforever.com/topic/2373/ai-development-guide-and-m27ai-v81-devlog
                                https://forum.faforever.com/topic/5331/m28ai-devlog-v294
                                M28 trophy holders: Radde, Yew (Radde trophy, v285) and Zwaffel (Sladow trophy, v284)

                                1 Reply Last reply Reply Quote 0
                                • waffelzNoobW Online
                                  waffelzNoob @rampeer
                                  last edited by waffelzNoob

                                  @rampeer said in Lets Talk about Rating:

                                  I do not believe you. 1v1 rating graph looks just as jagged as global:

                                  image.png

                                  I am certain it's possible to sacrifice a bit of ramp-up time (note spike on the left) for overall smoother graph and more stable rating.

                                  Also, what about draw probability? It feels like all the coefficients are off; will look into it later.

                                  that is a graph that deviates no more than 100 rating up and down, as i said. the rating is fairly solid around 1500-1700 and that is normal because humans are inconsistent. one day we play well, one day we don't. one day we run into an opponent whos having a good day, one day we don't. same thing happens in chess, where you also lose/gain 10 rating per game btw. and there is no problem with this system because going on a 10-game winstreak and only getting 50 rating sucks

                                  and this 100 rating deviation is an entirely different scenario than what caliber described with his global rating - he lost 400, not 200 (i now see this 400 was actually exaggerated - he was 1850 and dropped to 1550, not 1900 to 1500).

                                  frick snoops!

                                  1 Reply Last reply Reply Quote 2
                                  • S Offline
                                    Sainse Balance Team @rampeer
                                    last edited by

                                    @rampeer said in Lets Talk about Rating:

                                    Also, 4.5% draw probability is just wrong. Who has that many draws?

                                    The draw probability is estimated by FAF to be 10%. The math behind it is already mentioned above. Lower draw expectation would actually increase the jumps.

                                    1 Reply Last reply Reply Quote 0
                                    • C Offline
                                      Caliber
                                      last edited by

                                      One other point i would like to raise that i forgot about in the opening statement, was that at least through my experiance is that most games seem to be won/lost quite heavily, indecating that although most games are rated at 90% + balance they often go so very heavily one sided, I would say that only around 1 in 10 games actually seem to be a good balance that last a while at least.

                                      1 Reply Last reply Reply Quote 0
                                      • FtXCommandoF Offline
                                        FtXCommando
                                        last edited by FtXCommando

                                        Every RPS game is 100% balanced yet they can go from a theoretical length of 3 seconds to the heat death of the universe.

                                        All a balanced game means is that both sides have a good shot at winning the game, neither a singular game result nor its length is evidence of it being wrong.

                                        1 Reply Last reply Reply Quote 1
                                        • S Offline
                                          SzakiLLL
                                          last edited by

                                          This my first post.
                                          If you allow me, I'd like to share a few thoughts about [<100] rate players, like myself & about my brother [<-400] ... and probably all the 99% of players (according to the statistics I've seen) who are not here, because new players tried this game, and ran away quickly.

                                          You guys are all pros. Really.
                                          And the rating system seems to be a real thing, because it's true: I can not beat a 300+ player in 1v1.

                                          The problems are:
                                          1.
                                          I can mostly beat the AI NPC (the computer), which is rated 450.
                                          -> That means in theory I'm stronger than 500...

                                          Usually there are no games for <200 players.

                                          Players with 5-800+ rating create / join games called "noob..." and take away all the fun of beginners or players who are not "on the top".
                                          So they quit for good.

                                          If I dye in a team, or do not provide enough air, etc, -> I get very strong negative chat sentences, why I'm such a noob?

                                          Yes, I know I can create a game myself, limit the max rate to 200, and wait for new players to join, but as I've already explaned:

                                          • you guys have made all of them quit already.

                                          So we are only a few left, who still try to play, but our rating is low.

                                          Conclusion / Solution ?:


                                          I do not know exactly, how to solve these problems, but these are the facts. So I'm not playing this game more than a few times a year,
                                          and mostly against the AI opponent, which is not so good, because it's either too strong, or too weak randomly, or depending on the map I choose.

                                          Maybe:

                                          • Pushing the point system by adding +10000 to everyone would allow to feel the beginners not "being a zero" or even less?
                                          • Adding extra points even if you loose, (but provided enough eco, showing you have mastered the game on a minimal level, so these players can play against each other next time.
                                          • And prohibiting somehow top players to feel "great" by smashing beginners.
                                          • Auto creating a few games which beginners can join.
                                          • Creating and showing a secondary point system, independent from the existing one. (starting from 10K or 100K, always incrementing, even if you play non-rated games.)

                                          A little background:


                                          I'm 50 year old, my brother is 44, so we are not 17 any more, our reflexes are slower, we forget more things, etc.
                                          We play this game since 25-30? years. I don't even remember, but we have always LOVED it!
                                          I have played only with 5 different games during the last 5 years. My brother even less, because we are very selective.

                                          Yes, I've seen all the videos and blogs how to play "better", but I do not want to be a top player, race against You all, kill everyone else, simply want to enjoy this.

                                          Would you think about to make this possible?

                                          1 Reply Last reply Reply Quote 1
                                          • maudlin27M Offline
                                            maudlin27
                                            last edited by

                                            I'd caution against assuming that the default AI is the same challenge as a 450 rating player; a separate points system that increases based on how many games you've played is also already implemented (in custom games at least - the number of games played shows when you join a lobby)

                                            M27AI and M28AI developer:
                                            https://forum.faforever.com/topic/2373/ai-development-guide-and-m27ai-v81-devlog
                                            https://forum.faforever.com/topic/5331/m28ai-devlog-v294
                                            M28 trophy holders: Radde, Yew (Radde trophy, v285) and Zwaffel (Sladow trophy, v284)

                                            1 Reply Last reply Reply Quote 0

                                            Hello! It looks like you're interested in this conversation, but you don't have an account yet.

                                            Getting fed up of having to scroll through the same posts each visit? When you register for an account, you'll always come back to exactly where you were before, and choose to be notified of new replies (either via email, or push notification). You'll also be able to save bookmarks and upvote posts to show your appreciation to other community members.

                                            With your input, this post could be even better 💗

                                            Register Login
                                            • First post
                                              Last post