leaderbot.models.BradleyTerry.leaderboard#
- BradleyTerry.leaderboard(max_rank=None)#
Print leaderboard of the agent matches.
- Parameters:
- max_rankint, default=None
The maximum number of agents to be displayed. If None, all agents in the input dataset will be ranked and shown.
- Raises:
- RuntimeError
If the model is not trained before calling this method.
See also
Examples
>>> from leaderbot.data import load >>> from leaderbot.models import Davidson >>> # Create a model >>> data = load() >>> model = Davidson(data) >>> # Train the model >>> model.train() >>> # Leaderboard report and plot >>> model.leaderboard(max_rank=20)
The above code provides the text output and plot below.
+---------------------------+--------+--------+---------------+---------------+ | | | num | observed | predicted | | rnk agent | score | match | win loss tie | win loss tie | +---------------------------+--------+--------+---------------+---------------+ | 1. chatgpt-4o-latest | +0.172 | 11798 | 53% 23% 24% | 53% 23% 24% | | 2. gemini-1.5-pro-ex... | +0.149 | 16700 | 51% 26% 23% | 51% 26% 23% | | 3. gpt-4o-2024-05-13 | +0.130 | 66560 | 51% 26% 23% | 51% 26% 23% | | 4. gpt-4o-mini-2024-... | +0.121 | 15929 | 46% 29% 25% | 47% 29% 24% | | 5. claude-3-5-sonnet... | +0.119 | 40587 | 47% 31% 22% | 47% 31% 22% | | 6. gemini-advanced-0514 | +0.116 | 44319 | 49% 29% 22% | 49% 29% 22% | | 7. llama-3.1-405b-in... | +0.111 | 15680 | 44% 32% 24% | 44% 32% 23% | | 8. gpt-4o-2024-08-06 | +0.110 | 7796 | 43% 32% 25% | 43% 32% 25% | | 9. gemini-1.5-pro-ap... | +0.109 | 57941 | 47% 31% 22% | 47% 31% 22% | | 10. gemini-1.5-pro-ap... | +0.106 | 48381 | 52% 28% 20% | 52% 28% 20% | | 11. athene-70b-0725 | +0.100 | 9125 | 43% 35% 22% | 43% 35% 22% | | 12. mistral-large-2407 | +0.099 | 9309 | 41% 35% 25% | 41% 34% 25% | | 13. gpt-4-turbo-2024-... | +0.099 | 73106 | 47% 29% 24% | 47% 29% 24% | | 14. llama-3.1-70b-ins... | +0.096 | 10946 | 41% 36% 22% | 41% 37% 22% | | 15. claude-3-opus-202... | +0.094 | 134831 | 49% 29% 21% | 49% 29% 21% | | 16. gpt-4-1106-preview | +0.093 | 81545 | 53% 25% 22% | 53% 25% 22% | | 17. yi-large-preview | +0.088 | 42947 | 46% 32% 22% | 45% 31% 23% | | 18. gpt-4-0125-preview | +0.087 | 74890 | 49% 28% 23% | 49% 28% 22% | | 19. reka-core-20240722 | +0.080 | 5518 | 39% 39% 22% | 39% 39% 22% | | 20. gemini-1.5-flash-... | +0.080 | 45312 | 43% 35% 22% | 43% 35% 22% | +---------------------------+--------+--------+---------------+---------------+