HIPE-2026 Evaluation Results (Binary at)
This file is generated from results-binary.d/system-rankings/*.tsv.
Teams
| team | name | affiliation |
|---|---|---|
| baseline | Ministral-3-3B-Instruct GGUF baseline 0.2.2 random seed 42 | HIPE-2026 organizers |
| random | Random Decision Baseline | HIPE-2026 organizers |
| team1 | Awakened | National University of Science and Technology Politehnica Bucharest |
| team10 | BIU_NLP | Bar-Ilan University |
| team11 | gipplab | University of Göttingen |
| team12 | whereami | Alexandria University |
| team13 | Spinfo | Universität zu Köln |
| team14 | MILRIT | University of Toulouse & La Rochelle University |
| team15 | FI-CODE | University of the Bundeswehr Munich |
| team16 | Rittik&Souvik | Jadavpur University, Kolkata |
| team17 | INSA Lyon | INSA Lyon - University of Lyon |
| team2 | DS@GT_HIPE | Georgia Institute of Technology |
| team3 | VerbaNexAI II | Universidad Tecnológica de Bolívar |
| team4 | FourBytes | Sri Sivasubramaniya Nadar College of Engineering |
| team5 | UMUTEAM | Universidad de Murcia |
| team6 | VerbaNexAI I | Universidad Tecnológica de Bolívar |
| team7 | ROSTI | Université Lumière Lyon |
| team8 | MaxFo-Ajie | Foshan University |
| team9 | Hansel&Gretel | IIT Roorkee |
Table of Contents
- Accuracy Profile Ranking Overall
- Accuracy Profile Ranking German
- Accuracy Profile Ranking English
- Accuracy Profile Ranking French
- Generalization Profile Ranking
- Generalization Profile Ranking French
- Efficiency Profile Ranking Overall
- Balanced Efficiency Profile Ranking Overall
- Efficiency Profile Ranking German
- Efficiency Profile Ranking English
- Efficiency Profile Ranking French
Profile Score Definitions
- Accuracy Profile Ranking uses the
impressotest files. - Generalization Profile Ranking uses the
surprisetest files. - For a label
l,recall_l = true_positives_l / gold_instances_l. - This binary report maps
PROBABLEtoTRUEforatin both reference and system labels. at_macro_recall = mean(recall_TRUE, recall_FALSE)for the binarizedatlabels.isAt_macro_recall = mean(recall_TRUE, recall_FALSE)for theisAtlabels.impresso_profile_score: score for oneimpressolanguage file, computed as the mean ofat_macro_recallandisAt_macro_recall.mean_impresso_profile_score: mean ofimpresso_profile_scoreover the submittedimpressolanguage files.surprise_profile_score: score on asurprisefile, computed asat_macro_recall;isAtis not evaluated forsurprise.- Accuracy columns are included as contextual diagnostics; ranking is still determined by the macro-recall profile score.
mean_efficiency_profile_rank: mean ofrank_impresso_profile_score,rank_hipe_parameter_count, andrank_hipe_model_size; lower is better.balanced_efficiency_profile_rank:0.5 * rank_impresso_profile_score + 0.25 * rank_hipe_parameter_count + 0.25 * rank_hipe_model_size; lower is better.- If
team_efficiency_opt_out=truein a run’s*-info.json, that run is excluded from efficiency ranking tables. - If organizer fields
hipe_parameter_countorhipe_model_sizearenull, they are internally treated as maxint for efficiency rank computation (worst resource rank), while remaining empty in table outputs.
Accuracy Profile Ranking Overall
| rank | team | run | mean impresso profile score | languages | num language files |
|---|---|---|---|---|---|
| 1 | team13 | run1 | 0.8419 | de,en,fr | 3 |
| 2 | team13 | run3 | 0.8369 | de,en,fr | 3 |
| 3 | team12 | run2 | 0.8156 | de,en,fr | 3 |
| 4 | team12 | run1 | 0.8148 | de,en,fr | 3 |
| 5 | team13 | run2 | 0.7998 | de,en,fr | 3 |
| 6 | team1 | run3 | 0.7925 | de,en,fr | 3 |
| 7 | team8 | run1 | 0.788 | de,en,fr | 3 |
| 8 | team1 | run1 | 0.7862 | de,en,fr | 3 |
| 9 | team8 | run2 | 0.7701 | de,en,fr | 3 |
| 10 | team17 | run1 | 0.7629 | de,en,fr | 3 |
| 11 | team8 | run3 | 0.76 | de,en,fr | 3 |
| 12 | team11 | run2 | 0.739 | de,en,fr | 3 |
| 13 | team9 | run3 | 0.7284 | de,en,fr | 3 |
| 14 | team11 | run1 | 0.7198 | de,en,fr | 3 |
| 15 | team5 | run2 | 0.7052 | de,en,fr | 3 |
| 16 | team3 | run3 | 0.6864 | de,en,fr | 3 |
| 17 | baseline | run1 | 0.6818 | de,en,fr | 3 |
| 18 | team14 | run3 | 0.6782 | de,en,fr | 3 |
| 19 | team9 | run2 | 0.6765 | de,en,fr | 3 |
| 20 | team10 | run2 | 0.6721 | de,en,fr | 3 |
| 21 | team14 | run1 | 0.6653 | de,en,fr | 3 |
| 22 | team1 | run2 | 0.6558 | de,en,fr | 3 |
| 23 | team9 | run1 | 0.6539 | de,en,fr | 3 |
| 24 | team10 | run3 | 0.6274 | de,en,fr | 3 |
| 25 | team3 | run2 | 0.6259 | de,en,fr | 3 |
| 26 | team11 | run3 | 0.6145 | de,en,fr | 3 |
| 27 | team2 | run1 | 0.6065 | de,en,fr | 3 |
| 28 | team3 | run1 | 0.6005 | de,en,fr | 3 |
| 29 | team17 | run2 | 0.5853 | de,en,fr | 3 |
| 30 | team2 | run2 | 0.5819 | de,en,fr | 3 |
| 31 | team17 | run3 | 0.576 | de,en,fr | 3 |
| 32 | team15 | run2 | 0.5664 | de,en,fr | 3 |
| 33 | team6 | run2 | 0.5633 | de,en,fr | 3 |
| 34 | team2 | run3 | 0.5562 | de,en,fr | 3 |
| 35 | team6 | run1 | 0.5544 | de,en,fr | 3 |
| 36 | team15 | run3 | 0.5493 | de,en,fr | 3 |
| 37 | team7 | run3 | 0.5484 | de,en,fr | 3 |
| 38 | team5 | run3 | 0.5426 | de,en,fr | 3 |
| 39 | team7 | run2 | 0.5374 | de,en,fr | 3 |
| 40 | team7 | run1 | 0.5338 | de,en,fr | 3 |
| 41 | team10 | run1 | 0.5252 | de,en,fr | 3 |
| 42 | team5 | run1 | 0.5245 | de,en,fr | 3 |
| 43 | team15 | run1 | 0.5215 | de,en,fr | 3 |
| 44 | team14 | run2 | 0.5155 | de,en,fr | 3 |
| 45 | team4 | run1 | 0.4988 | de,en,fr | 3 |
| 46 | random | run1 | 0.4913 | de,en,fr | 3 |
Top 3 teams by best run:
- Spinfo (team13), run1, table rank 1, mean impresso profile score 0.8419
- whereami (team12), run2, table rank 3, mean impresso profile score 0.8156
- Awakened (team1), run3, table rank 6, mean impresso profile score 0.7925
Only team runs that submitted all impresso language files are included in this overall ranking. Team runs with partial submissions are shown only in the dataset-specific ranking tables.
Accuracy Profile Ranking German
| rank | team | run | submission | impresso profile score | at macro recall | at accuracy | isAt macro recall | isAt accuracy | diagnostics |
|---|---|---|---|---|---|---|---|---|---|
| 1 | team13 | run3 | team13_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.8721 | 0.9048 | 0.9202 | 0.8394 | 0.8866 | Comparison / Metrics |
| 2 | team13 | run1 | team13_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.8599 | 0.8999 | 0.9118 | 0.8199 | 0.8782 | Comparison / Metrics |
| 3 | team12 | run1 | team12_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.8508 | 0.8862 | 0.895 | 0.8154 | 0.8782 | Comparison / Metrics |
| 4 | team12 | run2 | team12_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.8472 | 0.8911 | 0.9034 | 0.8034 | 0.8739 | Comparison / Metrics |
| 5 | team17 | run1 | team17_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.8209 | 0.7922 | 0.8067 | 0.8497 | 0.8361 | Comparison / Metrics |
| 6 | team1 | run1 | team1_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.8088 | 0.8556 | 0.8277 | 0.7621 | 0.8277 | Comparison / Metrics |
| 7 | team1 | run3 | team1_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.8071 | 0.8644 | 0.8361 | 0.7498 | 0.8361 | Comparison / Metrics |
| 8 | team8 | run1 | team8_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.7885 | 0.8379 | 0.8529 | 0.7391 | 0.8403 | Comparison / Metrics |
| 9 | team13 | run2 | team13_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.7866 | 0.8671 | 0.8739 | 0.706 | 0.8319 | Comparison / Metrics |
| 10 | team5 | run2 | team5_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.7562 | 0.7744 | 0.7479 | 0.738 | 0.7605 | Comparison / Metrics |
| 11 | team8 | run2 | team8_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.7395 | 0.7967 | 0.8319 | 0.6823 | 0.8109 | Comparison / Metrics |
| 12 | team8 | run3 | team8_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.7387 | 0.7967 | 0.8319 | 0.6807 | 0.8151 | Comparison / Metrics |
| 13 | team11 | run2 | team11_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.7345 | 0.8868 | 0.8908 | 0.5821 | 0.7647 | Comparison / Metrics |
| 14 | team11 | run1 | team11_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.7216 | 0.8387 | 0.8613 | 0.6045 | 0.7773 | Comparison / Metrics |
| 15 | team9 | run3 | team9_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.7166 | 0.7219 | 0.7353 | 0.7112 | 0.8067 | Comparison / Metrics |
| 16 | team14 | run3 | team14_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.692 | 0.7408 | 0.7017 | 0.6433 | 0.7353 | Comparison / Metrics |
| 17 | team14 | run1 | team14_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.6885 | 0.6918 | 0.7353 | 0.6852 | 0.7563 | Comparison / Metrics |
| 18 | team3 | run3 | team3_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.6621 | 0.7149 | 0.7563 | 0.6093 | 0.7647 | Comparison / Metrics |
| 19 | baseline | run1 | baseline_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.6578 | 0.7028 | 0.7143 | 0.6129 | 0.7437 | Comparison / Metrics |
| 20 | team9 | run1 | team9_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.655 | 0.7831 | 0.7857 | 0.5269 | 0.7311 | Comparison / Metrics |
| 21 | team9 | run2 | team9_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.646 | 0.7575 | 0.7395 | 0.5344 | 0.7353 | Comparison / Metrics |
| 22 | team1 | run2 | team1_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.6455 | 0.7027 | 0.6723 | 0.5882 | 0.7605 | Comparison / Metrics |
| 23 | team10 | run3 | team10_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.6381 | 0.6312 | 0.6807 | 0.645 | 0.7899 | Comparison / Metrics |
| 24 | team10 | run2 | team10_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.6231 | 0.7102 | 0.7185 | 0.536 | 0.7311 | Comparison / Metrics |
| 25 | team11 | run3 | team11_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.6192 | 0.7115 | 0.7521 | 0.5269 | 0.7311 | Comparison / Metrics |
| 26 | team7 | run3 | team7_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.6158 | 0.6175 | 0.6639 | 0.6141 | 0.6933 | Comparison / Metrics |
| 27 | team2 | run1 | team2_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.6139 | 0.6235 | 0.6639 | 0.6043 | 0.6597 | Comparison / Metrics |
| 28 | team3 | run2 | team3_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.6102 | 0.7038 | 0.7353 | 0.5165 | 0.7227 | Comparison / Metrics |
| 29 | team7 | run1 | team7_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.6094 | 0.6106 | 0.6555 | 0.6082 | 0.6849 | Comparison / Metrics |
| 30 | team7 | run2 | team7_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.6031 | 0.6038 | 0.6471 | 0.6024 | 0.6765 | Comparison / Metrics |
| 31 | team2 | run2 | team2_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.592 | 0.5861 | 0.5588 | 0.5978 | 0.6765 | Comparison / Metrics |
| 32 | team6 | run2 | team6_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.5885 | 0.5847 | 0.5546 | 0.5923 | 0.6555 | Comparison / Metrics |
| 33 | team3 | run1 | team3_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.5795 | 0.6358 | 0.6765 | 0.5233 | 0.6933 | Comparison / Metrics |
| 34 | team2 | run3 | team2_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.5782 | 0.5661 | 0.6008 | 0.5904 | 0.6723 | Comparison / Metrics |
| 35 | team17 | run2 | team17_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.5707 | 0.6336 | 0.6639 | 0.5078 | 0.7101 | Comparison / Metrics |
| 36 | team15 | run2 | team15_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.5598 | 0.5746 | 0.5126 | 0.545 | 0.4244 | Comparison / Metrics |
| 37 | team5 | run3 | team5_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.553 | 0.5542 | 0.458 | 0.5517 | 0.395 | Comparison / Metrics |
| 38 | team6 | run1 | team6_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.5503 | 0.5517 | 0.5042 | 0.5488 | 0.6387 | Comparison / Metrics |
| 39 | team17 | run3 | team17_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.549 | 0.572 | 0.6303 | 0.5261 | 0.5798 | Comparison / Metrics |
| 40 | team10 | run1 | team10_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.5329 | 0.5435 | 0.6471 | 0.5224 | 0.7311 | Comparison / Metrics |
| 41 | team15 | run1 | team15_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.532 | 0.564 | 0.5588 | 0.5 | 0.7185 | Comparison / Metrics |
| 42 | team15 | run3 | team15_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 0.5187 | 0.5237 | 0.6303 | 0.5136 | 0.7185 | Comparison / Metrics |
| 43 | team14 | run2 | team14_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 0.5167 | 0.511 | 0.5924 | 0.5224 | 0.7311 | Comparison / Metrics |
| 44 | team5 | run1 | team5_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.5 | 0.5 | 0.6134 | 0.5 | 0.7185 | Comparison / Metrics |
| 45 | team4 | run1 | team4_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.4906 | 0.4818 | 0.416 | 0.4995 | 0.6134 | Comparison / Metrics |
| 46 | random | run1 | random_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 0.4783 | 0.4661 | 0.4412 | 0.4906 | 0.4832 | Comparison / Metrics |
Top 3 teams by best run:
- Spinfo (team13), run3, table rank 1, impresso profile score 0.8721
- whereami (team12), run1, table rank 3, impresso profile score 0.8508
- INSA Lyon (team17), run1, table rank 5, impresso profile score 0.8209
Accuracy Profile Ranking English
| rank | team | run | submission | impresso profile score | at macro recall | at accuracy | isAt macro recall | isAt accuracy | diagnostics |
|---|---|---|---|---|---|---|---|---|---|
| 1 | team8 | run1 | team8_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.8102 | 0.8642 | 0.8642 | 0.7562 | 0.7901 | Comparison / Metrics |
| 2 | team12 | run2 | team12_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.8058 | 0.8194 | 0.821 | 0.7921 | 0.821 | Comparison / Metrics |
| 3 | team13 | run1 | team13_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.803 | 0.8061 | 0.8086 | 0.7998 | 0.8272 | Comparison / Metrics |
| 4 | team8 | run2 | team8_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.8024 | 0.8897 | 0.8827 | 0.7151 | 0.7531 | Comparison / Metrics |
| 5 | team13 | run3 | team13_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.8016 | 0.8291 | 0.8333 | 0.7741 | 0.8025 | Comparison / Metrics |
| 6 | team12 | run1 | team12_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.7982 | 0.8145 | 0.8148 | 0.7819 | 0.8148 | Comparison / Metrics |
| 7 | team8 | run3 | team8_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.7875 | 0.8521 | 0.858 | 0.7229 | 0.7654 | Comparison / Metrics |
| 8 | team13 | run2 | team13_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.7605 | 0.8569 | 0.8457 | 0.664 | 0.7222 | Comparison / Metrics |
| 9 | team1 | run3 | team1_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.7471 | 0.8122 | 0.8395 | 0.682 | 0.7407 | Comparison / Metrics |
| 10 | team9 | run3 | team9_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.7279 | 0.7406 | 0.7346 | 0.7151 | 0.7531 | Comparison / Metrics |
| 11 | team1 | run1 | team1_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.7138 | 0.7868 | 0.821 | 0.6408 | 0.6914 | Comparison / Metrics |
| 12 | team11 | run2 | team11_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.7119 | 0.7674 | 0.7963 | 0.6564 | 0.7222 | Comparison / Metrics |
| 13 | team17 | run1 | team17_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.7031 | 0.6837 | 0.7037 | 0.7225 | 0.7346 | Comparison / Metrics |
| 14 | team11 | run1 | team11_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.696 | 0.7176 | 0.7284 | 0.6743 | 0.7346 | Comparison / Metrics |
| 15 | team3 | run3 | team3_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.6841 | 0.7453 | 0.7037 | 0.623 | 0.6914 | Comparison / Metrics |
| 16 | team9 | run2 | team9_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.6803 | 0.6887 | 0.7469 | 0.6718 | 0.7346 | Comparison / Metrics |
| 17 | team3 | run1 | team3_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.6786 | 0.7068 | 0.7469 | 0.6504 | 0.6543 | Comparison / Metrics |
| 18 | team9 | run1 | team9_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.6645 | 0.6983 | 0.7222 | 0.6308 | 0.7037 | Comparison / Metrics |
| 19 | baseline | run1 | baseline_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.6638 | 0.6945 | 0.6852 | 0.6331 | 0.6852 | Comparison / Metrics |
| 20 | team3 | run2 | team3_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.6566 | 0.7261 | 0.7346 | 0.5871 | 0.6605 | Comparison / Metrics |
| 21 | team10 | run2 | team10_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.6524 | 0.633 | 0.6852 | 0.6718 | 0.7346 | Comparison / Metrics |
| 22 | team14 | run3 | team14_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.6351 | 0.604 | 0.6852 | 0.6663 | 0.7037 | Comparison / Metrics |
| 23 | team5 | run2 | team5_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.6315 | 0.6282 | 0.6975 | 0.6349 | 0.6296 | Comparison / Metrics |
| 24 | team17 | run2 | team17_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.6247 | 0.6752 | 0.679 | 0.5741 | 0.642 | Comparison / Metrics |
| 25 | team11 | run3 | team11_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.6209 | 0.6316 | 0.642 | 0.6102 | 0.6852 | Comparison / Metrics |
| 26 | team17 | run3 | team17_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.613 | 0.612 | 0.5802 | 0.6141 | 0.5926 | Comparison / Metrics |
| 27 | team2 | run1 | team2_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.6044 | 0.5867 | 0.5988 | 0.6221 | 0.6235 | Comparison / Metrics |
| 28 | team10 | run3 | team10_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.5904 | 0.6193 | 0.5988 | 0.5615 | 0.6481 | Comparison / Metrics |
| 29 | team1 | run2 | team1_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.5889 | 0.5193 | 0.6235 | 0.6586 | 0.6975 | Comparison / Metrics |
| 30 | team14 | run1 | team14_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.5756 | 0.5782 | 0.5926 | 0.573 | 0.5556 | Comparison / Metrics |
| 31 | team2 | run3 | team2_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.5673 | 0.531 | 0.5556 | 0.6036 | 0.5679 | Comparison / Metrics |
| 32 | team15 | run2 | team15_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.5529 | 0.5592 | 0.642 | 0.5467 | 0.4815 | Comparison / Metrics |
| 33 | team6 | run1 | team6_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.5511 | 0.4803 | 0.537 | 0.622 | 0.6173 | Comparison / Metrics |
| 34 | team6 | run2 | team6_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.5451 | 0.4985 | 0.5741 | 0.5916 | 0.6173 | Comparison / Metrics |
| 35 | team5 | run1 | team5_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.5445 | 0.5191 | 0.5864 | 0.5699 | 0.5123 | Comparison / Metrics |
| 36 | team10 | run1 | team10_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.5382 | 0.5559 | 0.4444 | 0.5205 | 0.6111 | Comparison / Metrics |
| 37 | team2 | run2 | team2_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.5317 | 0.554 | 0.5617 | 0.5095 | 0.5494 | Comparison / Metrics |
| 38 | team7 | run3 | team7_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.5307 | 0.5415 | 0.463 | 0.52 | 0.5741 | Comparison / Metrics |
| 39 | team14 | run2 | team14_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.5267 | 0.5329 | 0.4383 | 0.5205 | 0.6111 | Comparison / Metrics |
| 40 | team7 | run2 | team7_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 0.518 | 0.5366 | 0.4568 | 0.4994 | 0.5556 | Comparison / Metrics |
| 41 | team15 | run1 | team15_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.518 | 0.536 | 0.5988 | 0.5 | 0.5988 | Comparison / Metrics |
| 42 | team5 | run3 | team5_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.5143 | 0.4951 | 0.6296 | 0.5335 | 0.4444 | Comparison / Metrics |
| 43 | team15 | run3 | team15_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 0.5126 | 0.4571 | 0.4938 | 0.568 | 0.5617 | Comparison / Metrics |
| 44 | team7 | run1 | team7_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.5042 | 0.5245 | 0.4506 | 0.484 | 0.537 | Comparison / Metrics |
| 45 | team4 | run1 | team4_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.497 | 0.4951 | 0.6296 | 0.4989 | 0.5123 | Comparison / Metrics |
| 46 | team16 | run1 | team16_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.4939 | 0.4628 | 0.4136 | 0.5249 | 0.5617 | Comparison / Metrics |
| 47 | random | run1 | random_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 0.462 | 0.4305 | 0.4691 | 0.4935 | 0.4877 | Comparison / Metrics |
Top 3 teams by best run:
- MaxFo-Ajie (team8), run1, table rank 1, impresso profile score 0.8102
- whereami (team12), run2, table rank 2, impresso profile score 0.8058
- Spinfo (team13), run1, table rank 3, impresso profile score 0.803
Accuracy Profile Ranking French
| rank | team | run | submission | impresso profile score | at macro recall | at accuracy | isAt macro recall | isAt accuracy | diagnostics |
|---|---|---|---|---|---|---|---|---|---|
| 1 | team13 | run1 | team13_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.8628 | 0.8938 | 0.895 | 0.8318 | 0.8529 | Comparison / Metrics |
| 2 | team13 | run2 | team13_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.8523 | 0.9056 | 0.9034 | 0.7991 | 0.8571 | Comparison / Metrics |
| 3 | team13 | run3 | team13_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.837 | 0.8607 | 0.8613 | 0.8132 | 0.8403 | Comparison / Metrics |
| 4 | team1 | run1 | team1_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.836 | 0.9237 | 0.9202 | 0.7483 | 0.7941 | Comparison / Metrics |
| 5 | team1 | run3 | team1_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.8232 | 0.9285 | 0.9244 | 0.7179 | 0.7815 | Comparison / Metrics |
| 6 | team12 | run1 | team12_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.7955 | 0.851 | 0.8613 | 0.74 | 0.8067 | Comparison / Metrics |
| 7 | team12 | run2 | team12_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.7939 | 0.851 | 0.8613 | 0.7368 | 0.8025 | Comparison / Metrics |
| 8 | team11 | run2 | team11_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.7706 | 0.8965 | 0.8992 | 0.6448 | 0.7521 | Comparison / Metrics |
| 9 | team8 | run2 | team8_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.7684 | 0.827 | 0.8403 | 0.7099 | 0.8025 | Comparison / Metrics |
| 10 | team8 | run1 | team8_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.7654 | 0.8271 | 0.8319 | 0.7037 | 0.7983 | Comparison / Metrics |
| 11 | team17 | run1 | team17_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.7648 | 0.7149 | 0.7311 | 0.8147 | 0.8067 | Comparison / Metrics |
| 12 | team8 | run3 | team8_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.7539 | 0.835 | 0.8529 | 0.6728 | 0.7773 | Comparison / Metrics |
| 13 | team11 | run1 | team11_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.7418 | 0.8078 | 0.8235 | 0.6758 | 0.7773 | Comparison / Metrics |
| 14 | team10 | run2 | team10_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.7409 | 0.7462 | 0.7227 | 0.7356 | 0.7773 | Comparison / Metrics |
| 15 | team9 | run3 | team9_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.7409 | 0.748 | 0.7647 | 0.7338 | 0.8025 | Comparison / Metrics |
| 16 | team1 | run2 | team1_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.7329 | 0.7904 | 0.7773 | 0.6754 | 0.7689 | Comparison / Metrics |
| 17 | team14 | run1 | team14_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.7318 | 0.7448 | 0.7563 | 0.7187 | 0.7353 | Comparison / Metrics |
| 18 | team5 | run2 | team5_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.7278 | 0.7254 | 0.7017 | 0.7302 | 0.7269 | Comparison / Metrics |
| 19 | baseline | run1 | baseline_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.7239 | 0.7674 | 0.7647 | 0.6804 | 0.7479 | Comparison / Metrics |
| 20 | team3 | run3 | team3_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.713 | 0.7944 | 0.8109 | 0.6316 | 0.7269 | Comparison / Metrics |
| 21 | team14 | run3 | team14_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.7074 | 0.7402 | 0.7353 | 0.6746 | 0.7521 | Comparison / Metrics |
| 22 | team9 | run2 | team9_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.7033 | 0.793 | 0.7899 | 0.6137 | 0.7269 | Comparison / Metrics |
| 23 | team10 | run3 | team10_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.6538 | 0.7584 | 0.7437 | 0.5492 | 0.6891 | Comparison / Metrics |
| 24 | team9 | run1 | team9_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.642 | 0.7133 | 0.7269 | 0.5707 | 0.7017 | Comparison / Metrics |
| 25 | team2 | run2 | team2_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.6219 | 0.6249 | 0.6261 | 0.6189 | 0.6471 | Comparison / Metrics |
| 26 | team15 | run3 | team15_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.6165 | 0.6261 | 0.6092 | 0.607 | 0.584 | Comparison / Metrics |
| 27 | team3 | run2 | team3_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.611 | 0.71 | 0.7353 | 0.512 | 0.6597 | Comparison / Metrics |
| 28 | team11 | run3 | team11_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.6033 | 0.6513 | 0.6849 | 0.5554 | 0.6933 | Comparison / Metrics |
| 29 | team2 | run1 | team2_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.6012 | 0.5669 | 0.5462 | 0.6354 | 0.6807 | Comparison / Metrics |
| 30 | team15 | run2 | team15_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.5865 | 0.5927 | 0.5462 | 0.5802 | 0.458 | Comparison / Metrics |
| 31 | team17 | run3 | team17_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.5658 | 0.5311 | 0.5714 | 0.6004 | 0.6345 | Comparison / Metrics |
| 32 | team6 | run1 | team6_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.5619 | 0.5555 | 0.5504 | 0.5683 | 0.6513 | Comparison / Metrics |
| 33 | team17 | run2 | team17_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.5606 | 0.6134 | 0.6387 | 0.5078 | 0.6345 | Comparison / Metrics |
| 34 | team5 | run3 | team5_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.5605 | 0.5442 | 0.4916 | 0.5768 | 0.4496 | Comparison / Metrics |
| 35 | team6 | run2 | team6_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.5564 | 0.5356 | 0.4916 | 0.5773 | 0.6513 | Comparison / Metrics |
| 36 | team3 | run1 | team3_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.5433 | 0.5697 | 0.5966 | 0.5169 | 0.6387 | Comparison / Metrics |
| 37 | random | run1 | random_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.5334 | 0.5407 | 0.5252 | 0.5262 | 0.5168 | Comparison / Metrics |
| 38 | team5 | run1 | team5_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.529 | 0.5613 | 0.5714 | 0.4967 | 0.5882 | Comparison / Metrics |
| 39 | team2 | run3 | team2_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.5231 | 0.5753 | 0.563 | 0.4708 | 0.4832 | Comparison / Metrics |
| 40 | team15 | run1 | team15_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.5144 | 0.5288 | 0.5252 | 0.5 | 0.6597 | Comparison / Metrics |
| 41 | team4 | run1 | team4_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.5088 | 0.512 | 0.4832 | 0.5056 | 0.5882 | Comparison / Metrics |
| 42 | team10 | run1 | team10_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.5044 | 0.5059 | 0.5672 | 0.503 | 0.6597 | Comparison / Metrics |
| 43 | team14 | run2 | team14_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.5031 | 0.5194 | 0.563 | 0.4869 | 0.6345 | Comparison / Metrics |
| 44 | team7 | run3 | team7_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 0.4987 | 0.487 | 0.5084 | 0.5104 | 0.563 | Comparison / Metrics |
| 45 | team7 | run2 | team7_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 0.491 | 0.4742 | 0.5 | 0.5078 | 0.5714 | Comparison / Metrics |
| 46 | team7 | run1 | team7_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 0.4877 | 0.4774 | 0.5 | 0.4981 | 0.5546 | Comparison / Metrics |
Top 3 teams by best run:
- Spinfo (team13), run1, table rank 1, impresso profile score 0.8628
- Awakened (team1), run1, table rank 4, impresso profile score 0.836
- whereami (team12), run1, table rank 6, impresso profile score 0.7955
Generalization Profile Ranking
| rank | team | run | submission | surprise profile score | diagnostics |
|---|---|---|---|---|---|
| 1 | team8 | run1 | team8_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.9182 | Comparison / Metrics |
| 1 | team8 | run3 | team8_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.9182 | Comparison / Metrics |
| 2 | team8 | run2 | team8_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.9163 | Comparison / Metrics |
| 3 | team10 | run1 | team10_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8804 | Comparison / Metrics |
| 4 | team13 | run1 | team13_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8764 | Comparison / Metrics |
| 5 | team13 | run2 | team13_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.8688 | Comparison / Metrics |
| 6 | team13 | run3 | team13_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.8667 | Comparison / Metrics |
| 7 | team12 | run2 | team12_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.8557 | Comparison / Metrics |
| 8 | team11 | run2 | team11_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.8546 | Comparison / Metrics |
| 9 | team1 | run3 | team1_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.852 | Comparison / Metrics |
| 10 | team1 | run1 | team1_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8485 | Comparison / Metrics |
| 11 | team12 | run1 | team12_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8461 | Comparison / Metrics |
| 12 | team11 | run1 | team11_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8357 | Comparison / Metrics |
| 13 | team9 | run3 | team9_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.832 | Comparison / Metrics |
| 14 | team3 | run3 | team3_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.8034 | Comparison / Metrics |
| 15 | team9 | run1 | team9_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.7972 | Comparison / Metrics |
| 16 | team9 | run2 | team9_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.794 | Comparison / Metrics |
| 17 | team11 | run3 | team11_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.7444 | Comparison / Metrics |
| 18 | team3 | run2 | team3_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.7425 | Comparison / Metrics |
| 19 | team14 | run3 | team14_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.727 | Comparison / Metrics |
| 20 | team10 | run3 | team10_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.7262 | Comparison / Metrics |
| 21 | team1 | run2 | team1_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.7237 | Comparison / Metrics |
| 22 | team5 | run2 | team5_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.7207 | Comparison / Metrics |
| 23 | team17 | run1 | team17_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.712 | Comparison / Metrics |
| 24 | team14 | run1 | team14_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.6909 | Comparison / Metrics |
| 25 | baseline | run1 | baseline_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.6678 | Comparison / Metrics |
| 26 | team10 | run2 | team10_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.6473 | Comparison / Metrics |
| 27 | team3 | run1 | team3_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.6368 | Comparison / Metrics |
| 28 | team17 | run2 | team17_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.6158 | Comparison / Metrics |
| 29 | team2 | run3 | team2_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5777 | Comparison / Metrics |
| 30 | team14 | run2 | team14_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5743 | Comparison / Metrics |
| 31 | team17 | run3 | team17_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5715 | Comparison / Metrics |
| 32 | team6 | run2 | team6_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5608 | Comparison / Metrics |
| 33 | team15 | run2 | team15_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5596 | Comparison / Metrics |
| 34 | team7 | run1 | team7_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5593 | Comparison / Metrics |
| 35 | team15 | run1 | team15_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5583 | Comparison / Metrics |
| 36 | team7 | run3 | team7_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5557 | Comparison / Metrics |
| 37 | team7 | run2 | team7_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5533 | Comparison / Metrics |
| 38 | team2 | run2 | team2_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5492 | Comparison / Metrics |
| 39 | random | run1 | random_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5344 | Comparison / Metrics |
| 40 | team15 | run3 | team15_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5265 | Comparison / Metrics |
| 41 | team2 | run1 | team2_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.523 | Comparison / Metrics |
| 42 | team4 | run1 | team4_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5118 | Comparison / Metrics |
| 42 | team6 | run1 | team6_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5118 | Comparison / Metrics |
| 43 | team5 | run3 | team5_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5026 | Comparison / Metrics |
| 44 | team5 | run1 | team5_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5 | Comparison / Metrics |
Top 3 teams by best run:
- MaxFo-Ajie (team8), run1, table rank 1, surprise profile score 0.9182
- BIU_NLP (team10), run1, table rank 3, surprise profile score 0.8804
- Spinfo (team13), run1, table rank 4, surprise profile score 0.8764
Generalization Profile Ranking French
| rank | team | run | submission | surprise profile score | at macro recall | at accuracy | diagnostics |
|---|---|---|---|---|---|---|---|
| 1 | team8 | run1 | team8_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.9182 | 0.9182 | 0.9187 | Comparison / Metrics |
| 1 | team8 | run3 | team8_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.9182 | 0.9182 | 0.9187 | Comparison / Metrics |
| 2 | team8 | run2 | team8_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.9163 | 0.9163 | 0.9208 | Comparison / Metrics |
| 3 | team10 | run1 | team10_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8804 | 0.8804 | 0.8917 | Comparison / Metrics |
| 4 | team13 | run1 | team13_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8764 | 0.8764 | 0.8792 | Comparison / Metrics |
| 5 | team13 | run2 | team13_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.8688 | 0.8688 | 0.8667 | Comparison / Metrics |
| 6 | team13 | run3 | team13_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.8667 | 0.8667 | 0.8729 | Comparison / Metrics |
| 7 | team12 | run2 | team12_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.8557 | 0.8557 | 0.875 | Comparison / Metrics |
| 8 | team11 | run2 | team11_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.8546 | 0.8546 | 0.8583 | Comparison / Metrics |
| 9 | team1 | run3 | team1_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.852 | 0.852 | 0.8354 | Comparison / Metrics |
| 10 | team1 | run1 | team1_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8485 | 0.8485 | 0.8333 | Comparison / Metrics |
| 11 | team12 | run1 | team12_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8461 | 0.8461 | 0.8667 | Comparison / Metrics |
| 12 | team11 | run1 | team11_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.8357 | 0.8357 | 0.8562 | Comparison / Metrics |
| 13 | team9 | run3 | team9_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.832 | 0.832 | 0.8354 | Comparison / Metrics |
| 14 | team3 | run3 | team3_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.8034 | 0.8034 | 0.8271 | Comparison / Metrics |
| 15 | team9 | run1 | team9_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.7972 | 0.7972 | 0.8021 | Comparison / Metrics |
| 16 | team9 | run2 | team9_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.794 | 0.794 | 0.7708 | Comparison / Metrics |
| 17 | team11 | run3 | team11_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.7444 | 0.7444 | 0.7646 | Comparison / Metrics |
| 18 | team3 | run2 | team3_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.7425 | 0.7425 | 0.7667 | Comparison / Metrics |
| 19 | team14 | run3 | team14_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.727 | 0.727 | 0.7063 | Comparison / Metrics |
| 20 | team10 | run3 | team10_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.7262 | 0.7262 | 0.6813 | Comparison / Metrics |
| 21 | team1 | run2 | team1_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.7237 | 0.7237 | 0.6979 | Comparison / Metrics |
| 22 | team5 | run2 | team5_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.7207 | 0.7207 | 0.6833 | Comparison / Metrics |
| 23 | team17 | run1 | team17_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.712 | 0.712 | 0.7375 | Comparison / Metrics |
| 24 | team14 | run1 | team14_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.6909 | 0.6909 | 0.7208 | Comparison / Metrics |
| 25 | baseline | run1 | baseline_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.6678 | 0.6678 | 0.6479 | Comparison / Metrics |
| 26 | team10 | run2 | team10_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.6473 | 0.6473 | 0.5771 | Comparison / Metrics |
| 27 | team3 | run1 | team3_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.6368 | 0.6368 | 0.6729 | Comparison / Metrics |
| 28 | team17 | run2 | team17_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.6158 | 0.6158 | 0.675 | Comparison / Metrics |
| 29 | team2 | run3 | team2_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5777 | 0.5777 | 0.5708 | Comparison / Metrics |
| 30 | team14 | run2 | team14_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5743 | 0.5743 | 0.6271 | Comparison / Metrics |
| 31 | team17 | run3 | team17_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5715 | 0.5715 | 0.65 | Comparison / Metrics |
| 32 | team6 | run2 | team6_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5608 | 0.5608 | 0.5417 | Comparison / Metrics |
| 33 | team15 | run2 | team15_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5596 | 0.5596 | 0.4854 | Comparison / Metrics |
| 34 | team7 | run1 | team7_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5593 | 0.5593 | 0.5979 | Comparison / Metrics |
| 35 | team15 | run1 | team15_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5583 | 0.5583 | 0.5375 | Comparison / Metrics |
| 36 | team7 | run3 | team7_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5557 | 0.5557 | 0.5958 | Comparison / Metrics |
| 37 | team7 | run2 | team7_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5533 | 0.5533 | 0.5896 | Comparison / Metrics |
| 38 | team2 | run2 | team2_HIPE-2026-v1.0-surprise-test-fr_run2.jsonl | 0.5492 | 0.5492 | 0.5583 | Comparison / Metrics |
| 39 | random | run1 | random_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5344 | 0.5344 | 0.5021 | Comparison / Metrics |
| 40 | team15 | run3 | team15_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5265 | 0.5265 | 0.5375 | Comparison / Metrics |
| 41 | team2 | run1 | team2_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.523 | 0.523 | 0.4708 | Comparison / Metrics |
| 42 | team4 | run1 | team4_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5118 | 0.5118 | 0.4167 | Comparison / Metrics |
| 42 | team6 | run1 | team6_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5118 | 0.5118 | 0.5833 | Comparison / Metrics |
| 43 | team5 | run3 | team5_HIPE-2026-v1.0-surprise-test-fr_run3.jsonl | 0.5026 | 0.5026 | 0.4188 | Comparison / Metrics |
| 44 | team5 | run1 | team5_HIPE-2026-v1.0-surprise-test-fr_run1.jsonl | 0.5 | 0.5 | 0.6042 | Comparison / Metrics |
Top 3 teams by best run:
- MaxFo-Ajie (team8), run1, table rank 1, surprise profile score 0.9182
- BIU_NLP (team10), run1, table rank 3, surprise profile score 0.8804
- Spinfo (team13), run1, table rank 4, surprise profile score 0.8764
Efficiency Profile Ranking Overall
| rank | team | run | mean efficiency profile rank | rank impresso profile score | rank hipe parameter count | rank hipe model size | mean impresso profile score | hipe parameter count | hipe model size mb |
|---|---|---|---|---|---|---|---|---|---|
| 1 | team15 | run2 | 10.3333 | 29 | 1 | 1 | 0.5664 | 0 | 0 |
| 2 | team14 | run3 | 10.6667 | 15 | 8 | 9 | 0.6782 | 277730309 | 1111 |
| 3 | team2 | run1 | 11 | 24 | 5 | 4 | 0.6065 | 2087375 | 87 |
| 4 | team2 | run2 | 12 | 27 | 5 | 4 | 0.5819 | 2087375 | 87 |
| 5 | team2 | run3 | 13.3333 | 31 | 5 | 4 | 0.5562 | 2087375 | 87 |
| 6 | team14 | run1 | 13.6667 | 18 | 12 | 11 | 0.6653 | 466577920 | 1780 |
| 6 | team7 | run1 | 13.6667 | 37 | 2 | 2 | 0.5338 | 12279 | 0.8 |
| 6 | team7 | run3 | 13.6667 | 34 | 4 | 3 | 0.5484 | 12399 | 0.81 |
| 7 | team7 | run2 | 14 | 36 | 3 | 3 | 0.5374 | 12365 | 0.81 |
| 8 | team17 | run2 | 14.3333 | 26 | 9 | 8 | 0.5853 | 278043651 | 1061 |
| 9 | random | run1 | 15 | 43 | 1 | 1 | 0.4913 | 0 | 0 |
| 10 | team1 | run2 | 15.3333 | 19 | 14 | 13 | 0.6558 | 560965127 | 2140 |
| 11 | baseline | run1 | 15.6667 | 14 | 19 | 14 | 0.6818 | 3000000000 | 2147.023 |
| 12 | team12 | run2 | 16.3333 | 3 | 22 | 24 | 0.8156 | 5123178979 | 9600 |
| 12 | team3 | run3 | 16.3333 | 13 | 20 | 16 | 0.6864 | 4000000000 | 2840 |
| 13 | team12 | run1 | 16.6667 | 4 | 22 | 24 | 0.8148 | 5123178979 | 9600 |
| 14 | team15 | run1 | 17 | 40 | 6 | 5 | 0.5215 | 208935168 | 816 |
| 14 | team6 | run2 | 17 | 30 | 11 | 10 | 0.5633 | 355000000 | 1424 |
| 15 | team5 | run1 | 17.3333 | 39 | 7 | 6 | 0.5245 | 270000000 | 1030 |
| 16 | team11 | run2 | 17.6667 | 9 | 21 | 23 | 0.739 | 4465470464 | 9012 |
| 16 | team6 | run1 | 17.6667 | 32 | 11 | 10 | 0.5544 | 355000000 | 1424 |
| 17 | team5 | run2 | 18 | 12 | 20 | 22 | 0.7052 | 4000000000 | 7600 |
| 18 | team3 | run1 | 18.6667 | 25 | 16 | 15 | 0.6005 | 1500000000 | 2340 |
| 19 | team11 | run3 | 19.3333 | 23 | 17 | 18 | 0.6145 | 1949101888 | 3845 |
| 20 | team4 | run1 | 19.6667 | 42 | 10 | 7 | 0.4988 | 278054405 | 1060 |
| 21 | team17 | run3 | 20 | 28 | 15 | 17 | 0.576 | 838778678 | 3217 |
| 21 | team9 | run1 | 20 | 20 | 19 | 21 | 0.6539 | 3000000000 | 6248 |
| 22 | team13 | run1 | 20.3333 | 1 | 30 | 30 | 0.8419 | 116830000000 | 65238 |
| 23 | team11 | run1 | 20.6667 | 11 | 25 | 26 | 0.7198 | 9300029952 | 18398 |
| 23 | team13 | run3 | 20.6667 | 2 | 30 | 30 | 0.8369 | 116830000000 | 65238 |
| 24 | team13 | run2 | 21.6667 | 5 | 30 | 30 | 0.7998 | 116830000000 | 65238 |
| 24 | team3 | run2 | 21.6667 | 22 | 23 | 20 | 0.6259 | 5900000000 | 5980 |
| 24 | team9 | run2 | 21.6667 | 16 | 24 | 25 | 0.6765 | 7000000000 | 15300 |
| 25 | team14 | run2 | 22 | 41 | 13 | 12 | 0.5155 | 466585989 | 1866 |
| 26 | team17 | run1 | 22.6667 | 8 | 29 | 31 | 0.7629 | 101927226758 | 195716 |
| 27 | team15 | run3 | 23.3333 | 33 | 18 | 19 | 0.5493 | 2274069824 | 4442 |
| 28 | team1 | run3 | 23.6667 | 6 | 32 | 33 | 0.7925 | 999999999999 | 999999 |
| 29 | team1 | run1 | 24 | 7 | 32 | 33 | 0.7862 | 999999999999 | 999999 |
| 30 | team9 | run3 | 24.3333 | 10 | 31 | 32 | 0.7284 | 120000000000 | 240000 |
| 31 | team10 | run2 | 24.6667 | 17 | 28 | 29 | 0.6721 | 27000000000 | 54000 |
| 31 | team10 | run3 | 24.6667 | 21 | 26 | 27 | 0.6274 | 24000000000 | 48000 |
| 32 | team5 | run3 | 25.6667 | 35 | 20 | 22 | 0.5426 | 4000000000 | 7600 |
| 33 | team10 | run1 | 31 | 38 | 27 | 28 | 0.5252 | 26000000000 | 52000 |
Top 3 teams by best run:
- FI-CODE (team15), run2, table rank 1, mean efficiency profile rank 10.3333
- MILRIT (team14), run3, table rank 2, mean efficiency profile rank 10.6667
- DS@GT_HIPE (team2), run1, table rank 3, mean efficiency profile rank 11
Balanced Efficiency Profile Ranking Overall
| rank | team | run | balanced efficiency profile rank | rank impresso profile score | rank hipe parameter count | rank hipe model size | mean impresso profile score | hipe parameter count | hipe model size mb |
|---|---|---|---|---|---|---|---|---|---|
| 1 | team14 | run3 | 11.75 | 15 | 8 | 9 | 0.6782 | 277730309 | 1111 |
| 2 | team12 | run2 | 13 | 3 | 22 | 24 | 0.8156 | 5123178979 | 9600 |
| 3 | team12 | run1 | 13.5 | 4 | 22 | 24 | 0.8148 | 5123178979 | 9600 |
| 4 | team2 | run1 | 14.25 | 24 | 5 | 4 | 0.6065 | 2087375 | 87 |
| 5 | team14 | run1 | 14.75 | 18 | 12 | 11 | 0.6653 | 466577920 | 1780 |
| 6 | team15 | run2 | 15 | 29 | 1 | 1 | 0.5664 | 0 | 0 |
| 7 | baseline | run1 | 15.25 | 14 | 19 | 14 | 0.6818 | 3000000000 | 2147.023 |
| 8 | team11 | run2 | 15.5 | 9 | 21 | 23 | 0.739 | 4465470464 | 9012 |
| 8 | team13 | run1 | 15.5 | 1 | 30 | 30 | 0.8419 | 116830000000 | 65238 |
| 8 | team3 | run3 | 15.5 | 13 | 20 | 16 | 0.6864 | 4000000000 | 2840 |
| 9 | team2 | run2 | 15.75 | 27 | 5 | 4 | 0.5819 | 2087375 | 87 |
| 10 | team13 | run3 | 16 | 2 | 30 | 30 | 0.8369 | 116830000000 | 65238 |
| 11 | team1 | run2 | 16.25 | 19 | 14 | 13 | 0.6558 | 560965127 | 2140 |
| 12 | team5 | run2 | 16.5 | 12 | 20 | 22 | 0.7052 | 4000000000 | 7600 |
| 13 | team17 | run2 | 17.25 | 26 | 9 | 8 | 0.5853 | 278043651 | 1061 |
| 14 | team13 | run2 | 17.5 | 5 | 30 | 30 | 0.7998 | 116830000000 | 65238 |
| 15 | team2 | run3 | 17.75 | 31 | 5 | 4 | 0.5562 | 2087375 | 87 |
| 16 | team11 | run1 | 18.25 | 11 | 25 | 26 | 0.7198 | 9300029952 | 18398 |
| 17 | team7 | run3 | 18.75 | 34 | 4 | 3 | 0.5484 | 12399 | 0.81 |
| 18 | team17 | run1 | 19 | 8 | 29 | 31 | 0.7629 | 101927226758 | 195716 |
| 19 | team1 | run3 | 19.25 | 6 | 32 | 33 | 0.7925 | 999999999999 | 999999 |
| 20 | team7 | run1 | 19.5 | 37 | 2 | 2 | 0.5338 | 12279 | 0.8 |
| 20 | team7 | run2 | 19.5 | 36 | 3 | 3 | 0.5374 | 12365 | 0.81 |
| 21 | team1 | run1 | 19.75 | 7 | 32 | 33 | 0.7862 | 999999999999 | 999999 |
| 22 | team9 | run1 | 20 | 20 | 19 | 21 | 0.6539 | 3000000000 | 6248 |
| 23 | team11 | run3 | 20.25 | 23 | 17 | 18 | 0.6145 | 1949101888 | 3845 |
| 23 | team3 | run1 | 20.25 | 25 | 16 | 15 | 0.6005 | 1500000000 | 2340 |
| 23 | team6 | run2 | 20.25 | 30 | 11 | 10 | 0.5633 | 355000000 | 1424 |
| 23 | team9 | run2 | 20.25 | 16 | 24 | 25 | 0.6765 | 7000000000 | 15300 |
| 24 | team9 | run3 | 20.75 | 10 | 31 | 32 | 0.7284 | 120000000000 | 240000 |
| 25 | team6 | run1 | 21.25 | 32 | 11 | 10 | 0.5544 | 355000000 | 1424 |
| 26 | team3 | run2 | 21.75 | 22 | 23 | 20 | 0.6259 | 5900000000 | 5980 |
| 27 | random | run1 | 22 | 43 | 1 | 1 | 0.4913 | 0 | 0 |
| 27 | team17 | run3 | 22 | 28 | 15 | 17 | 0.576 | 838778678 | 3217 |
| 28 | team10 | run2 | 22.75 | 17 | 28 | 29 | 0.6721 | 27000000000 | 54000 |
| 28 | team15 | run1 | 22.75 | 40 | 6 | 5 | 0.5215 | 208935168 | 816 |
| 28 | team5 | run1 | 22.75 | 39 | 7 | 6 | 0.5245 | 270000000 | 1030 |
| 29 | team10 | run3 | 23.75 | 21 | 26 | 27 | 0.6274 | 24000000000 | 48000 |
| 30 | team4 | run1 | 25.25 | 42 | 10 | 7 | 0.4988 | 278054405 | 1060 |
| 31 | team15 | run3 | 25.75 | 33 | 18 | 19 | 0.5493 | 2274069824 | 4442 |
| 32 | team14 | run2 | 26.75 | 41 | 13 | 12 | 0.5155 | 466585989 | 1866 |
| 33 | team5 | run3 | 28 | 35 | 20 | 22 | 0.5426 | 4000000000 | 7600 |
| 34 | team10 | run1 | 32.75 | 38 | 27 | 28 | 0.5252 | 26000000000 | 52000 |
Top 3 teams by best run:
- MILRIT (team14), run3, table rank 1, balanced efficiency profile rank 11.75
- whereami (team12), run2, table rank 2, balanced efficiency profile rank 13
- DS@GT_HIPE (team2), run1, table rank 4, balanced efficiency profile rank 14.25
This is an additional analysis ranking. It is not the guideline-defined Efficiency Profile Ranking; it gives equal total weight to accuracy and to the combined resource ranks.
Efficiency Profile Ranking German
| rank | team | run | submission | mean efficiency profile rank | rank impresso profile score | rank hipe parameter count | rank hipe model size | impresso profile score | hipe parameter count | hipe model size mb | diagnostics |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | team14 | run3 | team14_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 10 | 13 | 8 | 9 | 0.692 | 277730309 | 1111 | Comparison / Metrics |
| 1 | team7 | run1 | team7_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 10 | 26 | 2 | 2 | 0.6094 | 12279 | 0.8 | Comparison / Metrics |
| 1 | team7 | run3 | team7_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 10 | 23 | 4 | 3 | 0.6158 | 12399 | 0.81 | Comparison / Metrics |
| 2 | team2 | run1 | team2_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 11 | 24 | 5 | 4 | 0.6139 | 2087375 | 87 | Comparison / Metrics |
| 2 | team7 | run2 | team7_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 11 | 27 | 3 | 3 | 0.6031 | 12365 | 0.81 | Comparison / Metrics |
| 3 | team15 | run2 | team15_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 11.6667 | 33 | 1 | 1 | 0.5598 | 0 | 0 | Comparison / Metrics |
| 4 | team14 | run1 | team14_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 12.3333 | 14 | 12 | 11 | 0.6885 | 466577920 | 1780 | Comparison / Metrics |
| 4 | team2 | run2 | team2_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 12.3333 | 28 | 5 | 4 | 0.592 | 2087375 | 87 | Comparison / Metrics |
| 5 | team2 | run3 | team2_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 13.3333 | 31 | 5 | 4 | 0.5782 | 2087375 | 87 | Comparison / Metrics |
| 6 | random | run1 | random_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 15 | 43 | 1 | 1 | 0.4783 | 0 | 0 | Comparison / Metrics |
| 7 | team1 | run2 | team1_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 15.3333 | 19 | 14 | 13 | 0.6455 | 560965127 | 2140 | Comparison / Metrics |
| 8 | baseline | run1 | baseline_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 16.3333 | 16 | 19 | 14 | 0.6578 | 3000000000 | 2147.023 | Comparison / Metrics |
| 8 | team12 | run1 | team12_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 16.3333 | 3 | 22 | 24 | 0.8508 | 5123178979 | 9600 | Comparison / Metrics |
| 8 | team15 | run1 | team15_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 16.3333 | 38 | 6 | 5 | 0.532 | 208935168 | 816 | Comparison / Metrics |
| 8 | team17 | run2 | team17_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 16.3333 | 32 | 9 | 8 | 0.5707 | 278043651 | 1061 | Comparison / Metrics |
| 9 | team12 | run2 | team12_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 16.6667 | 4 | 22 | 24 | 0.8472 | 5123178979 | 9600 | Comparison / Metrics |
| 9 | team6 | run2 | team6_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 16.6667 | 29 | 11 | 10 | 0.5885 | 355000000 | 1424 | Comparison / Metrics |
| 10 | team3 | run3 | team3_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 17 | 15 | 20 | 16 | 0.6621 | 4000000000 | 2840 | Comparison / Metrics |
| 10 | team5 | run2 | team5_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 17 | 9 | 20 | 22 | 0.7562 | 4000000000 | 7600 | Comparison / Metrics |
| 11 | team11 | run2 | team11_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 18 | 10 | 21 | 23 | 0.7345 | 4465470464 | 9012 | Comparison / Metrics |
| 11 | team5 | run1 | team5_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 18 | 41 | 7 | 6 | 0.5 | 270000000 | 1030 | Comparison / Metrics |
| 12 | team6 | run1 | team6_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 18.6667 | 35 | 11 | 10 | 0.5503 | 355000000 | 1424 | Comparison / Metrics |
| 13 | team11 | run3 | team11_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 19 | 22 | 17 | 18 | 0.6192 | 1949101888 | 3845 | Comparison / Metrics |
| 13 | team9 | run1 | team9_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 19 | 17 | 19 | 21 | 0.655 | 3000000000 | 6248 | Comparison / Metrics |
| 14 | team4 | run1 | team4_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 19.6667 | 42 | 10 | 7 | 0.4906 | 278054405 | 1060 | Comparison / Metrics |
| 15 | team13 | run3 | team13_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 20.3333 | 1 | 30 | 30 | 0.8721 | 116830000000 | 65238 | Comparison / Metrics |
| 15 | team3 | run1 | team3_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 20.3333 | 30 | 16 | 15 | 0.5795 | 1500000000 | 2340 | Comparison / Metrics |
| 16 | team11 | run1 | team11_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 20.6667 | 11 | 25 | 26 | 0.7216 | 9300029952 | 18398 | Comparison / Metrics |
| 16 | team13 | run1 | team13_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 20.6667 | 2 | 30 | 30 | 0.8599 | 116830000000 | 65238 | Comparison / Metrics |
| 17 | team14 | run2 | team14_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 21.6667 | 40 | 13 | 12 | 0.5167 | 466585989 | 1866 | Comparison / Metrics |
| 17 | team17 | run1 | team17_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 21.6667 | 5 | 29 | 31 | 0.8209 | 101927226758 | 195716 | Comparison / Metrics |
| 18 | team9 | run2 | team9_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 22.3333 | 18 | 24 | 25 | 0.646 | 7000000000 | 15300 | Comparison / Metrics |
| 19 | team13 | run2 | team13_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 22.6667 | 8 | 30 | 30 | 0.7866 | 116830000000 | 65238 | Comparison / Metrics |
| 19 | team17 | run3 | team17_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 22.6667 | 36 | 15 | 17 | 0.549 | 838778678 | 3217 | Comparison / Metrics |
| 19 | team3 | run2 | team3_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 22.6667 | 25 | 23 | 20 | 0.6102 | 5900000000 | 5980 | Comparison / Metrics |
| 20 | team1 | run1 | team1_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 23.6667 | 6 | 32 | 33 | 0.8088 | 999999999999 | 999999 | Comparison / Metrics |
| 21 | team1 | run3 | team1_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 24 | 7 | 32 | 33 | 0.8071 | 999999999999 | 999999 | Comparison / Metrics |
| 22 | team10 | run3 | team10_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 24.3333 | 20 | 26 | 27 | 0.6381 | 24000000000 | 48000 | Comparison / Metrics |
| 23 | team9 | run3 | team9_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 25 | 12 | 31 | 32 | 0.7166 | 120000000000 | 240000 | Comparison / Metrics |
| 24 | team15 | run3 | team15_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 25.3333 | 39 | 18 | 19 | 0.5187 | 2274069824 | 4442 | Comparison / Metrics |
| 24 | team5 | run3 | team5_HIPE-2026-v1.0-impresso-test-de_run3.jsonl | 25.3333 | 34 | 20 | 22 | 0.553 | 4000000000 | 7600 | Comparison / Metrics |
| 25 | team10 | run2 | team10_HIPE-2026-v1.0-impresso-test-de_run2.jsonl | 26 | 21 | 28 | 29 | 0.6231 | 27000000000 | 54000 | Comparison / Metrics |
| 26 | team10 | run1 | team10_HIPE-2026-v1.0-impresso-test-de_run1.jsonl | 30.6667 | 37 | 27 | 28 | 0.5329 | 26000000000 | 52000 | Comparison / Metrics |
Top 3 teams by best run:
- MILRIT (team14), run3, table rank 1, mean efficiency profile rank 10
- ROSTI (team7), run1, table rank 1, mean efficiency profile rank 10
- DS@GT_HIPE (team2), run1, table rank 2, mean efficiency profile rank 11
Efficiency Profile Ranking English
| rank | team | run | submission | mean efficiency profile rank | rank impresso profile score | rank hipe parameter count | rank hipe model size | impresso profile score | hipe parameter count | hipe model size mb | diagnostics |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | team15 | run2 | team15_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 10.3333 | 29 | 1 | 1 | 0.5529 | 0 | 0 | Comparison / Metrics |
| 2 | team2 | run1 | team2_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 11.3333 | 24 | 6 | 4 | 0.6044 | 2087375 | 87 | Comparison / Metrics |
| 3 | team14 | run3 | team14_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 12.6667 | 19 | 9 | 10 | 0.6351 | 277730309 | 1111 | Comparison / Metrics |
| 3 | team2 | run3 | team2_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 12.6667 | 28 | 6 | 4 | 0.5673 | 2087375 | 87 | Comparison / Metrics |
| 4 | team17 | run2 | team17_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 13.3333 | 21 | 10 | 9 | 0.6247 | 278043651 | 1061 | Comparison / Metrics |
| 5 | team7 | run3 | team7_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 14.3333 | 35 | 5 | 3 | 0.5307 | 12399 | 0.81 | Comparison / Metrics |
| 6 | team2 | run2 | team2_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 14.6667 | 34 | 6 | 4 | 0.5317 | 2087375 | 87 | Comparison / Metrics |
| 6 | team7 | run2 | team7_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 14.6667 | 37 | 4 | 3 | 0.518 | 12365 | 0.81 | Comparison / Metrics |
| 7 | random | run1 | random_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 15.3333 | 44 | 1 | 1 | 0.462 | 0 | 0 | Comparison / Metrics |
| 7 | team7 | run1 | team7_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 15.3333 | 41 | 3 | 2 | 0.5042 | 12279 | 0.8 | Comparison / Metrics |
| 8 | team3 | run1 | team3_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 15.6667 | 14 | 17 | 16 | 0.6786 | 1500000000 | 2340 | Comparison / Metrics |
| 8 | team5 | run1 | team5_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 15.6667 | 32 | 8 | 7 | 0.5445 | 270000000 | 1030 | Comparison / Metrics |
| 9 | team12 | run2 | team12_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 16.3333 | 1 | 23 | 25 | 0.8058 | 5123178979 | 9600 | Comparison / Metrics |
| 10 | team16 | run1 | team16_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 16.6667 | 43 | 2 | 5 | 0.4939 | 110 | 433 | Comparison / Metrics |
| 10 | team3 | run3 | team3_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 16.6667 | 12 | 21 | 17 | 0.6841 | 4000000000 | 2840 | Comparison / Metrics |
| 11 | baseline | run1 | baseline_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 17 | 16 | 20 | 15 | 0.6638 | 3000000000 | 2147.023 | Comparison / Metrics |
| 11 | team15 | run1 | team15_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 17 | 38 | 7 | 6 | 0.518 | 208935168 | 816 | Comparison / Metrics |
| 12 | team12 | run1 | team12_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 17.3333 | 4 | 23 | 25 | 0.7982 | 5123178979 | 9600 | Comparison / Metrics |
| 12 | team14 | run1 | team14_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 17.3333 | 27 | 13 | 12 | 0.5756 | 466577920 | 1780 | Comparison / Metrics |
| 13 | team6 | run1 | team6_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 17.6667 | 30 | 12 | 11 | 0.5511 | 355000000 | 1424 | Comparison / Metrics |
| 14 | team6 | run2 | team6_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 18 | 31 | 12 | 11 | 0.5451 | 355000000 | 1424 | Comparison / Metrics |
| 15 | team1 | run2 | team1_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 18.3333 | 26 | 15 | 14 | 0.5889 | 560965127 | 2140 | Comparison / Metrics |
| 15 | team11 | run2 | team11_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 18.3333 | 9 | 22 | 24 | 0.7119 | 4465470464 | 9012 | Comparison / Metrics |
| 16 | team17 | run3 | team17_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 19 | 23 | 16 | 18 | 0.613 | 838778678 | 3217 | Comparison / Metrics |
| 16 | team9 | run1 | team9_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 19 | 15 | 20 | 22 | 0.6645 | 3000000000 | 6248 | Comparison / Metrics |
| 17 | team11 | run3 | team11_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 19.6667 | 22 | 18 | 19 | 0.6209 | 1949101888 | 3845 | Comparison / Metrics |
| 18 | team4 | run1 | team4_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 20.3333 | 42 | 11 | 8 | 0.497 | 278054405 | 1060 | Comparison / Metrics |
| 19 | team3 | run2 | team3_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 20.6667 | 17 | 24 | 21 | 0.6566 | 5900000000 | 5980 | Comparison / Metrics |
| 20 | team14 | run2 | team14_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 21 | 36 | 14 | 13 | 0.5267 | 466585989 | 1866 | Comparison / Metrics |
| 21 | team11 | run1 | team11_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 21.3333 | 11 | 26 | 27 | 0.696 | 9300029952 | 18398 | Comparison / Metrics |
| 21 | team13 | run1 | team13_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 21.3333 | 2 | 31 | 31 | 0.803 | 116830000000 | 65238 | Comparison / Metrics |
| 21 | team5 | run2 | team5_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 21.3333 | 20 | 21 | 23 | 0.6315 | 4000000000 | 7600 | Comparison / Metrics |
| 21 | team9 | run2 | team9_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 21.3333 | 13 | 25 | 26 | 0.6803 | 7000000000 | 15300 | Comparison / Metrics |
| 22 | team13 | run3 | team13_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 21.6667 | 3 | 31 | 31 | 0.8016 | 116830000000 | 65238 | Comparison / Metrics |
| 23 | team13 | run2 | team13_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 22.3333 | 5 | 31 | 31 | 0.7605 | 116830000000 | 65238 | Comparison / Metrics |
| 24 | team17 | run1 | team17_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 24 | 10 | 30 | 32 | 0.7031 | 101927226758 | 195716 | Comparison / Metrics |
| 24 | team9 | run3 | team9_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 24 | 7 | 32 | 33 | 0.7279 | 120000000000 | 240000 | Comparison / Metrics |
| 25 | team1 | run3 | team1_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 24.3333 | 6 | 33 | 34 | 0.7471 | 999999999999 | 999999 | Comparison / Metrics |
| 26 | team1 | run1 | team1_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 25 | 8 | 33 | 34 | 0.7138 | 999999999999 | 999999 | Comparison / Metrics |
| 27 | team10 | run2 | team10_HIPE-2026-v1.0-impresso-test-en_run2.jsonl | 25.6667 | 18 | 29 | 30 | 0.6524 | 27000000000 | 54000 | Comparison / Metrics |
| 28 | team15 | run3 | team15_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 26.3333 | 40 | 19 | 20 | 0.5126 | 2274069824 | 4442 | Comparison / Metrics |
| 29 | team10 | run3 | team10_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 26.6667 | 25 | 27 | 28 | 0.5904 | 24000000000 | 48000 | Comparison / Metrics |
| 30 | team5 | run3 | team5_HIPE-2026-v1.0-impresso-test-en_run3.jsonl | 27.6667 | 39 | 21 | 23 | 0.5143 | 4000000000 | 7600 | Comparison / Metrics |
| 31 | team10 | run1 | team10_HIPE-2026-v1.0-impresso-test-en_run1.jsonl | 30 | 33 | 28 | 29 | 0.5382 | 26000000000 | 52000 | Comparison / Metrics |
Top 3 teams by best run:
- FI-CODE (team15), run2, table rank 1, mean efficiency profile rank 10.3333
- DS@GT_HIPE (team2), run1, table rank 2, mean efficiency profile rank 11.3333
- MILRIT (team14), run3, table rank 3, mean efficiency profile rank 12.6667
Efficiency Profile Ranking French
| rank | team | run | submission | mean efficiency profile rank | rank impresso profile score | rank hipe parameter count | rank hipe model size | impresso profile score | hipe parameter count | hipe model size mb | diagnostics |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | team15 | run2 | team15_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 9.6667 | 27 | 1 | 1 | 0.5865 | 0 | 0 | Comparison / Metrics |
| 2 | team2 | run2 | team2_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 10.3333 | 22 | 5 | 4 | 0.6219 | 2087375 | 87 | Comparison / Metrics |
| 3 | team14 | run3 | team14_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 11.6667 | 18 | 8 | 9 | 0.7074 | 277730309 | 1111 | Comparison / Metrics |
| 3 | team2 | run1 | team2_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 11.6667 | 26 | 5 | 4 | 0.6012 | 2087375 | 87 | Comparison / Metrics |
| 4 | random | run1 | random_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 12 | 34 | 1 | 1 | 0.5334 | 0 | 0 | Comparison / Metrics |
| 5 | team14 | run1 | team14_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 12.3333 | 14 | 12 | 11 | 0.7318 | 466577920 | 1780 | Comparison / Metrics |
| 6 | team1 | run2 | team1_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 13.3333 | 13 | 14 | 13 | 0.7329 | 560965127 | 2140 | Comparison / Metrics |
| 7 | team2 | run3 | team2_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 15 | 36 | 5 | 4 | 0.5231 | 2087375 | 87 | Comparison / Metrics |
| 8 | team17 | run2 | team17_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 15.6667 | 30 | 9 | 8 | 0.5606 | 278043651 | 1061 | Comparison / Metrics |
| 8 | team7 | run1 | team7_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 15.6667 | 43 | 2 | 2 | 0.4877 | 12279 | 0.8 | Comparison / Metrics |
| 9 | team15 | run1 | team15_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 16 | 37 | 6 | 5 | 0.5144 | 208935168 | 816 | Comparison / Metrics |
| 9 | team5 | run1 | team5_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 16 | 35 | 7 | 6 | 0.529 | 270000000 | 1030 | Comparison / Metrics |
| 9 | team7 | run2 | team7_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 16 | 42 | 3 | 3 | 0.491 | 12365 | 0.81 | Comparison / Metrics |
| 9 | team7 | run3 | team7_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 16 | 41 | 4 | 3 | 0.4987 | 12399 | 0.81 | Comparison / Metrics |
| 10 | baseline | run1 | baseline_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 16.3333 | 16 | 19 | 14 | 0.7239 | 3000000000 | 2147.023 | Comparison / Metrics |
| 11 | team6 | run1 | team6_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 16.6667 | 29 | 11 | 10 | 0.5619 | 355000000 | 1424 | Comparison / Metrics |
| 12 | team11 | run2 | team11_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 17.3333 | 8 | 21 | 23 | 0.7706 | 4465470464 | 9012 | Comparison / Metrics |
| 12 | team12 | run1 | team12_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 17.3333 | 6 | 22 | 24 | 0.7955 | 5123178979 | 9600 | Comparison / Metrics |
| 13 | team12 | run2 | team12_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 17.6667 | 7 | 22 | 24 | 0.7939 | 5123178979 | 9600 | Comparison / Metrics |
| 13 | team3 | run3 | team3_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 17.6667 | 17 | 20 | 16 | 0.713 | 4000000000 | 2840 | Comparison / Metrics |
| 13 | team6 | run2 | team6_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 17.6667 | 32 | 11 | 10 | 0.5564 | 355000000 | 1424 | Comparison / Metrics |
| 14 | team4 | run1 | team4_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 18.3333 | 38 | 10 | 7 | 0.5088 | 278054405 | 1060 | Comparison / Metrics |
| 15 | team5 | run2 | team5_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 19 | 15 | 20 | 22 | 0.7278 | 4000000000 | 7600 | Comparison / Metrics |
| 16 | team11 | run3 | team11_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 20 | 25 | 17 | 18 | 0.6033 | 1949101888 | 3845 | Comparison / Metrics |
| 16 | team15 | run3 | team15_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 20 | 23 | 18 | 19 | 0.6165 | 2274069824 | 4442 | Comparison / Metrics |
| 16 | team17 | run3 | team17_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 20 | 28 | 15 | 17 | 0.5658 | 838778678 | 3217 | Comparison / Metrics |
| 17 | team11 | run1 | team11_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 20.3333 | 10 | 25 | 26 | 0.7418 | 9300029952 | 18398 | Comparison / Metrics |
| 17 | team13 | run1 | team13_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 20.3333 | 1 | 30 | 30 | 0.8628 | 116830000000 | 65238 | Comparison / Metrics |
| 17 | team9 | run1 | team9_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 20.3333 | 21 | 19 | 21 | 0.642 | 3000000000 | 6248 | Comparison / Metrics |
| 18 | team13 | run2 | team13_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 20.6667 | 2 | 30 | 30 | 0.8523 | 116830000000 | 65238 | Comparison / Metrics |
| 19 | team13 | run3 | team13_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 21 | 3 | 30 | 30 | 0.837 | 116830000000 | 65238 | Comparison / Metrics |
| 20 | team3 | run1 | team3_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 21.3333 | 33 | 16 | 15 | 0.5433 | 1500000000 | 2340 | Comparison / Metrics |
| 21 | team14 | run2 | team14_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 21.6667 | 40 | 13 | 12 | 0.5031 | 466585989 | 1866 | Comparison / Metrics |
| 22 | team3 | run2 | team3_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 22.3333 | 24 | 23 | 20 | 0.611 | 5900000000 | 5980 | Comparison / Metrics |
| 23 | team10 | run2 | team10_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 22.6667 | 11 | 28 | 29 | 0.7409 | 27000000000 | 54000 | Comparison / Metrics |
| 23 | team9 | run2 | team9_HIPE-2026-v1.0-impresso-test-fr_run2.jsonl | 22.6667 | 19 | 24 | 25 | 0.7033 | 7000000000 | 15300 | Comparison / Metrics |
| 24 | team1 | run1 | team1_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 23 | 4 | 32 | 33 | 0.836 | 999999999999 | 999999 | Comparison / Metrics |
| 24 | team17 | run1 | team17_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 23 | 9 | 29 | 31 | 0.7648 | 101927226758 | 195716 | Comparison / Metrics |
| 25 | team1 | run3 | team1_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 23.3333 | 5 | 32 | 33 | 0.8232 | 999999999999 | 999999 | Comparison / Metrics |
| 26 | team10 | run3 | team10_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 24.3333 | 20 | 26 | 27 | 0.6538 | 24000000000 | 48000 | Comparison / Metrics |
| 26 | team5 | run3 | team5_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 24.3333 | 31 | 20 | 22 | 0.5605 | 4000000000 | 7600 | Comparison / Metrics |
| 27 | team9 | run3 | team9_HIPE-2026-v1.0-impresso-test-fr_run3.jsonl | 25 | 12 | 31 | 32 | 0.7409 | 120000000000 | 240000 | Comparison / Metrics |
| 28 | team10 | run1 | team10_HIPE-2026-v1.0-impresso-test-fr_run1.jsonl | 31.3333 | 39 | 27 | 28 | 0.5044 | 26000000000 | 52000 | Comparison / Metrics |
Top 3 teams by best run:
- FI-CODE (team15), run2, table rank 1, mean efficiency profile rank 9.6667
- DS@GT_HIPE (team2), run2, table rank 2, mean efficiency profile rank 10.3333
- MILRIT (team14), run3, table rank 3, mean efficiency profile rank 11.6667