Skip to content

Commit 724486e

Browse files
committed
rerun notebook, added baseline, models, prompt engineering
1 parent d6ef351 commit 724486e

File tree

3 files changed

+391
-369
lines changed

3 files changed

+391
-369
lines changed

docs/80_benchmarking_llms/20_vision_models.ipynb

Lines changed: 339 additions & 343 deletions
Large diffs are not rendered by default.
Lines changed: 26 additions & 26 deletions
Original file line numberDiff line numberDiff line change
@@ -1,26 +1,26 @@
1-
,gpt-4-turbo-2024-04-09,gpt-4o-2024-05-13,gpt-4o-2024-08-06,gpt-4o-2024-11-20
2-
0,93,56,55,67
3-
1,61,52,55,73
4-
2,54,62,53,50
5-
3,49,53,70,72
6-
4,68,73,57,74
7-
5,53,58,50,53
8-
6,54,52,56,67
9-
7,64,56,68,68
10-
8,61,47,50,52
11-
9,51,54,53,61
12-
10,46,65,63,92
13-
11,53,58,54,57
14-
12,63,53,46,58
15-
13,58,47,52,54
16-
14,76,66,49,64
17-
15,62,53,68,72
18-
16,47,48,54,97
19-
17,83,57,53,78
20-
18,54,51,49,64
21-
19,56,61,54,67
22-
20,73,62,49,53
23-
21,49,58,58,80
24-
22,60,58,57,62
25-
23,54,48,46,70
26-
24,64,49,65,68
1+
,gpt-4-turbo-2024-04-09,gpt-4o-2024-05-13,gpt-4o-2024-08-06,gpt-4o-2024-11-20,gpt-4.1-mini-2025-04-14,gpt-4.1-nano-2025-04-14
2+
0,52,58,47,58,52,54
3+
1,49,57,47,49,48,64
4+
2,53,49,52,54,72,53
5+
3,68,49,53,57,61,75
6+
4,63,72,49,67,49,65
7+
5,56,47,52,54,54,77
8+
6,63,54,53,52,62,34
9+
7,68,93,56,56,54,53
10+
8,49,70,57,64,54,41
11+
9,53,56,58,63,67,35
12+
10,61,78,53,68,49,37
13+
11,57,53,54,51,76,48
14+
12,46,68,52,66,49,51
15+
13,67,79,62,72,59,60
16+
14,51,55,54,53,67,58
17+
15,53,87,50,78,56,47
18+
16,93,68,63,62,44,41
19+
17,52,58,54,72,45,54
20+
18,52,68,53,74,52,66
21+
19,49,55,62,64,54,56
22+
20,52,63,62,58,49,44
23+
21,57,63,54,57,52,41
24+
22,51,60,52,65,52,57
25+
23,72,59,70,48,44,53
26+
24,42,48,55,63,59,41
Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
,unspecific,w/o border,with border
2+
0,61.0,30.0,48
3+
1,57.0,47.0,56
4+
2,52.0,50.0,57
5+
3,65.0,43.0,58
6+
4,59.0,31.0,58
7+
5,52.0,,68
8+
6,67.0,27.0,61
9+
7,58.0,26.0,66
10+
8,60.0,41.0,47
11+
9,57.0,38.0,54
12+
10,56.0,31.0,61
13+
11,54.0,43.0,58
14+
12,52.0,44.0,49
15+
13,59.0,,62
16+
14,57.0,55.0,52
17+
15,67.0,42.0,57
18+
16,54.0,38.0,81
19+
17,55.0,30.0,55
20+
18,47.0,36.0,46
21+
19,63.0,39.0,48
22+
20,41.0,18.0,52
23+
21,,37.0,58
24+
22,51.0,43.0,63
25+
23,52.0,35.0,47
26+
24,56.0,44.0,48

0 commit comments

Comments
 (0)