A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Model | Mean | Min | 1st Quartile | Median | 3rd Quartile | Max | Num Preferred | Prop Preferred | |||||||
2 | None | 0 | 0 | 0 | 0 | 0 | 0 | 29 | 9.70% | |||||||
3 | Replicate Clip_Prefix_Caption | 0.80 | 0.69 | 0.71 | 0.72 | 0.74 | 5.10 | 0 | 0.00% | |||||||
4 | Replicate Image-Captioning-With-Visual-Attention | 1.14 | 0.70 | 0.72 | 0.73 | 0.80 | 71.11 | 0 | 0.00% | |||||||
5 | Google VertexAI | 1.69 | 1.25 | 1.52 | 1.65 | 1.80 | 2.85 | 54 | 18.06% | |||||||
6 | Replicate Blip | 1.73 | 0.69 | 0.71 | 0.73 | 1.27 | 59.19 | 5 | 1.67% | |||||||
7 | Replicate Blip-2 | 2.50 | 1.25 | 1.80 | 1.82 | 2.38 | 13.86 | 56 | 18.73% | |||||||
8 | Replicate Llava-13b | 3.90 | 1.39 | 2.06 | 2.71 | 3.48 | 41.05 | 117 | 39.13% | |||||||
9 | Replicate Minigpt-4 | 4.13 | 2.90 | 3.45 | 3.50 | 4.09 | 17.77 | 24 | 8.03% | |||||||
10 | Replicate Clip-Caption-Reward | 6.55 | 5.08 | 5.66 | 5.70 | 6.22 | 173.86 | 0 | 0.00% | |||||||
11 | Blip Local | 17.93 | 15.64 | 17.30 | 17.88 | 18.63 | 20.41 | 13 | 4.35% | |||||||
12 | Replicate Img2Prompt | 32.84 | 21.66 | 23.80 | 24.99 | 46.13 | 103.35 | 0 | 0.00% | |||||||
13 | Replicate Clip-Interrogator | 63.83 | 29.35 | 47.75 | 58.01 | 72.71 | 215.98 | 1 | 0.33% | |||||||
14 | ||||||||||||||||
15 | ||||||||||||||||
16 | ||||||||||||||||
17 | ||||||||||||||||
18 | ||||||||||||||||
19 | ||||||||||||||||
20 |