![]() | ![]() | ![]() | ![]() | ![]() |
![]() | ![]() | ![]() | ![]() | ![]() |
![]() | ![]() | ![]() | ![]() | ![]() |
| Main Focus | Confidence | Explanation |
|---|---|---|
| Main Object | 4 | Focus is on the bowl with soup. |
| Separate Object | 3 | Focus is on the the soup not the bowl |
| Separate Object | 4 | The red region mainly focuses on the soup, rather than the predicted object. |
| Main Object | 3 | The red region in the highlighted visual attributes focuses on the food in a soup bowl. |
| Main Object | 4 | Focus is on the contents of the soup bowl. |