Examples of triplets from our dataset.
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Humans
DreamSim
CLIP
DINO
LPIPS
Nearest neighbor searches in ImageNet-R and COCO
Ours = DreamSim.
LPIPS | DISTS | OpenCLIP | DINO | Ours | |
---|---|---|---|---|---|
Input | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
Nearest Neighbors | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
LPIPS | DISTS | OpenCLIP | DINO | Ours | |
---|---|---|---|---|---|
Input | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
Nearest Neighbors | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
LPIPS | DISTS | OpenCLIP | DINO | Ours | |
---|---|---|---|---|---|
Input | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
Nearest Neighbors | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
LPIPS | DISTS | OpenCLIP | DINO | Ours | |
---|---|---|---|---|---|
Input | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
Nearest Neighbors | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
LPIPS | DISTS | OpenCLIP | DINO | Ours | |
---|---|---|---|---|---|
Input | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
Nearest Neighbors | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
LPIPS | DISTS | OpenCLIP | DINO | Ours | |
---|---|---|---|---|---|
Input | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|
Nearest Neighbors | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Inversion visualization using optimization, deep image prior, and guided diffusion.
Ours = DreamSim.
Target | DINO | OpenCLIP | Ensemble | Ours | |
---|---|---|---|---|---|
Run 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 4 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
Target | DINO | OpenCLIP | Ensemble | Ours | |
---|---|---|---|---|---|
Run 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 4 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
Target | DINO | OpenCLIP | Ensemble | Ours | |
---|---|---|---|---|---|
Run 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 4 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
Target | DINO | OpenCLIP | Ensemble | Ours | |
---|---|---|---|---|---|
Run 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 4 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
Target | DINO | OpenCLIP | Ensemble | Ours | |
---|---|---|---|---|---|
Run 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 4 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
Target | DINO | OpenCLIP | Ensemble | Ours | |
---|---|---|---|---|---|
Run 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 4 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
Target | DINO | OpenCLIP | Ensemble | Ours | |
---|---|---|---|---|---|
Run 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 4 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
Target | DINO | OpenCLIP | Ensemble | Ours | |
---|---|---|---|---|---|
Run 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 4 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 5 | ![]() |
![]() |
![]() |
![]() |
![]() |
Target | DINO | OpenCLIP | Ensemble | Ours | |
---|---|---|---|---|---|
Run 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 4 | ![]() |
![]() |
![]() |
![]() |
![]() |
Run 5 | ![]() |
![]() |
![]() |
![]() |
![]() |