CLIP's Visual Transformer image encoder output(Clip的视觉转换器图像编码器输出)-6ren

CLIP's Visual Transformer image encoder output(Clip的视觉转换器图像编码器输出)

转载作者：bug小助手更新时间：2023-10-25 11:21:00

26

4

I was doing some experiments with the CLIP's visual transformer encoder output (clip-ViT-B-32). So basically given the same scene or image, it should output almost same image feature vector given it's a semantics model. But looks like it is very sensitive to illumination and lighting conditions which makes me wonder and the percentage of similarity between the images below are much lower than expected (surprisingly it says 89.45% similar)

我正在用剪辑的视觉转换器编码器输出(CLIP-VIT-B-32)做一些实验。因此，在给定相同的场景或图像的情况下，它应该输出几乎相同的图像特征向量(假设它是一个语义模型)。但看起来它对光照和光照条件非常敏感，这让我感到奇怪，下面图像之间的相似性百分比比预期低得多(令人惊讶的是，相似性为89.45%)。

Why is that? Is there any ways/models which are less sensitive to illumination changes and are more semantic based?

为什么会这样呢？有没有对光照变化不那么敏感、更基于语义的方法/模型？

from sentence_transformers import SentenceTransformer, util
#......
model = SentenceTransformer('clip-ViT-B-32')
encoded_image = model.encode(image, batch_size=128, convert_to_tensor=True, show_progress_bar=True)

# Now we run the clustering algorithm. This function compares images aganist 
# all other images and returns a list with the pairs that have the highest 
# cosine similarity score
processed_images = util.paraphrase_mining_embeddings(encoded_image)

更多回答

优秀答案推荐

更多回答

26

4

0

文章推荐： How to mount a single file in a volume(如何在卷中装载单个文件)

encoding - 理论: "Lexical Encoding"
我使用术语“词法编码”是因为我没有更好的编码。与字母相反，单词可以说是交流的基本单位。 Unicode 尝试为所有已知字母表的每个字母分配一个数值。对一种语言来说是字母，对另一种语言来说是字形。 U
encoding - 如何 “save with encoding”
我在UTF-8中有csv文件，我想将其保存在西里尔字母(Windows 1251)中...在中，我仅找到Atom -重新打开，并使用ctrl+shift+u编码在 Sublime Text 3 中，
encoding - "encoding-agnostic"的定义是什么？
在lua 5.3引用手册中，我们可以看到: Lua is also encoding-agnostic; it makes no assumptions about the contents of a
encoding - 变化 : Accept-Encoding overkill?
看完后how gzip compression works它让我思考。如果源和代理服务器 (CDN) 都支持 gzip，则添加 Vary: Accept-Encoding头需要吗？最佳答案 Vary
encoding - URL缩短: What's the best encoding to use?
我正在向我的项目添加一项功能，我们将生成指向我们网站内部内容的链接，并且我们希望这些链接尽可能短，因此我们将制作自己的“URL 缩短器”。我想知道生成的短网址的最佳编码/字母表是什么。这很大程度上是
encoding - HTTP Accept-Encoding 并发送未编码的数据
我构建了一个用于压缩 HTTP 输出的模块。阅读spec ，我在以下几件事上没有发现明显的区别: 接受编码: 是否应将其视为与 Accept-Encoding: * 相同，还是视为不存在 header
json.Marshal 与 Encoder.Encode
在下面的代码中: package main import ( "bytes" "encoding/json" "fmt" ) type Student struct {
perl - Encode::encode 是否修改/删除原始字符串？
这个问题在这里已经有了答案: Why does encode delete the argument? (1 个回答) 6年前关闭。 Encode::encode 的文档说: encode $octe
encode - 安卓媒体编解码器 : how to request a key frame when encoding
在Android4.1中，实时编码应用中经常会请求关键帧。但是如何使用 MediaCodec 对象呢？当前的 Android4.2 SDK 似乎不支持它。最佳答案您可以通过在排队输入缓冲区时指定
character-encoding - 如何解码乱码编码: Special Character Encoding
我有 CSV 格式的数据，这些数据在字符编码方面被严重打乱，可能在不同的软件应用程序(LibreOffice Calc、Microsoft、Excel、Google Refine、自定义 PHP/My
perl - 使用 Encode::encode 和 "utf8"
您可能知道，在 Perl 中，“utf8”意味着 Perl 对 UTF-8 的宽松理解，它允许使用技术上不是 UTF-8 中有效代码点的字符。相比之下，“UTF-8”(或“utf-8”)是 Perl
org.geotools.ysld.encode.YsldEncoder.encode()方法的使用及代码示例
本文整理了Java中org.geotools.ysld.encode.YsldEncoder.encode()方法的一些代码示例，展示了YsldEncoder.encode()的具体用法。这些代码示例
encoding - 访问错误: invalid UTF-8 encoding ${FFD8FFE0}
现在还没有任何关于红色的书，因为它太新了。因此，我正在尝试遵循一本旧的 Rebol 书，并从中挽救我能得到的东西。我发现一些命令，例如 read，由于文件编码的原因，我无法执行代码。 save %
encoding - 错误: unmappable character for encoding UTF-8
错误:无法映射用于编码 UTF-8 的字符。由于版权特征，我收到此错误。我使用的是 Netbeans 7.2。 /** * � 2006 * * This class was generate
encoding - 访问错误: invalid UTF-8 encoding ${FFD8FFE0}
现在还没有任何关于红色的书，因为它太新了。因此，我正在尝试遵循一本旧的 Rebol 书，并从中挽救我能得到的东西。我发现一些命令，例如 read，由于文件编码的原因，我无法执行代码。 save %
encoding - 错误: unmappable character for encoding UTF-8
错误:无法映射用于编码 UTF-8 的字符。由于版权特征，我收到此错误。我使用的是 Netbeans 7.2。 /** * � 2006 * * This class was generate
Java Base64 Encode 函数和PHP Base64 Encode 函数一样吗？
我正在尝试使用客户端提供的值在 PHP 中测试 Soap Security header 。他们提供的值(value)如... wTAmCL9tmg6KNpeAQOYubw== ...并说这是一个
java.lang.ClassNotFoundException : org. owasp.encoder.Encode
这个问题已经有答案了: ClassNotFoundException/NoClassDefFoundError in my Java web application (3 个回答) 已关闭 8 年前。
http - Encoding.ASCII VS Encoding.UTF8 错误
世界!我正在使用 .Net Framework 4 System.Net.Sockets.TcpClient 编写简单的 HTML 服务器。我在 StringBuilder html 中有 HTML
php - SOAP 错误 : Encoding: Violation of encoding rules?
我正在尝试使用 Yii 来提供网络服务。自动生成的 wsdl 如下。我可以从命令行成功使用 Web 服务，但是通过 Web 浏览器，我得到了 SOAP-ERROR: Encoding: Violati

首页

博学

6Ren·AI

商城

CLIP's Visual Transformer image encoder output(Clip的视觉转换器图像编码器输出)