Pythonでビット単位の排他的または2つの文字列を行う方法は？

Question

私はPythonでビットごとの排他的または2つの文字列を実行したいのですが、Pythonでは文字列のxorは許可されていません。どうすればいいですか？

Mark Byers · Accepted Answer

文字を整数に変換し、代わりにxorを使用できます：

l = [ord(a) ^ ord(b) for a,b in Zip(s1,s2)]

XORの結果として文字列が必要な場合の更新された関数は次のとおりです。

def sxor(s1,s2): # convert strings to a list of character pair tuples # go through each Tuple, converting them to ASCII code (ord) # perform exclusive or on the ASCII code # then convert the result back to ASCII (chr) # merge the resulting array of characters as a string return ''.join(chr(ord(a) ^ ord(b)) for a,b in Zip(s1,s2))

オンラインで動作することを確認してください： ideone

Duncan · Answer

バイトや単語を操作したい場合は、文字列ではなくPythonの配列型を使用した方が良いでしょう。固定長ブロックで作業している場合、バイトではなくワードでHまたはL形式を使用できる場合がありますが、この例では「B」を使用しました。

>>> import array >>> a1 = array.array('B', 'Hello, World!') >>> a1 array('B', [72, 101, 108, 108, 111, 44, 32, 87, 111, 114, 108, 100, 33]) >>> a2 = array.array('B', ('secret'*3)) >>> for i in range(len(a1)): a1[i] ^= a2[i] >>> a1.tostring() ';\x00\x0f\x1e
XS2\x0c\x00	\x10R'

doep · Answer

バイト配列の場合、XORを直接使用できます。

>>> b1 = bytearray("test123") >>> b2 = bytearray("321test") >>> b = bytearray(len(b1)) >>> for i in range(len(b1)): ... b[i] = b1[i] ^ b2[i] >>> b bytearray(b'GWB\x00TAG')

PaulMcG · Answer

これは、おそらくいくつかの穏やかな暗号化のための文字列XOR'erです。

>>> src = "Hello, World!" >>> code = "secret" >>> xorWord = lambda ss,cc: ''.join(chr(ord(s)^ord(c)) for s,c in Zip(ss,cc*100)) >>> encrypt = xorWord(src, code) >>> encrypt ';\x00\x0f\x1e
XS2\x0c\x00	\x10R' >>> decrypt = xorWord(encrypt,code) >>> print decrypt Hello, World!

これは、極端に弱い形式の暗号化であることに注意してください。空白文字列を指定してエンコードするとどうなるかを見てください：

>>> codebreak = xorWord(" ", code) >>> print codebreak SECRET

yota · Answer

python3の1つのライナーは次のとおりです。

_def bytes_xor(a, b) : return bytes(x ^ y for x, y in Zip(a, b)) _

ここで、a、bおよび戻り値は、もちろんbytes()ではなくstr()です

簡単にすることはできません、私はpython3が大好きです:)

user81779 · Answer

def strxor (s0, s1): l = [ chr ( ord (a) ^ ord (b) ) for a,b in Zip (s0, s1) ] return ''.join (l)

（Mark Byersの回答に基づいています。）

Ashray Malhotra · Answer

文字列の長さが等しくない場合は、これを使用できます

def strxor(a, b): # xor two strings of different lengths if len(a) > len(b): return "".join([chr(ord(x) ^ ord(y)) for (x, y) in Zip(a[:len(b)], b)]) else: return "".join([chr(ord(x) ^ ord(y)) for (x, y) in Zip(a, b[:len(a)])])

satoru · Answer

次のような意味ですか？

s1 = '00000001' s2 = '11111110' int(s1,2) ^ int(s2,2)

Mark Tolonen · Answer

以下は、文字列sとmのXOR演算を示しており、プロセスを逆にしています。

>>> s='hello, world' >>> m='markmarkmark' >>> s=''.join(chr(ord(a)^ord(b)) for a,b in Zip(s,m)) >>> s '\x05\x04\x1e\x07\x02MR\x1c\x02\x13\x1e\x0f' >>> s=''.join(chr(ord(a)^ord(b)) for a,b in Zip(s,m)) >>> s 'hello, world' >>>

mckoss · Answer

def xor_strings(s1, s2): max_len = max(len(s1), len(s2)) s1 += chr(0) * (max_len - len(s1)) s2 += chr(0) * (max_len - len(s2)) return ''.join([chr(ord(c1) ^ ord(c2)) for c1, c2 in Zip(s1, s2)])

William McBrine · Answer

Zip（s、m）のa、bの '' .join（chr（ord（a）^ ord（b）））メソッドはかなり遅いことがわかりました。代わりに、私はこれをやっています：

fmt = '%dB' % len(source) s = struct.unpack(fmt, source) m = struct.unpack(fmt, xor_data) final = struct.pack(fmt, *(a ^ b for a, b in izip(s, m)))

pts · Answer

William McBrineの回答に基づいて、固定長文字列の解決策を以下に示します。これは、私のユースケースで9％高速です。

import itertools import struct def make_strxor(size): def strxor(a, b, izip=itertools.izip, pack=struct.pack, unpack=struct.unpack, fmt='%dB' % size): return pack(fmt, *(a ^ b for a, b in izip(unpack(fmt, a), unpack(fmt, b)))) return strxor strxor_3 = make_strxor(3) print repr(strxor_3('foo', 'bar'))