画像上で優勢な色を見つける

Question

画像のドミナントカラーを見つけたい。このために、私は画像ヒストグラムを使用する必要があることを知っています。しかし、画像の形式はわかりません。 rgb、hsv、またはグレー画像のどれを使用する必要がありますか？

ヒストグラムが計算された後、ヒストグラムで最大値を見つける必要があります。このために、hsv画像の最大binVal値を下回る必要がありますか？結果の画像に黒い色しか含まれていないのはなぜですか？

float binVal = hist.at<float>(h, s);

編集：

以下のコードを試してみました。 h-sヒストグラムを描きます。そして私の結果画像はここにあります。バイナリしきい値の後に何も見つかりません。たぶん、最大ヒストグラム値が間違って見つかります。

enter image description here

cvtColor(src, hsv, CV_BGR2HSV); // Quantize the hue to 30 levels // and the saturation to 32 levels int hbins = 20, sbins = 22; int histSize[] = {hbins, sbins}; // hue varies from 0 to 179, see cvtColor float hranges[] = { 0, 180 }; // saturation varies from 0 (black-gray-white) to // 255 (pure spectrum color) float sranges[] = { 0, 256 }; const float* ranges[] = { hranges, sranges }; MatND hist; // we compute the histogram from the 0-th and 1-st channels int channels[] = {0, 1}; calcHist( &hsv, 1, channels, Mat(), // do not use mask hist, 2, histSize, ranges, true, // the histogram is uniform false ); double maxVal=0; minMaxLoc(hist, 0, &maxVal, 0, 0); int scale = 10; Mat histImg = Mat::zeros(sbins*scale, hbins*10, CV_8UC3); int maxIntensity = -100; for( int h = 0; h < hbins; h++ ) { for( int s = 0; s < sbins; s++ ) { float binVal = hist.at<float>(h, s); int intensity = cvRound(binVal*255/maxVal); rectangle( histImg, Point(h*scale, s*scale), Point( (h+1)*scale - 1, (s+1)*scale - 1), Scalar::all(intensity), CV_FILLED ); if(intensity > maxIntensity) maxIntensity = intensity; } } std::cout << "max Intensity " << maxVal << std::endl; Mat dst; cv::threshold(src, dst, maxIntensity, 255, cv::THRESH_BINARY); namedWindow( "Dest", 1 ); imshow( "Dest", dst ); namedWindow( "Source", 1 ); imshow( "Source", src ); namedWindow( "H-S Histogram", 1 ); imshow( "H-S Histogram", histImg );

zakjma · Accepted Answer

ソリューション

H-Sヒストグラムを見つける
ピークH値を見つける（minmaxLoc関数を使用）
分割画像3チャンネル（h、s、v）
しきい値に適用します。
3チャンネルをマージして画像を作成

mbschenkel · Answer

または、 k-means アプローチを試すこともできます。 kクラスターを計算 with k ~ 2..5そして最大のグループの重心を支配的な色として取ります。

OpenCvのpythonドキュメントには図解された例があり、支配的な色をかなりうまく取得します：

beaker · Answer

ここにあなたが始めるためのいくつかの提案があります。

RGBの3つのチャネルすべてが色に寄与するため、3つの異なるヒストグラムがすべて最大である場所を何らかの方法で把握する必要があります。（またはそれらの合計が最大であるか、または何でも。）
HSVは1つのチャネルにすべての色（色相）情報を持っているので、1つのヒストグラムを考慮するだけで済みます。
グレースケールはすべての色情報を破棄するため、色を見つけるのにほとんど役に立ちません。

HSVに変換してから、Hチャネルのヒストグラムを計算してください。

あなたが言うように、あなたはヒストグラムで最大値を見つけたいと思います。だが：

値の範囲を1つだけではなく、たとえば20-40ではなく30から検討することをお勧めします。さまざまな範囲サイズを試してください。
色相は円形であるため、H=0とH=360は同じであることに注意してください。
次のようにヒストグラムをプロットしてみてください。
http://docs.opencv.org/doc/tutorials/imgproc/histograms/histogram_calculation/histogram_calculation.html
結果に意味があるかどうかを確認します。
色相の範囲を使用していて、最大の範囲を見つけた場合は、その範囲の中央をドミナントカラーとして使用するか、その範囲内の色の平均を見つけてそれを使用することができます。

nathancy · Answer

これがPythonアプローチで K-Means Clustering を使用して、 sklearn.cluster.KMeans() で画像の主要な色を決定します。

入力画像

結果

n_clusters=5を使用すると、最も支配的な色とパーセンテージ分布がここにあります

[14.69488554 34.23074345 41.48107857] 13.67% [141.44980073 207.52576948 236.30722987] 15.69% [ 31.75790423 77.52713644 114.33328324] 18.77% [ 48.41205713 118.34814452 176.43411287] 25.19% [ 84.04820266 161.6848298 217.14045211] 26.69%

各カラークラスターの視覚化

n_clusters=10との類似性、

[ 55.09073171 113.28271003 74.97528455] 3.25% [ 85.36889668 145.80759374 174.59846237] 5.24% [164.17201088 223.34258123 241.81929254] 6.60% [ 9.97315932 22.79468111 22.01822211] 7.16% [19.96940211 47.8375841 72.83728002] 9.27% [ 26.73510467 70.5847759 124.79314278] 10.52% [118.44741779 190.98204701 230.66728334] 13.55% [ 51.61750364 130.59930047 198.76335878] 13.82% [ 41.10232129 104.89923271 160.54431333] 14.53% [ 81.70930412 161.823664 221.10258949] 16.04%

import cv2, numpy as np from sklearn.cluster import KMeans def visualize_colors(cluster, centroids): # Get the number of different clusters, create histogram, and normalize labels = np.arange(0, len(np.unique(cluster.labels_)) + 1) (hist, _) = np.histogram(cluster.labels_, bins = labels) hist = hist.astype("float") hist /= hist.sum() # Create frequency rect and iterate through each cluster's color and percentage rect = np.zeros((50, 300, 3), dtype=np.uint8) colors = sorted([(percent, color) for (percent, color) in Zip(hist, centroids)]) start = 0 for (percent, color) in colors: print(color, "{:0.2f}%".format(percent * 100)) end = start + (percent * 300) cv2.rectangle(rect, (int(start), 0), (int(end), 50), \ color.astype("uint8").tolist(), -1) start = end return rect # Load image and convert to a list of pixels image = cv2.imread('1.jpg') image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB) reshape = image.reshape((image.shape[0] * image.shape[1], 3)) # Find and display most dominant colors cluster = KMeans(n_clusters=5).fit(reshape) visualize = visualize_colors(cluster, cluster.cluster_centers_) visualize = cv2.cvtColor(visualize, cv2.COLOR_RGB2BGR) cv2.imshow('visualize', visualize) cv2.waitKey()