logstash jdbcの複数の入力

Question

私はmysqlとelasticsearchの間の同期を維持するためにlogstash jdbcを使用しています。 1つのテーブルで問題なく動作します。しかし、今は複数のテーブルに対してそれを実行したいと思います。端末で複数を開く必要がありますか

logstash agent -f /Users/logstash/logstash-jdbc.conf

それぞれに選択クエリがありますか、それとも複数のテーブルを更新できるようにするためのより良い方法がありますか？.

私の設定ファイル

input { jdbc { jdbc_driver_library => "/Users/logstash/mysql-connector-Java-5.1.39-bin.jar" jdbc_driver_class => "com.mysql.jdbc.Driver" jdbc_connection_string => "jdbc:mysql://localhost:3306/database_name" jdbc_user => "root" jdbc_password => "password" schedule => "* * * * *" statement => "select * from table1" } } output { elasticsearch { index => "testdb" document_type => "table1" document_id => "%{table_id}" hosts => "localhost:9200" } }

Val · Accepted Answer

複数のjdbc入力を含む単一の構成を確実に作成し、index出力のelasticsearchとdocument_typeを、イベントの送信元のテーブルに応じてパラメーター化することができます。。

input { jdbc { jdbc_driver_library => "/Users/logstash/mysql-connector-Java-5.1.39-bin.jar" jdbc_driver_class => "com.mysql.jdbc.Driver" jdbc_connection_string => "jdbc:mysql://localhost:3306/database_name" jdbc_user => "root" jdbc_password => "password" schedule => "* * * * *" statement => "select * from table1" type => "table1" } jdbc { jdbc_driver_library => "/Users/logstash/mysql-connector-Java-5.1.39-bin.jar" jdbc_driver_class => "com.mysql.jdbc.Driver" jdbc_connection_string => "jdbc:mysql://localhost:3306/database_name" jdbc_user => "root" jdbc_password => "password" schedule => "* * * * *" statement => "select * from table2" type => "table2" } # add more jdbc inputs to suit your needs } output { elasticsearch { index => "testdb" document_type => "%{type}" # <- use the type from each input hosts => "localhost:9200" } }

iNandi · Answer

これは重複データを作成しません。および互換性のあるlogstash 6x。

# YOUR_DATABASE_NAME : test # FIRST_TABLE : place # SECOND_TABLE : things # SET_DATA_INDEX : test_index_1, test_index_2 input { jdbc { # The path to our downloaded jdbc driver jdbc_driver_library => "/mysql-connector-Java-5.1.44-bin.jar" jdbc_driver_class => "com.mysql.jdbc.Driver" # Postgres jdbc connection string to our database, YOUR_DATABASE_NAME jdbc_connection_string => "jdbc:mysql://localhost:3306/test" # The user we wish to execute our statement as jdbc_user => "root" jdbc_password => "" schedule => "* * * * *" statement => "SELECT @slno:=@slno+1 aut_es_1, es_qry_tbl.* FROM (SELECT * FROM `place`) es_qry_tbl, (SELECT @slno:=0) es_tbl" type => "place" add_field => { "queryFunctionName" => "getAllDataFromFirstTable" } use_column_value => true tracking_column => "aut_es_1" } jdbc { # The path to our downloaded jdbc driver jdbc_driver_library => "/mysql-connector-Java-5.1.44-bin.jar" jdbc_driver_class => "com.mysql.jdbc.Driver" # Postgres jdbc connection string to our database, YOUR_DATABASE_NAME jdbc_connection_string => "jdbc:mysql://localhost:3306/test" # The user we wish to execute our statement as jdbc_user => "root" jdbc_password => "" schedule => "* * * * *" statement => "SELECT @slno:=@slno+1 aut_es_2, es_qry_tbl.* FROM (SELECT * FROM `things`) es_qry_tbl, (SELECT @slno:=0) es_tbl" type => "things" add_field => { "queryFunctionName" => "getAllDataFromSecondTable" } use_column_value => true tracking_column => "aut_es_2" } } # install uuid plugin 'bin/logstash-plugin install logstash-filter-uuid' # The uuid filter allows you to generate a UUID and add it as a field to each processed event. filter { mutate { add_field => { "[@metadata][document_id]" => "%{aut_es_1}%{aut_es_2}" } } uuid { target => "uuid" overwrite => true } } output { stdout {codec => rubydebug} if [type] == "place" { elasticsearch { hosts => "localhost:9200" index => "test_index_1_12" #document_id => "%{aut_es_1}" document_id => "%{[@metadata][document_id]}" } } if [type] == "things" { elasticsearch { hosts => "localhost:9200" index => "test_index_2_13" document_id => "%{[@metadata][document_id]}" # document_id => "%{aut_es_2}" # you can set document_id . otherwise ES will genrate unique id. } } }

zabusa · Answer

同じプロセスで複数のパイプラインを実行する必要がある場合、Logstashは、pipelines.ymlと呼ばれる構成ファイルを通じてこれを行う方法を提供し、複数のパイプライン

複数のパイプライン

複数のパイプラインを使用することは、現在の構成に同じ入力/フィルターと出力を共有せず、タグと条件を使用して互いに分離されているイベントフローがある場合に特に便利です。

より役立つリソース