PostgreSQL 如何瀟灑的處理每天上百TB的數據增量

2024-07-21 02:51:44

字體：大中小

來源：轉載

供稿：網友

摘要：本文主要介紹并測試一下PostgreSQL 在中高端x86服務器上的數據插入速度，幫助企業用戶了解PostgreSQL在這種純插入場景的性能。（例如運營商網關數據，金融行業數據，產生量大，并且要求快速插入大數據庫中持久化保存。）測試結果寫在前面：每32K的block存儲89條記錄, 每條記錄約3
本文主要介紹并測試PostgreSQL 在中高端x86服務器上的數據插入速度（目標表包含一個時間字段的索引），幫助企業用戶了解PostgreSQL在這類場景下的性能表現。這類場景常見于 : 運營商網關數據，金融行業數據，產生量大，并且要求快速插入大數據庫中持久化保存。另外, 用戶如果需要流式實時處理，可以參考基于PostgreSQL的流式處理方案，一天處理1萬億的實時流式處理是如何實現的？https://yq.aliyun.com/articles/166

TEST CASE

.1. 平均每條記錄長度360字節， (比較常見的長度) .2. 時間字段創建索引。.3. 每輪測試插入12TB數據，插入完12T后清除數據繼續插入。循環。.4. 測試滿24小時停止測試。.5. 統計24小時插入的記錄數。

TEST 結果

24小時一共完成12輪測試，平均每輪測試耗時7071秒。506萬行/s，1.78 GB/s，全天插入4372億，154TB數據。

測試的硬件環境

.1. X86服務器     .2. 3?核。   .3. 5??G 內存   .4. 幾塊SSD，15TB容量   軟件環境
.1. CENTOS 6.x x64   .2 .xfs   .3. PostgreSQL 9.5   系統配置參考
https://github.com/digoal/pgsql_admin_script/blob/master/pgsql_perf_tuning.md
數據庫配置
./configure --PRefix=/home/digoal/pgsql9.5.1 --with-blocksize=32 --with-segsize=128 --with-wal-blocksize=32 --with-wal-segsize=64  make && make install  PostgreSQL支持hugepage的方法請參考：https://yq.aliyun.com/articles/8482參數
listen_addresses = '0.0.0.0'            # what IP address(es) to listen on;fsync=onport = 1921                             # (change requires restart)max_connections = 600                   # (change requires restart)superuser_reserved_connections = 13     # (change requires restart)unix_socket_directories = '.'   # comma-separated list of directoriesunix_socket_permissions = 0700          # begin with 0 to use octal notationtcp_keepalives_idle = 60                # TCP_KEEPIDLE, in seconds;tcp_keepalives_interval = 10            # TCP_KEEPINTVL, in seconds;tcp_keepalives_count = 10               # TCP_KEEPCNT;shared_buffers = 256GB                   # min 128kBhuge_pages = on                 # on, off, or trywork_mem = 512MB                                # min 64kBmaintenance_work_mem = 1GB              # min 1MBautovacuum_work_mem = 1GB               # min 1MB, or -1 to use maintenance_work_memdynamic_shared_memory_type = posix      # the default is the first optionbgwriter_delay = 10ms                   # 10-10000ms between roundsbgwriter_lru_maxpages = 1000            # 0-1000 max buffers written/roundbgwriter_lru_multiplier = 2.0  synchronous_commit = off                # synchronization level;full_page_writes = on                  # recover from partial page writeswal_buffers = 2047MB                    # min 32kB, -1 sets based on shared_bufferswal_writer_delay = 10ms         # 1-10000 millisecondscheckpoint_timeout = 55min              # range 30s-1hmax_wal_size = 512GBcheckpoint_completion_target = 0.9      # checkpoint target duration, 0.0 - 1.0effective_cache_size = 40GB   log_destination = 'csvlog'              # Valid values are combinations oflogging_collector = on          # Enable capturing of stderr and csvloglog_directory = 'pg_log'                # directory where log files are written,log_filename = 'postgresql-%Y-%m-%d_%H%M%S.log' # log file name pattern,log_file_mode = 0600                    # creation mode for log files,log_truncate_on_rotation = on           # If on, an existing log file with thelog_checkpoints = offlog_connections = offlog_disconnections = offlog_error_verbosity = verbose           # terse, default, or verbose messageslog_timezone = 'PRC'log_autovacuum_min_duration = 0 # -1 disables, 0 logs all actions anddatestyle = 'iso, mdy'timezone = 'PRC'lc_messages = 'C'                       # locale for system error messagelc_monetary = 'C'                       # locale for monetary formattinglc_numeric = 'C'                        # locale for number formattinglc_time = 'C'                           # locale for time formattingdefault_text_search_config = 'pg_catalog.english'autovacuum=off創建測試表 :每32K的block存儲89條記錄, 每條記錄360字節。
postgres=# select string_agg(i,'') from (select md5(random()::text) i from generate_series(1,10) t(i)) t(i);                               string_agg                                                                       ---------------------------------------------------------------------- 53d3ec7adbeacc912a45bdd8557b435be848e4b1050dc0f5e46b75703d4745833541b5dabc177db460b6b1493961fc72c478daaaac74bcc89aec4f946a496028d9cff1cc4144f738e01ea36436455c216aa697d87fe1f87ceb49134a687dc69cba34c9951d0c9ce9ca82bba229d56874af40498dca5fd8dfb9c877546db76c35a3362d6bdba6472d3919289b6eaeeab58feb4f6e79592fc1dd8253fd4c588a29(1 row)postgres=# create unlogged table test(crt_time timestamp, info text default '53d3ec7adbeacc912a45bdd8557b435be848e4b1050dc0f5e46b75703d4745833541b5dabc177db460b6b1493961fc72c478daaaac74bcc89aec4f946a496028d9cff1cc4144f738e01ea36436455c216aa697d87fe1f87ceb49134a687dc69cba34c9951d0c9ce9ca82bba229d56874af40498dca5fd8dfb9c877546db76c35a3362d6bdba6472d3919289b6eaeeab58feb4f6e79592fc1dd8253fd4c588a29');postgres=# alter table test alter column info set storage plain;postgres=# insert into test select now() from generate_series(1,1000);postgres=# select ctid from test limit 1000;分別在3個物理塊設備上創建3個表空間目錄，同時在數據庫中創建表空間。 tbs1, tbs2, tbs3.
創建多個分表，用于減少 block extend 沖突。
do language plpgsql $$declarei int;sql text;begin  for i in 1..42 loop    sql := 'create unlogged table test'||i||' (like test including all) tablespace tbs1';    execute sql;    sql := 'create index idx_test'||i||' on test'||i||' using brin (crt_time) with (pages_per_range=512) tablespace tbs1';    execute sql;  end loop;  for i in 43..84 loop    sql := 'create unlogged table test'||i||' (like test including all) tablespace tbs2';    execute sql;    sql := 'create index idx_test'||i||' on test'||i||' using brin (crt_time) with (pages_per_range=512) tablespace tbs2';    execute sql;  end loop;  for i in 85..128 loop    sql := 'create unlogged table test'||i||' (like test including all) tablespace tbs3';    execute sql;    sql := 'create index idx_test'||i||' on test'||i||' using brin (crt_time) with (pages_per_range=512) tablespace tbs3';    execute sql;  end loop;end; $$;又見黑科技 BRIN 索引方法
這里使用的是brin范圍索引，PostgreSQL 針對物聯網流式數據的黑科技。
postgres=# /di                 List of relations Schema |    Name     | Type  |  Owner   |  Table  --------+-------------+-------+----------+--------- public | idx_test1   | index | postgres | test1 public | idx_test10  | index | postgres | test10 public | idx_test100 | index | postgres | test100 public | idx_test101 | index | postgres | test101 public | idx_test102 | index | postgres | test102 public | idx_test103 | index | postgres | test103 public | idx_test104 | index | postgres | test104 public | idx_test105 | index | postgres | test105 public | idx_test106 | index | postgres | test106............ public | idx_test90  | index | postgres | test90 public | idx_test91  | index | postgres | test91 public | idx_test92  | index | postgres | test92 public | idx_test93  | index | postgres | test93 public | idx_test94  | index | postgres | test94 public | idx_test95  | index | postgres | test95 public | idx_test96  | index | postgres | test96 public | idx_test97  | index | postgres | test97 public | idx_test98  | index | postgres | test98 public | idx_test99  | index | postgres | test99(128 rows)生成測試腳本, 一個連接一次插入178條記錄，占用2個32KB的block :
vi test.sql insert into test(crt_time) values (now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()),(now()); for ((i=1;i<=128;i++)) do sed "s/test/test$i/" test.sql > ./test$i.sql; done開始測試前清除數據：
do language plpgsql $$  declarei int;sql text;begin  for i in 1..128 loop    sql := 'truncate test'||i;    execute sql;  end loop;end; $$;測試方法：每輪測試插入12TB數據。通過以下方式控制：.1. 使用128個并行連接，每個連接執行1572864個事務。.2. 一共執行201326592個事務（每個事務插入178條記錄）。.3. 一共插入35836133376條記錄（358.36 億記錄）（共計12TB 數據，索引空間另算）。進行下一輪測試前，輸出日志，并TRUNCATE所有的數據，然后重復以上測試。直到測試滿24小時，輸出統計數據。測試腳本如下 :
vi test.sh#!/bin/bashif [ $# -ne 5 ]; then  echo "please use: $0 ip port dbname user pwd"  exit 1fiIP=$1PORT=$2DBNAME=$3USER=$4PASSWord=$5export PGPASSWORD=$PASSWORDDEP_CMD="psql"which $DEP_CMD if [ $? -ne 0 ]; then  echo -e "dep commands: $DEP_CMD not exist."  exit 1fitruncate() {psql -h $IP -p $PORT -U $USER $DBNAME <<EOFdo language plpgsql /$/$  declarei int;sql text;begin  for i in 1..128 loop    sql := 'truncate test'||i;    execute sql;  end loop;end; /$/$;checkpoint;/qEOF}# truncate data firsttruncateSTART=`date +%s`echo "`date +%F%T` $START"for ((x=1;x>0;x++)) do # ------------------------------------------------------echo "Round $x test start: `date +%F%T` `date +%s`"for ((i=1;i<=128;i++)) do   pgbench -M prepared -n -r -f ./test$i.sql -h $IP -p $PORT -U $USER $DBNAME -c 1 -j 1 -t 1572864 >>./$i.log 2>&1 & done waitecho "Round $x test end: `date +%F%T` `date +%s`"# ------------------------------------------------------if [ $((`date +%s`-$START)) -gt 86400 ]; then  echo "end `date +%F%T` `date +%s`"  echo "duration second: $((`date +%s`-$START))"  exit 0fiecho "Round $x test end, start truncate `date +%F%T` `date +%s`"truncateecho "Round $x test end, end truncate `date +%F%T` `date +%s`"done測試
nohup ./test.sh xxx.xxx.xxx.xxx 1921 postgres postgres postgres >./test.log 2>&1 &測試結果
24小時完成12輪測試，平均每輪測試耗時7071秒。 506萬行/s（每行360字節），1.78GB/s，全天插入4372億，154TB數據。
查詢性能
postgres=# select min(crt_time),max(crt_time) from test1;            min             |            max             ----------------------------+---------------------------- 2016-04-08 00:32:26.842728 | 2016-04-08 02:29:41.583367(1 row)postgres=# explain select count(*) from test1 where crt_time between '2016-04-08 00:32:00' and '2016-04-08 00:33:00';                                                                            QUERY PLAN                                                                             ------------------------------------------------------------------------------------------------------------------------------------------------------------------- Aggregate  (cost=1183919.81..1183919.82 rows=1 width=0)   ->  Bitmap Heap Scan on test1  (cost=14351.45..1180420.19 rows=1399849 width=0)         Recheck Cond: ((crt_time >= '2016-04-08 00:32:00'::timestamp without time zone) AND (crt_time <= '2016-04-08 00:33:00'::timestamp without time zone))         ->  Bitmap Index Scan on idx_test1  (cost=0.00..14001.49 rows=1399849 width=0)               Index Cond: ((crt_time >= '2016-04-08 00:32:00'::timestamp without time zone) AND (crt_time <= '2016-04-08 00:33:00'::timestamp without time zone))(5 rows)Time: 0.382 mspostgres=# select count(*) from test1 where crt_time between '2016-04-08 00:32:00' and '2016-04-08 00:33:00';  count  --------- 2857968(1 row)Time: 554.474 ms小結
.1. 這個CASE主要的應用場景是實時的大數據入庫，例如 物聯網 的應用場景，大量的 傳感器 會產生龐大的數據。又比如傳統的 運營商網關 ，也會有非常龐大的流量數據或業務數據需要實時的入庫。索引方面，用到了PostgreSQL黑科技BRIN。.2. 除了實時入庫，用戶如果需要流式實時處理，可以參考基于PostgreSQL的流式處理方案，
一天處理1萬億的實時流式處理是如何實現的？
https://yq.aliyun.com/articles/166
.3. 瓶頸, 還是在IO上面 , 有幾個表現，TOP大量進程處于D(front io)狀態 。
       w: S  --  Process Status          The status of the task which can be one of:             ’D’ = uninterruptible sleep             ’R’ = running             ’S’ = sleeping             ’T’ = traced or stopped             ’Z’ = zombie所有塊設備的使用率均達100% 。清理數據時 ：
Device:         rrqm/s   wrqm/s     r/s     w/s   rsec/s   wsec/s avgrq-sz avgqu-sz   await  svctm  %utildfa               0.00     0.00 5807.39 167576.65 1464080.93 1340613.23    16.18   535.69    3.02   0.01 116.77dfb               0.00     0.00 5975.10 185132.68 1506714.40 1481061.48    15.63   459.46    2.32   0.01 110.62dfc               0.00     0.00 5715.56 182584.05 1440771.98 1460672.37    15.41   568.02    2.93   0.01 112.37插入數據時 ：
Device:         rrqm/s   wrqm/s     r/s     w/s   rsec/s   wsec/s avgrq-sz avgqu-sz   await  svctm  %utildfa               0.00     0.00    0.00 235936.00     0.00 1887488.00     8.00  2676.34   11.17   0.00  99.10dfb               0.00     0.00    0.00 237621.00     0.00 1900968.00     8.00    66.02    0.10   0.00  99.10dfc               0.00     0.00    0.00 239830.00     0.00 1918632.00     8.00    10.66    0.04   0.00 101.30IO層面的性能問題，可以通過優化代碼（例如 PostgreSQL bgwriter 在寫出數據時，盡量順序寫出），便于OS層進行IO合并，來緩解IO壓力，從這個信息來看，單次寫IO的大小還可以再大點。
有幾個工具你可能用得上，perf, systemtap, goprof.如果要較全面的分析，建議把PostgreSQL --enable-profiling打開用于診斷。








上一篇：論find_in_set(str,strlist) 參數單引號(&amp;#39;&amp;#39;)的重要性


下一篇：elasticsearch5.2.1安裝head步奏














發表評論
共有條評論






用戶名:

密碼:



驗證碼:

 

匿名發表


















學習交流
更多





練就火眼金睛 十一種常見電腦


打印機共享提示“操作無法完成






如何查找有故障的配件
如何查找有故障的配件...






綠毒越獄軟件 完美越獄4.21教程12-21

htcg15怎么刷機？htcg15刷機教程12-21

HTC hd7刷安卓2.2全教程12-21

mpkg軟件下載和卸載的方法12-21




網站后臺系統設置的網站地址格式寫法07-09

防止重復提交js按鈕變灰07-09

不同時間顯示不同的廣告代碼07-09

iframe框架調用高度自適應方法07-09

備份數據庫出現問題的解決方法07-09







熱門圖片
更多




回眸一笑百魅生，六宮粉黛無顏色


歲月靜美，剪一影煙雨江南




蕪湖有個“松鼠小鎮”


小滿：小得盈滿，一切剛剛好！




一串串晶瑩剔透的葡萄，像一顆顆寶石掛在藤


正宗老北京脆皮烤鴨




人逢知己千杯少，喝酒搞笑圖集


搞笑試卷，學生惡搞答題







猜你喜歡的新聞


最牛同桌！我考了696分我同桌考了703分

650分！高二女生考入北大：遺憾不能上高三

男生估分600只考了397 媽媽：高考雖重要，但不

唐尚珺回應是否會直播帶貨：有人出100萬想和

名校搶人名場面：清華、北大太拼了！

男生高考語文滿分！網友：第一次聽說

第16次參加高考！唐尚珺今年成績超600分

語文145分學霸筆記公開字跡清秀：多閱讀，該背

數學滿分！四川綿陽一男生高考710分 理綜僅扣

阿里全球數學競賽選手：決賽專業性太強，6道題





猜你喜歡的關注


技術點滴：關于BIOS設置和CMOS設置

技術點滴：進入系統去遲遲不見反應

技術點滴：菜鳥必學的53個計算機英語詞匯

技術點滴：讓你的MAC地址隨意轉變

技術點滴：讓Windows百毒不侵

技術點滴：CMD命令集錦

6類網線的完美接法

web.config中存儲自定義對象的正確方法

文件映射成WINDOW進程通信的首選方法（權釋wi

添加路由讓多網卡網關工作正常











新聞熱點





最牛同桌！我考了696分我同桌考了703分
2024-06-26 22:28:41






650分！高二女生考入北大：遺憾不能上高三
2024-06-26 22:26:16






男生估分600只考了397 媽媽：高考雖重要，但不代表所有
2024-06-26 22:23:01






唐尚珺回應是否會直播帶貨：有人出100萬想和他合作！
2024-06-25 19:29:23






名校搶人名場面：清華、北大太拼了！
2024-06-25 19:22:14






男生高考語文滿分！網友：第一次聽說
2024-06-25 19:19:15











疑難解答




如何查找有故障的配件

計算機啟動故障的判斷

練就火眼金睛 十一種常見電腦故障判斷

了解電腦啟動過程，解決啟動問題

電腦也怕冷，溫度低引發的故障

電腦各種中英文信息對照及錯誤信息總

新安裝計算機無法啟動的故障原因

電腦噪音大的解決辦法

怎樣減少機箱內的電磁干擾？

常見電腦故障維修－聲卡音箱篇







圖片精選




6類網線的完美接法



使用VMM2008 R2管理Hyper-V之1-實



開機黑屏與內存的原因導致DDR2內存



Lua中基本的數據類型、表達式與流











網友關注




三星i9250去除虛擬按鍵教程

綠毒越獄軟件 完美越獄4.21教程

htcg15怎么刷機？htcg15刷機教程

HTC hd7刷安卓2.2全教程

mpkg軟件下載和卸載的方法

華為c8860刷機教程

美萌手賬更換手賬背景方法

用ipad mini可以打電話嗎？怎么打？

蘋果ipad2使用教程

三星s6352怎么樣設置上網？