Before /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_01_19_04_53_00.csv After /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_01_20_04_52_54.csv --------------------------- outcome --------------------------- Answers changed: -> Before A 98.463560 UNK 1.225398 D 0.311042 Name: outcome, dtype: float64 -> After A 99.679815 D 0.320185 Name: outcome, dtype: float64 --------------------------- health_worker --------------------------- Answers changed: -> Before NO 93.943877 YES 5.316478 UNK 0.672405 9 0.058275 0 0.008965 Name: health_worker, dtype: float64 -> After NO 94.589468 YES 5.333755 9 0.058712 UNK 0.009033 0 0.009033 Name: health_worker, dtype: float64 --------------------------- epilink --------------------------- self other diff NO 70.368529 71.57226 1.203731 DataComPy Comparison -------------------- DataFrame Summary ----------------- DataFrame Columns Rows 0 df1 34 60432 1 df2 34 60432 Column Summary -------------- Number of columns in common: 34 Number of columns in df1 but not in df2: 0 Number of columns in df2 but not in df1: 0 Row Summary ----------- Matched on: index Any duplicates on match values: No Absolute Tolerance: 0 Relative Tolerance: 0 Number of rows in common: 60,427 Number of rows in df1 but not in df2: 5 Number of rows in df2 but not in df1: 5 Number of rows with some compared columns unequal: 1,532 Number of rows with all compared columns equal: 58,895 Column Comparison ----------------- Number of columns compared with some values unequal: 30 Number of columns compared with all values equal: 4 Total number of values which compare unequal: 12,556 Columns with Unequal Values or Types ------------------------------------ Column df1 dtype df2 dtype # Unequal Max Diff # Null Diff 5 concurrrent_sti object object 214 0 214 13 epilink object object 977 0 977 14 exposuresetting object object 77 0 77 2 gender object object 725 0 725 9 health_worker object object 183 0 169 8 hiv_status object object 76 0 63 4 hospitalised object object 812 0 794 6 icu object object 17 0 16 28 immunosuppresssion object object 182 0 138 29 lab_method object object 54 0 54 10 mpx_treatment object object 237 0 237 7 outcome object object 1063 0 1063 0 pregnant object object 729 0 729 3 sexual_orientation object object 75 0 68 1 smallpox_vaccine object object 811 0 811 11 specemin_type object object 659 0 659 15 symp_asy object object 7 0 0 23 symp_chills object object 407 0 407 25 symp_conj object object 18 0 18 27 symp_cough object object 117 0 117 22 symp_fatigue object object 714 0 714 19 symp_fever object object 756 0 756 17 symp_genital object object 949 0 949 24 symp_headache object object 549 0 549 20 symp_musc object object 518 0 518 18 symp_oral object object 544 0 544 16 symp_rash object object 590 0 590 21 symp_throat object object 382 0 382 26 symp_vomit object object 87 0 87 12 transmission object object 27 0 27 Sample Rows with Unequal Values ------------------------------- pregnant (df1) pregnant (df2) recordid reporting_country ON1270497 CANADA NO NaN ON1267946 CANADA NO NaN ON1311057 CANADA NO NaN ON1268102 CANADA NO NaN ON1271816 CANADA NO NaN ON1265714 CANADA NO NaN ON1268057 CANADA NO NaN ON1270068 CANADA NO NaN ON1270320 CANADA NO NaN ON1268332 CANADA NO NaN smallpox_vaccine (df1) smallpox_vaccine (df2) recordid reporting_country QC400106 CANADA NO NaN QC401125 CANADA UNK NaN QC392469 CANADA UNK NaN BCmpx_125 CANADA YES NaN QC399303 CANADA NO NaN BCmpx_065 CANADA NO NaN BCmpx_183 CANADA YES NaN BCmpx_140 CANADA YES NaN BCmpx_044 CANADA UNK NaN BCmpx_010 CANADA NO NaN gender (df1) gender (df2) recordid reporting_country ON1274370 CANADA NaN MALE ON1268859 CANADA NaN MALE ON1298695 CANADA NaN MALE ON1276710 CANADA NaN MALE ON1271351 CANADA NaN MALE ON1265586 CANADA NaN MALE ON1271336 CANADA NaN MALE ON1274767 CANADA NaN MALE ON1273116 CANADA NaN MALE ON1266791 CANADA NaN MALE sexual_orientation (df1) sexual_orientation (df2) recordid reporting_country ON1276952 CANADA HETERO NaN QC402719 CANADA UNK NaN ON1311011 CANADA NaN MSM AB11453082 CANADA HETERO NaN BCmpx_138 CANADA BISEXUAL MSM QC394055 CANADA HETERO NaN QC397686 CANADA HETERO NaN ON1264736 CANADA HETERO NaN ON1267624 CANADA HETERO NaN ON1266728 CANADA HETERO NaN hospitalised (df1) hospitalised (df2) recordid reporting_country ON1271618 CANADA NaN NO ON1270331 CANADA NaN NO ON1268084 CANADA NaN NO ON1272248 CANADA NaN NO ON1268710 CANADA NaN NO ON1265572 CANADA NaN NO QC405221 CANADA NaN YUNK SK20232336 CANADA NaN YUNK ON1267022 CANADA NaN NO ON1268925 CANADA NaN NO concurrrent_sti (df1) concurrrent_sti (df2) recordid reporting_country BCmpx_189 CANADA UNK NaN BCmpx_013 CANADA CHLAM NaN ON1264231 CANADA CHLAM NaN AB11291277 CANADA SYPH NaN ON1271540 CANADA SYPH NaN ON1271838 CANADA SYPH NaN ON1263569 CANADA SYPH NaN ON1268077 CANADA GONO NaN QC396690 CANADA SYPH NaN ON1266147 CANADA CHLAM NaN icu (df1) icu (df2) recordid reporting_country ON1321577 CANADA UNK NaN ON1326815 CANADA UNK NaN ON1315777 CANADA UNK NaN ON1322689 CANADA UNK NaN ON1323194 CANADA UNK NaN ON1316974 CANADA UNK NaN QC401533 CANADA UNK NaN ON1266143 CANADA NO NaN BCmpx_148 CANADA UNK NaN ON1263562 CANADA YES NO outcome (df1) outcome (df2) recordid reporting_country ON1269892 CANADA UNK NaN ON1268882 CANADA A NaN QC398571 CANADA A NaN ON1263574 CANADA UNK NaN ON1267925 CANADA UNK NaN ON1271732 CANADA UNK NaN ON1267320 CANADA UNK NaN NLMPXV-53711 CANADA A NaN ON1266924 CANADA A NaN ON1311202 CANADA UNK NaN hiv_status (df1) hiv_status (df2) recordid reporting_country QC403155 CANADA UNK NaN SK20221229 CANADA UNK NaN ON1267022 CANADA UNK NaN BCmpx_059 CANADA UNK NaN BCmpx_083 CANADA UNK NaN ON1268251 CANADA POS NEG ON1271896 CANADA NaN NEG ON1271540 CANADA POS NEG ON1267723 CANADA UNK NaN SK20221093 CANADA UNK NaN health_worker (df1) health_worker (df2) recordid reporting_country ON1267202 CANADA UNK NaN ON1266026 CANADA UNK NaN ON1275165 CANADA UNK NaN ON1266240 CANADA UNK NaN ON1268387 CANADA UNK NaN ON1270424 CANADA UNK NaN ON1270761 CANADA UNK NaN ON1271351 CANADA UNK NaN ON1267606 CANADA UNK NaN ON1266266 CANADA UNK NaN mpx_treatment (df1) mpx_treatment (df2) recordid reporting_country BCmpx_061 CANADA NO NaN BCmpx_033 CANADA NO NaN BCmpx_004 CANADA NO NaN BCmpx_069 CANADA NO NaN BCmpx_003 CANADA NO NaN BCmpx_006 CANADA UNK NaN BCmpx_104 CANADA NO NaN AB11286360 CANADA NO NaN BCmpx_076 CANADA NO NaN NBR2MP2022-001 CANADA NO NaN specemin_type (df1) specemin_type (df2) recordid reporting_country QC394681 CANADA SWAB NaN QC392926 CANADA OTHER NaN QC396325 CANADA OTHER NaN QC394003 CANADA OTHER NaN BCmpx_191 CANADA RECTAL NaN QC395865 CANADA OTHER NaN QC401506 CANADA OTHER NaN NS188773 CANADA GENITAL NaN QC397692 CANADA OTHER NaN QC399292 CANADA OTHER NaN transmission (df1) transmission (df2) recordid reporting_country BCmpx_110 CANADA UNK NaN BCmpx_042 CANADA UNK NaN BCmpx_097 CANADA UNK NaN ON1316974 CANADA UNK NaN BCmpx_103 CANADA UNK NaN BCmpx_181 CANADA UNK NaN ON1326815 CANADA UNK NaN BCmpx_121 CANADA UNK NaN BCmpx_006 CANADA UNK NaN ON1314245 CANADA UNK NaN epilink (df1) epilink (df2) recordid reporting_country ON1269880 CANADA UNK NaN ON1271189 CANADA NO NaN ON1271332 CANADA NO NaN AB11291139 CANADA NO NaN ON1267841 CANADA YES NaN ON1271456 CANADA NO NaN ON1265767 CANADA NO NaN ON1268532 CANADA NO NaN ON1268882 CANADA UNK NaN BCmpx_192 CANADA UNK NaN exposuresetting (df1) exposuresetting (df2) recordid reporting_country MB1289585 CANADA HOUSE NaN BCmpx_188 CANADA HOUSE NaN BCmpx_021 CANADA HOUSE NaN BCmpx_057 CANADA OTHER NaN BCmpx_059 CANADA BAR NaN BCmpx_026 CANADA HOUSE NaN BCmpx_175 CANADA HOUSE NaN NLMPXV-53711 CANADA PARTY NaN BCmpx_153 CANADA HOUSE NaN AB11372678 CANADA PARTY NaN symp_asy (df1) symp_asy (df2) recordid reporting_country ON1269950 CANADA YES NO ON1272039 CANADA YES NO ON1270424 CANADA YES NO ON1272516 CANADA YES NO ON1312031 CANADA YES NO ON1270183 CANADA YES NO ON1311011 CANADA YES NO symp_rash (df1) symp_rash (df2) recordid reporting_country ON1274395 CANADA YES NaN ON1270557 CANADA YES NaN BCmpx_059 CANADA YES NaN QC417375 CANADA YES NaN QC410667 CANADA YES NaN QC406682 CANADA YES NaN ON1267568 CANADA YES NaN BCmpx_013 CANADA YES NaN QC398443 CANADA YES NaN ON1266515 CANADA YES NaN symp_genital (df1) symp_genital (df2) recordid reporting_country ON1267798 CANADA YES NaN QC407021 CANADA YES NaN ON1266153 CANADA YES NaN ON1305460 CANADA YES NaN ON1267843 CANADA YES NaN ON1269897 CANADA YES NaN ON1272057 CANADA YES NaN ON1267946 CANADA YES NaN QC392458 CANADA YES NaN ON1265586 CANADA YES NaN symp_oral (df1) symp_oral (df2) recordid reporting_country ON1268057 CANADA YES NaN ON1267134 CANADA YES NaN ON1269820 CANADA YES NaN ON1267267 CANADA YES NaN ON1267843 CANADA YES NaN ON1264207 CANADA YES NaN AB11313395 CANADA YES NaN QC393294 CANADA YES NaN ON1269931 CANADA YES NaN AB11292483 CANADA YES NaN symp_fever (df1) symp_fever (df2) recordid reporting_country ON1265173 CANADA YES NaN SK20232336 CANADA YES NaN QC395856 CANADA YES NaN ON1263954 CANADA YES NaN ON1273625 CANADA YES NaN QC399288 CANADA YES NaN ON1267521 CANADA YES NaN ON1268564 CANADA YES NaN QC397370 CANADA YES NaN QC396120 CANADA YES NaN symp_musc (df1) symp_musc (df2) recordid reporting_country ON1275616 CANADA YES NaN ON1266920 CANADA YES NaN BCmpx_177 CANADA YES NaN BCmpx_191 CANADA YES NaN ON1270548 CANADA YES NaN ON1269210 CANADA YES NaN QC398112 CANADA YES NaN ON1271594 CANADA YES NaN AB11301152 CANADA YES NaN ON1276617 CANADA YES NaN symp_throat (df1) symp_throat (df2) recordid reporting_country ON1271690 CANADA YES NaN QC407442 CANADA YES NaN BCmpx_123 CANADA YES NaN QC399676 CANADA YES NaN AB11285203 CANADA YES NaN QC401517 CANADA YES NaN BCmpx_082 CANADA YES NaN NLMPXV-53711 CANADA YES NaN ON1269497 CANADA YES NaN ON1272057 CANADA YES NaN symp_fatigue (df1) symp_fatigue (df2) recordid reporting_country QC403000 CANADA YES NaN ON1267028 CANADA YES NaN ON1269825 CANADA YES NaN QC401519 CANADA YES NaN ON1272449 CANADA YES NaN QC399266 CANADA YES NaN BCmpx_036 CANADA YES NaN ON1269734 CANADA YES NaN ON1271594 CANADA YES NaN ON1270781 CANADA YES NaN symp_chills (df1) symp_chills (df2) recordid reporting_country ON1270070 CANADA YES NaN SK20222289 CANADA YES NaN ON1272507 CANADA YES NaN ON1263574 CANADA YES NaN ON1267624 CANADA YES NaN ON1266147 CANADA YES NaN BCmpx_178 CANADA YES NaN ON1266468 CANADA YES NaN BCmpx_133 CANADA YES NaN ON1267938 CANADA YES NaN symp_headache (df1) symp_headache (df2) recordid reporting_country QC400496 CANADA YES NaN ON1272843 CANADA YES NaN QC399269 CANADA YES NaN ON1267238 CANADA YES NaN ON1267933 CANADA YES NaN ON1267568 CANADA YES NaN QC396945 CANADA YES NaN QC396119 CANADA YES NaN ON1270077 CANADA YES NaN BCmpx_036 CANADA YES NaN symp_conj (df1) symp_conj (df2) recordid reporting_country ON1267861 CANADA YES NaN AB11292483 CANADA YES NaN BCmpx_026 CANADA YES NaN NLMPXV-53725 CANADA YES NaN BCmpx_123 CANADA YES NaN ON1270790 CANADA YES NaN ON1268615 CANADA YES NaN ON1266637 CANADA YES NaN BCmpx_191 CANADA YES NaN BCmpx_185 CANADA YES NaN symp_vomit (df1) symp_vomit (df2) recordid reporting_country BCmpx_186 CANADA YES NaN BCmpx_111 CANADA YES NaN QC399279 CANADA YES NaN BCmpx_169 CANADA YES NaN QC406195 CANADA YES NaN QC406105 CANADA YES NaN BCmpx_063 CANADA YES NaN QC404573 CANADA YES NaN QC397443 CANADA YES NaN QC399288 CANADA YES NaN symp_cough (df1) symp_cough (df2) recordid reporting_country ON1267938 CANADA YES NaN ON1267532 CANADA YES NaN ON1268803 CANADA YES NaN ON1268167 CANADA YES NaN BCmpx_061 CANADA YES NaN NLMPXV-53711 CANADA YES NaN AB11261540 CANADA YES NaN BCmpx_024 CANADA YES NaN ON1273464 CANADA YES NaN ON1269555 CANADA YES NaN immunosuppresssion (df1) immunosuppresssion (df2) recordid reporting_country ON1268233 CANADA UNK NaN ON1275654 CANADA UNK NaN QC395855 CANADA UNK NaN ON1314252 CANADA YD NO ON1277326 CANADA UNK NaN ON1302476 CANADA YD YM ON1266146 CANADA UNK NaN ON1268423 CANADA UNK NaN ON1268381 CANADA UNK NaN BCmpx_148 CANADA UNK NaN lab_method (df1) lab_method (df2) recordid reporting_country AB11293854 CANADA MPXPCR NaN AB11265640 CANADA MPXPCR NaN AB11325102 CANADA MPXPCR NaN AB11285203 CANADA MPXPCR NaN AB11245228 CANADA MPXPCR NaN AB11259740 CANADA MPXPCR NaN AB11294347 CANADA MPXPCR NaN AB11283172 CANADA MPXPCR NaN AB11290606 CANADA MPXPCR NaN NS188773 CANADA MPXPCR NaN Sample Rows Only in df1 (First 10 Columns) ------------------------------------------ pregnant case_class smallpox_vaccine gender sexual_orientation clade hospitalised concurrrent_sti icu outcome recordid reporting_country ON1268701 CANADA NO CONFIRMED NaN NaN MSM NaN NaN NaN NaN A QC417521 CANADA NaN PROBABLE NaN MALE NaN NaN NaN NaN NaN NaN QC417519 CANADA NaN PROBABLE NaN MALE NaN NaN NaN NaN NaN NaN QC417527 CANADA NaN PROBABLE NaN MALE NaN NaN NaN NaN NaN NaN ON1271108 CANADA NO CONFIRMED NaN NaN MSM NaN NaN NaN NaN A Sample Rows Only in df2 (First 10 Columns) ------------------------------------------ pregnant case_class smallpox_vaccine gender sexual_orientation clade hospitalised concurrrent_sti icu outcome recordid reporting_country ON1333276 CANADA NaN CONFIRMED NaN MALE MSM NaN NO NaN NaN NaN BCmpx_219 CANADA NO CONFIRMED NaN MALE NaN NaN NO NaN NaN NaN BCmpx_221 CANADA NaN CONFIRMED NaN MALE MSM NaN NO NaN NaN NaN BCmpx_220 CANADA NO CONFIRMED NaN MALE MSM NaN NO NaN NO NaN BCmpx_218 CANADA NO CONFIRMED NaN MALE MSM NaN NO NaN NO NaN