Before /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_01_12_04_53_00.csv After /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_01_13_04_53_02.csv --------------------------- hospitalised --------------------------- self other diff UNK 8.637288 9.692049 1.054761 --------------------------- epilink --------------------------- self other diff YES 17.428365 23.273880 5.845515 UNK 4.926559 6.357591 1.431032 DataComPy Comparison -------------------- DataFrame Summary ----------------- DataFrame Columns Rows 0 df1 34 60218 1 df2 34 60431 Column Summary -------------- Number of columns in common: 34 Number of columns in df1 but not in df2: 0 Number of columns in df2 but not in df1: 0 Row Summary ----------- Matched on: index Any duplicates on match values: No Absolute Tolerance: 0 Relative Tolerance: 0 Number of rows in common: 60,218 Number of rows in df1 but not in df2: 0 Number of rows in df2 but not in df1: 213 Number of rows with some compared columns unequal: 31,100 Number of rows with all compared columns equal: 29,118 Column Comparison ----------------- Number of columns compared with some values unequal: 10 Number of columns compared with all values equal: 24 Total number of values which compare unequal: 138,383 Columns with Unequal Values or Types ------------------------------------ Column df1 dtype df2 dtype # Unequal Max Diff # Null Diff 4 concurrrent_sti object object 1901 0 1882 7 epilink object object 10800 0 10339 2 gender object object 6101 0 4397 6 health_worker object object 16503 0 16131 3 hospitalised object object 13902 0 5413 9 immunosuppresssion object object 14599 0 14599 5 outcome object object 15428 0 15368 0 pregnant object object 17762 0 15543 1 smallpox_vaccine object object 21134 0 15973 8 symp_asy object object 20253 0 14454 Sample Rows with Unequal Values ------------------------------- pregnant (df1) pregnant (df2) recordid reporting_country USA00014352 UNITED STATES OF AMERICA NaN UNK USA00016412 UNITED STATES OF AMERICA UNK NaN USA00015441 UNITED STATES OF AMERICA NO NaN USA00021019 UNITED STATES OF AMERICA UNK NaN USA00015702 UNITED STATES OF AMERICA NaN UNK USA00029566 UNITED STATES OF AMERICA UNK NaN USA00001997 UNITED STATES OF AMERICA NO NaN USA00004442 UNITED STATES OF AMERICA NaN UNK USA00007507 UNITED STATES OF AMERICA NaN NO USA00009125 UNITED STATES OF AMERICA NaN UNK smallpox_vaccine (df1) smallpox_vaccine (df2) recordid reporting_country USA00008820 UNITED STATES OF AMERICA NaN NO USA00031137 UNITED STATES OF AMERICA NO NaN USA00021059 UNITED STATES OF AMERICA NO NaN USA00021743 UNITED STATES OF AMERICA UNK NaN USA00026471 UNITED STATES OF AMERICA NaN UNK USA00019292 UNITED STATES OF AMERICA NaN NO USA00007158 UNITED STATES OF AMERICA NaN PREV USA00021747 UNITED STATES OF AMERICA NaN NO USA00009559 UNITED STATES OF AMERICA NaN NO USA00021400 UNITED STATES OF AMERICA NaN NO gender (df1) gender (df2) recordid reporting_country USA00003233 UNITED STATES OF AMERICA NaN MALE USA00020286 UNITED STATES OF AMERICA FEMALE MALE USA00005510 UNITED STATES OF AMERICA MALE NaN USA00006389 UNITED STATES OF AMERICA NaN MALE USA00018854 UNITED STATES OF AMERICA MALE FEMALE USA00004782 UNITED STATES OF AMERICA FEMALE MALE USA00018458 UNITED STATES OF AMERICA NaN MALE USA00006538 UNITED STATES OF AMERICA FEMALE MALE USA00014703 UNITED STATES OF AMERICA NaN MALE USA00030080 UNITED STATES OF AMERICA MALE NaN hospitalised (df1) hospitalised (df2) recordid reporting_country USA00029143 UNITED STATES OF AMERICA UNK NO USA00017369 UNITED STATES OF AMERICA YUNK NO USA00023586 UNITED STATES OF AMERICA UNK NaN USA00020386 UNITED STATES OF AMERICA NO UNK USA00028220 UNITED STATES OF AMERICA NO UNK USA00021821 UNITED STATES OF AMERICA YUNK NaN USA00016733 UNITED STATES OF AMERICA NaN YUNK USA00030350 UNITED STATES OF AMERICA NO UNK USA00001531 UNITED STATES OF AMERICA NO NaN USA00023344 UNITED STATES OF AMERICA UNK NO concurrrent_sti (df1) concurrrent_sti (df2) recordid reporting_country USA00023280 UNITED STATES OF AMERICA NaN SYPH USA00031069 UNITED STATES OF AMERICA NaN SYPH USA00023875 UNITED STATES OF AMERICA NaN SYPH USA00023881 UNITED STATES OF AMERICA CHLAM NaN USA00025323 UNITED STATES OF AMERICA NaN CHLAM USA00008999 UNITED STATES OF AMERICA SYPH NaN USA00014890 UNITED STATES OF AMERICA CHLAM NaN USA00030186 UNITED STATES OF AMERICA NaN SYPH USA00019002 UNITED STATES OF AMERICA NaN SYPH USA00012847 UNITED STATES OF AMERICA CHLAM NaN outcome (df1) outcome (df2) recordid reporting_country USA00002692 UNITED STATES OF AMERICA NaN A USA00015702 UNITED STATES OF AMERICA NaN A USA00008391 UNITED STATES OF AMERICA NaN A USA00008638 UNITED STATES OF AMERICA NaN A USA00002249 UNITED STATES OF AMERICA NaN A USA00006047 UNITED STATES OF AMERICA A NaN USA00004501 UNITED STATES OF AMERICA NaN A USA00025520 UNITED STATES OF AMERICA A NaN USA00003024 UNITED STATES OF AMERICA A NaN USA00004835 UNITED STATES OF AMERICA A NaN health_worker (df1) health_worker (df2) recordid reporting_country USA00023154 UNITED STATES OF AMERICA NO NaN USA00030598 UNITED STATES OF AMERICA NO NaN USA00018138 UNITED STATES OF AMERICA NaN NO USA00026901 UNITED STATES OF AMERICA NaN NO USA00025244 UNITED STATES OF AMERICA NO NaN USA00025282 UNITED STATES OF AMERICA NaN NO USA00008694 UNITED STATES OF AMERICA NaN NO USA00026057 UNITED STATES OF AMERICA NaN NO USA00020130 UNITED STATES OF AMERICA NaN NO USA00007749 UNITED STATES OF AMERICA NaN NO epilink (df1) epilink (df2) recordid reporting_country USA00003077 UNITED STATES OF AMERICA NO NaN USA00017711 UNITED STATES OF AMERICA NaN YES USA00027919 UNITED STATES OF AMERICA NO NaN USA00027177 UNITED STATES OF AMERICA NO NaN USA00008990 UNITED STATES OF AMERICA NO NaN USA00001561 UNITED STATES OF AMERICA YES NaN USA00009422 UNITED STATES OF AMERICA NO NaN USA00004816 UNITED STATES OF AMERICA NaN NO USA00008779 UNITED STATES OF AMERICA NO NaN USA00016178 UNITED STATES OF AMERICA NO NaN symp_asy (df1) symp_asy (df2) recordid reporting_country USA00023206 UNITED STATES OF AMERICA NaN YES USA00003452 UNITED STATES OF AMERICA NaN YES USA00011604 UNITED STATES OF AMERICA NO NaN USA00028608 UNITED STATES OF AMERICA NaN YES USA00024019 UNITED STATES OF AMERICA YES NaN USA00021139 UNITED STATES OF AMERICA NO NaN USA00021636 UNITED STATES OF AMERICA NaN YES USA00019368 UNITED STATES OF AMERICA YES NaN USA00009780 UNITED STATES OF AMERICA NaN YES USA00030690 UNITED STATES OF AMERICA NaN YES immunosuppresssion (df1) immunosuppresssion (df2) recordid reporting_country USA00009159 UNITED STATES OF AMERICA NO NaN USA00007672 UNITED STATES OF AMERICA NO NaN USA00023948 UNITED STATES OF AMERICA NO NaN USA00021559 UNITED STATES OF AMERICA NO NaN USA00030902 UNITED STATES OF AMERICA NO NaN USA00022535 UNITED STATES OF AMERICA NaN NO USA00027182 UNITED STATES OF AMERICA NO NaN USA00021639 UNITED STATES OF AMERICA NO NaN USA00030725 UNITED STATES OF AMERICA NaN NO USA00029275 UNITED STATES OF AMERICA NaN NO Sample Rows Only in df2 (First 10 Columns) ------------------------------------------ pregnant case_class smallpox_vaccine gender sexual_orientation clade hospitalised concurrrent_sti icu outcome recordid reporting_country USA00031518 UNITED STATES OF AMERICA NaN NaN NO MALE NaN NaN NO NaN NaN A USA00031470 UNITED STATES OF AMERICA UNK NaN NaN MALE NaN NaN NO NaN NaN A USA00031621 UNITED STATES OF AMERICA UNK NaN UNK MALE NaN NaN NO NaN NaN A USA00031582 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NaN NaN NaN NaN USA00031494 UNITED STATES OF AMERICA NaN NaN NO NaN NaN NaN NO SYPH NaN A USA00031545 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN A USA00031586 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN A USA00031575 UNITED STATES OF AMERICA NaN NaN PREV MALE NaN NaN NO NaN NaN NaN USA00031523 UNITED STATES OF AMERICA NaN NaN NO MALE NaN NaN NO NaN NaN A USA00031626 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN YUNK NaN NaN NaN