Before /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_01_29_04_53_00.csv After /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_01_30_04_52_59.csv DataComPy Comparison -------------------- DataFrame Summary ----------------- DataFrame Columns Rows 0 df1 34 60432 1 df2 34 60549 Column Summary -------------- Number of columns in common: 34 Number of columns in df1 but not in df2: 0 Number of columns in df2 but not in df1: 0 Row Summary ----------- Matched on: index Any duplicates on match values: No Absolute Tolerance: 0 Relative Tolerance: 0 Number of rows in common: 60,432 Number of rows in df1 but not in df2: 0 Number of rows in df2 but not in df1: 117 Number of rows with some compared columns unequal: 29,592 Number of rows with all compared columns equal: 30,840 Column Comparison ----------------- Number of columns compared with some values unequal: 10 Number of columns compared with all values equal: 24 Total number of values which compare unequal: 107,597 Columns with Unequal Values or Types ------------------------------------ Column df1 dtype df2 dtype # Unequal Max Diff # Null Diff 4 concurrrent_sti object object 1929 0 1909 7 epilink object object 6301 0 6044 2 gender object object 5568 0 3883 6 health_worker object object 11823 0 11231 3 hospitalised object object 12965 0 4699 9 immunosuppresssion object object 11368 0 11368 5 outcome object object 12286 0 12231 0 pregnant object object 11136 0 10029 1 smallpox_vaccine object object 18084 0 12012 8 symp_asy object object 16137 0 11992 Sample Rows with Unequal Values ------------------------------- pregnant (df1) pregnant (df2) recordid reporting_country USA00028605 UNITED STATES OF AMERICA NaN NO USA00021635 UNITED STATES OF AMERICA NO NaN USA00025830 UNITED STATES OF AMERICA UNK NO USA00013726 UNITED STATES OF AMERICA NO NaN USA00030283 UNITED STATES OF AMERICA UNK NaN USA00007187 UNITED STATES OF AMERICA NaN UNK USA00007180 UNITED STATES OF AMERICA UNK NaN USA00007966 UNITED STATES OF AMERICA UNK NaN USA00022610 UNITED STATES OF AMERICA NO NaN USA00021141 UNITED STATES OF AMERICA UNK NaN smallpox_vaccine (df1) smallpox_vaccine (df2) recordid reporting_country USA00019517 UNITED STATES OF AMERICA PREV NO USA00029648 UNITED STATES OF AMERICA UNK NaN USA00004269 UNITED STATES OF AMERICA NaN NO USA00029689 UNITED STATES OF AMERICA NaN NO USA00007159 UNITED STATES OF AMERICA UNK NaN USA00022633 UNITED STATES OF AMERICA NaN NO USA00023143 UNITED STATES OF AMERICA NaN PREV USA00018759 UNITED STATES OF AMERICA PREV NaN USA00016254 UNITED STATES OF AMERICA NaN NO USA00003492 UNITED STATES OF AMERICA PREV NaN gender (df1) gender (df2) recordid reporting_country USA00024254 UNITED STATES OF AMERICA MALE FEMALE USA00000226 UNITED STATES OF AMERICA MALE NaN USA00005147 UNITED STATES OF AMERICA FEMALE MALE USA00004747 UNITED STATES OF AMERICA MALE FEMALE USA00003751 UNITED STATES OF AMERICA NaN FEMALE USA00008458 UNITED STATES OF AMERICA NaN MALE USA00027455 UNITED STATES OF AMERICA NaN MALE USA00021595 UNITED STATES OF AMERICA NaN MALE USA00029318 UNITED STATES OF AMERICA FEMALE MALE USA00005612 UNITED STATES OF AMERICA NaN FEMALE hospitalised (df1) hospitalised (df2) recordid reporting_country USA00017549 UNITED STATES OF AMERICA NaN NO USA00007913 UNITED STATES OF AMERICA UNK NO USA00014347 UNITED STATES OF AMERICA NO UNK USA00013787 UNITED STATES OF AMERICA NO NaN USA00007723 UNITED STATES OF AMERICA NO NaN USA00014352 UNITED STATES OF AMERICA NO UNK USA00013351 UNITED STATES OF AMERICA NO NaN USA00000498 UNITED STATES OF AMERICA YUNK NaN USA00025839 UNITED STATES OF AMERICA NO UNK USA00012018 UNITED STATES OF AMERICA UNK NO concurrrent_sti (df1) concurrrent_sti (df2) recordid reporting_country USA00000987 UNITED STATES OF AMERICA CHLAM NaN USA00029676 UNITED STATES OF AMERICA SYPH NaN USA00002750 UNITED STATES OF AMERICA NaN CHLAM USA00004415 UNITED STATES OF AMERICA NaN SYPH USA00011258 UNITED STATES OF AMERICA SYPH NaN USA00001190 UNITED STATES OF AMERICA CHLAM NaN USA00000830 UNITED STATES OF AMERICA NaN CHLAM USA00005542 UNITED STATES OF AMERICA SYPH NaN USA00012930 UNITED STATES OF AMERICA SYPH NaN USA00027603 UNITED STATES OF AMERICA SYPH NaN outcome (df1) outcome (df2) recordid reporting_country USA00001312 UNITED STATES OF AMERICA A NaN USA00013358 UNITED STATES OF AMERICA A NaN USA00002337 UNITED STATES OF AMERICA A NaN USA00005732 UNITED STATES OF AMERICA A NaN USA00007654 UNITED STATES OF AMERICA NaN A USA00002741 UNITED STATES OF AMERICA NaN A USA00023232 UNITED STATES OF AMERICA A NaN USA00009753 UNITED STATES OF AMERICA A NaN USA00020268 UNITED STATES OF AMERICA A NaN USA00017097 UNITED STATES OF AMERICA NaN A health_worker (df1) health_worker (df2) recordid reporting_country USA00020675 UNITED STATES OF AMERICA NO NaN USA00020005 UNITED STATES OF AMERICA NaN NO USA00002665 UNITED STATES OF AMERICA NO NaN USA00002782 UNITED STATES OF AMERICA NaN NO USA00007702 UNITED STATES OF AMERICA NO NaN USA00003094 UNITED STATES OF AMERICA NaN NO USA00024405 UNITED STATES OF AMERICA NaN NO USA00009531 UNITED STATES OF AMERICA NO YES USA00029858 UNITED STATES OF AMERICA NaN NO USA00005488 UNITED STATES OF AMERICA NaN NO epilink (df1) epilink (df2) recordid reporting_country USA00029635 UNITED STATES OF AMERICA NO NaN USA00028753 UNITED STATES OF AMERICA NaN NO USA00010280 UNITED STATES OF AMERICA NO NaN USA00003475 UNITED STATES OF AMERICA NaN YES USA00029555 UNITED STATES OF AMERICA NaN YES USA00006875 UNITED STATES OF AMERICA NO NaN USA00008455 UNITED STATES OF AMERICA NaN NO USA00015900 UNITED STATES OF AMERICA YES NaN USA00027788 UNITED STATES OF AMERICA NaN NO USA00019218 UNITED STATES OF AMERICA NO NaN symp_asy (df1) symp_asy (df2) recordid reporting_country USA00013782 UNITED STATES OF AMERICA YES NaN USA00018634 UNITED STATES OF AMERICA NO YES USA00025235 UNITED STATES OF AMERICA NO NaN USA00031409 UNITED STATES OF AMERICA YES NaN USA00005695 UNITED STATES OF AMERICA YES NaN USA00015473 UNITED STATES OF AMERICA YES NaN USA00021971 UNITED STATES OF AMERICA NaN YES USA00004485 UNITED STATES OF AMERICA YES NO USA00022024 UNITED STATES OF AMERICA YES NaN USA00025316 UNITED STATES OF AMERICA YES NaN immunosuppresssion (df1) immunosuppresssion (df2) recordid reporting_country USA00000537 UNITED STATES OF AMERICA NO NaN USA00020206 UNITED STATES OF AMERICA NaN NO USA00007490 UNITED STATES OF AMERICA NaN NO USA00017663 UNITED STATES OF AMERICA NO NaN USA00021796 UNITED STATES OF AMERICA NaN NO USA00023916 UNITED STATES OF AMERICA NO NaN USA00025189 UNITED STATES OF AMERICA NaN NO USA00015086 UNITED STATES OF AMERICA NaN NO USA00021305 UNITED STATES OF AMERICA NaN NO USA00020434 UNITED STATES OF AMERICA NaN NO Sample Rows Only in df2 (First 10 Columns) ------------------------------------------ pregnant case_class smallpox_vaccine gender sexual_orientation clade hospitalised concurrrent_sti icu outcome recordid reporting_country USA00031682 UNITED STATES OF AMERICA NO NaN NO MALE NaN NaN NO NaN NaN NaN USA00031693 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN A USA00031738 UNITED STATES OF AMERICA NaN NaN UNK MALE NaN NaN UNK NaN NaN NaN USA00031715 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN UNK NaN NaN A USA00031751 UNITED STATES OF AMERICA NO NaN NaN MALE NaN NaN NO NaN NaN A USA00031686 UNITED STATES OF AMERICA NO NaN PREV MALE NaN NaN NO NaN NaN A USA00031700 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN A USA00031725 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN UNK NaN NaN NaN USA00031676 UNITED STATES OF AMERICA UNK NaN NO MALE NaN NaN NO NaN NaN NaN USA00031668 UNITED STATES OF AMERICA NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN