Before /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_03_12_04_53_08.csv After /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_03_13_04_53_13.csv DataComPy Comparison -------------------- DataFrame Summary ----------------- DataFrame Columns Rows 0 df1 34 60781 1 df2 34 60902 Column Summary -------------- Number of columns in common: 34 Number of columns in df1 but not in df2: 0 Number of columns in df2 but not in df1: 0 Row Summary ----------- Matched on: index Any duplicates on match values: No Absolute Tolerance: 0 Relative Tolerance: 0 Number of rows in common: 60,781 Number of rows in df1 but not in df2: 0 Number of rows in df2 but not in df1: 121 Number of rows with some compared columns unequal: 29,918 Number of rows with all compared columns equal: 30,863 Column Comparison ----------------- Number of columns compared with some values unequal: 10 Number of columns compared with all values equal: 24 Total number of values which compare unequal: 106,528 Columns with Unequal Values or Types ------------------------------------ Column df1 dtype df2 dtype # Unequal Max Diff # Null Diff 4 concurrrent_sti object object 1931 0 1903 7 epilink object object 6307 0 6032 2 gender object object 5624 0 3900 6 health_worker object object 11451 0 10855 3 hospitalised object object 12954 0 4736 9 immunosuppresssion object object 11433 0 11433 5 outcome object object 11623 0 11562 0 pregnant object object 10823 0 9613 1 smallpox_vaccine object object 18219 0 12096 8 symp_asy object object 16163 0 12011 Sample Rows with Unequal Values ------------------------------- pregnant (df1) pregnant (df2) recordid reporting_country USA00004852 UNITED STATES OF AMERICA NO NaN USA00024251 UNITED STATES OF AMERICA NO NaN USA00007332 UNITED STATES OF AMERICA UNK NaN USA00022590 UNITED STATES OF AMERICA NaN NO USA00020585 UNITED STATES OF AMERICA UNK NaN USA00021995 UNITED STATES OF AMERICA UNK NaN USA00006143 UNITED STATES OF AMERICA NaN UNK USA00010857 UNITED STATES OF AMERICA NO NaN USA00010511 UNITED STATES OF AMERICA UNK NO USA00025421 UNITED STATES OF AMERICA NaN NO smallpox_vaccine (df1) smallpox_vaccine (df2) recordid reporting_country USA00006686 UNITED STATES OF AMERICA NaN PREV USA00011187 UNITED STATES OF AMERICA NO NaN USA00024195 UNITED STATES OF AMERICA PREV UNK USA00014679 UNITED STATES OF AMERICA NaN NO USA00022969 UNITED STATES OF AMERICA NaN NO USA00016189 UNITED STATES OF AMERICA UNK NaN USA00025608 UNITED STATES OF AMERICA UNK NO USA00026250 UNITED STATES OF AMERICA UNK NaN USA00004005 UNITED STATES OF AMERICA PREV NO USA00031120 UNITED STATES OF AMERICA NO UNK gender (df1) gender (df2) recordid reporting_country USA00026011 UNITED STATES OF AMERICA MALE NaN USA00000282 UNITED STATES OF AMERICA NaN MALE USA00023145 UNITED STATES OF AMERICA MALE NaN USA00027664 UNITED STATES OF AMERICA NaN MALE USA00029697 UNITED STATES OF AMERICA MALE NaN USA00015699 UNITED STATES OF AMERICA MALE NaN USA00009337 UNITED STATES OF AMERICA MALE NaN USA00015274 UNITED STATES OF AMERICA NaN MALE USA00002894 UNITED STATES OF AMERICA FEMALE MALE USA00018987 UNITED STATES OF AMERICA MALE FEMALE hospitalised (df1) hospitalised (df2) recordid reporting_country USA00003772 UNITED STATES OF AMERICA UNK NO USA00025913 UNITED STATES OF AMERICA NO YUNK USA00002936 UNITED STATES OF AMERICA NO NaN USA00025904 UNITED STATES OF AMERICA UNK NO USA00008213 UNITED STATES OF AMERICA NaN NO USA00004444 UNITED STATES OF AMERICA NO UNK USA00022213 UNITED STATES OF AMERICA NO NaN USA00028219 UNITED STATES OF AMERICA NO NaN USA00000436 UNITED STATES OF AMERICA NO UNK USA00029057 UNITED STATES OF AMERICA UNK NO concurrrent_sti (df1) concurrrent_sti (df2) recordid reporting_country USA00002225 UNITED STATES OF AMERICA CHLAM NaN USA00002849 UNITED STATES OF AMERICA NaN SYPH USA00023642 UNITED STATES OF AMERICA CHLAM NaN USA00019374 UNITED STATES OF AMERICA SYPH NaN USA00005422 UNITED STATES OF AMERICA NaN SYPH USA00025966 UNITED STATES OF AMERICA SYPH NaN USA00026383 UNITED STATES OF AMERICA CHLAM NaN USA00000644 UNITED STATES OF AMERICA NaN SYPH USA00016942 UNITED STATES OF AMERICA NaN CHLAM USA00000890 UNITED STATES OF AMERICA CHLAM NaN outcome (df1) outcome (df2) recordid reporting_country USA00026027 UNITED STATES OF AMERICA NaN A USA00024354 UNITED STATES OF AMERICA NaN A USA00020791 UNITED STATES OF AMERICA A NaN USA00017696 UNITED STATES OF AMERICA NaN A USA00031977 UNITED STATES OF AMERICA A NaN USA00023663 UNITED STATES OF AMERICA NaN A USA00006625 UNITED STATES OF AMERICA A NaN USA00024830 UNITED STATES OF AMERICA A NaN USA00031390 UNITED STATES OF AMERICA A NaN USA00031272 UNITED STATES OF AMERICA NaN A health_worker (df1) health_worker (df2) recordid reporting_country USA00004320 UNITED STATES OF AMERICA NO NaN USA00004152 UNITED STATES OF AMERICA NaN NO USA00003796 UNITED STATES OF AMERICA NaN NO USA00002277 UNITED STATES OF AMERICA NO YES USA00011442 UNITED STATES OF AMERICA NaN NO USA00021830 UNITED STATES OF AMERICA NaN NO USA00027248 UNITED STATES OF AMERICA NaN NO USA00028242 UNITED STATES OF AMERICA NaN NO USA00026902 UNITED STATES OF AMERICA NO NaN USA00003631 UNITED STATES OF AMERICA NO NaN epilink (df1) epilink (df2) recordid reporting_country USA00013767 UNITED STATES OF AMERICA NaN YES USA00008010 UNITED STATES OF AMERICA NaN YES USA00015851 UNITED STATES OF AMERICA YES NO USA00021169 UNITED STATES OF AMERICA NO YES USA00023355 UNITED STATES OF AMERICA NaN NO USA00004763 UNITED STATES OF AMERICA NaN YES USA00006817 UNITED STATES OF AMERICA NaN NO USA00016246 UNITED STATES OF AMERICA NaN YES USA00029508 UNITED STATES OF AMERICA NaN NO USA00008090 UNITED STATES OF AMERICA NaN NO symp_asy (df1) symp_asy (df2) recordid reporting_country USA00023809 UNITED STATES OF AMERICA YES NaN USA00028942 UNITED STATES OF AMERICA YES NaN USA00025500 UNITED STATES OF AMERICA NaN YES USA00003275 UNITED STATES OF AMERICA NO YES USA00010983 UNITED STATES OF AMERICA YES NaN USA00015542 UNITED STATES OF AMERICA YES NaN USA00021150 UNITED STATES OF AMERICA YES NaN USA00030958 UNITED STATES OF AMERICA NaN NO USA00019673 UNITED STATES OF AMERICA YES NaN USA00024191 UNITED STATES OF AMERICA NaN YES immunosuppresssion (df1) immunosuppresssion (df2) recordid reporting_country USA00003375 UNITED STATES OF AMERICA NaN NO USA00014071 UNITED STATES OF AMERICA NaN NO USA00005709 UNITED STATES OF AMERICA NO NaN USA00013375 UNITED STATES OF AMERICA NO NaN USA00002935 UNITED STATES OF AMERICA NaN NO USA00030110 UNITED STATES OF AMERICA NaN NO USA00006157 UNITED STATES OF AMERICA NO NaN USA00012176 UNITED STATES OF AMERICA NaN NO USA00001346 UNITED STATES OF AMERICA NO NaN USA00022230 UNITED STATES OF AMERICA NO NaN Sample Rows Only in df2 (First 10 Columns) ------------------------------------------ pregnant case_class smallpox_vaccine gender sexual_orientation clade hospitalised concurrrent_sti icu outcome recordid reporting_country USA00032035 UNITED STATES OF AMERICA NaN NaN PREV MALE NaN NaN YUNK NaN NaN NaN USA00032076 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN NaN USA00032124 UNITED STATES OF AMERICA NO NaN UNK MALE NaN NaN NaN NaN NaN A USA00032087 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN NaN USA00032101 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN NaN USA00032109 UNITED STATES OF AMERICA NaN NaN NO MALE NaN NaN NO NaN NaN A USA00032049 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN NaN USA00032116 UNITED STATES OF AMERICA NO NaN NO MALE NaN NaN NO NaN NaN A USA00032090 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN NaN USA00032056 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NaN NaN NaN NaN