Before /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_02_26_04_53_08.csv After /home/pythonscripts/mpxdatacheck/LINELIST_PAHO2024_02_27_04_53_07.csv DataComPy Comparison -------------------- DataFrame Summary ----------------- DataFrame Columns Rows 0 df1 34 60557 1 df2 34 60781 Column Summary -------------- Number of columns in common: 34 Number of columns in df1 but not in df2: 0 Number of columns in df2 but not in df1: 0 Row Summary ----------- Matched on: index Any duplicates on match values: No Absolute Tolerance: 0 Relative Tolerance: 0 Number of rows in common: 60,557 Number of rows in df1 but not in df2: 0 Number of rows in df2 but not in df1: 224 Number of rows with some compared columns unequal: 30,921 Number of rows with all compared columns equal: 29,636 Column Comparison ----------------- Number of columns compared with some values unequal: 10 Number of columns compared with all values equal: 24 Total number of values which compare unequal: 124,519 Columns with Unequal Values or Types ------------------------------------ Column df1 dtype df2 dtype # Unequal Max Diff # Null Diff 4 concurrrent_sti object object 1946 0 1924 7 epilink object object 6811 0 6591 2 gender object object 5902 0 4235 6 health_worker object object 14204 0 13713 3 hospitalised object object 14034 0 5315 9 immunosuppresssion object object 13165 0 13165 5 outcome object object 14630 0 14562 0 pregnant object object 14955 0 13317 1 smallpox_vaccine object object 19998 0 14261 8 symp_asy object object 18874 0 13764 Sample Rows with Unequal Values ------------------------------- pregnant (df1) pregnant (df2) recordid reporting_country USA00017160 UNITED STATES OF AMERICA NaN UNK USA00011201 UNITED STATES OF AMERICA NaN UNK USA00014014 UNITED STATES OF AMERICA UNK NO USA00008266 UNITED STATES OF AMERICA NaN NO USA00030100 UNITED STATES OF AMERICA NaN NO USA00027574 UNITED STATES OF AMERICA UNK NaN USA00014384 UNITED STATES OF AMERICA NaN UNK USA00007895 UNITED STATES OF AMERICA NO NaN USA00029008 UNITED STATES OF AMERICA UNK NaN USA00003597 UNITED STATES OF AMERICA NO NaN smallpox_vaccine (df1) smallpox_vaccine (df2) recordid reporting_country USA00029237 UNITED STATES OF AMERICA NaN NO USA00014780 UNITED STATES OF AMERICA NO PREV USA00012202 UNITED STATES OF AMERICA UNK NO USA00029733 UNITED STATES OF AMERICA PREV NaN USA00020614 UNITED STATES OF AMERICA NaN NO USA00019245 UNITED STATES OF AMERICA UNK NaN USA00018639 UNITED STATES OF AMERICA NO NaN USA00000384 UNITED STATES OF AMERICA NO PREV USA00016312 UNITED STATES OF AMERICA NO NaN USA00021270 UNITED STATES OF AMERICA NO NaN gender (df1) gender (df2) recordid reporting_country USA00028596 UNITED STATES OF AMERICA FEMALE MALE USA00004261 UNITED STATES OF AMERICA FEMALE MALE USA00008562 UNITED STATES OF AMERICA NaN MALE USA00000440 UNITED STATES OF AMERICA MALE FEMALE USA00008955 UNITED STATES OF AMERICA MALE NaN USA00013835 UNITED STATES OF AMERICA MALE NaN USA00027195 UNITED STATES OF AMERICA MALE NaN USA00016010 UNITED STATES OF AMERICA NaN FEMALE USA00023858 UNITED STATES OF AMERICA MALE NaN USA00012375 UNITED STATES OF AMERICA NaN MALE hospitalised (df1) hospitalised (df2) recordid reporting_country USA00022323 UNITED STATES OF AMERICA NaN NO USA00012709 UNITED STATES OF AMERICA NaN NO USA00020810 UNITED STATES OF AMERICA NaN NO USA00019062 UNITED STATES OF AMERICA UNK NO USA00030353 UNITED STATES OF AMERICA UNK NO USA00009692 UNITED STATES OF AMERICA NaN NO USA00004108 UNITED STATES OF AMERICA UNK YUNK USA00019961 UNITED STATES OF AMERICA YUNK NO USA00015206 UNITED STATES OF AMERICA NO UNK USA00019347 UNITED STATES OF AMERICA NaN NO concurrrent_sti (df1) concurrrent_sti (df2) recordid reporting_country USA00031738 UNITED STATES OF AMERICA NaN SYPH USA00018367 UNITED STATES OF AMERICA NaN CHLAM USA00005113 UNITED STATES OF AMERICA NaN CHLAM USA00012450 UNITED STATES OF AMERICA SYPH NaN USA00031752 UNITED STATES OF AMERICA SYPH NaN USA00008701 UNITED STATES OF AMERICA NaN CHLAM USA00004839 UNITED STATES OF AMERICA NaN CHLAM USA00024633 UNITED STATES OF AMERICA NaN CHLAM USA00027913 UNITED STATES OF AMERICA NaN SYPH USA00016852 UNITED STATES OF AMERICA NaN SYPH outcome (df1) outcome (df2) recordid reporting_country USA00030215 UNITED STATES OF AMERICA NaN A USA00016012 UNITED STATES OF AMERICA NaN A USA00010287 UNITED STATES OF AMERICA A NaN USA00008078 UNITED STATES OF AMERICA A NaN USA00003038 UNITED STATES OF AMERICA A NaN USA00023389 UNITED STATES OF AMERICA NaN A USA00029718 UNITED STATES OF AMERICA NaN A USA00013723 UNITED STATES OF AMERICA A NaN USA00021720 UNITED STATES OF AMERICA NaN A USA00024963 UNITED STATES OF AMERICA NaN A health_worker (df1) health_worker (df2) recordid reporting_country USA00017165 UNITED STATES OF AMERICA NO NaN USA00024054 UNITED STATES OF AMERICA NO NaN USA00026853 UNITED STATES OF AMERICA NO NaN USA00018509 UNITED STATES OF AMERICA NO NaN USA00017962 UNITED STATES OF AMERICA NaN NO USA00026737 UNITED STATES OF AMERICA NO NaN USA00022454 UNITED STATES OF AMERICA NaN NO USA00000504 UNITED STATES OF AMERICA NaN NO USA00003584 UNITED STATES OF AMERICA NO NaN USA00026127 UNITED STATES OF AMERICA NaN NO epilink (df1) epilink (df2) recordid reporting_country USA00024467 UNITED STATES OF AMERICA NaN YES USA00021823 UNITED STATES OF AMERICA YES NaN USA00003026 UNITED STATES OF AMERICA YES NaN USA00018226 UNITED STATES OF AMERICA NaN YES USA00024981 UNITED STATES OF AMERICA YES NaN USA00012553 UNITED STATES OF AMERICA NO NaN USA00027880 UNITED STATES OF AMERICA NaN YES USA00031772 UNITED STATES OF AMERICA YES NaN USA00024384 UNITED STATES OF AMERICA YES NaN USA00010428 UNITED STATES OF AMERICA NaN YES symp_asy (df1) symp_asy (df2) recordid reporting_country USA00011877 UNITED STATES OF AMERICA YES NaN USA00015109 UNITED STATES OF AMERICA YES NO USA00031112 UNITED STATES OF AMERICA YES NO USA00004817 UNITED STATES OF AMERICA NaN YES USA00030554 UNITED STATES OF AMERICA YES NO USA00028111 UNITED STATES OF AMERICA YES NaN USA00007508 UNITED STATES OF AMERICA YES NaN USA00023791 UNITED STATES OF AMERICA NO YES USA00027680 UNITED STATES OF AMERICA YES NO USA00027122 UNITED STATES OF AMERICA YES NaN immunosuppresssion (df1) immunosuppresssion (df2) recordid reporting_country USA00017410 UNITED STATES OF AMERICA NO NaN USA00019059 UNITED STATES OF AMERICA NO NaN USA00010916 UNITED STATES OF AMERICA NaN NO USA00029122 UNITED STATES OF AMERICA NaN NO USA00030005 UNITED STATES OF AMERICA NO NaN USA00018074 UNITED STATES OF AMERICA NO NaN USA00001700 UNITED STATES OF AMERICA NaN NO USA00023201 UNITED STATES OF AMERICA NaN NO USA00015153 UNITED STATES OF AMERICA NO NaN USA00016272 UNITED STATES OF AMERICA NO NaN Sample Rows Only in df2 (First 10 Columns) ------------------------------------------ pregnant case_class smallpox_vaccine gender sexual_orientation clade hospitalised concurrrent_sti icu outcome recordid reporting_country USA00031982 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN A USA00031892 UNITED STATES OF AMERICA NO NaN NO MALE NaN NaN NO NaN NaN NaN USA00031938 UNITED STATES OF AMERICA NaN NaN UNK MALE NaN NaN NO NaN NaN NaN USA00031881 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN NO NaN NaN NaN USA00031820 UNITED STATES OF AMERICA NaN NaN NO MALE NaN NaN NO NaN NaN NaN USA00031999 UNITED STATES OF AMERICA NaN NaN NaN MALE NaN NaN YUNK NaN NaN A USA00031952 UNITED STATES OF AMERICA NO NaN NO MALE NaN NaN NO NaN NaN A USA00031831 UNITED STATES OF AMERICA NaN NaN UNK MALE NaN NaN NO NaN NaN A USA00031786 UNITED STATES OF AMERICA NaN NaN NO MALE NaN NaN NO NaN NaN NaN USA00031922 UNITED STATES OF AMERICA UNK NaN NaN MALE NaN NaN NO NaN NaN A