; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016552 (gene) of Snake gourd v1 genome

Gene IDTan0016552
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA cross-link repair 1A protein
Genome locationLG01:100168284..100180427
RNA-Seq ExpressionTan0016552
SyntenyTan0016552
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0016746 - transferase activity, transferring acyl groups (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR001660 - Sterile alpha motif domain
IPR011084 - DNA repair metallo-beta-lactamase
IPR013761 - Sterile alpha motif/pointed domain superfamily
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591500.1 DNA cross-link repair 1A protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.47Show/hide
Query:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR
        MPP+  A   RH  Q+SAS  PQFH+PTNA  DDDDLLPSTQTVLSSRSSTS K  ATSDLSLHIRA K+PK SAPTATG+ NIPS  HC+VGFKPNFDR
Subjt:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR

Query:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS
        H+GA  LD GE FGSSE+DLGCSLDLIQPS+VGCSYETH ANS  E+FDGD DFSGATDEC+GSKVKGGYL NSIESRL+NSRVDCDVGVSGSGVDK SS
Subjt:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS

Query:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR
        DDFESD ELDLLLNLHSQLDEEDRI GVGF T+E+YFP+DED LIQCPLCGVDISDLSDEQRLVHTNDCIDK+D QAQ AALTPD+KQTSGPQQS D SR
Subjt:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR

Query:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP
        FSTVLKWLH LGLSKYEDIFVREEIDW+TLQWLTDEDLNN+GITALGPRRKIT ALSELRKESSAVETCTN+HAPSG GQ SN+G D  EGS +GTN+ P
Subjt:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP

Query:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF
         NKLITDYFPGFA NKKN CSIS+GQ+DVGKKLPDSL   K KTAKRNVRN K  NVPVWSCIPGTPFRVDAFRHLRGDCSHWFL+HFHMDHYQGLTKSF
Subjt:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF

Query:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD
         HGMIYCS ITAKLVNMK+GIPWERLQVLPL+QK+NIAG DVTCFDANHCPGSIIILFEPP+GKAVLHTGDFR+CEQMG LSVF+TC IHTLILDTTYCD
Subjt:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD

Query:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
        PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYV++AKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
Subjt:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL

Query:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
        KH+SN YA+RFSLIVAFSPTGWALSKGKKKSPGRR QQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
Subjt:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS

KAG7024385.1 DNA cross-link repair 1A protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0086.54Show/hide
Query:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR
        MPP+  A   RH  Q+SAS  PQFH+PTNA  DDDDLLPSTQTVLSSRSSTS K  ATSDLSLHIRA K+PK S PTATG+ NIPSI HC+VGFKPNFDR
Subjt:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR

Query:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS
        H+GA  LD GE FGSSE+DLGCSLDLIQPS+VGCSYETH+ANS  E+FDGD DFSGATDEC+GSK+KGGYL NSIESRL+NSRVDCDVGVSGSGVDK SS
Subjt:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS

Query:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR
        DDFESD ELDLLLNLHSQLDEEDRI GVGF T+E+YFP+DED LIQCPLCGVDISDLSD+QRLVHTNDCIDK+D QAQ  A TPD+KQ SGPQQS DNSR
Subjt:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR

Query:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDE-----DLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSING
        FSTVLKWLH LGLSKYEDIFVREEIDW+TLQWLTDE     DLNN+GITALGPRRKIT ALSELRKESSAVETCTN+HAPSG GQ SN+GSD  EGS +G
Subjt:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDE-----DLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSING

Query:  TNKAPANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQG
        TN+ P NKLITDYFPGFA NKKN CSIS+GQ+DVGKKLPDSL   K KTAKRNVRN K  NVPVWSCIPGTPFRVDAFRHLRGDCSHWFL+HFHMDHYQG
Subjt:  TNKAPANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQG

Query:  LTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILD
        LTKSF HGMIYCS ITAKLVNMK+GIPWERLQVLPL+QK+NIAG DVTCFDANHCPGSIIILFEPP+GKAVLHTGDFR+CEQMG LSVF+TC IHTLILD
Subjt:  LTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILD

Query:  TTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLA
        TTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYV++AKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLA
Subjt:  TTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLA

Query:  SFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
        SFKRLKH+SN YA+RFSLIVAFSPTGWALSKGKKKSPGRR QQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
Subjt:  SFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS

XP_022936316.1 DNA cross-link repair 1A protein [Cucurbita moschata]0.0e+0087.59Show/hide
Query:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR
        MPP+  A   RH  Q+SASQ PQFH+PTNA  DDDDLLPSTQTVLSSRSSTS K  ATSDLSLHIRA K+PK SAPTATG+ NIPSI HC+VGFKPNFDR
Subjt:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR

Query:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS
        H+GA  LD GE FGSSE+DLGCSLDLIQPS+VGCSYETH+ANS  E+FDGD DFSGATDEC+GSK+KGGYL NSIESRL+NSRVDCDVGVSGSGVDK SS
Subjt:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS

Query:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR
        DDFESD ELDLLLNLHSQLDEEDRI GVGF T+E+YFP+DED LIQCPLCGVDISDLSDEQRLVHTNDCIDK+D QAQ AALTPD+KQTSGPQQS D SR
Subjt:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR

Query:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP
        FSTVLKWLH LGLSKYEDIFVREEIDW+TLQWLTDEDLNN+GITALGPRRKIT ALSELRKESSAVETCTN+HAPSG GQ SN+GSD  E S NGTN+ P
Subjt:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP

Query:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF
         NKLITDYFPGFA NKKN CSIS+GQ+DVGKKLPDSL   K KTAKRNVRN K  NVPVWSCIPGTPFRVDAFRHLRGDCSHWFL+HFHMDHYQGLTKSF
Subjt:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF

Query:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD
         HGMIYCS ITAKLVNMK+GIPWERLQVLPL+QK+NIAG DVTCFDANHCPGSIIILFEPP+GKAVLHTGDFR+CEQMG LSVF+TCRIHTLILDTTYCD
Subjt:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD

Query:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
        PQ DFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVL KKVYV++AKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
Subjt:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL

Query:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
        KH+SN YA+RFSLIVAFSPTGWALSKGKKKSPGRR QQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
Subjt:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS

XP_022976979.1 DNA cross-link repair 1A protein isoform X1 [Cucurbita maxima]0.0e+0087.47Show/hide
Query:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR
        MPPI  A   RH  Q+SASQ  QFH+PTNA  DDDDLLPSTQTVLSSRSSTS K  ATSDLSLHIRA K+PK SAPT TG+ NIPSI HC+VGFKPNFDR
Subjt:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR

Query:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS
        H+ A  LD GE FGSSE+DLGCSLDLIQPS+VGCSYETH+ANS  E+F+GD DFSGATDEC+GSKVKGGYL NSIES+L+NSRVDCDVGVSGSGVDK SS
Subjt:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS

Query:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR
        DDFESD ELDLLLNLHSQLDEEDRI GVGF T+E+YFP+DED LIQCPLCGVDISDLSDEQRLVHTNDCIDK+D QAQ AALTPD+KQTSGPQQS DNSR
Subjt:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR

Query:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP
        FSTVLKWLH LGLSKYEDIFVREEIDW+TLQWLTDEDL+N+GI+ALGPRRKIT ALSELRKESSAVETCTN+HAPSG GQ SN+GSD  EGS NGTN+ P
Subjt:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP

Query:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF
         NKLITDYFPGFA NKKN CSIS+GQ+DVGKKLPDSL   K KTAKRNVRN KP  VPVWSCIPGTPFRVDAFRHLRGDCSHWFL+HFHMDHYQGLTKSF
Subjt:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF

Query:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD
         HGMIYCS ITAKLVNMK+GIPWERLQVLPL+QK+NIAG DVTCFDANHCPGSIIILFEPP+GKAVLHTGDFR+CEQMG LSVF+TCRIHTLILDTTYCD
Subjt:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD

Query:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
        PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYV++AKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
Subjt:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL

Query:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
        KH+SN YANRFSLIVAFSPTGWALSKGKKKSPGRR QQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSAR MISLLSS
Subjt:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS

XP_038897129.1 DNA cross-link repair 1A protein isoform X1 [Benincasa hispida]0.0e+0088.8Show/hide
Query:  PIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDRHN
        P+  AAAHR R Q+SASQ PQFH PTNAADDDDD LPSTQTVLSSRSS SQK  ATSDLSLHIRAPK+PK SAPTATGKEN+PSI H +VGF+PNF RHN
Subjt:  PIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDRHN

Query:  GAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASSDD
        GA  LD GE F +S+  LGCSLDLIQPSIVGCSYETH+ NSG E+ DGDD FSGA DEC+GSK KGGYL NSIESRL+NSRVDCDVGVSGSGVDK S+D 
Subjt:  GAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASSDD

Query:  FESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSRFS
        FESDTELDLLLNLHS+LDEE  I GVGF T+ T F MDE+ LIQCPLCGVDISDLSDEQRLVHTN+CIDK+D QAQNAALTPD+KQT GPQQSGDNSRFS
Subjt:  FESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSRFS

Query:  TVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAPAN
         VL+WLH LGLS+Y+DIFVREEIDWD LQWLTDEDLNNMGITALGPRRKITHALSELRKES+AVETCTNSHAPSG GQQSN+GSD REGSI GTN+ PAN
Subjt:  TVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAPAN

Query:  KLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSFCHGM
        KLITDYFPGFA NKKN C+ISSGQKDVGKK+PDSL KGKTAKRNVRN K  NVPVWSCIPGTPFRVDAFRHLRGDC HWFL+HFHMDHYQGLTKSFCHGM
Subjt:  KLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSFCHGM

Query:  IYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYD
        IYCS ITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMG LSVF+TCRIHTLILDTTYCDPQYD
Subjt:  IYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYD

Query:  FPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVS
        FPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVS
Subjt:  FPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVS

Query:  NHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
        N YANRF+LIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSF+ELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
Subjt:  NHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS

TrEMBL top hitse value%identityAlignment
A0A5D3D910 DNA cross-link repair protein SNM10.0e+0084.01Show/hide
Query:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR
        MP    AA  RH          QF  PTN   D+DD LPSTQT+LS+R   SQK  ATSDLSLHI   K+P+ S P ATGKEN+PSI H +VGFK    +
Subjt:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR

Query:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS
         NGA  LD  E FG+S+IDLGCSLDLIQPSIVGCSYETH+ NSG E+ DGDDDFSGATDEC+GSK KGGYL NSIESRL+NSRVDCDVGVSGSG DK S 
Subjt:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS

Query:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR
        D FESDTELDLLLNLHS+LDEED I G GF  + T F +DE+ LIQCPLCGVDISDLSDEQRLVHTNDCIDK D QAQNAALT D+KQTSG +QS +NS+
Subjt:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR

Query:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP
        FSTVLKWLH L LSKYED+FVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESS VET TNS A S  GQQSN+GSD REGS NGTNK P
Subjt:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP

Query:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSFCH
         NKLITDYFPGFA NK N C ISSGQKDVGKK+P SL KGKTAKRNVRN K  NVPVWSCIPGTPFRVDAFRHLRGDC HWFL+HFHMDHYQGLTKSFCH
Subjt:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSFCH

Query:  GMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQ
        GMIYCS ITAKLVNMKIGIPWERLQVLPLDQKINIAG+DVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMG LSVF+TCRIHTLILDTTYCDPQ
Subjt:  GMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQ

Query:  YDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKH
        YDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTA+KLRILKCLGFSAEDM+WFTVNERESHIHVVPLWTLASFKRLKH
Subjt:  YDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKH

Query:  VSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
        VS  YA+RFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAM SLLSS
Subjt:  VSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS

A0A6J1CCT1 DNA cross-link repair 1A protein-like isoform X10.0e+0082.86Show/hide
Query:  PIRTAAAHRHRDQTSASQEPQFHSPTNAADDDD---DLLPSTQTVLSSRSSTSQKLFATSDL-SLHIRAPKKPKCSAPTATGKENIPSIAH---------
        P+  +AA RH  Q SASQ  QFH PTNA DDDD   D LPSTQTVLSSRSS+SQK  ATS+L SLH RAPK+PK S PTA GKENIPSI +         
Subjt:  PIRTAAAHRHRDQTSASQEPQFHSPTNAADDDD---DLLPSTQTVLSSRSSTSQKLFATSDL-SLHIRAPKKPKCSAPTATGKENIPSIAH---------

Query:  ------CNVGFKPNFDRHNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSR
                V    NFD  NGA  LDG E FG SEIDLGCSLD I+PSIVGCSY+ H+ANSG E+FDG + FSGA DEC+GSKVKGGYL NSIESRL+NSR
Subjt:  ------CNVGFKPNFDRHNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSR

Query:  VDCDVGVSGSGVDKASSDDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQN-AAL
        VD DVG+SG GVDK +SDDFESDTELDLLLNLHSQ+DEED + GV F T+ET FP+DE+ LIQCPLCGVDISDLSD++R VHTNDCID++D QAQN  AL
Subjt:  VDCDVGVSGSGVDKASSDDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQN-AAL

Query:  TPDRKQTSGPQQSGDNSRFSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQS
        T DRKQTS PQQSGDNSR STVLKWLHGLGLSKYEDIF+REEIDW  LQWLTDEDL  MGITALGPRRKITHALSELRK+S AVETCTNSHAPS IGQQS
Subjt:  TPDRKQTSGPQQSGDNSRFSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQS

Query:  NHGSDRREGSINGTNKAPANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWF
        N GSD REGSING NK PANKLITDYFPGFA +KKN+CSI SGQKDV KKL DS  +GKT KRNV+N K  NVPVWSCIPGTPFRVDAFRHLRGDC HWF
Subjt:  NHGSDRREGSINGTNKAPANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWF

Query:  LSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVF
        L+HFHMDHYQGLTKSF HGMIYCS ITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCE++G L VF
Subjt:  LSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVF

Query:  KTCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERES
        +TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFS EDMQWFTVNE ES
Subjt:  KTCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERES

Query:  HIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAM
        HIHVVPLWTLASFKRLKHVSN YANRFSL+VAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSEL+DFVKLVSP NIIPSVNNHGPDSARAM
Subjt:  HIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAM

Query:  ISLLS
        ISLLS
Subjt:  ISLLS

A0A6J1CG29 DNA cross-link repair 1A protein-like isoform X20.0e+0082.96Show/hide
Query:  PIRTAAAHRHRDQTSASQEPQFHSPTNAADDDD---DLLPSTQTVLSSRSSTSQKLFATSDL-SLHIRAPKKPKCSAPTATGKENIPSIAH---------
        P+  +AA RH  Q SASQ  QFH PTNA DDDD   D LPSTQTVLSSRSS+SQK  ATS+L SLH RAPK+PK S PTA GKENIPSI +         
Subjt:  PIRTAAAHRHRDQTSASQEPQFHSPTNAADDDD---DLLPSTQTVLSSRSSTSQKLFATSDL-SLHIRAPKKPKCSAPTATGKENIPSIAH---------

Query:  ------CNVGFKPNFDRHNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSR
                V    NFD  NGA  LDG E FG SEIDLGCSLD I+PSIVGCSY+ H+ANSG E+FDG + FSGA DEC+GSKVKGGYL NSIESRL+NSR
Subjt:  ------CNVGFKPNFDRHNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSR

Query:  VDCDVGVSGSGVDKASSDDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALT
        VD DVG+SG GVDK +SDDFESDTELDLLLNLHSQ+DEED + GV F T+ET FP+DE+ LIQCPLCGVDISDLSD++R VHTNDCID++D QAQN ALT
Subjt:  VDCDVGVSGSGVDKASSDDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALT

Query:  PDRKQTSGPQQSGDNSRFSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSN
         DRKQTS PQQSGDNSR STVLKWLHGLGLSKYEDIF+REEIDW  LQWLTDEDL  MGITALGPRRKITHALSELRK+S AVETCTNSHAPS IGQQSN
Subjt:  PDRKQTSGPQQSGDNSRFSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSN

Query:  HGSDRREGSINGTNKAPANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFL
         GSD REGSING NK PANKLITDYFPGFA +KKN+CSI SGQKDV KKL DS  +GKT KRNV+N K  NVPVWSCIPGTPFRVDAFRHLRGDC HWFL
Subjt:  HGSDRREGSINGTNKAPANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFL

Query:  SHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFK
        +HFHMDHYQGLTKSF HGMIYCS ITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCE++G L VF+
Subjt:  SHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFK

Query:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESH
        TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFS EDMQWFTVNE ESH
Subjt:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESH

Query:  IHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMI
        IHVVPLWTLASFKRLKHVSN YANRFSL+VAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSEL+DFVKLVSP NIIPSVNNHGPDSARAMI
Subjt:  IHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMI

Query:  SLLS
        SLLS
Subjt:  SLLS

A0A6J1F758 DNA cross-link repair 1A protein0.0e+0087.59Show/hide
Query:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR
        MPP+  A   RH  Q+SASQ PQFH+PTNA  DDDDLLPSTQTVLSSRSSTS K  ATSDLSLHIRA K+PK SAPTATG+ NIPSI HC+VGFKPNFDR
Subjt:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR

Query:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS
        H+GA  LD GE FGSSE+DLGCSLDLIQPS+VGCSYETH+ANS  E+FDGD DFSGATDEC+GSK+KGGYL NSIESRL+NSRVDCDVGVSGSGVDK SS
Subjt:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS

Query:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR
        DDFESD ELDLLLNLHSQLDEEDRI GVGF T+E+YFP+DED LIQCPLCGVDISDLSDEQRLVHTNDCIDK+D QAQ AALTPD+KQTSGPQQS D SR
Subjt:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR

Query:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP
        FSTVLKWLH LGLSKYEDIFVREEIDW+TLQWLTDEDLNN+GITALGPRRKIT ALSELRKESSAVETCTN+HAPSG GQ SN+GSD  E S NGTN+ P
Subjt:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP

Query:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF
         NKLITDYFPGFA NKKN CSIS+GQ+DVGKKLPDSL   K KTAKRNVRN K  NVPVWSCIPGTPFRVDAFRHLRGDCSHWFL+HFHMDHYQGLTKSF
Subjt:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF

Query:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD
         HGMIYCS ITAKLVNMK+GIPWERLQVLPL+QK+NIAG DVTCFDANHCPGSIIILFEPP+GKAVLHTGDFR+CEQMG LSVF+TCRIHTLILDTTYCD
Subjt:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD

Query:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
        PQ DFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVL KKVYV++AKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
Subjt:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL

Query:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
        KH+SN YA+RFSLIVAFSPTGWALSKGKKKSPGRR QQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
Subjt:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS

A0A6J1IQ65 DNA cross-link repair 1A protein isoform X10.0e+0087.47Show/hide
Query:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR
        MPPI  A   RH  Q+SASQ  QFH+PTNA  DDDDLLPSTQTVLSSRSSTS K  ATSDLSLHIRA K+PK SAPT TG+ NIPSI HC+VGFKPNFDR
Subjt:  MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDR

Query:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS
        H+ A  LD GE FGSSE+DLGCSLDLIQPS+VGCSYETH+ANS  E+F+GD DFSGATDEC+GSKVKGGYL NSIES+L+NSRVDCDVGVSGSGVDK SS
Subjt:  HNGAPDLDGGETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASS

Query:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR
        DDFESD ELDLLLNLHSQLDEEDRI GVGF T+E+YFP+DED LIQCPLCGVDISDLSDEQRLVHTNDCIDK+D QAQ AALTPD+KQTSGPQQS DNSR
Subjt:  DDFESDTELDLLLNLHSQLDEEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSR

Query:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP
        FSTVLKWLH LGLSKYEDIFVREEIDW+TLQWLTDEDL+N+GI+ALGPRRKIT ALSELRKESSAVETCTN+HAPSG GQ SN+GSD  EGS NGTN+ P
Subjt:  FSTVLKWLHGLGLSKYEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAP

Query:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF
         NKLITDYFPGFA NKKN CSIS+GQ+DVGKKLPDSL   K KTAKRNVRN KP  VPVWSCIPGTPFRVDAFRHLRGDCSHWFL+HFHMDHYQGLTKSF
Subjt:  ANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSL--PKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSF

Query:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD
         HGMIYCS ITAKLVNMK+GIPWERLQVLPL+QK+NIAG DVTCFDANHCPGSIIILFEPP+GKAVLHTGDFR+CEQMG LSVF+TCRIHTLILDTTYCD
Subjt:  CHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCD

Query:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
        PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYV++AKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL
Subjt:  PQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRL

Query:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS
        KH+SN YANRFSLIVAFSPTGWALSKGKKKSPGRR QQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSAR MISLLSS
Subjt:  KHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLLSS

SwissProt top hitse value%identityAlignment
Q38961 DNA cross-link repair protein SNM12.1e-6938.24Show/hide
Query:  TDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMI
        TD F  + + K +    +S   +  KK   +L K +    +  N  PR  P +  +PGTPF VDAFR+  ++G CS +FL+HFH DHY GLTK++ HG I
Subjt:  TDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMI

Query:  YCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDF
        YCS +T++L+ + + +    +  L LD +  I GI VT  +ANHCPG+ +I F   +G   LHTGDFR  +QM    +    R+H L LDTTYC+P+Y F
Subjt:  YCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDF

Query:  PKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLK
        P +E V+ +V+    D ++ +   PKTL ++G Y+IGKE ++L +A+ L  K++  A++ RIL+  G+  +  +  + + + + +HV+P+ +L   +RL 
Subjt:  PKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLK

Query:  HVSNHYANRFSLIVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMIS
             Y  ++  ++AF PTGW  S+  G+     +   +G I  Y VPYSEHSSF+EL++FV+ + P  IIP+VNN    +   M S
Subjt:  HVSNHYANRFSLIVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMIS

Q5QJC4 DNA cross-link repair 1A protein3.0e-6841.24Show/hide
Query:  SSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIP
        S GQ+   K+  +S    + A++       +  P +  IPGT F VDAF++  + G C+ +FL+HFH DHY GLTK+F    +YC+ IT  LV  K+ + 
Subjt:  SSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIP

Query:  WERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQA
         + + VLP+D +  + GI V   DANHCPG+ +ILF  P+G A+LHTGDFR    M R       +IHTL LDTTYC P+Y FP Q+ VIQF ++ A + 
Subjt:  WERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQA

Query:  EAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTG
           NP+TL + G Y+IGKE++FL +A VL  K  ++  K + L+CL  SA      T+N   + +H++P+  + +FK L+   N ++  F  ++AF PTG
Subjt:  EAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTG

Query:  WALSKGKKKSPGRRWQ-QGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVN
        W  S         + Q +G I  Y +PYSEHSS+ E+K FV+ + P  IIP+VN
Subjt:  WALSKGKKKSPGRRWQ-QGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVN

Q6PJP8 DNA cross-link repair 1A protein6.2e-6636.98Show/hide
Query:  RRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSI-NGTNKAPANKLITDYFPGFAINKKN--TCSISSGQKDVGKKLPDSLPKGKTAKR
        +RK   +LS+L  ++S +     S   S    Q      R+  S+  G  +  ++ LI        ++K    T S   G +   KK+P+S         
Subjt:  RRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSI-NGTNKAPANKLITDYFPGFAINKKN--TCSISSGQKDVGKKLPDSLPKGKTAKR

Query:  NVRNAKPRNVPVWSCIPGTPFRVDAFRH-LRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFD
        NV  ++ +  P +  IPGT F VDAF++ +   C+ +FL+HFH DHY GL+K F    +YCS IT  L+  K+ +  + +  LPLD +  + G+ V   D
Subjt:  NVRNAKPRNVPVWSCIPGTPFRVDAFRH-LRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFD

Query:  ANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKERLFLE
        ANHCPG+++ILF  PNG  +LHTGDFR    M R S+    ++H L LDTTYC P+Y FP Q+ VI+F I+ A +A   NP  L + G Y+IGKE++FL 
Subjt:  ANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKERLFLE

Query:  VARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALS-KGKKKSPGRRWQQGTIIRY
        +A VL  KV ++  K + L+CL    E     T +   S +H++P+  + +FK L+        +++ I+AF PTGW  S K  + +      +G I  Y
Subjt:  VARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALS-KGKKKSPGRRWQQGTIIRY

Query:  EVPYSEHSSFSELKDFVKLVSPVNIIPSVN
         +PYSEHSS+ E+K FV+ + P  IIP+VN
Subjt:  EVPYSEHSSFSELKDFVKLVSPVNIIPSVN

Q86KS1 DNA cross-link repair 1 protein8.7e-6033.57Show/hide
Query:  NSHAPSGIGQQSNHGSDRREGSINGTNKAPANKLITDYFPGFAINKKNTCSISSGQK---DVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFR
        N++  +     +N+ ++      N  NK    K    Y     IN  N  +    +K   D+ K   +   K     R     + +  P +  I GT F 
Subjt:  NSHAPSGIGQQSNHGSDRREGSINGTNKAPANKLITDYFPGFAINKKNTCSISSGQK---DVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFR

Query:  VDAFRHLRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPP--------
        VD F++   D +H+FL+HFH DHY G+TK++  G IYC+  T KLV+ K+G+    +     ++ I I G+ V   D+NHCPGS +ILF  P        
Subjt:  VDAFRHLRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPP--------

Query:  -NGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAK
           +++LHTGDFR+ + M    + K   I  L LD TYCDPQY FP Q  +I+ V   ++ E  + +TLFL G Y IGKER+ LE+A+   K V+V+  K
Subjt:  -NGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAK

Query:  LRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDF
          IL CL  +  D+  FT NE  +    V + ++ S+  +  + +   N++  ++ F PTGW  +K       R    G    Y V YSEHSSF+EL+D 
Subjt:  LRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDF

Query:  VKLVSPVNIIPSVNNHGPDSARAMISLLS
        +    P  IIP+V+   P   + +++  S
Subjt:  VKLVSPVNIIPSVNNHGPDSARAMISLLS

Q9JIC3 DNA cross-link repair 1A protein2.0e-6436.32Show/hide
Query:  RRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAPANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVR
        +RK   +LS+L  E  A     + H+    G++  H   R + S N   + P  +       G  +N      +S  +  V +       +G+T + N+ 
Subjt:  RRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAPANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVR

Query:  NAKP-------RNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGID
         ++        R  P +  IPGT F VDAF++  + G C+ +FL+HFH DHY GL+K F    +YCS IT  L+  K+ +  + ++ LP+D +  +  + 
Subjt:  NAKP-------RNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGID

Query:  VTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKE
        V   DANHCPG+ +ILF+ PNG  +LHTGDFR    M R S     ++HTL LDTTYC P+Y FP Q+ VIQF I+ A +A   NP+ L + G Y IGKE
Subjt:  VTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKE

Query:  RLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQ-QG
        ++FL +A VL  KV ++  K + L+CL    E     T +  +S +H++P+  + +FK L+        ++  I+AF PTGW  S     +     Q +G
Subjt:  RLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQ-QG

Query:  TIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVN
         I  Y +PYSEHSS+ E+K FV+ + P  IIP+VN
Subjt:  TIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVN

Arabidopsis top hitse value%identityAlignment
AT1G66730.1 DNA LIGASE 62.3e-5536.02Show/hide
Query:  INKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCS-HWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKL
        +N   T   SS    +  + P   PK  ++  ++ N+K         IP T F VD FR      S  +FLSHFH DHY GL+ S+  G+IYCS  TA+L
Subjt:  INKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCS-HWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKL

Query:  VNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILF----EPPNGKAVLHTGDFRFCEQM---GRLSVFKTCRIHTLILDTTYCDPQYDFPK
        V   + +P + +  LP++Q + I G +V   +ANHCPG++  LF    E    +  +HTGDFRFC++M     L+ F  C    + LDTTYC+P++ FP 
Subjt:  VNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILF----EPPNGKAVLHTGDFRFCEQM---GRLSVFKTCRIHTLILDTTYCDPQYDFPK

Query:  QETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVV------PLWTLASFKRLK
        QE  + +V+  I  +    K LFL+  Y +GKE++ +E+AR  ++K+ V A K+ +L  LG   E M  FT +E ES +HVV        W       +K
Subjt:  QETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVV------PLWTLASFKRLK

Query:  HVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSV
                 +  +V F PTGW     + K   R      I  + VPYSEHS++ EL++F+K + P  +IP+V
Subjt:  HVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSV

AT2G45700.1 sterile alpha motif (SAM) domain-containing protein3.5e-21355.5Show/hide
Query:  TNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAP-KKPK-CSAPTATGKENIPSIAHCNVGFKPNFDRHNGAPDLDGGETFGSSEIDLGCSLD
        +N  +DDDD     Q   SS+ S  + L  T+  ++  R P KKP+ C  P   GKEN+           P+ D     PDL     F SS     C LD
Subjt:  TNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAP-KKPK-CSAPTATGKENIPSIAHCNVGFKPNFDRHNGAPDLDGGETFGSSEIDLGCSLD

Query:  LIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKV-KGGYLWNSIESRLINSRVDCDVGVSGSGVDKASSDDFESDTELDLLLNLHSQLDEEDR
         I PS V CS    N    G +    ++     D+C   KV + GYL NS+E+RL+ SR+        SG+ +      ES++ELD+L+NL S  + E R
Subjt:  LIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKV-KGGYLWNSIESRLINSRVDCDVGVSGSGVDKASSDDFESDTELDLLLNLHSQLDEEDR

Query:  IGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQ-AQNAALTPDRKQTSGPQQSGDN--------SRFSTVLKWLHGLGLSK
         G          F + +D  IQCPLC +DIS LS+EQR VH+N C+DK   Q ++  +L      +S  ++S D+        +  S VLKWL  LGL+K
Subjt:  IGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQ-AQNAALTPDRKQTSGPQQSGDN--------SRFSTVLKWLHGLGLSK

Query:  YEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKE-SSAVETCTNSHAPSG-IGQQSNHGSDRREGSINGTNKAPANKLITDYFPGFA
        YED+F+REEIDWDTLQ LT+EDL ++GIT+LGPR+KI +ALS +R   +S+ E    SH  SG + ++    S  R+ S     K  ANKLIT++FPG A
Subjt:  YEDIFVREEIDWDTLQWLTDEDLNNMGITALGPRRKITHALSELRKE-SSAVETCTNSHAPSG-IGQQSNHGSDRREGSINGTNKAPANKLITDYFPGFA

Query:  INKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLV
                I +  K V +K P      +  +RN  N K + +P W+CIPGTPFRVDAF++L  DC HWFL+HFH+DHYQGLTKSF HG IYCSL+TAKLV
Subjt:  INKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLV

Query:  NMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFV
        NMKIGIPWERLQVL L QK+NI+GIDVTCFDANHCPGSI+ILFEP NGKAVLHTGDFR+ E+M    +     I +LILDTTYC+PQYDFPKQE VIQFV
Subjt:  NMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFV

Query:  IDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIV
        ++AIQAEAFNPKTLFLIG YTIGKERLFLEVARVLR+K+Y+  AKL++L+CLGFS +D+QWFTV E ESHIHVVPLWTLASFKRLKHV+N Y NR+SLIV
Subjt:  IDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIV

Query:  AFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLL
        AFSPTGW   K KKKSPGRR QQGTIIRYEVPYSEHSSF+ELK+FV+ VSP  IIPSVNN GPDSA AM+SLL
Subjt:  AFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMISLL

AT3G26680.1 DNA repair metallo-beta-lactamase family protein1.5e-7038.24Show/hide
Query:  TDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMI
        TD F  + + K +    +S   +  KK   +L K +    +  N  PR  P +  +PGTPF VDAFR+  ++G CS +FL+HFH DHY GLTK++ HG I
Subjt:  TDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMI

Query:  YCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDF
        YCS +T++L+ + + +    +  L LD +  I GI VT  +ANHCPG+ +I F   +G   LHTGDFR  +QM    +    R+H L LDTTYC+P+Y F
Subjt:  YCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDF

Query:  PKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLK
        P +E V+ +V+    D ++ +   PKTL ++G Y+IGKE ++L +A+ L  K++  A++ RIL+  G+  +  +  + + + + +HV+P+ +L   +RL 
Subjt:  PKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLK

Query:  HVSNHYANRFSLIVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMIS
             Y  ++  ++AF PTGW  S+  G+     +   +G I  Y VPYSEHSSF+EL++FV+ + P  IIP+VNN    +   M S
Subjt:  HVSNHYANRFSLIVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMIS

AT3G26680.2 DNA repair metallo-beta-lactamase family protein1.5e-7038.24Show/hide
Query:  TDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMI
        TD F  + + K +    +S   +  KK   +L K +    +  N  PR  P +  +PGTPF VDAFR+  ++G CS +FL+HFH DHY GLTK++ HG I
Subjt:  TDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMI

Query:  YCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDF
        YCS +T++L+ + + +    +  L LD +  I GI VT  +ANHCPG+ +I F   +G   LHTGDFR  +QM    +    R+H L LDTTYC+P+Y F
Subjt:  YCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDF

Query:  PKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLK
        P +E V+ +V+    D ++ +   PKTL ++G Y+IGKE ++L +A+ L  K++  A++ RIL+  G+  +  +  + + + + +HV+P+ +L   +RL 
Subjt:  PKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLK

Query:  HVSNHYANRFSLIVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMIS
             Y  ++  ++AF PTGW  S+  G+     +   +G I  Y VPYSEHSSF+EL++FV+ + P  IIP+VNN    +   M S
Subjt:  HVSNHYANRFSLIVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMIS

AT3G26680.3 DNA repair metallo-beta-lactamase family protein1.5e-7038.24Show/hide
Query:  TDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMI
        TD F  + + K +    +S   +  KK   +L K +    +  N  PR  P +  +PGTPF VDAFR+  ++G CS +FL+HFH DHY GLTK++ HG I
Subjt:  TDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKGKTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRH--LRGDCSHWFLSHFHMDHYQGLTKSFCHGMI

Query:  YCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDF
        YCS +T++L+ + + +    +  L LD +  I GI VT  +ANHCPG+ +I F   +G   LHTGDFR  +QM    +    R+H L LDTTYC+P+Y F
Subjt:  YCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPGSIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDF

Query:  PKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLK
        P +E V+ +V+    D ++ +   PKTL ++G Y+IGKE ++L +A+ L  K++  A++ RIL+  G+  +  +  + + + + +HV+P+ +L   +RL 
Subjt:  PKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLK

Query:  HVSNHYANRFSLIVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMIS
             Y  ++  ++AF PTGW  S+  G+     +   +G I  Y VPYSEHSSF+EL++FV+ + P  IIP+VNN    +   M S
Subjt:  HVSNHYANRFSLIVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPSVNNHGPDSARAMIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCCCATCAGAACCGCTGCCGCCCACCGCCACCGCGACCAAACTTCTGCATCTCAAGAACCTCAATTCCACTCGCCGACCAATGCCGCCGATGACGACGACGATCT
TCTTCCTTCCACCCAAACAGTCCTCTCCTCGCGATCTTCGACCTCGCAAAAACTCTTCGCGACCTCCGACTTATCGCTCCACATTCGCGCTCCCAAGAAGCCGAAGTGTT
CTGCTCCGACGGCCACTGGTAAGGAAAACATTCCCTCTATAGCGCATTGCAATGTCGGATTTAAGCCAAATTTTGATAGACATAATGGCGCTCCTGATTTGGACGGTGGT
GAAACATTCGGATCGTCGGAAATTGATTTAGGTTGCAGTTTGGATTTGATACAGCCGAGCATTGTCGGCTGTAGTTACGAAACTCATAATGCAAACTCGGGAGGGGAGAT
GTTTGACGGGGATGATGATTTTTCTGGTGCGACGGATGAATGTGAAGGGTCGAAGGTCAAGGGGGGCTACTTATGGAATTCGATTGAATCTAGGTTAATTAATTCCAGAG
TAGATTGCGATGTCGGCGTCAGTGGCAGTGGAGTTGATAAGGCGAGCAGTGATGATTTTGAGTCGGATACAGAGCTAGACTTGTTGCTCAATCTGCATTCGCAACTGGAT
GAGGAAGACCGTATTGGTGGAGTGGGATTTGATACTAAGGAAACATATTTTCCAATGGATGAGGACCGTCTGATTCAATGTCCTCTATGTGGGGTTGATATTTCTGATCT
TAGCGATGAGCAACGCCTTGTACACACCAATGATTGTATTGACAAAGATGACACGCAAGCTCAAAATGCTGCTCTCACACCTGATAGAAAGCAAACTTCAGGTCCTCAAC
AATCTGGTGATAATTCGAGATTTTCAACTGTTCTCAAATGGTTGCATGGTCTTGGTTTATCGAAATATGAAGACATTTTTGTTAGGGAGGAGATCGATTGGGACACATTA
CAGTGGTTGACAGATGAGGATCTCAATAATATGGGTATTACTGCACTTGGGCCCCGAAGGAAGATCACACATGCTCTCTCCGAACTCAGAAAAGAATCTTCTGCGGTTGA
AACTTGCACGAATTCACATGCTCCTTCCGGGATTGGGCAACAGAGTAACCATGGTTCAGATCGGCGGGAAGGATCAATCAATGGAACTAATAAAGCACCTGCGAACAAGT
TGATCACTGATTATTTCCCAGGCTTTGCCATTAATAAGAAGAATACATGCAGCATCTCTAGTGGTCAGAAAGATGTGGGAAAAAAATTACCCGACTCTCTTCCTAAAGGT
AAAACTGCGAAAAGAAATGTAAGAAATGCGAAGCCTAGGAATGTTCCTGTCTGGAGTTGTATACCAGGGACACCATTCAGAGTGGATGCTTTTCGACATCTTAGAGGAGA
TTGTTCTCATTGGTTTCTTTCTCACTTTCATATGGATCATTATCAAGGTCTGACAAAATCATTTTGTCATGGAATGATTTATTGTTCGCTTATAACAGCAAAGCTAGTGA
ACATGAAGATTGGAATTCCGTGGGAAAGGTTACAAGTTTTACCTCTCGACCAAAAAATTAACATTGCGGGAATAGATGTAACCTGTTTTGATGCTAATCATTGCCCTGGT
TCAATAATCATACTCTTTGAACCACCCAACGGTAAGGCTGTTCTTCATACAGGTGATTTCCGATTTTGTGAGCAAATGGGAAGGTTGTCGGTTTTTAAAACATGTCGTAT
CCATACTCTGATCCTTGATACCACTTATTGTGATCCACAGTATGATTTCCCGAAACAGGAGACCGTTATACAATTTGTAATTGATGCCATTCAGGCTGAAGCTTTCAACC
CAAAAACTCTTTTCCTAATTGGATGTTACACCATCGGAAAGGAACGGCTTTTTTTAGAGGTTGCCCGTGTTCTGCGCAAGAAGGTATATGTAACTGCAGCAAAGTTACGA
ATTTTGAAATGCTTGGGGTTCTCAGCGGAGGACATGCAGTGGTTTACAGTGAATGAACGGGAAAGCCACATACATGTTGTACCTCTGTGGACACTTGCGAGCTTCAAACG
ACTAAAGCATGTATCAAATCATTATGCCAATCGATTCAGTCTTATAGTTGCTTTCTCTCCTACCGGCTGGGCACTTTCGAAAGGAAAGAAGAAATCTCCCGGGAGAAGGT
GGCAGCAAGGTACAATTATAAGGTATGAAGTGCCATATAGTGAGCATAGCAGCTTCTCAGAGCTCAAGGATTTTGTAAAGCTTGTATCTCCCGTAAACATAATACCAAGT
GTGAATAATCATGGACCTGACTCAGCCCGTGCCATGATCTCGCTTTTGTCATCTTGA
mRNA sequenceShow/hide mRNA sequence
GCGCCACAAATTTAAAAACCACAATTCCAAAACCCTCTCTGGAATTTCCGGCTTCCTTCGTCCCCAACTCCATTTCGCCTTCACAGATTGTTGCTCAGCCCTCATGCCCC
CCATCAGAACCGCTGCCGCCCACCGCCACCGCGACCAAACTTCTGCATCTCAAGAACCTCAATTCCACTCGCCGACCAATGCCGCCGATGACGACGACGATCTTCTTCCT
TCCACCCAAACAGTCCTCTCCTCGCGATCTTCGACCTCGCAAAAACTCTTCGCGACCTCCGACTTATCGCTCCACATTCGCGCTCCCAAGAAGCCGAAGTGTTCTGCTCC
GACGGCCACTGGTAAGGAAAACATTCCCTCTATAGCGCATTGCAATGTCGGATTTAAGCCAAATTTTGATAGACATAATGGCGCTCCTGATTTGGACGGTGGTGAAACAT
TCGGATCGTCGGAAATTGATTTAGGTTGCAGTTTGGATTTGATACAGCCGAGCATTGTCGGCTGTAGTTACGAAACTCATAATGCAAACTCGGGAGGGGAGATGTTTGAC
GGGGATGATGATTTTTCTGGTGCGACGGATGAATGTGAAGGGTCGAAGGTCAAGGGGGGCTACTTATGGAATTCGATTGAATCTAGGTTAATTAATTCCAGAGTAGATTG
CGATGTCGGCGTCAGTGGCAGTGGAGTTGATAAGGCGAGCAGTGATGATTTTGAGTCGGATACAGAGCTAGACTTGTTGCTCAATCTGCATTCGCAACTGGATGAGGAAG
ACCGTATTGGTGGAGTGGGATTTGATACTAAGGAAACATATTTTCCAATGGATGAGGACCGTCTGATTCAATGTCCTCTATGTGGGGTTGATATTTCTGATCTTAGCGAT
GAGCAACGCCTTGTACACACCAATGATTGTATTGACAAAGATGACACGCAAGCTCAAAATGCTGCTCTCACACCTGATAGAAAGCAAACTTCAGGTCCTCAACAATCTGG
TGATAATTCGAGATTTTCAACTGTTCTCAAATGGTTGCATGGTCTTGGTTTATCGAAATATGAAGACATTTTTGTTAGGGAGGAGATCGATTGGGACACATTACAGTGGT
TGACAGATGAGGATCTCAATAATATGGGTATTACTGCACTTGGGCCCCGAAGGAAGATCACACATGCTCTCTCCGAACTCAGAAAAGAATCTTCTGCGGTTGAAACTTGC
ACGAATTCACATGCTCCTTCCGGGATTGGGCAACAGAGTAACCATGGTTCAGATCGGCGGGAAGGATCAATCAATGGAACTAATAAAGCACCTGCGAACAAGTTGATCAC
TGATTATTTCCCAGGCTTTGCCATTAATAAGAAGAATACATGCAGCATCTCTAGTGGTCAGAAAGATGTGGGAAAAAAATTACCCGACTCTCTTCCTAAAGGTAAAACTG
CGAAAAGAAATGTAAGAAATGCGAAGCCTAGGAATGTTCCTGTCTGGAGTTGTATACCAGGGACACCATTCAGAGTGGATGCTTTTCGACATCTTAGAGGAGATTGTTCT
CATTGGTTTCTTTCTCACTTTCATATGGATCATTATCAAGGTCTGACAAAATCATTTTGTCATGGAATGATTTATTGTTCGCTTATAACAGCAAAGCTAGTGAACATGAA
GATTGGAATTCCGTGGGAAAGGTTACAAGTTTTACCTCTCGACCAAAAAATTAACATTGCGGGAATAGATGTAACCTGTTTTGATGCTAATCATTGCCCTGGTTCAATAA
TCATACTCTTTGAACCACCCAACGGTAAGGCTGTTCTTCATACAGGTGATTTCCGATTTTGTGAGCAAATGGGAAGGTTGTCGGTTTTTAAAACATGTCGTATCCATACT
CTGATCCTTGATACCACTTATTGTGATCCACAGTATGATTTCCCGAAACAGGAGACCGTTATACAATTTGTAATTGATGCCATTCAGGCTGAAGCTTTCAACCCAAAAAC
TCTTTTCCTAATTGGATGTTACACCATCGGAAAGGAACGGCTTTTTTTAGAGGTTGCCCGTGTTCTGCGCAAGAAGGTATATGTAACTGCAGCAAAGTTACGAATTTTGA
AATGCTTGGGGTTCTCAGCGGAGGACATGCAGTGGTTTACAGTGAATGAACGGGAAAGCCACATACATGTTGTACCTCTGTGGACACTTGCGAGCTTCAAACGACTAAAG
CATGTATCAAATCATTATGCCAATCGATTCAGTCTTATAGTTGCTTTCTCTCCTACCGGCTGGGCACTTTCGAAAGGAAAGAAGAAATCTCCCGGGAGAAGGTGGCAGCA
AGGTACAATTATAAGGTATGAAGTGCCATATAGTGAGCATAGCAGCTTCTCAGAGCTCAAGGATTTTGTAAAGCTTGTATCTCCCGTAAACATAATACCAAGTGTGAATA
ATCATGGACCTGACTCAGCCCGTGCCATGATCTCGCTTTTGTCATCTTGATTTGTTGACCAGACTTCTAAGGGACGGGCTCTTCCATTTTCAAGTGCAGTAGATATTGAA
ATGAACATAATACGACTTTTGTGGGTCGAGGCGAGCTGAATCAAATCTGCAGATTCACGTAAGTTCAAATTTTAACCACCATTGACCCTCTTTTAGATAGCCATTTTATA
TAATCATCTCATTTTAGCAAGCTGCTTGTGCTTTACAAGCCTTGTATTGCCAACCAAGCCAAAAGCATGCCAAAATTGTTGCACCTAATGGTCGTAATTATGTCGGAAAA
AGAGGCTTCCTATTGCGTAGTAATCGGCCCTCTTTCTTTCCTCTAGAATAGGCGATTGATGTTTTGATTATTTGAAT
Protein sequenceShow/hide protein sequence
MPPIRTAAAHRHRDQTSASQEPQFHSPTNAADDDDDLLPSTQTVLSSRSSTSQKLFATSDLSLHIRAPKKPKCSAPTATGKENIPSIAHCNVGFKPNFDRHNGAPDLDGG
ETFGSSEIDLGCSLDLIQPSIVGCSYETHNANSGGEMFDGDDDFSGATDECEGSKVKGGYLWNSIESRLINSRVDCDVGVSGSGVDKASSDDFESDTELDLLLNLHSQLD
EEDRIGGVGFDTKETYFPMDEDRLIQCPLCGVDISDLSDEQRLVHTNDCIDKDDTQAQNAALTPDRKQTSGPQQSGDNSRFSTVLKWLHGLGLSKYEDIFVREEIDWDTL
QWLTDEDLNNMGITALGPRRKITHALSELRKESSAVETCTNSHAPSGIGQQSNHGSDRREGSINGTNKAPANKLITDYFPGFAINKKNTCSISSGQKDVGKKLPDSLPKG
KTAKRNVRNAKPRNVPVWSCIPGTPFRVDAFRHLRGDCSHWFLSHFHMDHYQGLTKSFCHGMIYCSLITAKLVNMKIGIPWERLQVLPLDQKINIAGIDVTCFDANHCPG
SIIILFEPPNGKAVLHTGDFRFCEQMGRLSVFKTCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLR
ILKCLGFSAEDMQWFTVNERESHIHVVPLWTLASFKRLKHVSNHYANRFSLIVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELKDFVKLVSPVNIIPS
VNNHGPDSARAMISLLSS