; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g04590 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g04590
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA cross-link repair 1A protein
Genome locationchr11:2973455..2979419
RNA-Seq ExpressionMoc11g04590
SyntenyMoc11g04590
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR001660 - Sterile alpha motif domain
IPR011084 - DNA repair metallo-beta-lactamase
IPR013761 - Sterile alpha motif/pointed domain superfamily
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139615.1 DNA cross-link repair 1A protein-like isoform X1 [Momordica charantia]0.0e+0084.44Show/hide
Query:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
        MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
Subjt:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS

Query:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
        SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
Subjt:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV

Query:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER-----------------QVALT
        DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER                 QVALT
Subjt:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER-----------------QVALT

Query:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN
        NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN
Subjt:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN

Query:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL---------------
        RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG                   C P +  R+               
Subjt:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL---------------

Query:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ
               +  ++ S  +G +  S         K+   W+ L +   + K        +  +  +  G +   F  P+ +AVLHTGDFRFCEEIGSLPVFQ
Subjt:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ

Query:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH
        TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH
Subjt:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH

Query:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI
        IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI
Subjt:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI

Query:  SLLSPLCGTT
        SLLSPLCGTT
Subjt:  SLLSPLCGTT

XP_022139623.1 DNA cross-link repair 1A protein-like isoform X2 [Momordica charantia]0.0e+0084.55Show/hide
Query:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
        MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
Subjt:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS

Query:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
        SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
Subjt:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV

Query:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQ----------------VALTN
        DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQ                VALTN
Subjt:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQ----------------VALTN

Query:  DRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNR
        DRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNR
Subjt:  DRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNR

Query:  GSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL----------------
        GSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG                   C P +  R+                
Subjt:  GSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL----------------

Query:  ------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQT
              +  ++ S  +G +  S         K+   W+ L +   + K        +  +  +  G +   F  P+ +AVLHTGDFRFCEEIGSLPVFQT
Subjt:  ------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQT

Query:  CRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHI
        CRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHI
Subjt:  CRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHI

Query:  HVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMIS
        HVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMIS
Subjt:  HVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMIS

Query:  LLSPLCGTT
        LLSPLCGTT
Subjt:  LLSPLCGTT

XP_022139631.1 uncharacterized protein LOC111010471 isoform X3 [Momordica charantia]0.0e+0086.62Show/hide
Query:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
        MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
Subjt:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS

Query:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
        SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
Subjt:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV

Query:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER-----------------QVALT
        DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER                 QVALT
Subjt:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER-----------------QVALT

Query:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN
        NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN
Subjt:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN

Query:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKR-----RLFPLVSYSLSYGSLSRSD------------
        RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG   T KR     +L  +  +S   G+  R D            
Subjt:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKR-----RLFPLVSYSLSYGSLSRSD------------

Query:  --------------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQY
                      K+   W+ L +   + K        +  +  +  G +   F  P+ +AVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQY
Subjt:  --------------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQY

Query:  DFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHV
        DFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHV
Subjt:  DFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHV

Query:  SNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMISLLSPLCGTT
        SNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMISLLSPLCGTT
Subjt:  SNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMISLLSPLCGTT

XP_022976979.1 DNA cross-link repair 1A protein isoform X1 [Cucurbita maxima]1.4e-28267.04Show/hide
Query:  PLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLSS
        P+  +   RHQ SASQL+QFH PTNAGDDD    D LPSTQTVLSSRSS+S KPLATS+L SLH RA KRPKRS PT  G+ NIPSI   + D  FK   
Subjt:  PLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLSS

Query:  MSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRVD
                   NFD  + AAALD  E FG SE+DLGCSLD I+PS+VGCSY+ HDANS EEIF+G   FSGA DECKGSKVKGGYL NSIES+LMNSRVD
Subjt:  MSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRVD

Query:  RDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV----------------ALTND
         DVG+SG GVDK +SDDFESD ELDLLLNLHSQ+DEED ++GV FGTEE+ FPVDE+ LIQCPLCGVDISDLSD++R V                ALT D
Subjt:  RDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV----------------ALTND

Query:  RKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNRG
        +KQTS PQQS DNSR STVLKWLH LGLSKYEDIF+REEIDW  LQWLTDEDL  +GI+ALGPRRKIT ALSELRK+S AVETCTN+HAPS  GQ SN G
Subjt:  RKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNRG

Query:  SDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKRRL------------------------------------
        SDG EGS NG N+TP NKLITDYFPGFA++KKN+CSI +GQ+DV KKL DS ++    T+KR +                                    
Subjt:  SDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKRRL------------------------------------

Query:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ
               +  ++ S  +G +  S         K+   W+ L +   N K        +  +  +  G +   F  P  +AVLHTGDFR+CE++GSL VFQ
Subjt:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ

Query:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH
        TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYV++AKLRILKCLGFS EDMQWFTVNE ESH
Subjt:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH

Query:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI
        IHVVPLWTLASFKRLKH+SNQYANRFSL+VAFSPTGWALSKGKKKSPGRR QQGTIIRYEVPYSEHSSFSEL+DFVKLVSP NIIPSVNNHGPDSAR MI
Subjt:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI

Query:  SLLS
        SLLS
Subjt:  SLLS

XP_038897129.1 DNA cross-link repair 1A protein isoform X1 [Benincasa hispida]6.8e-29068.28Show/hide
Query:  MPLTKSAA-VRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKL
        MPL  +AA  RHQ SASQ  QFH+PTNA DDDD   DFLPSTQTVLSSRSS SQKPLATS+L SLH RAPKRPK S PTA GKEN+PSI   + D  F+ 
Subjt:  MPLTKSAA-VRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKL

Query:  SSMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSR
                     NF   NGA  LD  E F  S+  LGCSLD I+PSIVGCSY+ HD NSGEEI DG +GFSGAMDECKGSK KGGYL NSIESRL+NSR
Subjt:  SSMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSR

Query:  VDRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV----------------ALT
        VD DVG+SG GVDKV++D FESDTELDLLLNLHS++DEE  ++GV FGTE TDF +DEEGLIQCPLCGVDISDLSD++R V                ALT
Subjt:  VDRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV----------------ALT

Query:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN
         D+KQT  PQQSGDNSR S VL+WLH LGLS+Y+DIF+REEIDW ALQWLTDEDL  MGITALGPRRKITHALSELRK+S AVETCTNSHAPS  GQQSN
Subjt:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN

Query:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL---------------
         GSDGREGSI G N+TPANKLITDYFPGFA++KKN+C+I SGQKDV KK+ DS ++G                   C P +  R+               
Subjt:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL---------------

Query:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ
               +  ++ S  +G +  S         K+   W+ L +   + K        +  +  +  G +   F  P+ +AVLHTGDFRFCE++G L VFQ
Subjt:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ

Query:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH
        TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFS EDMQWFTVNE ESH
Subjt:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH

Query:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI
        IHVVPLWTLASFKRLKHVSNQYANRF+L+VAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSF+EL+DFVKLVSP NIIPSVNNHGPDSARAMI
Subjt:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI

Query:  SLLS
        SLLS
Subjt:  SLLS

TrEMBL top hitse value%identityAlignment
A0A6J1CCT1 DNA cross-link repair 1A protein-like isoform X10.0e+0084.44Show/hide
Query:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
        MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
Subjt:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS

Query:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
        SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
Subjt:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV

Query:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER-----------------QVALT
        DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER                 QVALT
Subjt:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER-----------------QVALT

Query:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN
        NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN
Subjt:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN

Query:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL---------------
        RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG                   C P +  R+               
Subjt:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL---------------

Query:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ
               +  ++ S  +G +  S         K+   W+ L +   + K        +  +  +  G +   F  P+ +AVLHTGDFRFCEEIGSLPVFQ
Subjt:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ

Query:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH
        TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH
Subjt:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH

Query:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI
        IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI
Subjt:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI

Query:  SLLSPLCGTT
        SLLSPLCGTT
Subjt:  SLLSPLCGTT

A0A6J1CD95 uncharacterized protein LOC111010471 isoform X30.0e+0086.62Show/hide
Query:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
        MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
Subjt:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS

Query:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
        SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
Subjt:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV

Query:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER-----------------QVALT
        DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER                 QVALT
Subjt:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKER-----------------QVALT

Query:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN
        NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN
Subjt:  NDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSN

Query:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKR-----RLFPLVSYSLSYGSLSRSD------------
        RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG   T KR     +L  +  +S   G+  R D            
Subjt:  RGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKR-----RLFPLVSYSLSYGSLSRSD------------

Query:  --------------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQY
                      K+   W+ L +   + K        +  +  +  G +   F  P+ +AVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQY
Subjt:  --------------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQY

Query:  DFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHV
        DFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHV
Subjt:  DFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHV

Query:  SNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMISLLSPLCGTT
        SNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMISLLSPLCGTT
Subjt:  SNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMISLLSPLCGTT

A0A6J1CG29 DNA cross-link repair 1A protein-like isoform X20.0e+0084.55Show/hide
Query:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
        MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS
Subjt:  MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLS

Query:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
        SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV
Subjt:  SMSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRV

Query:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQ----------------VALTN
        DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQ                VALTN
Subjt:  DRDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQ----------------VALTN

Query:  DRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNR
        DRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNR
Subjt:  DRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNR

Query:  GSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL----------------
        GSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG                   C P +  R+                
Subjt:  GSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG-------------------CFPTSKRRL----------------

Query:  ------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQT
              +  ++ S  +G +  S         K+   W+ L +   + K        +  +  +  G +   F  P+ +AVLHTGDFRFCEEIGSLPVFQT
Subjt:  ------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQT

Query:  CRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHI
        CRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHI
Subjt:  CRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHI

Query:  HVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMIS
        HVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMIS
Subjt:  HVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMIS

Query:  LLSPLCGTT
        LLSPLCGTT
Subjt:  LLSPLCGTT

A0A6J1F758 DNA cross-link repair 1A protein4.3e-28267.54Show/hide
Query:  PLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLSS
        P+  +   RHQ SASQL QFH PTNAGDDD    D LPSTQTVLSSRSS+S KPLATS+L SLH RA KRPKRS PTA G+ NIPSI   + D  FK   
Subjt:  PLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLSS

Query:  MSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRVD
                   NFD  +GAAALD  E FG SE+DLGCSLD I+PS+VGCSY+ HDANS EEIFDG   FSGA DECKGSK+KGGYL NSIESRLMNSRVD
Subjt:  MSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRVD

Query:  RDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV----------------ALTND
         DVG+SG GVDK +SDDFESD ELDLLLNLHSQ+DEED ++GV FGTEE+ FPVDE+ LIQCPLCGVDISDLSD++R V                ALT D
Subjt:  RDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV----------------ALTND

Query:  RKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNRG
        +KQTS PQQS D SR STVLKWLH LGLSKYEDIF+REEIDW  LQWLTDEDL  +GITALGPRRKIT ALSELRK+S AVETCTN+HAPS  GQ SN G
Subjt:  RKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNRG

Query:  SDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKR-----RLFPLVSYSLSYGSLSRSDKV------ISSWDD
        SDG E S NG N+TP NKLITDYFPGFA++KKN+CSI +GQ+DV KKL DS ++    T+KR     +L  +  +S   G+  R D         S W  
Subjt:  SDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKR-----RLFPLVSYSLSYGSLSRSDKV------ISSWDD

Query:  LLFAYN-----SKASKHE------------------DWNSM------GKVNNFFPDI---------------------QAVLHTGDFRFCEEIGSLPVFQ
          F  +     +K+ +H                    W  +       KVN    D+                     +AVLHTGDFR+CE++GSL VFQ
Subjt:  LLFAYN-----SKASKHE------------------DWNSM------GKVNNFFPDI---------------------QAVLHTGDFRFCEEIGSLPVFQ

Query:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH
        TCRIHTLILDTTYCDPQ DFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVL KKVYV++AKLRILKCLGFS EDMQWFTVNE ESH
Subjt:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH

Query:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI
        IHVVPLWTLASFKRLKH+SNQYA+RFSL+VAFSPTGWALSKGKKKSPGRR QQGTIIRYEVPYSEHSSFSEL+DFVKLVSP NIIPSVNNHGPDSARAMI
Subjt:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI

Query:  SLLS
        SLLS
Subjt:  SLLS

A0A6J1IQ65 DNA cross-link repair 1A protein isoform X16.7e-28367.04Show/hide
Query:  PLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLSS
        P+  +   RHQ SASQL+QFH PTNAGDDD    D LPSTQTVLSSRSS+S KPLATS+L SLH RA KRPKRS PT  G+ NIPSI   + D  FK   
Subjt:  PLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLSS

Query:  MSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRVD
                   NFD  + AAALD  E FG SE+DLGCSLD I+PS+VGCSY+ HDANS EEIF+G   FSGA DECKGSKVKGGYL NSIES+LMNSRVD
Subjt:  MSFPDAVSGSTNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRVD

Query:  RDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV----------------ALTND
         DVG+SG GVDK +SDDFESD ELDLLLNLHSQ+DEED ++GV FGTEE+ FPVDE+ LIQCPLCGVDISDLSD++R V                ALT D
Subjt:  RDVGISGGGVDKVASDDFESDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV----------------ALTND

Query:  RKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNRG
        +KQTS PQQS DNSR STVLKWLH LGLSKYEDIF+REEIDW  LQWLTDEDL  +GI+ALGPRRKIT ALSELRK+S AVETCTN+HAPS  GQ SN G
Subjt:  RKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNRG

Query:  SDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKRRL------------------------------------
        SDG EGS NG N+TP NKLITDYFPGFA++KKN+CSI +GQ+DV KKL DS ++    T+KR +                                    
Subjt:  SDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRGCFPTSKRRL------------------------------------

Query:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ
               +  ++ S  +G +  S         K+   W+ L +   N K        +  +  +  G +   F  P  +AVLHTGDFR+CE++GSL VFQ
Subjt:  -------FPLVSYSLSYGSLSRSD--------KVISSWDDL-LFAYNSK-------ASKHEDWNSMGKVNNFF--PDIQAVLHTGDFRFCEEIGSLPVFQ

Query:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH
        TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYV++AKLRILKCLGFS EDMQWFTVNE ESH
Subjt:  TCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESH

Query:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI
        IHVVPLWTLASFKRLKH+SNQYANRFSL+VAFSPTGWALSKGKKKSPGRR QQGTIIRYEVPYSEHSSFSEL+DFVKLVSP NIIPSVNNHGPDSAR MI
Subjt:  IHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMI

Query:  SLLS
        SLLS
Subjt:  SLLS

SwissProt top hitse value%identityAlignment
Q38961 DNA cross-link repair protein SNM12.2e-3638.05Show/hide
Query:  LHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLR
        LHTGDFR  +++ + P+    R+H L LDTTYC+P+Y FP +E V+ +V+    D ++ +   PKTL ++G Y+IGKE ++L +A+ L  K++  A++ R
Subjt:  LHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLR

Query:  ILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDF
        IL+  G+ +      T  +  + +HV+P+ +L   +RL      Y  ++  V+AF PTGW  S+  G+     +   +G I  Y VPYSEHSSF+EL +F
Subjt:  ILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDF

Query:  VKLVSPANIIPSVNNHGPDSARAMIS
        V+ + P  IIP+VNN    +   M S
Subjt:  VKLVSPANIIPSVNNHGPDSARAMIS

Q5QJC4 DNA cross-link repair 1A protein3.9e-3840.37Show/hide
Query:  FFPDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVT
        + P   A+LHTGDFR    +   P     +IHTL LDTTYC P+Y FP Q+ VIQF ++ A +    NP+TL + G Y+IGKE++FL +A VL  K  ++
Subjt:  FFPDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVT

Query:  AAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQ-QGTIIRYEVPYSEHSSFSE
          K + L+CL  S       T+N   + +H++P+  + +FK L+   N+++  F  V+AF PTGW  S         + Q +G I  Y +PYSEHSS+ E
Subjt:  AAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQ-QGTIIRYEVPYSEHSSFSE

Query:  LEDFVKLVSPANIIPSVN
        ++ FV+ + P  IIP+VN
Subjt:  LEDFVKLVSPANIIPSVN

Q6PJP8 DNA cross-link repair 1A protein5.9e-3437.61Show/hide
Query:  FFPDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVT
        + P+   +LHTGDFR  +      +    ++H L LDTTYC P+Y FP Q+ VI+F I+ A +A   NP  L + G Y+IGKE++FL +A VL  KV ++
Subjt:  FFPDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVT

Query:  AAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALS-KGKKKSPGRRWQQGTIIRYEVPYSEHSSFSE
          K + L+CL   E +    T +   S +H++P+  + +FK L+    +   +++ ++AF PTGW  S K  + +      +G I  Y +PYSEHSS+ E
Subjt:  AAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALS-KGKKKSPGRRWQQGTIIRYEVPYSEHSSFSE

Query:  LEDFVKLVSPANIIPSVN
        ++ FV+ + P  IIP+VN
Subjt:  LEDFVKLVSPANIIPSVN

Q86KS1 DNA cross-link repair 1 protein2.2e-3336.84Show/hide
Query:  QAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRI
        +++LHTGDFR+ + + + P+ +   I  L LD TYCDPQY FP Q  +I+ V   ++ E  + +TLFL G Y IGKER+ LE+A+   K V+V+  K  I
Subjt:  QAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRI

Query:  LKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKL
        L CL  +  D+  FT NE  +    V + ++ S+  +  + +   N++  V+ F PTGW  +K       R    G    Y V YSEHSSF+EL D +  
Subjt:  LKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKL

Query:  VSPANIIPSVNNHGPDSARAMISLLSPL
          P  IIP+V+   P   + +++  S L
Subjt:  VSPANIIPSVNNHGPDSARAMISLLSPL

Q9JIC3 DNA cross-link repair 1A protein9.0e-3538.89Show/hide
Query:  PDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAA
        P+   +LHTGDFR  +           ++HTL LDTTYC P+Y FP Q+ VIQF I+ A +A   NP+ L + G Y IGKE++FL +A VL  KV ++  
Subjt:  PDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVID-AIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAA

Query:  KLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQ-QGTIIRYEVPYSEHSSFSELE
        K + L+CL   E      T +  +S +H++P+  + +FK L+    +   ++  ++AF PTGW  S     +     Q +G I  Y +PYSEHSS+ E++
Subjt:  KLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQ-QGTIIRYEVPYSEHSSFSELE

Query:  DFVKLVSPANIIPSVN
         FV+ + P  IIP+VN
Subjt:  DFVKLVSPANIIPSVN

Arabidopsis top hitse value%identityAlignment
AT1G66730.1 DNA LIGASE 61.3e-3137.5Show/hide
Query:  LHTGDFRFCEEIGSLPV---FQTCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRI
        +HTGDFRFC+E+   P    F  C    + LDTTYC+P++ FP QE  + +V+  I  +    K LFL+  Y +GKE++ +E+AR  ++K+ V A K+ +
Subjt:  LHTGDFRFCEEIGSLPV---FQTCRIHTLILDTTYCDPQYDFPKQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRI

Query:  LKCLGFSEEDMQWFTVNEWESHIHVV------PLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSEL
        L  LG  EE M  FT +E ES +HVV        W       +K         +  VV F PTGW     + K   R      I  + VPYSEHS++ EL
Subjt:  LKCLGFSEEDMQWFTVNEWESHIHVV------PLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSEL

Query:  EDFVKLVSPANIIPSV
         +F+K + P  +IP+V
Subjt:  EDFVKLVSPANIIPSV

AT2G45700.1 sterile alpha motif (SAM) domain-containing protein6.7e-13446.19Show/hide
Query:  CSLDFIEPSIVGCS---YQVHDANSGEEIFDGVNGFSGAMDECKGSKV-KGGYLCNSIESRLMNSRVDRDVGISGGGVDKVASDD---FESDTELDLLLN
        C LD I PS V CS   +    ++ GEE  +         D+C   KV + GYLCNS+E+RL+ SR+   +G   G    +  DD    ES++ELD+L+N
Subjt:  CSLDFIEPSIVGCS---YQVHDANSGEEIFDGVNGFSGAMDECKGSKV-KGGYLCNSIESRLMNSRVDRDVGISGGGVDKVASDD---FESDTELDLLLN

Query:  LHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV-----------------ALTNDRKQTSVPQQSGDN--------SRLSTVL
        L S   E +  +G        +F + ++  IQCPLC +DIS LS+++RQV                 +L      +S+ ++S D+        + LS VL
Subjt:  LHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQV-----------------ALTNDRKQTSVPQQSGDN--------SRLSTVL

Query:  KWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKD-SCAVETCTNSHAPSS-IGQQSNRGSDGREGSINGANKTPANK
        KWL  LGL+KYED+FIREEIDW  LQ LT+EDL+ +GIT+LGPR+KI +ALS +R   + + E    SH  S  + ++    S  R+ S     K  ANK
Subjt:  KWLHGLGLSKYEDIFIREEIDWAALQWLTDEDLIKMGITALGPRRKITHALSELRKD-SCAVETCTNSHAPSS-IGQQSNRGSDGREGSINGANKTPANK

Query:  LITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSS------------------HRGCFPTSKRRL------------FPLVSYSLS-YGSLSRSDKVISSW
        LIT++FPG A++     + P  +   EK  SDSS                  H  C P +  R+            + L  + L  Y  L++S      +
Subjt:  LITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSS------------------HRGCFPTSKRRL------------FPLVSYSLS-YGSLSRSDKVISSW

Query:  DDLLFAYNSKASKHEDWNSM------GKVNNFFPDI---------------------QAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFP
          L+ A          W  +       KVN    D+                     +AVLHTGDFR+ EE+ +  +     I +LILDTTYC+PQYDFP
Subjt:  DDLLFAYNSKASKHEDWNSM------GKVNNFFPDI---------------------QAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFP

Query:  KQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQ
        KQE VIQFV++AIQAEAFNPKTLFLIG YTIGKERLFLEVARVLR+K+Y+  AKL++L+CLGFS++D+QWFTV E ESHIHVVPLWTLASFKRLKHV+N+
Subjt:  KQETVIQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQ

Query:  YANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMISLL
        Y NR+SL+VAFSPTGW   K KKKSPGRR QQGTIIRYEVPYSEHSSF+EL++FV+ VSP  IIPSVNN GPDSA AM+SLL
Subjt:  YANRFSLVVAFSPTGWALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMISLL

AT3G26680.1 DNA repair metallo-beta-lactamase family protein1.5e-3738.05Show/hide
Query:  LHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLR
        LHTGDFR  +++ + P+    R+H L LDTTYC+P+Y FP +E V+ +V+    D ++ +   PKTL ++G Y+IGKE ++L +A+ L  K++  A++ R
Subjt:  LHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLR

Query:  ILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDF
        IL+  G+ +      T  +  + +HV+P+ +L   +RL      Y  ++  V+AF PTGW  S+  G+     +   +G I  Y VPYSEHSSF+EL +F
Subjt:  ILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDF

Query:  VKLVSPANIIPSVNNHGPDSARAMIS
        V+ + P  IIP+VNN    +   M S
Subjt:  VKLVSPANIIPSVNNHGPDSARAMIS

AT3G26680.2 DNA repair metallo-beta-lactamase family protein1.5e-3738.05Show/hide
Query:  LHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLR
        LHTGDFR  +++ + P+    R+H L LDTTYC+P+Y FP +E V+ +V+    D ++ +   PKTL ++G Y+IGKE ++L +A+ L  K++  A++ R
Subjt:  LHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLR

Query:  ILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDF
        IL+  G+ +      T  +  + +HV+P+ +L   +RL      Y  ++  V+AF PTGW  S+  G+     +   +G I  Y VPYSEHSSF+EL +F
Subjt:  ILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDF

Query:  VKLVSPANIIPSVNNHGPDSARAMIS
        V+ + P  IIP+VNN    +   M S
Subjt:  VKLVSPANIIPSVNNHGPDSARAMIS

AT3G26680.3 DNA repair metallo-beta-lactamase family protein1.5e-3738.05Show/hide
Query:  LHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLR
        LHTGDFR  +++ + P+    R+H L LDTTYC+P+Y FP +E V+ +V+    D ++ +   PKTL ++G Y+IGKE ++L +A+ L  K++  A++ R
Subjt:  LHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETVIQFVI----DAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLR

Query:  ILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDF
        IL+  G+ +      T  +  + +HV+P+ +L   +RL      Y  ++  V+AF PTGW  S+  G+     +   +G I  Y VPYSEHSSF+EL +F
Subjt:  ILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTGWALSK--GKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDF

Query:  VKLVSPANIIPSVNNHGPDSARAMIS
        V+ + P  IIP+VNN    +   M S
Subjt:  VKLVSPANIIPSVNNHGPDSARAMIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCTCACCAAATCCGCCGCCGTCCGCCACCAGAAATCTGCATCCCAGTTATCTCAATTCCACTTGCCTACTAATGCCGGTGATGACGACGACGGCGGCGGCGATTT
TCTTCCTTCCACCCAGACAGTCCTCTCCTCGCGCTCTTCATCCTCCCAAAAACCCCTCGCGACTTCCGAGTTATCATCCCTCCACACTCGCGCGCCCAAGAGGCCGAAGC
GCTCTACTCCGACGGCAATTGGCAAGGAAAACATTCCCTCTATAGGATATGGTTATGGTGATACTGAATTTAAGCTTTCGTCGATGTCTTTTCCAGATGCTGTATCAGGA
TCCACAAATTTTGATGGCCAGAATGGTGCTGCTGCTTTGGATGGTAGTGAAGCATTCGGATTGTCGGAAATAGATTTAGGTTGCAGTCTGGACTTTATAGAGCCGAGCAT
TGTTGGCTGCAGTTACCAAGTTCATGATGCAAATTCGGGAGAGGAGATATTTGACGGGGTTAATGGTTTTTCTGGTGCGATGGATGAATGTAAGGGGTCGAAGGTGAAAG
GGGGCTATCTATGCAATTCGATTGAATCTAGGTTAATGAATTCGAGAGTAGATCGCGATGTCGGCATCAGTGGCGGTGGAGTTGACAAGGTAGCCAGTGATGATTTTGAG
TCGGACACAGAGCTGGACTTGTTGCTCAATCTTCATTCGCAAGTGGATGAAGAAGACCATATGAACGGAGTAAGATTTGGTACTGAGGAAACAGATTTTCCAGTGGATGA
AGAAGGTTTGATTCAATGTCCTCTCTGCGGAGTTGATATTTCTGATCTTAGCGACAAGGAACGTCAGGTTGCTCTCACCAATGATAGGAAGCAAACTTCAGTTCCTCAAC
AATCTGGTGATAATTCAAGACTTTCTACTGTTCTCAAATGGTTGCATGGTCTTGGTTTATCGAAATATGAAGACATTTTTATAAGGGAGGAGATCGATTGGGCCGCATTA
CAGTGGCTGACAGATGAGGATCTCATCAAGATGGGTATTACTGCACTTGGCCCTCGAAGGAAGATCACACATGCTCTCTCCGAACTAAGAAAAGATTCTTGTGCGGTTGA
AACTTGCACGAATTCACATGCCCCTTCTAGTATTGGGCAACAGAGTAACCGTGGTTCAGATGGGCGAGAAGGATCAATCAATGGAGCTAATAAAACACCTGCGAACAAGT
TGATCACTGATTATTTTCCAGGCTTTGCCTCTGATAAGAAAAATTCATGCAGCATCCCTAGTGGTCAGAAAGATGTGGAAAAAAAATTATCTGACTCTTCACACAGAGGA
TGCTTTCCGACATCTAAGAGGAGATTGTTTCCATTGGTTTCTTACTCACTTTCATATGGATCATTATCAAGGTCTGACAAAGTCATTTCGTCATGGGATGATTTATTGTT
CGCCTATAACAGCAAAGCTAGTAAACATGAAGATTGGAATTCCATGGGAAAGGTGAACAACTTTTTTCCCGACATCCAGGCTGTTCTACATACGGGGGATTTCCGATTTT
GCGAGGAAATTGGAAGCTTGCCAGTTTTTCAAACATGTCGTATCCATACTCTGATCCTTGATACCACTTATTGTGATCCACAGTATGATTTTCCAAAGCAGGAGACTGTA
ATACAGTTTGTAATTGATGCAATTCAAGCTGAAGCTTTCAACCCAAAAACACTTTTCCTGATTGGATGCTATACTATCGGAAAAGAACGGCTTTTTTTGGAGGTTGCCCG
TGTTCTGCGTAAGAAGGTGTATGTAACTGCAGCGAAGTTACGCATTTTAAAATGCTTGGGATTCTCAGAGGAGGATATGCAGTGGTTTACAGTGAATGAATGGGAAAGCC
ACATACATGTCGTACCTTTGTGGACACTCGCAAGCTTCAAACGACTGAAGCATGTATCAAATCAATATGCTAATCGATTCAGTCTTGTAGTTGCTTTCTCTCCTACTGGC
TGGGCGCTCTCAAAAGGAAAGAAGAAATCTCCTGGTAGAAGGTGGCAGCAAGGTACAATTATAAGGTACGAAGTGCCATATAGCGAGCACAGCAGCTTCTCAGAGCTTGA
GGATTTTGTAAAGCTCGTATCTCCTGCAAACATAATACCAAGCGTGAATAATCATGGACCTGATTCAGCCCGTGCCATGATCTCGCTCTTATCACCTCTTTGTGGGACGA
CCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCCTCACCAAATCCGCCGCCGTCCGCCACCAGAAATCTGCATCCCAGTTATCTCAATTCCACTTGCCTACTAATGCCGGTGATGACGACGACGGCGGCGGCGATTT
TCTTCCTTCCACCCAGACAGTCCTCTCCTCGCGCTCTTCATCCTCCCAAAAACCCCTCGCGACTTCCGAGTTATCATCCCTCCACACTCGCGCGCCCAAGAGGCCGAAGC
GCTCTACTCCGACGGCAATTGGCAAGGAAAACATTCCCTCTATAGGATATGGTTATGGTGATACTGAATTTAAGCTTTCGTCGATGTCTTTTCCAGATGCTGTATCAGGA
TCCACAAATTTTGATGGCCAGAATGGTGCTGCTGCTTTGGATGGTAGTGAAGCATTCGGATTGTCGGAAATAGATTTAGGTTGCAGTCTGGACTTTATAGAGCCGAGCAT
TGTTGGCTGCAGTTACCAAGTTCATGATGCAAATTCGGGAGAGGAGATATTTGACGGGGTTAATGGTTTTTCTGGTGCGATGGATGAATGTAAGGGGTCGAAGGTGAAAG
GGGGCTATCTATGCAATTCGATTGAATCTAGGTTAATGAATTCGAGAGTAGATCGCGATGTCGGCATCAGTGGCGGTGGAGTTGACAAGGTAGCCAGTGATGATTTTGAG
TCGGACACAGAGCTGGACTTGTTGCTCAATCTTCATTCGCAAGTGGATGAAGAAGACCATATGAACGGAGTAAGATTTGGTACTGAGGAAACAGATTTTCCAGTGGATGA
AGAAGGTTTGATTCAATGTCCTCTCTGCGGAGTTGATATTTCTGATCTTAGCGACAAGGAACGTCAGGTTGCTCTCACCAATGATAGGAAGCAAACTTCAGTTCCTCAAC
AATCTGGTGATAATTCAAGACTTTCTACTGTTCTCAAATGGTTGCATGGTCTTGGTTTATCGAAATATGAAGACATTTTTATAAGGGAGGAGATCGATTGGGCCGCATTA
CAGTGGCTGACAGATGAGGATCTCATCAAGATGGGTATTACTGCACTTGGCCCTCGAAGGAAGATCACACATGCTCTCTCCGAACTAAGAAAAGATTCTTGTGCGGTTGA
AACTTGCACGAATTCACATGCCCCTTCTAGTATTGGGCAACAGAGTAACCGTGGTTCAGATGGGCGAGAAGGATCAATCAATGGAGCTAATAAAACACCTGCGAACAAGT
TGATCACTGATTATTTTCCAGGCTTTGCCTCTGATAAGAAAAATTCATGCAGCATCCCTAGTGGTCAGAAAGATGTGGAAAAAAAATTATCTGACTCTTCACACAGAGGA
TGCTTTCCGACATCTAAGAGGAGATTGTTTCCATTGGTTTCTTACTCACTTTCATATGGATCATTATCAAGGTCTGACAAAGTCATTTCGTCATGGGATGATTTATTGTT
CGCCTATAACAGCAAAGCTAGTAAACATGAAGATTGGAATTCCATGGGAAAGGTGAACAACTTTTTTCCCGACATCCAGGCTGTTCTACATACGGGGGATTTCCGATTTT
GCGAGGAAATTGGAAGCTTGCCAGTTTTTCAAACATGTCGTATCCATACTCTGATCCTTGATACCACTTATTGTGATCCACAGTATGATTTTCCAAAGCAGGAGACTGTA
ATACAGTTTGTAATTGATGCAATTCAAGCTGAAGCTTTCAACCCAAAAACACTTTTCCTGATTGGATGCTATACTATCGGAAAAGAACGGCTTTTTTTGGAGGTTGCCCG
TGTTCTGCGTAAGAAGGTGTATGTAACTGCAGCGAAGTTACGCATTTTAAAATGCTTGGGATTCTCAGAGGAGGATATGCAGTGGTTTACAGTGAATGAATGGGAAAGCC
ACATACATGTCGTACCTTTGTGGACACTCGCAAGCTTCAAACGACTGAAGCATGTATCAAATCAATATGCTAATCGATTCAGTCTTGTAGTTGCTTTCTCTCCTACTGGC
TGGGCGCTCTCAAAAGGAAAGAAGAAATCTCCTGGTAGAAGGTGGCAGCAAGGTACAATTATAAGGTACGAAGTGCCATATAGCGAGCACAGCAGCTTCTCAGAGCTTGA
GGATTTTGTAAAGCTCGTATCTCCTGCAAACATAATACCAAGCGTGAATAATCATGGACCTGATTCAGCCCGTGCCATGATCTCGCTCTTATCACCTCTTTGTGGGACGA
CCTGA
Protein sequenceShow/hide protein sequence
MPLTKSAAVRHQKSASQLSQFHLPTNAGDDDDGGGDFLPSTQTVLSSRSSSSQKPLATSELSSLHTRAPKRPKRSTPTAIGKENIPSIGYGYGDTEFKLSSMSFPDAVSG
STNFDGQNGAAALDGSEAFGLSEIDLGCSLDFIEPSIVGCSYQVHDANSGEEIFDGVNGFSGAMDECKGSKVKGGYLCNSIESRLMNSRVDRDVGISGGGVDKVASDDFE
SDTELDLLLNLHSQVDEEDHMNGVRFGTEETDFPVDEEGLIQCPLCGVDISDLSDKERQVALTNDRKQTSVPQQSGDNSRLSTVLKWLHGLGLSKYEDIFIREEIDWAAL
QWLTDEDLIKMGITALGPRRKITHALSELRKDSCAVETCTNSHAPSSIGQQSNRGSDGREGSINGANKTPANKLITDYFPGFASDKKNSCSIPSGQKDVEKKLSDSSHRG
CFPTSKRRLFPLVSYSLSYGSLSRSDKVISSWDDLLFAYNSKASKHEDWNSMGKVNNFFPDIQAVLHTGDFRFCEEIGSLPVFQTCRIHTLILDTTYCDPQYDFPKQETV
IQFVIDAIQAEAFNPKTLFLIGCYTIGKERLFLEVARVLRKKVYVTAAKLRILKCLGFSEEDMQWFTVNEWESHIHVVPLWTLASFKRLKHVSNQYANRFSLVVAFSPTG
WALSKGKKKSPGRRWQQGTIIRYEVPYSEHSSFSELEDFVKLVSPANIIPSVNNHGPDSARAMISLLSPLCGTT