; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G00120 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G00120
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionEndo/exonuclease/phosphatase domain-containing protein
Genome locationClcChr07:162077..164529
RNA-Seq ExpressionClc07G00120
SyntenyClc07G00120
Gene Ontology termsGO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140218.2 uncharacterized protein LOC101212223 [Cucumis sativus]9.5e-22784.51Show/hide
Query:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVN-GSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN
        MLKFLNR LRRLCSRLRWPRRR IRPRV+++K+FGKT S+T + P+K+IDSFVN  SS SAVHPNSQFH L TQRPIRIATFNAASFSMAPAVP  EKSN
Subjt:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVN-GSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN

Query:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVKS
        SSAKFRRSLDSNSRTKS ND PKSILKQSPLHTNS++             A+ KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSSS NDRKGMR  KS
Subjt:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVKS

Query:  GLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFR
        G PLRW+VSM SER    +YRCSRTVVEVLRELDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFR
Subjt:  GLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFR

Query:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKEF
        NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLK++MQYRDAKEF
Subjt:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKEF

Query:  GGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPRP------QSHLHSHSLPPW-KRWT
        GGECESVVMIAKGQ+VQGTCKYGTRVDYI+ASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPRP      Q+ LHSHS+ PW KRWT
Subjt:  GGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPRP------QSHLHSHSLPPW-KRWT

XP_008449463.1 PREDICTED: uncharacterized protein LOC103491341 [Cucumis melo]1.9e-22786.03Show/hide
Query:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN
        MLKFLNR LRRLCSRLRWPRRRRIRPRV+V+K+FGKT S +T + PEK+IDSFVN SS SAVHPNSQF+ LNTQRPIRIATFNAASFSMAPAVP  EKSN
Subjt:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN

Query:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSM-SGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVK
        SSAKFRRSLDSNSRTKS ND PKSILKQSPLHTNS+ SGV           AK KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSSS NDR+GM   K
Subjt:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSM-SGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVK

Query:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF
        SG PLRW+VSM SER    SYRCSRTVVEVLR+LDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF
Subjt:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF

Query:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE
        RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKVTKFLK++MQYRDAKE
Subjt:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE

Query:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQSHLHSHSLPPW-KRWT
        +GGECESVVMIAKGQ+VQGTCKYGTRVDYILASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPR  PQ+ LHSHSL PW KRWT
Subjt:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQSHLHSHSLPPW-KRWT

XP_022963872.1 uncharacterized protein LOC111464053 [Cucurbita moschata]3.4e-22487.18Show/hide
Query:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSNS
        ML FLNRSLRRLCSRLRWPR RR+RPRVVV+K+FGKT SK  ADP  ++DSFVN S AS VHP  QFHG NTQRP+RIATFNAASFSMAPAVP AEKSNS
Subjt:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSNS

Query:  SAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSG-VENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSSSNDRKGMRRVK
        SAKFRRSLDS+ RTKS ND PKSILKQSPLH NS++G V NH+L TQ KF K KPRVSINLPDNEISLLRNRQASFSEYEME ED SSS ND  GMR  K
Subjt:  SAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSG-VENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSSSNDRKGMRRVK

Query:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF
        S  PLR  VSM  ERE GESYRCSRTVVEVLRELDADILALQDVKAVEEK MRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TDF
Subjt:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF

Query:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE
        RNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRSTN+EPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKV KFLK+ M YRDAKE
Subjt:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE

Query:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH
        FGGECESVVMIAKGQ+VQGTCKYGTRVDYILASP+ADYEFV+GSYSVLSSKGTSDHHIVKVDFLKPPH
Subjt:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH

XP_022967537.1 uncharacterized protein LOC111467014 [Cucurbita maxima]3.4e-22486.11Show/hide
Query:  THSSTSTMLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVP
        T S +S MLKFLNRSLRRLC+RLRWPR RR+RPRVVV+K+FGKT SK  ADP  ++DSFVNGS AS VHP  QF GLN  RP+RIATFNAASFSMAPAVP
Subjt:  THSSTSTMLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVP

Query:  CAEKSNSSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSG-VENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSSSNDR
         AEKSNSSAKFRRSLDS+ RTKS ND PKSILKQSPLH N+++G V NH+L TQ KF K KPRVSINLPDNEISLLRNRQASFSEYEME ED SSS ND 
Subjt:  CAEKSNSSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSG-VENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSSSNDR

Query:  KGMRRVKSGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEK
         GMR  KS  PLR  VSM SERE GESYRC+RTVVEVLRELDADILALQDVKAVEEK MRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEK
Subjt:  KGMRRVKSGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEK

Query:  IFDDTDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSM
        IFD TDFRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRSTN+EPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKV KFLK+SM
Subjt:  IFDDTDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSM

Query:  QYRDAKEFGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH
         YRDAKEFGGECESVVMIAKGQ+VQGTCKYGTRVDYILASP+ADY+FV+GSYSVLSSKGTSDHHIVKVDFLKPPH
Subjt:  QYRDAKEFGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH

XP_038887606.1 uncharacterized protein LOC120077715 [Benincasa hispida]4.9e-25591.02Show/hide
Query:  MLSPTHSSTSTMLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMA
        MLSPTHSSTSTMLKFLNRSLRRLCSRLRWPRRRRIRPRVV++K+FGKT SKTK++P KSIDSFVN SS SAVHPNSQFHGLNTQRPIRIATFNAASFSMA
Subjt:  MLSPTHSSTSTMLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMA

Query:  PAVPCAEKSNSSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSN
        PAVP  EKSNSSAKFRRSLDSNSRTKS ND PKSILKQSPLHTNSM+GVENH+L      AKAKPRVSINLPDNEISLLRNRQASFSEYEMEE+ SSS N
Subjt:  PAVPCAEKSNSSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSN

Query:  DRKGMRRVKSGLPLRWSVSMASERENG-ESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWK
        DRK MRR KS  PLRW+VSM SERENG ESYRCSRT+VEVLRELDADILALQDVKA EEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWK
Subjt:  DRKGMRRVKSGLPLRWSVSMASERENG-ESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWK

Query:  VEKIFDDTDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLK
         EKIFDDTDFRNVLK TIDVEEVGEVNVQCT+LDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYS++RWMDIVKYYEEIGKPTPEAKVTKFLK
Subjt:  VEKIFDDTDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLK

Query:  NSMQYRDAKEFGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLK-PPHPPQPRPQSHLHSHSLPPWKRW
        +SMQYRDAKEFGGECESVVMIAKGQ+VQGTCKYGTRVDYILASP+ADYEFVQGSYSVLSSKGTSDHHIVKVDFLK PPHPPQPRPQ+ LHSHSL PWKRW
Subjt:  NSMQYRDAKEFGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLK-PPHPPQPRPQSHLHSHSLPPWKRW

Query:  T
        T
Subjt:  T

TrEMBL top hitse value%identityAlignment
A0A0A0KDU0 Endo/exonuclease/phosphatase domain-containing protein4.6e-22784.51Show/hide
Query:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVN-GSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN
        MLKFLNR LRRLCSRLRWPRRR IRPRV+++K+FGKT S+T + P+K+IDSFVN  SS SAVHPNSQFH L TQRPIRIATFNAASFSMAPAVP  EKSN
Subjt:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVN-GSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN

Query:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVKS
        SSAKFRRSLDSNSRTKS ND PKSILKQSPLHTNS++             A+ KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSSS NDRKGMR  KS
Subjt:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVKS

Query:  GLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFR
        G PLRW+VSM SER    +YRCSRTVVEVLRELDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFR
Subjt:  GLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFR

Query:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKEF
        NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLK++MQYRDAKEF
Subjt:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKEF

Query:  GGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPRP------QSHLHSHSLPPW-KRWT
        GGECESVVMIAKGQ+VQGTCKYGTRVDYI+ASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPRP      Q+ LHSHS+ PW KRWT
Subjt:  GGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPRP------QSHLHSHSLPPW-KRWT

A0A1S3BLG5 uncharacterized protein LOC1034913419.3e-22886.03Show/hide
Query:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN
        MLKFLNR LRRLCSRLRWPRRRRIRPRV+V+K+FGKT S +T + PEK+IDSFVN SS SAVHPNSQF+ LNTQRPIRIATFNAASFSMAPAVP  EKSN
Subjt:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN

Query:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSM-SGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVK
        SSAKFRRSLDSNSRTKS ND PKSILKQSPLHTNS+ SGV           AK KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSSS NDR+GM   K
Subjt:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSM-SGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVK

Query:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF
        SG PLRW+VSM SER    SYRCSRTVVEVLR+LDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF
Subjt:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF

Query:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE
        RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKVTKFLK++MQYRDAKE
Subjt:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE

Query:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQSHLHSHSLPPW-KRWT
        +GGECESVVMIAKGQ+VQGTCKYGTRVDYILASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPR  PQ+ LHSHSL PW KRWT
Subjt:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQSHLHSHSLPPW-KRWT

A0A5A7UUR9 DNAse I-like superfamily protein9.3e-22886.03Show/hide
Query:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN
        MLKFLNR LRRLCSRLRWPRRRRIRPRV+V+K+FGKT S +T + PEK+IDSFVN SS SAVHPNSQF+ LNTQRPIRIATFNAASFSMAPAVP  EKSN
Subjt:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN

Query:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSM-SGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVK
        SSAKFRRSLDSNSRTKS ND PKSILKQSPLHTNS+ SGV           AK KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSSS NDR+GM   K
Subjt:  SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSM-SGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVK

Query:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF
        SG PLRW+VSM SER    SYRCSRTVVEVLR+LDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF
Subjt:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF

Query:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE
        RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKVTKFLK++MQYRDAKE
Subjt:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE

Query:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQSHLHSHSLPPW-KRWT
        +GGECESVVMIAKGQ+VQGTCKYGTRVDYILASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPR  PQ+ LHSHSL PW KRWT
Subjt:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQSHLHSHSLPPW-KRWT

A0A6J1HGD4 uncharacterized protein LOC1114640531.6e-22487.18Show/hide
Query:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSNS
        ML FLNRSLRRLCSRLRWPR RR+RPRVVV+K+FGKT SK  ADP  ++DSFVN S AS VHP  QFHG NTQRP+RIATFNAASFSMAPAVP AEKSNS
Subjt:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSNS

Query:  SAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSG-VENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSSSNDRKGMRRVK
        SAKFRRSLDS+ RTKS ND PKSILKQSPLH NS++G V NH+L TQ KF K KPRVSINLPDNEISLLRNRQASFSEYEME ED SSS ND  GMR  K
Subjt:  SAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSG-VENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSSSNDRKGMRRVK

Query:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF
        S  PLR  VSM  ERE GESYRCSRTVVEVLRELDADILALQDVKAVEEK MRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFD TDF
Subjt:  SGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDF

Query:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE
        RNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRSTN+EPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKV KFLK+ M YRDAKE
Subjt:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKE

Query:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH
        FGGECESVVMIAKGQ+VQGTCKYGTRVDYILASP+ADYEFV+GSYSVLSSKGTSDHHIVKVDFLKPPH
Subjt:  FGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH

A0A6J1HR38 uncharacterized protein LOC1114670141.6e-22486.11Show/hide
Query:  THSSTSTMLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVP
        T S +S MLKFLNRSLRRLC+RLRWPR RR+RPRVVV+K+FGKT SK  ADP  ++DSFVNGS AS VHP  QF GLN  RP+RIATFNAASFSMAPAVP
Subjt:  THSSTSTMLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVP

Query:  CAEKSNSSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSG-VENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSSSNDR
         AEKSNSSAKFRRSLDS+ RTKS ND PKSILKQSPLH N+++G V NH+L TQ KF K KPRVSINLPDNEISLLRNRQASFSEYEME ED SSS ND 
Subjt:  CAEKSNSSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSG-VENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSSSNDR

Query:  KGMRRVKSGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEK
         GMR  KS  PLR  VSM SERE GESYRC+RTVVEVLRELDADILALQDVKAVEEK MRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEK
Subjt:  KGMRRVKSGLPLRWSVSMASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEK

Query:  IFDDTDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSM
        IFD TDFRNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRSTN+EPHILLGGLNSLDPTDYSQQRW DIVKYYEEIGKPTPEAKV KFLK+SM
Subjt:  IFDDTDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSM

Query:  QYRDAKEFGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH
         YRDAKEFGGECESVVMIAKGQ+VQGTCKYGTRVDYILASP+ADY+FV+GSYSVLSSKGTSDHHIVKVDFLKPPH
Subjt:  QYRDAKEFGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G48030.1 DNAse I-like superfamily protein2.5e-11656.28Show/hide
Query:  RRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNG-SSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSNSSAKFRRSLDSNSRTKSGN
        RRR  RPR     R          D         NG SSA+A+HP       N  + I +ATFNAA FSMAPAVP    SN    F        R+KS  
Subjt:  RRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNG-SSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSNSSAKFRRSLDSNSRTKSGN

Query:  DHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKP-RVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVKSGLPLRWSVSMASERENGE
        D PKSILK  P++    +    H    QQ+FAK++P RVSINLPDNEIS    RQ SF E      L                            R    
Subjt:  DHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKP-RVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVKSGLPLRWSVSMASERENGE

Query:  SYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNVQ
          R +RT +EVL ELDAD+LALQDVKA E  +MRPLSDLA ALGM YVFAESWAPEYGNA+LS+WPIK   V +IFD TDFRNVLKA+I+V   GEV   
Subjt:  SYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNVQ

Query:  CTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKEFGGECESVVMIAKGQNVQG
        CTHLDHLDE WRMKQ+ +II+ST N PHIL G LNSLD +DYS +RW DIVKYYEE+GKP P+A+V +FLK S +Y DAK+F GECESVV++AKGQ+VQG
Subjt:  CTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKEFGGECESVVMIAKGQNVQG

Query:  TCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLK
        TCKYGTRVDYILAS ++ Y FV GSYSVLSSKGTSDHHIVKVD +K
Subjt:  TCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLK

AT3G21530.1 DNAse I-like superfamily protein1.7e-12055.03Show/hide
Query:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKT--ASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKS
        ML    R L  L SRLRW  ++R+R R V+V+RF K    ++ K  PE         S  S++H +S     N+ R IR+ATFN A FS+AP V   E++
Subjt:  MLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKT--ASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKS

Query:  NSSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVK
             F   LDS++ T      PK ILKQSPLH++                A  KP+V INLPDNEISL +    S+S   M E      ND  G +  +
Subjt:  NSSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVK

Query:  SGLPLRWSVSMAS---ERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDD
          L +R  V + S   ++E+   Y   R++ E+LRELDADILALQDVKA EE  M+PLSDLA ALGMKYVFAESWAPEYGNA+LS+WPIK+W+V++I D 
Subjt:  SGLPLRWSVSMAS---ERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDD

Query:  TDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRD
         DFRNVLK T+++   G+VNV CT LDHLDENWRMKQI +I R  +  PHILLGGLNSLD +DYS  RW  IVKYYE+ GKPTP  +V +FLK    Y D
Subjt:  TDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRD

Query:  AKEFGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFL
        +KEF GECE VV+IAKGQNVQGTCKYGTRVDYILASP + YEFV GSYSV+SSKGTSDHHIVKVD +
Subjt:  AKEFGGECESVVMIAKGQNVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTCACCCACCCATTCTTCCACTTCTACAATGCTCAAGTTCCTCAACCGGAGCCTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCCCGTCGTCGGAGAATCAGACC
TAGGGTAGTCGTCGTCAAGAGGTTTGGAAAAACCGCCTCCAAAACTAAGGCTGATCCCGAGAAAAGTATCGATTCCTTCGTTAATGGGTCGTCGGCGTCAGCGGTTCATC
CCAATTCTCAATTTCACGGTCTTAATACACAGAGACCCATACGAATTGCGACATTTAATGCCGCCTCCTTCTCTATGGCACCTGCTGTTCCTTGCGCTGAAAAATCCAAT
TCTTCTGCCAAATTCCGACGGAGTTTAGATTCCAATTCACGGACAAAATCCGGAAATGATCACCCCAAAAGCATTTTGAAACAGTCTCCATTGCATACCAATTCCATGAG
TGGAGTTGAAAATCATAGCCTCTCGACACAACAGAAATTCGCGAAAGCCAAGCCGCGGGTTTCGATCAACCTTCCTGATAACGAAATATCTTTACTAAGAAATCGACAGG
CGAGCTTTTCGGAGTACGAAATGGAGGAGGACCTATCTTCTTCAAGTAACGATAGGAAGGGAATGCGGAGAGTTAAGAGTGGACTTCCTCTGAGATGGTCTGTAAGCATG
GCTTCGGAGCGGGAGAATGGGGAGAGTTACAGATGCAGTAGGACGGTTGTGGAGGTGCTTAGAGAGTTGGATGCTGACATATTGGCGTTGCAAGATGTGAAGGCGGTGGA
AGAGAAAGAGATGAGACCACTTTCAGATTTGGCCGATGCTTTAGGAATGAAGTACGTTTTTGCTGAGAGCTGGGCGCCGGAGTACGGAAATGCGGTCTTGTCTCGGTGGC
CCATCAAACGCTGGAAAGTTGAGAAGATCTTCGACGACACCGATTTCAGGAATGTGTTAAAAGCGACCATTGATGTGGAAGAAGTAGGAGAGGTAAATGTGCAGTGTACC
CATTTGGATCATCTGGACGAGAATTGGAGGATGAAACAGATAAAGTCCATAATCCGATCAACCAACAATGAACCCCACATCTTATTAGGAGGCCTCAATTCTCTGGATCC
CACGGATTACTCTCAACAAAGGTGGATGGACATTGTGAAGTATTACGAAGAGATAGGAAAGCCAACTCCGGAAGCTAAAGTCACCAAGTTTCTAAAAAACAGTATGCAAT
ATAGGGATGCAAAAGAGTTTGGAGGAGAATGCGAGTCAGTGGTGATGATCGCCAAAGGACAAAATGTTCAAGGGACGTGCAAGTACGGGACTCGGGTGGACTACATATTG
GCCTCTCCCAACGCAGATTACGAGTTTGTACAAGGATCCTACTCCGTCCTTTCCTCCAAAGGAACCTCCGATCATCACATTGTCAAAGTCGATTTCCTCAAACCTCCTCA
TCCTCCACAGCCTCGGCCTCAAAGTCACCTCCATTCACATTCCCTTCCCCCTTGGAAGAGATGGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTCACCCACCCATTCTTCCACTTCTACAATGCTCAAGTTCCTCAACCGGAGCCTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCCCGTCGTCGGAGAATCAGACC
TAGGGTAGTCGTCGTCAAGAGGTTTGGAAAAACCGCCTCCAAAACTAAGGCTGATCCCGAGAAAAGTATCGATTCCTTCGTTAATGGGTCGTCGGCGTCAGCGGTTCATC
CCAATTCTCAATTTCACGGTCTTAATACACAGAGACCCATACGAATTGCGACATTTAATGCCGCCTCCTTCTCTATGGCACCTGCTGTTCCTTGCGCTGAAAAATCCAAT
TCTTCTGCCAAATTCCGACGGAGTTTAGATTCCAATTCACGGACAAAATCCGGAAATGATCACCCCAAAAGCATTTTGAAACAGTCTCCATTGCATACCAATTCCATGAG
TGGAGTTGAAAATCATAGCCTCTCGACACAACAGAAATTCGCGAAAGCCAAGCCGCGGGTTTCGATCAACCTTCCTGATAACGAAATATCTTTACTAAGAAATCGACAGG
CGAGCTTTTCGGAGTACGAAATGGAGGAGGACCTATCTTCTTCAAGTAACGATAGGAAGGGAATGCGGAGAGTTAAGAGTGGACTTCCTCTGAGATGGTCTGTAAGCATG
GCTTCGGAGCGGGAGAATGGGGAGAGTTACAGATGCAGTAGGACGGTTGTGGAGGTGCTTAGAGAGTTGGATGCTGACATATTGGCGTTGCAAGATGTGAAGGCGGTGGA
AGAGAAAGAGATGAGACCACTTTCAGATTTGGCCGATGCTTTAGGAATGAAGTACGTTTTTGCTGAGAGCTGGGCGCCGGAGTACGGAAATGCGGTCTTGTCTCGGTGGC
CCATCAAACGCTGGAAAGTTGAGAAGATCTTCGACGACACCGATTTCAGGAATGTGTTAAAAGCGACCATTGATGTGGAAGAAGTAGGAGAGGTAAATGTGCAGTGTACC
CATTTGGATCATCTGGACGAGAATTGGAGGATGAAACAGATAAAGTCCATAATCCGATCAACCAACAATGAACCCCACATCTTATTAGGAGGCCTCAATTCTCTGGATCC
CACGGATTACTCTCAACAAAGGTGGATGGACATTGTGAAGTATTACGAAGAGATAGGAAAGCCAACTCCGGAAGCTAAAGTCACCAAGTTTCTAAAAAACAGTATGCAAT
ATAGGGATGCAAAAGAGTTTGGAGGAGAATGCGAGTCAGTGGTGATGATCGCCAAAGGACAAAATGTTCAAGGGACGTGCAAGTACGGGACTCGGGTGGACTACATATTG
GCCTCTCCCAACGCAGATTACGAGTTTGTACAAGGATCCTACTCCGTCCTTTCCTCCAAAGGAACCTCCGATCATCACATTGTCAAAGTCGATTTCCTCAAACCTCCTCA
TCCTCCACAGCCTCGGCCTCAAAGTCACCTCCATTCACATTCCCTTCCCCCTTGGAAGAGATGGACATGA
Protein sequenceShow/hide protein sequence
MLSPTHSSTSTMLKFLNRSLRRLCSRLRWPRRRRIRPRVVVVKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRPIRIATFNAASFSMAPAVPCAEKSN
SSAKFRRSLDSNSRTKSGNDHPKSILKQSPLHTNSMSGVENHSLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSSSNDRKGMRRVKSGLPLRWSVSM
ASERENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAVLSRWPIKRWKVEKIFDDTDFRNVLKATIDVEEVGEVNVQCT
HLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEEIGKPTPEAKVTKFLKNSMQYRDAKEFGGECESVVMIAKGQNVQGTCKYGTRVDYIL
ASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPHPPQPRPQSHLHSHSLPPWKRWT