; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011931 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011931
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEndo/exonuclease/phosphatase domain-containing protein
Genome locationChr01:15516232..15520533
RNA-Seq ExpressionHG10011931
SyntenyHG10011931
Gene Ontology termsGO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140218.2 uncharacterized protein LOC101212223 [Cucumis sativus]1.9e-22483.3Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVN-GSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN
        ML FLNR +RRLCSRLRWPRRR I+P+V++IK+FGKT S+T + P+K+IDSFVN  SS SAVHPNSQFH L TQR IRIATFNAASFSMAP VP  EKSN
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVN-GSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN

Query:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS
        SSA+FRRSLDSNSRTKS NDRPK ILKQSPLHTNS+N             A+ KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSS GNDRKGMR AKS
Subjt:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS

Query:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR
        G PLRW+VSM SE     +YRCSRTVVEVLRELDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNA+LSRWPIK WKVEKIFDDTDFR
Subjt:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR

Query:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF
        NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEE GKPTPEAKVTKFLK++MQYRDAKEF
Subjt:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF

Query:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPRP------QTHLHSHSLSPW-KIWT
        GGECESVVMIAKGQSVQGTCKYGTRVDYI+ASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPRP      QT LHSHS+SPW K WT
Subjt:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPRP------QTHLHSHSLSPW-KIWT

XP_008449463.1 PREDICTED: uncharacterized protein LOC103491341 [Cucumis melo]8.7e-22584.38Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN
        ML FLNR +RRLCSRLRWPRRRRI+P+V+VIK+FGKT S +T + PEK+IDSFVN SS SAVHPNSQF+ LNTQR IRIATFNAASFSMAP VP  EKSN
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN

Query:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS
        SSA+FRRSLDSNSRTKS NDRPK ILKQSPLHTNS+N             AK KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSS GNDR+GM  AKS
Subjt:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS

Query:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR
        G PLRW+VSM SE     SYRCSRTVVEVLR+LDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNA+LSRWPIK WKVEKIFDDTDFR
Subjt:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR

Query:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF
        NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRW DIVKYYEE GKPTPEAKVTKFLK++MQYRDAKE+
Subjt:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF

Query:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQTHLHSHSLSPW-KIWT
        GGECESVVMIAKGQSVQGTCKYGTRVDYILASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPR  PQT LHSHSLSPW K WT
Subjt:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQTHLHSHSLSPW-KIWT

XP_022963872.1 uncharacterized protein LOC111464053 [Cucurbita moschata]4.3e-22486.54Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS
        MLNFLNRS+RRLCSRLRWPR RR++P+VVVIK+FGKT SK  ADP  ++DSFVN S AS VHP  QFHG NTQR +RIATFNAASFSMAP VPYAEKSNS
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS

Query:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNG-VENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSLGNDRKGMRRAK
        SA+FRRSLDS+ RTKS NDRPK ILKQSPLH NS+NG V NHNL TQ KF K KPRVSINLPDNEISLLRNRQASFSEYEME ED SS GND  GMR AK
Subjt:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNG-VENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSLGNDRKGMRRAK

Query:  SGLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDF
        S  PLR  VSM  E E GESYRCSRTVVEVLRELDADILALQDVKAVEEK MRPLSDLADALGMKYVFAESWAPEYGNA+LSRWPIK WKVEKIFD TDF
Subjt:  SGLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDF

Query:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKE
        RNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRSTN+EPHILLGGLNSLDPTDYSQQRW DIVKYYEE GKPTPEAKV KFLK+ M YRDAKE
Subjt:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKE

Query:  FGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH
        FGGECESVVMIAKGQSVQGTCKYGTRVDYILASP+ADYEFV+GSYSVLSSKGTSDHHIVKVDFLKPPH
Subjt:  FGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH

XP_023554061.1 uncharacterized protein LOC111811442 [Cucurbita pepo subsp. pepo]1.6e-22386.32Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS
        MLNFLNR++RRLCSRLRWPR RR++P+VVVIK+FGKT SK  ADP  ++DSFVN S AS VHP  QF GLNT R +RIATFNAASFSMAP VPYAEKSNS
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS

Query:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNG-VENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSLGNDRKGMRRAK
        SA+FRRSLDS+ RT S NDRPK ILKQSPLH NS+NG V NHNL TQ KF K KPRVSINLPDNEISLLRNRQASFSEYEME ED SS GND  GMR AK
Subjt:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNG-VENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSLGNDRKGMRRAK

Query:  SGLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDF
        S  PLR  VSM SE E GESYRCSRTVVEVLRELDADILALQDVKAVEEK MRPLSDLADALGMKYVFAESWAPEYGNA+LSRWPIK WKVEKIFD TDF
Subjt:  SGLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDF

Query:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKE
        RNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRSTN+EPHILLGGLNSLDPTDYSQQRW DIVKYYEE GKPTPEAKV KFLK+SM YRDAKE
Subjt:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKE

Query:  FGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH
        FGGECESVVMIAKGQSVQGTCKYGTRVDYILASP+ADYEFV+GSYSVLSSKGTSDHHIVKVDFLKPPH
Subjt:  FGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH

XP_038887606.1 uncharacterized protein LOC120077715 [Benincasa hispida]4.8e-24789.8Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS
        ML FLNRS+RRLCSRLRWPRRRRI+P+VV+IK+FGKT SKTK++P KSIDSFVN SS SAVHPNSQFHGLNTQR IRIATFNAASFSMAP VP  EKSNS
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS

Query:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKSG
        SA+FRRSLDSNSRTKS NDRPK ILKQSPLHTNSMNGVENHNL      AKAKPRVSINLPDNEISLLRNRQASFSEYEMEE+ SS GNDRK MRRAKS 
Subjt:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKSG

Query:  LPLRWSVSMASEWENG-ESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR
         PLRW+VSM SE ENG ESYRCSRT+VEVLRELDADILALQDVKA EEKEMRPLSDLADALGMKYVFAESWAPEYGNA+LSRWPIK WK EKIFDDTDFR
Subjt:  LPLRWSVSMASEWENG-ESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR

Query:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF
        NVLK TIDVEEVGEVNVQCT+LDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYS++RWMDIVKYYEE GKPTPEAKVTKFLK+SMQYRDAKEF
Subjt:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF

Query:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLK-PPHPPQPRPQTHLHSHSLSPWKIWT
        GGECESVVMIAKGQSVQGTCKYGTRVDYILASP+ADYEFVQGSYSVLSSKGTSDHHIVKVDFLK PPHPPQPRPQT LHSHSLSPWK WT
Subjt:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLK-PPHPPQPRPQTHLHSHSLSPWKIWT

TrEMBL top hitse value%identityAlignment
A0A0A0KDU0 Endo/exonuclease/phosphatase domain-containing protein9.4e-22583.3Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVN-GSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN
        ML FLNR +RRLCSRLRWPRRR I+P+V++IK+FGKT S+T + P+K+IDSFVN  SS SAVHPNSQFH L TQR IRIATFNAASFSMAP VP  EKSN
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVN-GSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN

Query:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS
        SSA+FRRSLDSNSRTKS NDRPK ILKQSPLHTNS+N             A+ KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSS GNDRKGMR AKS
Subjt:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS

Query:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR
        G PLRW+VSM SE     +YRCSRTVVEVLRELDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNA+LSRWPIK WKVEKIFDDTDFR
Subjt:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR

Query:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF
        NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEE GKPTPEAKVTKFLK++MQYRDAKEF
Subjt:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF

Query:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPRP------QTHLHSHSLSPW-KIWT
        GGECESVVMIAKGQSVQGTCKYGTRVDYI+ASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPRP      QT LHSHS+SPW K WT
Subjt:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPRP------QTHLHSHSLSPW-KIWT

A0A1S3BLG5 uncharacterized protein LOC1034913414.2e-22584.38Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN
        ML FLNR +RRLCSRLRWPRRRRI+P+V+VIK+FGKT S +T + PEK+IDSFVN SS SAVHPNSQF+ LNTQR IRIATFNAASFSMAP VP  EKSN
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN

Query:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS
        SSA+FRRSLDSNSRTKS NDRPK ILKQSPLHTNS+N             AK KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSS GNDR+GM  AKS
Subjt:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS

Query:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR
        G PLRW+VSM SE     SYRCSRTVVEVLR+LDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNA+LSRWPIK WKVEKIFDDTDFR
Subjt:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR

Query:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF
        NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRW DIVKYYEE GKPTPEAKVTKFLK++MQYRDAKE+
Subjt:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF

Query:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQTHLHSHSLSPW-KIWT
        GGECESVVMIAKGQSVQGTCKYGTRVDYILASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPR  PQT LHSHSLSPW K WT
Subjt:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQTHLHSHSLSPW-KIWT

A0A5A7UUR9 DNAse I-like superfamily protein4.2e-22584.38Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN
        ML FLNR +RRLCSRLRWPRRRRI+P+V+VIK+FGKT S +T + PEK+IDSFVN SS SAVHPNSQF+ LNTQR IRIATFNAASFSMAP VP  EKSN
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTAS-KTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSN

Query:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS
        SSA+FRRSLDSNSRTKS NDRPK ILKQSPLHTNS+N             AK KPRVSINLPDNEISLLRNRQA  SEYEMEE+LSS GNDR+GM  AKS
Subjt:  SSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKS

Query:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR
        G PLRW+VSM SE     SYRCSRTVVEVLR+LDADILALQDVKA EEK+MRPLSDLA+ALGMKYVFAESWAPEYGNA+LSRWPIK WKVEKIFDDTDFR
Subjt:  GLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFR

Query:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF
        NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRS NNEPHILLGGLNSLDPTDYSQQRW DIVKYYEE GKPTPEAKVTKFLK++MQYRDAKE+
Subjt:  NVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEF

Query:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQTHLHSHSLSPW-KIWT
        GGECESVVMIAKGQSVQGTCKYGTRVDYILASP+A+YEFVQGSYSV+SSKGTSDHHIVKVDFLK PH PPQPR  PQT LHSHSLSPW K WT
Subjt:  GGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH-PPQPR--PQTHLHSHSLSPW-KIWT

A0A6J1HGD4 uncharacterized protein LOC1114640532.1e-22486.54Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS
        MLNFLNRS+RRLCSRLRWPR RR++P+VVVIK+FGKT SK  ADP  ++DSFVN S AS VHP  QFHG NTQR +RIATFNAASFSMAP VPYAEKSNS
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS

Query:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNG-VENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSLGNDRKGMRRAK
        SA+FRRSLDS+ RTKS NDRPK ILKQSPLH NS+NG V NHNL TQ KF K KPRVSINLPDNEISLLRNRQASFSEYEME ED SS GND  GMR AK
Subjt:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNG-VENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSLGNDRKGMRRAK

Query:  SGLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDF
        S  PLR  VSM  E E GESYRCSRTVVEVLRELDADILALQDVKAVEEK MRPLSDLADALGMKYVFAESWAPEYGNA+LSRWPIK WKVEKIFD TDF
Subjt:  SGLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDF

Query:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKE
        RNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRSTN+EPHILLGGLNSLDPTDYSQQRW DIVKYYEE GKPTPEAKV KFLK+ M YRDAKE
Subjt:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKE

Query:  FGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH
        FGGECESVVMIAKGQSVQGTCKYGTRVDYILASP+ADYEFV+GSYSVLSSKGTSDHHIVKVDFLKPPH
Subjt:  FGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH

A0A6J1HR38 uncharacterized protein LOC1114670142.0e-22285.68Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS
        ML FLNRS+RRLC+RLRWPR RR++P+VVVIK+FGKT SK  ADP  ++DSFVNGS AS VHP  QF GLN  R +RIATFNAASFSMAP VPYAEKSNS
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNS

Query:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNG-VENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSLGNDRKGMRRAK
        SA+FRRSLDS+ RTKS NDRPK ILKQSPLH N++NG V NHNL TQ KF K KPRVSINLPDNEISLLRNRQASFSEYEME ED SS GND  GMR AK
Subjt:  SAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNG-VENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEME-EDLSSLGNDRKGMRRAK

Query:  SGLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDF
        S  PLR  VSM SE E GESYRC+RTVVEVLRELDADILALQDVKAVEEK MRPLSDLADALGMKYVFAESWAPEYGNA+LSRWPIK WKVEKIFD TDF
Subjt:  SGLPLRWSVSMASEWENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDF

Query:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKE
        RNVLKATIDV EVGEVNVQCTHLDHLDENWRMKQIKSIIRSTN+EPHILLGGLNSLDPTDYSQQRW DIVKYYEE GKPTPEAKV KFLK+SM YRDAKE
Subjt:  RNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKE

Query:  FGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH
        FGGECESVVMIAKGQSVQGTCKYGTRVDYILASP+ADY+FV+GSYSVLSSKGTSDHHIVKVDFLKPPH
Subjt:  FGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLKPPH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G48030.1 DNAse I-like superfamily protein2.4e-11656.32Show/hide
Query:  RLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNG-SSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNSSAQFRRSLDSNSR
        RLR PR+ RI                   D         NG SSA+A+HP       N  + I +ATFNAA FSMAP VP    SN    F        R
Subjt:  RLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNG-SSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNSSAQFRRSLDSNSR

Query:  TKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKP-RVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKSGLPLRWSVSMASE
        +KS  DRPK ILK  P++  +      H+   QQ+FAK++P RVSINLPDNEIS    RQ SF E      L          R  + GL           
Subjt:  TKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKP-RVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKSGLPLRWSVSMASE

Query:  WENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFRNVLKATIDVEEVG
               R +RT +EVL ELDAD+LALQDVKA E  +MRPLSDLA ALGM YVFAESWAPEYGNAILS+WPIK   V +IFD TDFRNVLKA+I+V   G
Subjt:  WENGESYRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFRNVLKATIDVEEVG

Query:  EVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEFGGECESVVMIAKG
        EV   CTHLDHLDE WRMKQ+ +II+ST N PHIL G LNSLD +DYS +RW DIVKYYEE GKP P+A+V +FLK S +Y DAK+F GECESVV++AKG
Subjt:  EVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEFGGECESVVMIAKG

Query:  QSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLK
        QSVQGTCKYGTRVDYILAS ++ Y FV GSYSVLSSKGTSDHHIVKVD +K
Subjt:  QSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFLK

AT3G21530.1 DNAse I-like superfamily protein3.5e-12355.25Show/hide
Query:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKT--ASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKS
        ML    R +  L SRLRW  ++R++ +V+V +RF K    ++ K  PE         S  S++H +S     N+ RHIR+ATFN A FS+APVV   E++
Subjt:  MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKT--ASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKS

Query:  NSSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAK
             F   LDS++ T      PKGILKQSPLH++++                 KP+V INLPDNEISL ++   SF        LS + ND  G +  +
Subjt:  NSSAQFRRSLDSNSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAK

Query:  SGLPLRWSVSMASEWENGES---YRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDD
          L +R  V + S W + ES   Y   R++ E+LRELDADILALQDVKA EE  M+PLSDLA ALGMKYVFAESWAPEYGNAILS+WPIK W+V++I D 
Subjt:  SGLPLRWSVSMASEWENGES---YRCSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDD

Query:  TDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRD
         DFRNVLK T+++   G+VNV CT LDHLDENWRMKQI +I R  +  PHILLGGLNSLD +DYS  RW  IVKYYE++GKPTP  +V +FLK    Y D
Subjt:  TDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRMKQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRD

Query:  AKEFGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFL
        +KEF GECE VV+IAKGQ+VQGTCKYGTRVDYILASP + YEFV GSYSV+SSKGTSDHHIVKVD +
Subjt:  AKEFGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQGSYSVLSSKGTSDHHIVKVDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAACTTCCTCAACCGGAGCGTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCTCGTCGCCGGAGAATCAAACCTAAAGTAGTTGTCATCAAGAGGTTTGGAAAAAC
CGCCTCCAAAACTAAGGCCGATCCCGAGAAAAGTATCGATTCCTTCGTTAATGGGTCGTCGGCGTCAGCGGTTCATCCCAATTCTCAATTTCACGGTCTTAATACACAGA
GACACATACGAATTGCGACATTTAATGCTGCCTCCTTCTCCATGGCACCTGTTGTTCCTTACGCAGAAAAATCTAATTCGTCTGCTCAATTCCGACGGAGTTTAGATTCC
AATTCACGGACAAAATCCGGAAATGATCGCCCCAAAGGCATTTTGAAACAGTCTCCATTGCATACCAATTCCATGAATGGAGTTGAAAATCATAACCTCTCGACACAACA
GAAATTCGCGAAAGCCAAGCCGCGGGTTTCGATTAACCTGCCTGATAACGAAATATCTTTACTAAGAAATCGACAGGCGAGCTTTTCTGAGTATGAAATGGAGGAAGACC
TATCTTCTTTAGGTAACGATAGGAAGGGGATGCGGAGAGCTAAGAGTGGACTTCCTCTGAGGTGGTCTGTAAGCATGGCTTCGGAGTGGGAGAATGGGGAGAGTTACAGA
TGCAGTAGGACGGTTGTGGAGGTGCTTAGAGAGTTGGATGCTGACATATTGGCGTTGCAAGATGTGAAGGCGGTGGAAGAGAAAGAGATGAGACCGCTTTCAGATTTGGC
AGATGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGCTGGGCGCCGGAATACGGAAATGCGATTTTGTCTCGGTGGCCCATCAAACACTGGAAAGTCGAGAAGATCTTCG
ACGACACCGATTTCAGGAATGTGTTAAAAGCGACCATTGATGTGGAAGAAGTAGGAGAGGTAAATGTGCAGTGTACCCACTTGGATCATCTGGATGAGAATTGGAGGATG
AAACAGATAAAGTCCATAATCCGATCAACCAACAATGAACCCCACATCTTATTAGGAGGCCTCAATTCTCTGGATCCCACGGATTACTCTCAACAAAGGTGGATGGACAT
TGTGAAGTATTACGAAGAGACAGGAAAGCCAACTCCGGAAGCTAAAGTCACCAAGTTTCTGAAAAACAGTATGCAATATAGGGATGCGAAAGAGTTTGGAGGAGAATGCG
AGTCGGTGGTGATGATCGCCAAAGGACAAAGTGTTCAAGGGACGTGCAAGTATGGGACTCGGGTGGACTACATATTGGCGTCTCCCAACGCAGATTATGAGTTTGTACAA
GGATCCTACTCCGTCCTTTCCTCCAAAGGAACCTCCGATCATCACATTGTCAAGGTCGATTTTCTCAAACCTCCCCATCCTCCACAGCCTCGGCCTCAAACTCACCTTCA
TTCACATTCCCTTTCCCCTTGGAAGATATGGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCAACTTCCTCAACCGGAGCGTCCGCCGCCTCTGCTCCCGTCTCCGATGGCCTCGTCGCCGGAGAATCAAACCTAAAGTAGTTGTCATCAAGAGGTTTGGAAAAAC
CGCCTCCAAAACTAAGGCCGATCCCGAGAAAAGTATCGATTCCTTCGTTAATGGGTCGTCGGCGTCAGCGGTTCATCCCAATTCTCAATTTCACGGTCTTAATACACAGA
GACACATACGAATTGCGACATTTAATGCTGCCTCCTTCTCCATGGCACCTGTTGTTCCTTACGCAGAAAAATCTAATTCGTCTGCTCAATTCCGACGGAGTTTAGATTCC
AATTCACGGACAAAATCCGGAAATGATCGCCCCAAAGGCATTTTGAAACAGTCTCCATTGCATACCAATTCCATGAATGGAGTTGAAAATCATAACCTCTCGACACAACA
GAAATTCGCGAAAGCCAAGCCGCGGGTTTCGATTAACCTGCCTGATAACGAAATATCTTTACTAAGAAATCGACAGGCGAGCTTTTCTGAGTATGAAATGGAGGAAGACC
TATCTTCTTTAGGTAACGATAGGAAGGGGATGCGGAGAGCTAAGAGTGGACTTCCTCTGAGGTGGTCTGTAAGCATGGCTTCGGAGTGGGAGAATGGGGAGAGTTACAGA
TGCAGTAGGACGGTTGTGGAGGTGCTTAGAGAGTTGGATGCTGACATATTGGCGTTGCAAGATGTGAAGGCGGTGGAAGAGAAAGAGATGAGACCGCTTTCAGATTTGGC
AGATGCTTTGGGAATGAAGTACGTTTTTGCAGAGAGCTGGGCGCCGGAATACGGAAATGCGATTTTGTCTCGGTGGCCCATCAAACACTGGAAAGTCGAGAAGATCTTCG
ACGACACCGATTTCAGGAATGTGTTAAAAGCGACCATTGATGTGGAAGAAGTAGGAGAGGTAAATGTGCAGTGTACCCACTTGGATCATCTGGATGAGAATTGGAGGATG
AAACAGATAAAGTCCATAATCCGATCAACCAACAATGAACCCCACATCTTATTAGGAGGCCTCAATTCTCTGGATCCCACGGATTACTCTCAACAAAGGTGGATGGACAT
TGTGAAGTATTACGAAGAGACAGGAAAGCCAACTCCGGAAGCTAAAGTCACCAAGTTTCTGAAAAACAGTATGCAATATAGGGATGCGAAAGAGTTTGGAGGAGAATGCG
AGTCGGTGGTGATGATCGCCAAAGGACAAAGTGTTCAAGGGACGTGCAAGTATGGGACTCGGGTGGACTACATATTGGCGTCTCCCAACGCAGATTATGAGTTTGTACAA
GGATCCTACTCCGTCCTTTCCTCCAAAGGAACCTCCGATCATCACATTGTCAAGGTCGATTTTCTCAAACCTCCCCATCCTCCACAGCCTCGGCCTCAAACTCACCTTCA
TTCACATTCCCTTTCCCCTTGGAAGATATGGACATGA
Protein sequenceShow/hide protein sequence
MLNFLNRSVRRLCSRLRWPRRRRIKPKVVVIKRFGKTASKTKADPEKSIDSFVNGSSASAVHPNSQFHGLNTQRHIRIATFNAASFSMAPVVPYAEKSNSSAQFRRSLDS
NSRTKSGNDRPKGILKQSPLHTNSMNGVENHNLSTQQKFAKAKPRVSINLPDNEISLLRNRQASFSEYEMEEDLSSLGNDRKGMRRAKSGLPLRWSVSMASEWENGESYR
CSRTVVEVLRELDADILALQDVKAVEEKEMRPLSDLADALGMKYVFAESWAPEYGNAILSRWPIKHWKVEKIFDDTDFRNVLKATIDVEEVGEVNVQCTHLDHLDENWRM
KQIKSIIRSTNNEPHILLGGLNSLDPTDYSQQRWMDIVKYYEETGKPTPEAKVTKFLKNSMQYRDAKEFGGECESVVMIAKGQSVQGTCKYGTRVDYILASPNADYEFVQ
GSYSVLSSKGTSDHHIVKVDFLKPPHPPQPRPQTHLHSHSLSPWKIWT