; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg02435 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg02435
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPlastid envelope DNA binding protein
Genome locationCarg_Chr15:2382629..2384911
RNA-Seq ExpressionCarg02435
SyntenyCarg02435
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578787.1 hypothetical protein SDJN03_23235, partial [Cucurbita argyrosperma subsp. sororia]6.1e-203100Show/hide
Query:  MVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEE
        MVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEE
Subjt:  MVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEE

Query:  PIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHT
        PIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHT
Subjt:  PIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHT

Query:  MELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNK
        MELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNK
Subjt:  MELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNK

Query:  AFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWSE
        AFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWSE
Subjt:  AFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWSE

KAG7016318.1 hypothetical protein SDJN02_21425, partial [Cucurbita argyrosperma subsp. argyrosperma]5.0e-229100Show/hide
Query:  EVLNFLDFMHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL
        EVLNFLDFMHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL
Subjt:  EVLNFLDFMHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL

Query:  EEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNH
        EEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNH
Subjt:  EEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNH

Query:  VTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGS
        VTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGS
Subjt:  VTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGS

Query:  TRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPL
        TRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPL
Subjt:  TRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPL

Query:  LEIFKAFISALVKFWSE
        LEIFKAFISALVKFWSE
Subjt:  LEIFKAFISALVKFWSE

XP_022939659.1 uncharacterized protein LOC111445486 [Cucurbita moschata]7.2e-22098.29Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALAKSNESEGRKTR RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY
        VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKAD NCAAPLSETKSDLVEVAQIVETSNGS RKEGSIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSK KEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKP+NNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI

Query:  SALVKFWSE
         A VKFWSE
Subjt:  SALVKFWSE

XP_022992975.1 uncharacterized protein LOC111489141 [Cucurbita maxima]1.4e-21596.09Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALA+SNESEGRKTR RRSKEERKAMVEVFIKKYQESNNGSFPSLN+THKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISS+EVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVT+INGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFT DV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY
        VVETFPL+SISWA++SSDVRSET ISTSASEKQPS TMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFD+TSKIKEET +ENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI

Query:  SALVKFWSE
         A VKFWSE
Subjt:  SALVKFWSE

XP_023549620.1 uncharacterized protein LOC111808067 [Cucurbita pepo subsp. pepo]2.9e-21396.09Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALAKSNESEGRKTR RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTD+S
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISSEEVD PVNYNQYIN EPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDK+SDELKKVEEVVREESGMPFNHVTPFTTDV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY
        VVETFPL+SIS+A+DSSDVRSE  ISTSASEKQPSHTMELGSDVGLVN KGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGST+KE SIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSK KEETG+ENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI

Query:  SALVKFWSE
        SA VKFWSE
Subjt:  SALVKFWSE

TrEMBL top hitse value%identityAlignment
A0A1S3C473 uncharacterized protein LOC103496473 isoform X12.6e-15468.61Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL-EEHSTDH
        MHAIKGGW GRPLALAK+NE+EGRKTR RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGK LL EEH+TDH
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL-EEHSTDH

Query:  SLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDEL------------------------
        SL++NPLHSIAIEPQSPLT+SS+EV  P+NYN+YINEEPIFVSDEQCT+TNIQGSQN ++INGS  D+S++DSDE                         
Subjt:  SLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDEL------------------------

Query:  ----------------KKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGY
                         KVEEVV+EESGMP NHVTP  TDVVVETFPL+ + W ++ SDVRSE  IST+ASEKQ S ++EL SDVGL NI     TAS  
Subjt:  ----------------KKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGY

Query:  VVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEET
        VV+KA EN A PLSETKSDLVEVAQIVE SNGST KEGS++EV GP      DTPI V  EQGQKSS+ K+P AS    +NLNK FSN FDQ SKI+   
Subjt:  VVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEET

Query:  GIENEVEAEQNGGS------TLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWSE
         IEN+V+  Q GGS      TLNRINLESWEGMSKNSSKPENNPLLEI K+FI+A VKFWSE
Subjt:  GIENEVEAEQNGGS------TLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein5.7e-15468.4Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL-EEHSTDH
        MHAIKGGW GRPLALAK+NE+EGRKTR RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGK LL EEH+TDH
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL-EEHSTDH

Query:  SLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDEL------------------------
        SL++NPLHSIAIEPQSPLT+SS+EV  P+NYN+YINEEPIFVSDEQCT+TNIQGSQN ++INGS  D+S++DSDE                         
Subjt:  SLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDEL------------------------

Query:  ----------------KKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGY
                         KVEEVV+EESGMP NHVTP  TDVVVETFPL+ + W ++ SDVRSE  IST+ASEKQ S ++EL SDVGL NI     TAS  
Subjt:  ----------------KKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGY

Query:  VVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEET
        VV+KA EN A PLSETKSDLVEVAQIVE SNGST KEGS++EV GP      DTPI V  EQGQKSS+ K+P AS    +NLNK FSN FDQ SKI+   
Subjt:  VVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEET

Query:  GIENEVEAEQNGGS------TLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWSE
         IEN+V+  Q GGS      TLNRINLESWEGMSKNSSKPENNPLLEI K+FI+A VKFWS+
Subjt:  GIENEVEAEQNGGS------TLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWSE

A0A6J1E1K4 uncharacterized protein LOC1114297501.9e-14971.59Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGW G PLALAK NESEGRKTR RRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGK  LEEHSTDH 
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSD-EQCTSTNIQGSQNVTVINGSPADMSDKDSDEL----------KKVEEVVREESGMP
        LEENPLHSIAIEPQSPLT  SEE D P+N+N  INEEPI VSD EQ TS NIQGSQN  +INGS  D SDKDSDE+          KK+EEV++EESGMP
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSD-EQCTSTNIQGSQNVTVINGSPADMSDKDSDEL----------KKVEEVVREESGMP

Query:  FNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNST-ASGYVVKKADENCAAPLSETKSDLVEVAQIVET
         NHVTP   DV V TFPL+S SWA + SDV SET IST ASEK+ S  +EL SDV L N + NNST ASG    +ADE     LSET SDLVEVAQIVE 
Subjt:  FNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNST-ASGYVVKKADENCAAPLSETKSDLVEVAQIVET

Query:  SNGSTRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGS------TLNRINLES
        +NG+  K+G I+EVEGP      DTPI V  EQGQKSSE KAPNASPS TKNLN + +N  DQ SKIKEET ++N+VEAEQ GGS      TLNR+NL+S
Subjt:  SNGSTRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGS------TLNRINLES

Query:  WEGMSKNSSKPENNPLLEIFKAFISALVKFWSE
        W G SK+SSKPENNPLLEI  AFI+A VKFWSE
Subjt:  WEGMSKNSSKPENNPLLEIFKAFISALVKFWSE

A0A6J1FHV2 uncharacterized protein LOC1114454863.5e-22098.29Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALAKSNESEGRKTR RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY
        VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKAD NCAAPLSETKSDLVEVAQIVETSNGS RKEGSIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSK KEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKP+NNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI

Query:  SALVKFWSE
         A VKFWSE
Subjt:  SALVKFWSE

A0A6J1JX82 uncharacterized protein LOC1114891416.8e-21696.09Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALA+SNESEGRKTR RRSKEERKAMVEVFIKKYQESNNGSFPSLN+THKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISS+EVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVT+INGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFT DV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY
        VVETFPL+SISWA++SSDVRSET ISTSASEKQPS TMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFD+TSKIKEET +ENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFI

Query:  SALVKFWSE
         A VKFWSE
Subjt:  SALVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding5.1e-3829.54Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLE------E
        MH++K   +G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG  LLE      +
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLE------E

Query:  HSTDHSLEENPLHSIAIEPQ-------SPLTISSEEVDSPVNYNQ------------YINEEPIFVSDEQCTSTNIQGSQNVT-----------------
         S   S+  +P+  +++ P          L  SSE  +  VN +Q             + +E I +  +   ST+I  +Q  T                 
Subjt:  HSTDHSLEENPLHSIAIEPQ-------SPLTISSEEVDSPVNYNQ------------YINEEPIFVSDEQCTSTNIQGSQNVT-----------------

Query:  ------VINGSP----ADMSDKD----------SDELKKVEEVVR-EESGMPFNHV------TPFTTDVVVETFPLNSISWAIDSSDVR----SETSIST
               ++  P     D+ +KD          SD  K V    R  ++G     +         + + VVETFPL S++  +DS D +    ++     
Subjt:  ------VINGSP----ADMSDKD----------SDELKKVEEVVR-EESGMPFNHV------TPFTTDVVVETFPLNSISWAIDSSDVR----SETSIST

Query:  SASEKQPSHTMELGSDVGLVNIKGNNSTA------SGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPDTPIPVISEQGQKSSE
          +E +        + V L  I  + S+A      +  +V +   + + P+ +   + +  +  V+      ++   +  V G        S  G  ++E
Subjt:  SASEKQPSHTMELGSDVGLVNIKGNNSTA------SGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPDTPIPVISEQGQKSSE

Query:  TKAPNASPSVTKNLN------KAFSNSFDQTSKIKEETGIE-------NEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWS
         K P +S       N         S + ++ + ++++  +E       +   +++   +TLNRI  ESW+G S N  + E NPLL + K+F++A VKFWS
Subjt:  TKAPNASPSVTKNLN------KAFSNSFDQTSKIKEETGIE-------NEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWS

Query:  E
        E
Subjt:  E

AT3G52170.2 DNA binding5.1e-3829.54Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLE------E
        MH++K   +G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG  LLE      +
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLE------E

Query:  HSTDHSLEENPLHSIAIEPQ-------SPLTISSEEVDSPVNYNQ------------YINEEPIFVSDEQCTSTNIQGSQNVT-----------------
         S   S+  +P+  +++ P          L  SSE  +  VN +Q             + +E I +  +   ST+I  +Q  T                 
Subjt:  HSTDHSLEENPLHSIAIEPQ-------SPLTISSEEVDSPVNYNQ------------YINEEPIFVSDEQCTSTNIQGSQNVT-----------------

Query:  ------VINGSP----ADMSDKD----------SDELKKVEEVVR-EESGMPFNHV------TPFTTDVVVETFPLNSISWAIDSSDVR----SETSIST
               ++  P     D+ +KD          SD  K V    R  ++G     +         + + VVETFPL S++  +DS D +    ++     
Subjt:  ------VINGSP----ADMSDKD----------SDELKKVEEVVR-EESGMPFNHV------TPFTTDVVVETFPLNSISWAIDSSDVR----SETSIST

Query:  SASEKQPSHTMELGSDVGLVNIKGNNSTA------SGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPDTPIPVISEQGQKSSE
          +E +        + V L  I  + S+A      +  +V +   + + P+ +   + +  +  V+      ++   +  V G        S  G  ++E
Subjt:  SASEKQPSHTMELGSDVGLVNIKGNNSTA------SGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPDTPIPVISEQGQKSSE

Query:  TKAPNASPSVTKNLN------KAFSNSFDQTSKIKEETGIE-------NEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWS
         K P +S       N         S + ++ + ++++  +E       +   +++   +TLNRI  ESW+G S N  + E NPLL + K+F++A VKFWS
Subjt:  TKAPNASPSVTKNLN------KAFSNSFDQTSKIKEETGIE-------NEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWS

Query:  E
        E
Subjt:  E

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.9e-0823.48Show/hide
Query:  KTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTI
        K   R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E  S+   D S   +P     +E ++   +
Subjt:  KTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTI

Query:  SSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDV
        S      P + + +++  P+ + + +  S         T  + SPA +   +++ L  V   V  ++    +H +P     +VE   L+ +S ++     
Subjt:  SSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDV

Query:  RSETSISTSASEKQPSHTMELGSDV-------------GLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPD
           + +      +  S  +E+GS+                 N +GNN   +    K+  E   A      S+  E   +    N    +E     +EG D
Subjt:  RSETSISTSASEKQPSHTMELGSDV-------------GLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPD

Query:  TPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKE
            V+  + ++ SET A     + T  + +  S+  +  S  KE
Subjt:  TPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKE

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.9e-0823.48Show/hide
Query:  KTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTI
        K   R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E  S+   D S   +P     +E ++   +
Subjt:  KTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTI

Query:  SSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDV
        S      P + + +++  P+ + + +  S         T  + SPA +   +++ L  V   V  ++    +H +P     +VE   L+ +S ++     
Subjt:  SSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDV

Query:  RSETSISTSASEKQPSHTMELGSDV-------------GLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPD
           + +      +  S  +E+GS+                 N +GNN   +    K+  E   A      S+  E   +    N    +E     +EG D
Subjt:  RSETSISTSASEKQPSHTMELGSDV-------------GLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPD

Query:  TPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKE
            V+  + ++ SET A     + T  + +  S+  +  S  KE
Subjt:  TPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKE

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.9e-0823.48Show/hide
Query:  KTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTI
        K   R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E  S+   D S   +P     +E ++   +
Subjt:  KTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTI

Query:  SSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDV
        S      P + + +++  P+ + + +  S         T  + SPA +   +++ L  V   V  ++    +H +P     +VE   L+ +S ++     
Subjt:  SSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDV

Query:  RSETSISTSASEKQPSHTMELGSDV-------------GLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPD
           + +      +  S  +E+GS+                 N +GNN   +    K+  E   A      S+  E   +    N    +E     +EG D
Subjt:  RSETSISTSASEKQPSHTMELGSDV-------------GLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPD

Query:  TPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKE
            V+  + ++ SET A     + T  + +  S+  +  S  KE
Subjt:  TPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKIKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAGTGTTGAATTTTTTGGATTTCATGCATGCTATAAAGGGTGGGTGGATAGGGCGTCCTCTTGCCCTAGCCAAAAGCAATGAGTCCGAAGGGAGGAAGACTAGAACTCG
GCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGCTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTG
GATCTTTCTATACAGTGCGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTTCTTGGTCCCGGAAAGTGGTTATTAGAAGAGCACAGCACTGATCATTCACTTGAA
GAGAATCCACTCCATTCGATTGCTATTGAACCTCAATCTCCTTTAACGATATCATCAGAGGAAGTTGATTCTCCAGTCAACTACAACCAATACATAAATGAAGAGCCAAT
CTTCGTGTCAGATGAGCAATGCACTTCAACAAATATTCAGGGATCACAGAATGTGACAGTAATTAATGGCAGCCCGGCAGATATGAGTGACAAGGATTCTGATGAACTCA
AGAAAGTAGAGGAAGTGGTGAGAGAGGAATCAGGAATGCCATTTAATCATGTAACTCCTTTTACAACAGATGTCGTGGTAGAGACATTCCCATTGAATTCGATTTCTTGG
GCCATCGATAGTTCAGATGTAAGATCTGAGACATCGATTTCAACCAGCGCCTCAGAAAAGCAACCGAGTCATACCATGGAGTTAGGATCAGATGTTGGCTTGGTTAACAT
TAAAGGTAATAATTCCACGGCTTCTGGTTATGTAGTCAAGAAAGCAGATGAAAATTGTGCTGCGCCATTATCAGAAACAAAGTCTGATTTGGTGGAGGTAGCACAAATTG
TTGAAACCTCTAATGGATCAACTAGGAAAGAAGGTAGCATATATGAAGTTGAGGGTCCTGATACTCCAATACCTGTAATCTCTGAACAAGGCCAGAAATCTAGTGAAACC
AAGGCTCCTAATGCTTCTCCAAGTGTTACCAAGAATCTCAACAAGGCATTCAGCAATAGCTTTGACCAGACCTCAAAAATCAAAGAGGAGACCGGGATCGAAAACGAAGT
AGAGGCTGAACAGAATGGTGGCTCAACATTAAATAGAATCAATCTTGAATCATGGGAAGGGATGTCCAAAAATTCTTCGAAACCCGAAAACAACCCGCTTTTGGAAATCT
TCAAAGCATTCATCAGTGCATTGGTGAAGTTTTGGTCCGAGTAA
mRNA sequenceShow/hide mRNA sequence
GAAGTGTTGAATTTTTTGGATTTCATGCATGCTATAAAGGGTGGGTGGATAGGGCGTCCTCTTGCCCTAGCCAAAAGCAATGAGTCCGAAGGGAGGAAGACTAGAACTCG
GCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGCTTCCCCTCGCTCAACCTTACTCACAAGGAAGTTGGTG
GATCTTTCTATACAGTGCGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTTCTTGGTCCCGGAAAGTGGTTATTAGAAGAGCACAGCACTGATCATTCACTTGAA
GAGAATCCACTCCATTCGATTGCTATTGAACCTCAATCTCCTTTAACGATATCATCAGAGGAAGTTGATTCTCCAGTCAACTACAACCAATACATAAATGAAGAGCCAAT
CTTCGTGTCAGATGAGCAATGCACTTCAACAAATATTCAGGGATCACAGAATGTGACAGTAATTAATGGCAGCCCGGCAGATATGAGTGACAAGGATTCTGATGAACTCA
AGAAAGTAGAGGAAGTGGTGAGAGAGGAATCAGGAATGCCATTTAATCATGTAACTCCTTTTACAACAGATGTCGTGGTAGAGACATTCCCATTGAATTCGATTTCTTGG
GCCATCGATAGTTCAGATGTAAGATCTGAGACATCGATTTCAACCAGCGCCTCAGAAAAGCAACCGAGTCATACCATGGAGTTAGGATCAGATGTTGGCTTGGTTAACAT
TAAAGGTAATAATTCCACGGCTTCTGGTTATGTAGTCAAGAAAGCAGATGAAAATTGTGCTGCGCCATTATCAGAAACAAAGTCTGATTTGGTGGAGGTAGCACAAATTG
TTGAAACCTCTAATGGATCAACTAGGAAAGAAGGTAGCATATATGAAGTTGAGGGTCCTGATACTCCAATACCTGTAATCTCTGAACAAGGCCAGAAATCTAGTGAAACC
AAGGCTCCTAATGCTTCTCCAAGTGTTACCAAGAATCTCAACAAGGCATTCAGCAATAGCTTTGACCAGACCTCAAAAATCAAAGAGGAGACCGGGATCGAAAACGAAGT
AGAGGCTGAACAGAATGGTGGCTCAACATTAAATAGAATCAATCTTGAATCATGGGAAGGGATGTCCAAAAATTCTTCGAAACCCGAAAACAACCCGCTTTTGGAAATCT
TCAAAGCATTCATCAGTGCATTGGTGAAGTTTTGGTCCGAGTAAGTTCTATGATTGTTAGGTAGACGAGTAGAGAGGAGTTATTTTCCTACCACAGAACCTGTCCTGTCT
CCGTACCAAGTTGCAGTCAGTTTCCCCGTTCGCTCCAGTCGAGATCCTCGATGTTTATGAGTTAGGAAACTGGATATTACTGGATGTGGGTTGGCACTTCTGTAGTCTGC
AGTAAGAAAGAAGGTTTAGGAGTAGTAGGCATATCCCACCCCTCAGTTACAAAGTAAGTAAAAGCTTGTGTGTTTTCTTTTACTGTAACCATGAGCTATGATGCCCCATC
CCCAATGATTTTCTTCCAGATTTGAAGCTTTCCTATTATTTTTTCACAACCATATGATAAGAAAAAGGAATTTATATCTACGTAATCTTTCGACTC
Protein sequenceShow/hide protein sequence
EVLNFLDFMHAIKGGWIGRPLALAKSNESEGRKTRTRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHSLE
ENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISW
AIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADENCAAPLSETKSDLVEVAQIVETSNGSTRKEGSIYEVEGPDTPIPVISEQGQKSSET
KAPNASPSVTKNLNKAFSNSFDQTSKIKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPENNPLLEIFKAFISALVKFWSE