; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G005010 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G005010
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPlastid envelope DNA binding protein
Genome locationCmo_Chr15:2300655..2304527
RNA-Seq ExpressionCmoCh15G005010
SyntenyCmoCh15G005010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578787.1 hypothetical protein SDJN03_23235, partial [Cucurbita argyrosperma subsp. sororia]4.0e-19998.39Show/hide
Query:  MVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEE
        MVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEE
Subjt:  MVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEE

Query:  PIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHT
        PIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHT
Subjt:  PIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHT

Query:  MELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNK
        MELGSDVGLVNIKGNNSTASGYVVKKAD NCAAPLSETKSDLVEVAQIVETSNGS RKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNK
Subjt:  MELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNK

Query:  AFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWSE
        AFSNSFDQTSK KEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKP+NNPLLEIFKAFI A VKFWSE
Subjt:  AFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWSE

KAG7016318.1 hypothetical protein SDJN02_21425, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-22498.32Show/hide
Query:  EVLNFLDFMHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL
        EVLNFLDFMHAIKGGWIGRPLALAKSNESEGRKTR RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL
Subjt:  EVLNFLDFMHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL

Query:  EEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNH
        EEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNH
Subjt:  EEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNH

Query:  VTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGS
        VTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKAD NCAAPLSETKSDLVEVAQIVETSNGS
Subjt:  VTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGS

Query:  PRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPL
         RKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSK KEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKP+NNPL
Subjt:  PRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPL

Query:  LEIFKAFIGAFVKFWSE
        LEIFKAFI A VKFWSE
Subjt:  LEIFKAFIGAFVKFWSE

XP_022939659.1 uncharacterized protein LOC111445486 [Cucurbita moschata]9.4e-225100Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY
        VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI

Query:  GAFVKFWSE
        GAFVKFWSE
Subjt:  GAFVKFWSE

XP_022992975.1 uncharacterized protein LOC111489141 [Cucurbita maxima]6.8e-21595.84Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALA+SNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLN+THKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISS+EVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVT+INGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFT DV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY
        VVETFPL+SISWA++SSDVRSET ISTSASEKQPS TMELGSDVGLVNIKGNNSTASGYVVKKAD NCAAPLSETKSDLVEVAQIVETSNGS RKEGSIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFD+TSK KEET +ENEVEAEQNGGSTLNRINLESWEGMSKNSSKP+NNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI

Query:  GAFVKFWSE
        GAFVKFWSE
Subjt:  GAFVKFWSE

XP_023549620.1 uncharacterized protein LOC111808067 [Cucurbita pepo subsp. pepo]1.4e-21295.84Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTD+S
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISSEEVD PVNYNQYIN EPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDK+SDELKKVEEVVREESGMPFNHVTPFTTDV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY
        VVETFPL+SIS+A+DSSDVRSE  ISTSASEKQPSHTMELGSDVGLVN KGNNSTASGYVVKKAD NCAAPLSETKSDLVEVAQIVETSNGS +KE SIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETG+ENEVEAEQNGGSTLNRINLESWEGMSKNSSKP+NNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI

Query:  GAFVKFWSE
         AFVKFWSE
Subjt:  GAFVKFWSE

TrEMBL top hitse value%identityAlignment
A0A1S3C473 uncharacterized protein LOC103496473 isoform X17.9e-15368.18Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL-EEHSTDH
        MHAIKGGW GRPLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGK LL EEH+TDH
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL-EEHSTDH

Query:  SLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDEL------------------------
        SL++NPLHSIAIEPQSPLT+SS+EV  P+NYN+YINEEPIFVSDEQCT+TNIQGSQN ++INGS  D+S++DSDE                         
Subjt:  SLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDEL------------------------

Query:  ----------------KKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGY
                         KVEEVV+EESGMP NHVTP  TDVVVETFPL+ + W ++ SDVRSE  IST+ASEKQ S ++EL SDVGL NI     TAS  
Subjt:  ----------------KKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGY

Query:  VVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEET
        VV+KA  N A PLSETKSDLVEVAQIVE SNGS  KEGS++EV GP      DTPI V  EQGQKSS+ K+P AS    +NLNK FSN FDQ SK +   
Subjt:  VVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEET

Query:  GIENEVEAEQNGGS------TLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWSE
         IEN+V+  Q GGS      TLNRINLESWEGMSKNSSKP+NNPLLEI K+FI AFVKFWSE
Subjt:  GIENEVEAEQNGGS------TLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWSE

A0A5A7UUF2 Plastid envelope DNA binding protein7.7e-15666.67Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL-EEHSTDH
        MHAIKGGW GRPLALAK+NE+EGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENR+LGPGK LL EEH+TDH
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLL-EEHSTDH

Query:  SLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDEL------------------------
        SL++NPLHSIAIEPQSPLT+SS+EV  P+NYN+YINEEPIFVSDEQCT+TNIQGSQN ++INGS  D+S++DSDE                         
Subjt:  SLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDEL------------------------

Query:  ----------------KKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGY
                         KVEEVV+EESGMP NHVTP  TDVVVETFPL+ + W ++ SDVRSE  IST+ASEKQ S ++EL SDVGL NI     TAS  
Subjt:  ----------------KKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGY

Query:  VVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEET
        VV+KA  N A PLSETKSDLVEVAQIVE SNGS  KEGS++EV GP      DTPI V  EQGQKSS+ K+P AS    +NLNK FSN FDQ SK +   
Subjt:  VVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEET

Query:  GIENEVEAEQNGGS------TLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWSEQVERSYFPTTEPVRLRTKLQS
         IEN+V+  Q GGS      TLNRINLESWEGMSKNSSKP+NNPLLEI K+FI AFVKFWS++V RS       + +RTKL S
Subjt:  GIENEVEAEQNGGS------TLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWSEQVERSYFPTTEPVRLRTKLQS

A0A6J1E1K4 uncharacterized protein LOC1114297504.1e-14971.36Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGW G PLALAK NESEGRKTRIRRSKEERKAMVEVFIKKYQESN GSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGK  LEEHSTDH 
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSD-EQCTSTNIQGSQNVTVINGSPADMSDKDSDEL----------KKVEEVVREESGMP
        LEENPLHSIAIEPQSPLT  SEE D P+N+N  INEEPI VSD EQ TS NIQGSQN  +INGS  D SDKDSDE+          KK+EEV++EESGMP
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSD-EQCTSTNIQGSQNVTVINGSPADMSDKDSDEL----------KKVEEVVREESGMP

Query:  FNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNST-ASGYVVKKADGNCAAPLSETKSDLVEVAQIVET
         NHVTP   DV V TFPL+S SWA + SDV SET IST ASEK+ S  +EL SDV L N + NNST ASG   +KA       LSET SDLVEVAQIVE 
Subjt:  FNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNST-ASGYVVKKADGNCAAPLSETKSDLVEVAQIVET

Query:  SNGSPRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGS------TLNRINLES
        +NG+  K+G I+EVEGP      DTPI V  EQGQKSSE KAPNASPS TKNLN + +N  DQ SK KEET ++N+VEAEQ GGS      TLNR+NL+S
Subjt:  SNGSPRKEGSIYEVEGP------DTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGS------TLNRINLES

Query:  WEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWSE
        W G SK+SSKP+NNPLLEI  AFI AFVKFWSE
Subjt:  WEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWSE

A0A6J1FHV2 uncharacterized protein LOC1114454864.6e-225100Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY
        VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI

Query:  GAFVKFWSE
        GAFVKFWSE
Subjt:  GAFVKFWSE

A0A6J1JX82 uncharacterized protein LOC1114891413.3e-21595.84Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
        MHAIKGGWIGRPLALA+SNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLN+THKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHS

Query:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV
        LEENPLHSIAIEPQSPLTISS+EVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVT+INGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFT DV
Subjt:  LEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDV

Query:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY
        VVETFPL+SISWA++SSDVRSET ISTSASEKQPS TMELGSDVGLVNIKGNNSTASGYVVKKAD NCAAPLSETKSDLVEVAQIVETSNGS RKEGSIY
Subjt:  VVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKEGSIY

Query:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI
        EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFD+TSK KEET +ENEVEAEQNGGSTLNRINLESWEGMSKNSSKP+NNPLLEIFKAFI
Subjt:  EVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFI

Query:  GAFVKFWSE
        GAFVKFWSE
Subjt:  GAFVKFWSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52170.1 DNA binding7.1e-3729.54Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLE------E
        MH++K   +G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG  LLE      +
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLE------E

Query:  HSTDHSLEENPLHSIAIEPQ-------SPLTISSEEVDSPVNYNQ------------YINEEPIFVSDEQCTSTNIQGSQNVT-----------------
         S   S+  +P+  +++ P          L  SSE  +  VN +Q             + +E I +  +   ST+I  +Q  T                 
Subjt:  HSTDHSLEENPLHSIAIEPQ-------SPLTISSEEVDSPVNYNQ------------YINEEPIFVSDEQCTSTNIQGSQNVT-----------------

Query:  ------VINGSP----ADMSDKD----------SDELKKVEEVVR-EESGMPFNHV------TPFTTDVVVETFPLNSISWAIDSSDVR----SETSIST
               ++  P     D+ +KD          SD  K V    R  ++G     +         + + VVETFPL S++  +DS D +    ++     
Subjt:  ------VINGSP----ADMSDKD----------SDELKKVEEVVR-EESGMPFNHV------TPFTTDVVVETFPLNSISWAIDSSDVR----SETSIST

Query:  SASEKQPSHTMELGSDVGLVNIKGNNSTA------SGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKE-------GSIYEVEGPDTPIPVISE
          +E +        + V L  I  + S+A      +  +V +   + + P+ +   + +  +  V+      ++        G+++E +       + +E
Subjt:  SASEKQPSHTMELGSDVGLVNIKGNNSTA------SGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKE-------GSIYEVEGPDTPIPVISE

Query:  QGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEA------EQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWS
        Q   +S T++ +      K    +     +  S  K+ T  + +++A      ++   +TLNRI  ESW+G S N  + + NPLL + K+F+ AFVKFWS
Subjt:  QGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEA------EQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWS

Query:  E
        E
Subjt:  E

AT3G52170.2 DNA binding7.1e-3729.54Show/hide
Query:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLE------E
        MH++K   +G+  ALAK ++S G++TR R  KEERK +VE FIKK+Q+ NNGSFPSL+LTHKEVGGSFYT+REIVR+IIQENRVLGPG  LLE      +
Subjt:  MHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGPGKWLLE------E

Query:  HSTDHSLEENPLHSIAIEPQ-------SPLTISSEEVDSPVNYNQ------------YINEEPIFVSDEQCTSTNIQGSQNVT-----------------
         S   S+  +P+  +++ P          L  SSE  +  VN +Q             + +E I +  +   ST+I  +Q  T                 
Subjt:  HSTDHSLEENPLHSIAIEPQ-------SPLTISSEEVDSPVNYNQ------------YINEEPIFVSDEQCTSTNIQGSQNVT-----------------

Query:  ------VINGSP----ADMSDKD----------SDELKKVEEVVR-EESGMPFNHV------TPFTTDVVVETFPLNSISWAIDSSDVR----SETSIST
               ++  P     D+ +KD          SD  K V    R  ++G     +         + + VVETFPL S++  +DS D +    ++     
Subjt:  ------VINGSP----ADMSDKD----------SDELKKVEEVVR-EESGMPFNHV------TPFTTDVVVETFPLNSISWAIDSSDVR----SETSIST

Query:  SASEKQPSHTMELGSDVGLVNIKGNNSTA------SGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKE-------GSIYEVEGPDTPIPVISE
          +E +        + V L  I  + S+A      +  +V +   + + P+ +   + +  +  V+      ++        G+++E +       + +E
Subjt:  SASEKQPSHTMELGSDVGLVNIKGNNSTA------SGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRKE-------GSIYEVEGPDTPIPVISE

Query:  QGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEA------EQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWS
        Q   +S T++ +      K    +     +  S  K+ T  + +++A      ++   +TLNRI  ESW+G S N  + + NPLL + K+F+ AFVKFWS
Subjt:  QGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEA------EQNGGSTLNRINLESWEGMSKNSSKPKNNPLLEIFKAFIGAFVKFWS

Query:  E
        E
Subjt:  E

AT5G58210.1 hydroxyproline-rich glycoprotein family protein1.1e-0823.08Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTISSEE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E  S+   D S   +P     +E ++   +S   
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTISSEE

Query:  VDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSET
           P + + +++  P+ + + +  S         T  + SPA +   +++ L  V   V  ++    +H +P     +VE   L+ +S ++        +
Subjt:  VDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSET

Query:  SISTSASEKQPSHTMELGSD--VGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRK--------EGSIYEVEGPDTPIPVIS
         +      +  S  +E+GS+    +++   +NS           GN     ++      E  Q +E S     +        E     +EG D    V+ 
Subjt:  SISTSASEKQPSHTMELGSD--VGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRK--------EGSIYEVEGPDTPIPVIS

Query:  EQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKE
         + ++ SET A     + T  + +  S+  +  S  KE
Subjt:  EQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKE

AT5G58210.2 hydroxyproline-rich glycoprotein family protein1.1e-0823.08Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTISSEE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E  S+   D S   +P     +E ++   +S   
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTISSEE

Query:  VDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSET
           P + + +++  P+ + + +  S         T  + SPA +   +++ L  V   V  ++    +H +P     +VE   L+ +S ++        +
Subjt:  VDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSET

Query:  SISTSASEKQPSHTMELGSD--VGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRK--------EGSIYEVEGPDTPIPVIS
         +      +  S  +E+GS+    +++   +NS           GN     ++      E  Q +E S     +        E     +EG D    V+ 
Subjt:  SISTSASEKQPSHTMELGSD--VGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRK--------EGSIYEVEGPDTPIPVIS

Query:  EQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKE
         + ++ SET A     + T  + +  S+  +  S  KE
Subjt:  EQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKE

AT5G58210.3 hydroxyproline-rich glycoprotein family protein1.1e-0823.08Show/hide
Query:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTISSEE
        R SK++R+A+VE F+ +Y+ +N G FPSL+ THK+VGGS+Y VR+I +++  + +   P   K L E  S+   D S   +P     +E ++   +S   
Subjt:  RRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEVGGSFYTVREIVRDIIQENRVLGP--GKWLLEEHST---DHSLEENPLHSIAIEPQSPLTISSEE

Query:  VDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSET
           P + + +++  P+ + + +  S         T  + SPA +   +++ L  V   V  ++    +H +P     +VE   L+ +S ++        +
Subjt:  VDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDELKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSET

Query:  SISTSASEKQPSHTMELGSD--VGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRK--------EGSIYEVEGPDTPIPVIS
         +      +  S  +E+GS+    +++   +NS           GN     ++      E  Q +E S     +        E     +EG D    V+ 
Subjt:  SISTSASEKQPSHTMELGSD--VGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQIVETSNGSPRK--------EGSIYEVEGPDTPIPVIS

Query:  EQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKE
         + ++ SET A     + T  + +  S+  +  S  KE
Subjt:  EQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACATATATTTCAACGTAAAGTTCTTTCAGTTTTTGCATCATTGTCTTGCTTTTACACTGCGATTCTATCACTGTTGGCTCCGCCCCCTTCCATGGCGATCATTTC
TCCTGAAGTGTTGAATTTTTTGGATTTCATGCATGCTATAAAGGGTGGGTGGATAGGGCGTCCTCTTGCCCTAGCCAAAAGCAATGAGTCCGAAGGGAGGAAGACTAGAA
TTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGCTTCCCCTCGCTCAACCTTACTCACAAGGAAGTT
GGTGGATCTTTCTATACAGTGCGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTTCTTGGTCCCGGAAAGTGGTTATTAGAAGAGCACAGCACTGATCATTCACT
TGAAGAGAATCCACTCCACTCGATTGCTATTGAACCTCAATCTCCTTTAACGATATCATCAGAGGAAGTTGATTCTCCAGTCAACTACAACCAATACATAAATGAAGAGC
CAATCTTCGTGTCAGATGAGCAATGCACTTCAACAAATATTCAGGGATCACAGAATGTGACAGTAATTAATGGCAGCCCGGCAGATATGAGTGACAAGGATTCTGATGAA
CTCAAGAAAGTAGAGGAAGTGGTGAGAGAGGAATCAGGAATGCCATTTAATCATGTAACTCCTTTTACAACAGATGTCGTGGTAGAGACATTCCCATTGAATTCGATTTC
TTGGGCCATCGATAGTTCAGATGTAAGATCTGAGACATCGATTTCAACCAGCGCCTCAGAAAAGCAACCGAGTCATACCATGGAGTTAGGATCAGATGTTGGCTTGGTTA
ACATTAAAGGTAATAATTCCACGGCTTCTGGTTATGTAGTCAAGAAAGCAGATGGAAATTGTGCTGCTCCATTATCAGAAACAAAGTCTGATTTGGTGGAGGTAGCACAA
ATTGTTGAAACCTCTAATGGATCACCTAGGAAAGAAGGTAGCATATATGAAGTTGAGGGTCCTGATACTCCAATACCTGTAATCTCTGAACAAGGCCAGAAATCTAGTGA
AACCAAGGCTCCTAATGCTTCTCCAAGTGTTACCAAGAATCTCAACAAGGCATTCAGCAATAGCTTTGACCAGACCTCAAAAACCAAAGAGGAGACAGGGATCGAAAATG
AAGTAGAGGCAGAACAGAATGGTGGCTCAACATTAAATAGAATCAATCTTGAATCATGGGAAGGGATGTCCAAAAATTCTTCGAAACCCAAAAACAACCCGCTTTTGGAA
ATCTTTAAAGCATTCATCGGTGCATTCGTGAAGTTTTGGTCCGAACAAGTAGAGAGGAGTTATTTTCCTACCACAGAACCTGTCCGTCTCCGTACCAAGTTGCAGTCGGT
TTCCCCGTTCGCTCGAGTCGATTCCATCCTCGATGAAACTGGATATTACTGGATGTGGGTTGGCACTTCTGTATTCTGCAGTAAGAAAGAAGGTTTAGGAGTAGTAGGCA
TATCCCACCCCTCAGTTGCAAGTTTCATTTTGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGACATATATTTCAACGTAAAGTTCTTTCAGTTTTTGCATCATTGTCTTGCTTTTACACTGCGATTCTATCACTGTTGGCTCCGCCCCCTTCCATGGCGATCATTTC
TCCTGAAGTGTTGAATTTTTTGGATTTCATGCATGCTATAAAGGGTGGGTGGATAGGGCGTCCTCTTGCCCTAGCCAAAAGCAATGAGTCCGAAGGGAGGAAGACTAGAA
TTCGGCGTTCAAAGGAGGAAAGGAAGGCAATGGTTGAAGTCTTCATAAAAAAGTATCAGGAATCAAATAATGGGAGCTTCCCCTCGCTCAACCTTACTCACAAGGAAGTT
GGTGGATCTTTCTATACAGTGCGAGAGATTGTACGTGATATAATCCAAGAAAATAGAGTTCTTGGTCCCGGAAAGTGGTTATTAGAAGAGCACAGCACTGATCATTCACT
TGAAGAGAATCCACTCCACTCGATTGCTATTGAACCTCAATCTCCTTTAACGATATCATCAGAGGAAGTTGATTCTCCAGTCAACTACAACCAATACATAAATGAAGAGC
CAATCTTCGTGTCAGATGAGCAATGCACTTCAACAAATATTCAGGGATCACAGAATGTGACAGTAATTAATGGCAGCCCGGCAGATATGAGTGACAAGGATTCTGATGAA
CTCAAGAAAGTAGAGGAAGTGGTGAGAGAGGAATCAGGAATGCCATTTAATCATGTAACTCCTTTTACAACAGATGTCGTGGTAGAGACATTCCCATTGAATTCGATTTC
TTGGGCCATCGATAGTTCAGATGTAAGATCTGAGACATCGATTTCAACCAGCGCCTCAGAAAAGCAACCGAGTCATACCATGGAGTTAGGATCAGATGTTGGCTTGGTTA
ACATTAAAGGTAATAATTCCACGGCTTCTGGTTATGTAGTCAAGAAAGCAGATGGAAATTGTGCTGCTCCATTATCAGAAACAAAGTCTGATTTGGTGGAGGTAGCACAA
ATTGTTGAAACCTCTAATGGATCACCTAGGAAAGAAGGTAGCATATATGAAGTTGAGGGTCCTGATACTCCAATACCTGTAATCTCTGAACAAGGCCAGAAATCTAGTGA
AACCAAGGCTCCTAATGCTTCTCCAAGTGTTACCAAGAATCTCAACAAGGCATTCAGCAATAGCTTTGACCAGACCTCAAAAACCAAAGAGGAGACAGGGATCGAAAATG
AAGTAGAGGCAGAACAGAATGGTGGCTCAACATTAAATAGAATCAATCTTGAATCATGGGAAGGGATGTCCAAAAATTCTTCGAAACCCAAAAACAACCCGCTTTTGGAA
ATCTTTAAAGCATTCATCGGTGCATTCGTGAAGTTTTGGTCCGAACAAGTAGAGAGGAGTTATTTTCCTACCACAGAACCTGTCCGTCTCCGTACCAAGTTGCAGTCGGT
TTCCCCGTTCGCTCGAGTCGATTCCATCCTCGATGAAACTGGATATTACTGGATGTGGGTTGGCACTTCTGTATTCTGCAGTAAGAAAGAAGGTTTAGGAGTAGTAGGCA
TATCCCACCCCTCAGTTGCAAGTTTCATTTTGAGCTAA
Protein sequenceShow/hide protein sequence
MGHIFQRKVLSVFASLSCFYTAILSLLAPPPSMAIISPEVLNFLDFMHAIKGGWIGRPLALAKSNESEGRKTRIRRSKEERKAMVEVFIKKYQESNNGSFPSLNLTHKEV
GGSFYTVREIVRDIIQENRVLGPGKWLLEEHSTDHSLEENPLHSIAIEPQSPLTISSEEVDSPVNYNQYINEEPIFVSDEQCTSTNIQGSQNVTVINGSPADMSDKDSDE
LKKVEEVVREESGMPFNHVTPFTTDVVVETFPLNSISWAIDSSDVRSETSISTSASEKQPSHTMELGSDVGLVNIKGNNSTASGYVVKKADGNCAAPLSETKSDLVEVAQ
IVETSNGSPRKEGSIYEVEGPDTPIPVISEQGQKSSETKAPNASPSVTKNLNKAFSNSFDQTSKTKEETGIENEVEAEQNGGSTLNRINLESWEGMSKNSSKPKNNPLLE
IFKAFIGAFVKFWSEQVERSYFPTTEPVRLRTKLQSVSPFARVDSILDETGYYWMWVGTSVFCSKKEGLGVVGISHPSVASFILS