; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G020100 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G020100
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUnknown protein
Genome locationchr04:27198560..27201362
RNA-Seq ExpressionLsi04G020100
SyntenyLsi04G020100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]3.1e-26182.25Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAA NRNPSNKRP DPKNR NKKKKPR EP QDS PEWPCPEP+QNQPSTSSGWP IEPVA
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA

Query:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK
        TP   PVSSEERANLAALQLQYKGS+ACRGFFARNADSGSDEEGEEEE      +GEMMESEEYKFFLKLFVENDELRGYYEKN ESGLFCCLVCGGM K
Subjt:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK

Query:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN
        +K GK+FKNC+GLVQHSISISRTKKKRAHRAFGQVVCRVFGWDI+RLPTIVLKGEPL R+LADSG+LKV PEENHVAK+HDSGVQNENVAIS DDINKKN
Subjt:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN

Query:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLP-----VPELILKACKEFFAAFLTSMSDDDVSEN
        +VV +D K+QKLEEE+TAEDPT N+KDLISG+NDDAC  NDV LQAENTD+S+ G+ ESNAEMD LP     VPE ILKACKEF AAF TSMSD+DVSEN
Subjt:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLP-----VPELILKACKEFFAAFLTSMSDDDVSEN

Query:  NLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGL
        NLI+G+GVEE EE+KFFLKLFTENESLRRYYENNYDDGEFFCLAC GAGKKMLKSFKTCGRLLQH+TSLG  KI KKPVQKPHIAKMLK+KM+AHRA   
Subjt:  NLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGL

Query:  VICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSK--DESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK
        VICKVLGWDIEK PAVVLKGE LGRSLTK+D +K  DESVGN+VDNTKE D      STKINKMQ +SVGNAV     I EDD  K
Subjt:  VICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSK--DESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]9.5e-26382.53Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAA NRNPSNKRP DPKNR NKKKKPR EP QDS PEWPCPEP+QNQPSTSSGWP IEPVA
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA

Query:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK
        TP   PVSSEERANLAALQLQYKGS+ACRGFFARNADSGSDEEGEEEE      +GEMMESEEYKFFLKLFVENDELRGYYEKN ESGLFCCLVCGGM K
Subjt:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK

Query:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN
        +K GK+FKNC+GLVQHSISISRTKKKRAHRAFGQVVCRVFGWDI+RLPTIVLKGEPL R+LADSG+LKV PEENHVAK+HDSGVQNENVAIS DDINKKN
Subjt:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN

Query:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLP-----VPELILKACKEFFAAFLTSMSDDDVSEN
        +VV +D K+QKLEEE+TAEDPT N+KDLISG+NDDAC  NDV LQAENTD+S+ G+ ESNAEMD LP     VPE ILKACKEF AAF TSMSD+DVSEN
Subjt:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLP-----VPELILKACKEFFAAFLTSMSDDDVSEN

Query:  NLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGL
        NLI+G+GVEE EE+KFFLKLFTENESLRRYYENNYDDGEFFCLAC GAGKKMLKSFKTCGRLLQH+TSLG  KI KKPVQKPHIAKMLK+KM+AHRA   
Subjt:  NLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGL

Query:  VICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK
        VICKVLGWDIEK PAVVLKGE LGRSLTK+D +KDESVGN+VDNTKE D      STKINKMQ +SVGNAV     I EDD  K
Subjt:  VICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK

XP_038899320.1 uncharacterized protein LOC120086655 isoform X3 [Benincasa hispida]1.7e-25982.08Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAA NRNPSNKRP DPKNR NKKKKPR EP QDS PEWPCPEP+QNQPSTSSGWP IEPVA
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA

Query:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK
        TP   PVSSEERANLAALQLQYKGS+ACRGFFARNADSGSDEEGEEEE      +GEMMESEEYKFFLKLFVENDELRGYYEKN ESGLFCCLVCGGM K
Subjt:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK

Query:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN
        +K GK+FKNC+GLVQHSISISRTKKKRAHRAFGQVVCRVFGWDI+RLPTIVLKGEPL R+LADSG+LK  PEENHVAK+HDSGVQNENVAIS DDINKKN
Subjt:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN

Query:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLP-----VPELILKACKEFFAAFLTSMSDDDVSEN
        +VV +D K+QKLEEE+TAEDPT N+KDLISG+NDDAC  NDV LQAENTD+S+ G+ ESNAEMD LP     VPE ILKACKEF AAF TSMSD+DVSEN
Subjt:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLP-----VPELILKACKEFFAAFLTSMSDDDVSEN

Query:  NLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGL
        NLI+G+GVEE EE+KFFLKLFTENESLRRYYENNYDDGEFFCLAC GAGKKMLKSFKTCGRLLQH+TSLG  KI KKPVQKPHIAKMLK+KM+AHRA   
Subjt:  NLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGL

Query:  VICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSK--DESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK
        VICKVLGWDIEK PAVVLKGE LGRSLTK+D +K  DESVGN+VDNTKE D      STKINKMQ +SVGNAV     I EDD  K
Subjt:  VICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSK--DESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]4.3e-26382.96Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAA NRNPSNKRP DPKNR NKKKKPR EP QDS PEWPCPEP+QNQPSTSSGWP IEPVA
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA

Query:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK
        TP   PVSSEERANLAALQLQYKGS+ACRGFFARNADSGSDEEGEEEE      +GEMMESEEYKFFLKLFVENDELRGYYEKN ESGLFCCLVCGGM K
Subjt:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK

Query:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN
        +K GK+FKNC+GLVQHSISISRTKKKRAHRAFGQVVCRVFGWDI+RLPTIVLKGEPL R+LADSG+LKV PEENHVAK+HDSGVQNENVAIS DDINKKN
Subjt:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN

Query:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNLING
        +VV +D K+QKLEEE+TAEDPT N+KDLISG+NDDAC  NDV LQAENTD+S+ G+ ESNAEMD LPVPE ILKACKEF AAF TSMSD+DVSENNLI+G
Subjt:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNLING

Query:  DGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVICKV
        +GVEE EE+KFFLKLFTENESLRRYYENNYDDGEFFCLAC GAGKKMLKSFKTCGRLLQH+TSLG  KI KKPVQKPHIAKMLK+KM+AHRA   VICKV
Subjt:  DGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVICKV

Query:  LGWDIEKFPAVVLKGEALGRSLTKSDVSK--DESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK
        LGWDIEK PAVVLKGE LGRSLTK+D +K  DESVGN+VDNTKE D      STKINKMQ +SVGNAV     I EDD  K
Subjt:  LGWDIEKFPAVVLKGEALGRSLTKSDVSK--DESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK

XP_038899322.1 uncharacterized protein LOC120086655 isoform X5 [Benincasa hispida]1.7e-24378.49Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA
        M+PYSE+ LTEEVL+LH+LWRRGPPRNPKP HNHSSTVVAAA NRNPSNKRP DPKNR NKKKKPR EP QDS PEWPCPEP+QNQPSTSSGWP IEPVA
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA

Query:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK
        TP   PVSSEERANLAALQLQYKGS+ACRGFFARNADSGSDEEGEEEE      +GEMMESEEYKFFLKLFVENDELRGYYEKN ESGLFCCLVCGGM K
Subjt:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK

Query:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN
        +K GK+FKNC+GLVQHSISISRTKKKRAHRAFGQVVCRVFGWDI+RLPTIVLKGEPL R+LADSG+LKV PEENHVAK+HDSGVQNENVAIS DDINKKN
Subjt:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN

Query:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNLING
        +VV +D K+QKLEEE+TAEDPT N+KDLISG+                                   VPE ILKACKEF AAF TSMSD+DVSENNLI+G
Subjt:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNLING

Query:  DGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVICKV
        +GVEE EE+KFFLKLFTENESLRRYYENNYDDGEFFCLAC GAGKKMLKSFKTCGRLLQH+TSLG  KI KKPVQKPHIAKMLK+KM+AHRA   VICKV
Subjt:  DGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVICKV

Query:  LGWDIEKFPAVVLKGEALGRSLTKSDVSK--DESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK
        LGWDIEK PAVVLKGE LGRSLTK+D +K  DESVGN+VDNTKE D      STKINKMQ +SVGNAV     I EDD  K
Subjt:  LGWDIEKFPAVVLKGEALGRSLTKSDVSK--DESVGNAVDNTKEADDLVKENSTKINKMQGKSVGNAV-----IEEDDKNK

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X12.3e-20169.93Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKN---KKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIE
        M+PYS++ LT+EVLYLHSLW RGPPRNPKPTH+HSST VA   + NPSNKRP DP  RKN   KKKKPRS+PPQDS PEWPCPEP+QNQPSTSSGWP I+
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKN---KKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIE

Query:  PVATPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGG
        PVATP  Q VSSEER NLAALQLQYKGS+ACR FFARNADSGSDEE EEEEE+    DGEMMES+EY FFLK+FVEN+ELR YYEKN ESGLFCCLVC G
Subjt:  PVATPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGG

Query:  MGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDIN
        MGKKK GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDI+RLPTIVLKGEPL R+LA+SGDLKV PEE H        V N+N  +S     
Subjt:  MGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDIN

Query:  KKNDVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNL
             VSV+E EQKLEE KTAEDPT N+KDLISGENDDA  D DV LQ EN D+SI G+GESN EMD L V   IL+ACKEF AAF  SM+DDDVSE   
Subjt:  KKNDVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNL

Query:  INGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVI
         + DG EE EE+KFFLKLFTENE+LRRYYEN+Y DGEF CLACE AG+K +K FKTC RLLQHST LG   I +K  QKP   K+LK+ MLAHRAY  V+
Subjt:  INGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVI

Query:  CKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKINKMQG
        CKVLG DI+  PA+VL GEALG SLTKSDVSK +   +    +  ADD+V+++ST++N+++G
Subjt:  CKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKINKMQG

A0A1S3CJZ1 uncharacterized protein LOC103501816 isoform X31.6e-19969.75Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKN---KKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIE
        M+PYS++ LT+EVLYLHSLW RGPPRNPKPTH+HSST VA   + NPSNKRP DP  RKN   KKKKPRS+PPQDS PEWPCPEP+QNQPSTSSGWP I+
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKN---KKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIE

Query:  PVATPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGG
        PVATP  Q VSSEER NLAALQLQYKGS+ACR FFARNADSGSDEE EEEEE+    DGEMMES+EY FFLK+FVEN+ELR YYEKN ESGLFCCLVC G
Subjt:  PVATPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGG

Query:  MGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDIN
        MGKKK GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDI+RLPTIVLKGEPL R+LA+SGDLK  PEE H        V N+N  +S     
Subjt:  MGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDIN

Query:  KKNDVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNL
             VSV+E EQKLEE KTAEDPT N+KDLISGENDDA  D DV LQ EN D+SI G+GESN EMD L V   IL+ACKEF AAF  SM+DDDVSE   
Subjt:  KKNDVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNL

Query:  INGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVI
         + DG EE EE+KFFLKLFTENE+LRRYYEN+Y DGEF CLACE AG+K +K FKTC RLLQHST LG   I +K  QKP   K+LK+ MLAHRAY  V+
Subjt:  INGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVI

Query:  CKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKINKMQG
        CKVLG DI+  PA+VL GEALG SLTKSDVSK +   +    +  ADD+V+++ST++N+++G
Subjt:  CKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKINKMQG

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X21.3e-20170.28Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKN---KKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIE
        M+PYS++ LT+EVLYLHSLW RGPPRNPKPTH+HSST VA   + NPSNKRP DP  RKN   KKKKPRS+PPQDS PEWPCPEP+QNQPSTSSGWP I+
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKN---KKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIE

Query:  PVATPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGG
        PVATP  Q VSSEER NLAALQLQYKGS+ACR FFARNADSGSDEE EEEEE+    DGEMMES+EY FFLK+FVEN+ELR YYEKN ESGLFCCLVC G
Subjt:  PVATPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGG

Query:  MGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDIN
        MGKKK GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDI+RLPTIVLKGEPL R+LA+SGDLKV PEE H        V N+N  +S     
Subjt:  MGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDIN

Query:  KKNDVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNL
             VSV+E EQKLEE KTAEDPT N+KDLISGENDDA  D DV LQ EN D+SI G+GESN EMD L V   IL+ACKEF AAF  SM+DDDVSE   
Subjt:  KKNDVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNL

Query:  INGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVI
         + DG EE EE+KFFLKLFTENE+LRRYYEN+Y DGEF CLACE AG+K +K FKTC RLLQHST LG   I +K  QKP   K+LK+ MLAHRAY  V+
Subjt:  INGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVI

Query:  CKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKINKMQG
        CKVLG DI+  PA+VL GEALG SLTKSDVSKD+S  +    +  ADD+V+++ST++N+++G
Subjt:  CKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKINKMQG

A0A5D3DXE1 Uncharacterized protein5.7e-19767.01Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKN---KKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIE
        M+PYS++ LT+EVLYLHSLW RGPPRNPKPTH+HSST VA   + NPSNKRP DP  RKN   KKKKPRS+PPQDS PEWPCPEP+QNQPSTSSGWP I+
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKN---KKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIE

Query:  PVATPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGG
        PVATP  Q VSSEER NLAALQLQYKGS+ACR FFARNADSGSDEE EEEEE+    DGEMMES+EY FFLK+FVEN+ELR YYEKN ESGLFCCLVC G
Subjt:  PVATPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGG

Query:  MGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDIN
        MGKKK GK+FKNC+ LVQHSISIS TKKKRAHRAFG VV RVFGWDI+RLPTIVLKGEPL R+LA+SGDLKV PEE H        V N+N  +S     
Subjt:  MGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDIN

Query:  KKNDVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNL
             VSV+E EQKLEE KTAEDPT N+KDLISGENDDA  D DV LQ EN D+SI G+GESN EMD L V   IL+ACKEF AAF  SM+DDDVSE   
Subjt:  KKNDVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNL

Query:  INGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVI
         + DG EE EE+KFFLKLFTENE+LRRYYEN+Y DGEF CLACE AG+K +K FKTC RLLQHST LG   I +K  QKP   K+LK+ MLAHRAY  V+
Subjt:  INGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVI

Query:  CKVLGWDIEKFPAVVLKGEALGRSLTKSDVSK---------------------DESVGNAVDNTK--------EADDLVKENSTKINKMQG
        CKVLG DI+  PA+VL GEALG SLTKSDVSK                     +E+   A    K         ADD+V+++ST++N+++G
Subjt:  CKVLGWDIEKFPAVVLKGEALGRSLTKSDVSK---------------------DESVGNAVDNTK--------EADDLVKENSTKINKMQG

A0A6J1FFD4 uncharacterized protein LOC111443568 isoform X12.0e-19469.19Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA
        MNPYSE+ LTEEVLYLHSLW+RGPPR PKPT  + ST VAAA     +NKRPRD KNRK KKKKPR EP QD+ PEWPCPEP+QNQPSTSSGWP + P A
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVA

Query:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK
        TP  + VSSEERAN  ALQLQYKG EACR F  RNADSGSDEE EEE    EGNDGE+MESEEYKFFL LF+ENDELRGYYEKN E GLFCCLVCGGMGK
Subjt:  TPVPQPVSSEERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGK

Query:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN
        KKSGKRFKNCIGLV HS SISRTKKK AHRAFGQ VCRVFGWDI+RLPTIVL GEPL R+LA SGD K  PEEN VA++HDS V NENVAI ND+I+ KN
Subjt:  KKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKN

Query:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNLING
                EQK EEEKTAE       DLISGE                                   VPE I +AC+EFFAAFLTSM+DDDVSENN    
Subjt:  DVVSVDEKEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNLING

Query:  DGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVICKV
          +EE EE+KFFLKLF ENESLRRYY+N YDDGEF CL CEGAGKK L+SFKTC RLL+H+T  G  K  KK V KPHIAKMLK+KMLAHRAY LVIC+V
Subjt:  DGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVICKV

Query:  LGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKIN
        LGWDIEK PA+VLKGE  G SLTK DV KD  VGNA DNT E DD V+++ST+I+
Subjt:  LGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKENSTKIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein4.2e-5130.07Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPP-RNPKPTHNHS---STVVAAAENRNPS-----------------NKRPRDPKNRKNKKKKPRSEPPQDSSPEWPC
        MN Y +++L +EV+YLHSLW +GPP R P P+ N +     +     N  P                  ++ P +P+N  N  K+PR     DS  EWP 
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPP-RNPKPTHNHS---STVVAAAENRNPS-----------------NKRPRDPKNRKNKKKKPRSEPPQDSSPEWPC

Query:  PEPLQNQPSTSSGWPSIEPVATPVPQPVSSEERANLAALQLQYKGSEACRGFFAR-NADSGSDEEGEEEEEEAEGNDGEMME------SEEYKFFLKLFV
         + +   PST SGWP   P      +P+S+EE+  LAA  LQ      CR FF R + +  S   G +E E  EG++ + +E      S+E++F  ++F 
Subjt:  PEPLQNQPSTSSGWPSIEPVATPVPQPVSSEERANLAALQLQYKGSEACRGFFAR-NADSGSDEEGEEEEEEAEGNDGEMME------SEEYKFFLKLFV

Query:  ENDELRGYYEKNSESGLFCCLVCGGMGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPE
        EN +L+ YYEKN+ +G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+N  P +  +                   
Subjt:  ENDELRGYYEKNSESGLFCCLVCGGMGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPE

Query:  ENHVAKDHDSGVQNENVAISNDDI-NKKNDVVSVDE--KEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVP
             KD  + V+  +   S+  I  +K  V+SV+E  K   L+ ++ A +     KD+   +   A +  +     EN D+++                
Subjt:  ENHVAKDHDSGVQNENVAISNDDI-NKKNDVVSVDE--KEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVP

Query:  ELILKACKEFFAAFLTSMSDDDVSENNLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGA-GKKMLKSFKTCGRLLQHSTSLGNCK
                                             EE +   K+F+EN  L+ YYE NY+ G F CL C  A  KKMLK FK C  ++QH T      
Subjt:  ELILKACKEFFAAFLTSMSDDDVSENNLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGA-GKKMLKSFKTCGRLLQHSTSLGNCK

Query:  IWKKPVQKPHIAKMLKLKMLAHRAYGLVICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKEN--STKINKMQGKSVGNAV
                    K+ K+K+ AH+ +   +C++LGWD E  P  V+KG A              ++ NA +N +    +V+E+    K    Q  +   A 
Subjt:  IWKKPVQKPHIAKMLKLKMLAHRAYGLVICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKEN--STKINKMQGKSVGNAV

Query:  IE
        +E
Subjt:  IE

AT1G78810.2 unknown protein3.2e-5130.41Show/hide
Query:  MNPYSEKTLTEEVLYLHSLWRRGPP-RNPKPTHNHS---STVVAAAENRNPS-----------------NKRPRDPKNRKNKKKKPRSEPPQDSSPEWPC
        MN Y +++L +EV+YLHSLW +GPP R P P+ N +     +     N  P                  ++ P +P+N  N  K+PR     DS  EWP 
Subjt:  MNPYSEKTLTEEVLYLHSLWRRGPP-RNPKPTHNHS---STVVAAAENRNPS-----------------NKRPRDPKNRKNKKKKPRSEPPQDSSPEWPC

Query:  PEPLQNQPSTSSGWPSIEPVATPVPQPVSSEERANLAALQLQYKGSEACRGFFAR-NADSGSDEEGEEEEEEAEGNDGEMME------SEEYKFFLKLFV
         + +   PST SGWP   P      +P+S+EE+  LAA  LQ      CR FF R + +  S   G +E E  EG++ + +E      S+E++F  ++F 
Subjt:  PEPLQNQPSTSSGWPSIEPVATPVPQPVSSEERANLAALQLQYKGSEACRGFFAR-NADSGSDEEGEEEEEEAEGNDGEMME------SEEYKFFLKLFV

Query:  ENDELRGYYEKNSESGLFCCLVCGGMGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPE
        EN +L+ YYEKN+ +G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+N  P +  +                   
Subjt:  ENDELRGYYEKNSESGLFCCLVCGGMGKKKSGKRFKNCIGLVQHSISISRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPE

Query:  ENHVAKDHDSGVQNENVAISNDDI-NKKNDVVSVDE--KEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVP
             KD  + V+  +   S+  I  +K  V+SV+E  K   L+ ++ A +     KD+   +   A +  +     EN D+++                
Subjt:  ENHVAKDHDSGVQNENVAISNDDI-NKKNDVVSVDE--KEQKLEEEKTAEDPTCNAKDLISGENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVP

Query:  ELILKACKEFFAAFLTSMSDDDVSENNLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGA-GKKMLKSFKTCGRLLQHSTSLGNCK
                                             EE +   K+F+EN  L+ YYE NY+ G F CL C  A  KKMLK FK C  ++QH T      
Subjt:  ELILKACKEFFAAFLTSMSDDDVSENNLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLACEGA-GKKMLKSFKTCGRLLQHSTSLGNCK

Query:  IWKKPVQKPHIAKMLKLKMLAHRAYGLVICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKEN
                    K+ K+K+ AH+ +   +C++LGWD E  P  V+KG A              ++ NA +N +    +V+E+
Subjt:  IWKKPVQKPHIAKMLKLKMLAHRAYGLVICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTACTCCGAGAAAACACTCACCGAAGAGGTCCTGTATCTTCACTCTCTGTGGCGCCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCAC
CGTCGTCGCCGCTGCCGAGAATCGGAACCCCTCCAACAAGAGACCCAGAGATCCAAAGAACCGAAAGAACAAGAAGAAAAAACCACGCTCCGAGCCACCGCAAGACTCCA
GCCCTGAATGGCCCTGTCCGGAGCCGCTTCAAAATCAGCCCTCCACGTCATCTGGGTGGCCGTCAATTGAGCCCGTTGCCACTCCGGTGCCTCAGCCGGTGTCGTCTGAA
GAGCGAGCAAATCTTGCGGCGTTGCAATTGCAGTACAAGGGTTCCGAGGCTTGCCGGGGATTTTTCGCTAGAAATGCCGATTCGGGGAGCGACGAAGAGGGGGAGGAAGA
GGAGGAGGAAGCTGAGGGTAATGATGGGGAAATGATGGAAAGTGAAGAATATAAGTTCTTTTTGAAGCTGTTTGTGGAGAATGACGAACTTAGGGGTTATTACGAGAAGA
ATTCTGAAAGTGGGTTGTTTTGTTGCTTGGTTTGTGGTGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCAACATTCGATTTCGATA
TCGAGGACGAAGAAGAAGCGGGCTCATAGGGCTTTTGGGCAGGTTGTATGCAGGGTTTTTGGTTGGGATATTAATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCGCT
CGGTCGAGCATTAGCCGATTCTGGAGACTTGAAGGTTCTGCCAGAGGAAAATCATGTGGCTAAAGATCATGATTCTGGGGTTCAGAATGAAAATGTAGCTATTTCAAATG
ATGACATTAATAAGAAGAATGACGTGGTTTCTGTGGATGAGAAGGAACAGAAATTGGAGGAAGAAAAGACAGCTGAAGATCCTACTTGTAATGCTAAAGATTTGATTTCT
GGAGAGAATGATGATGCTTGCAATGATAACGATGTCAATCTGCAAGCAGAAAATACAGATGATTCAATTCCAGGCATTGGAGAAAGCAATGCGGAAATGGATAAATTGCC
TGTTCCGGAGTTGATTTTGAAAGCATGTAAAGAATTTTTTGCAGCCTTCTTAACATCTATGAGCGACGATGATGTTAGTGAAAACAACTTAATCAACGGGGATGGAGTTG
AGGAATGCGAAGAGTACAAATTCTTTTTAAAGTTGTTCACCGAGAACGAAAGCTTGAGAAGATATTACGAGAACAACTATGATGATGGAGAATTTTTCTGTTTAGCTTGT
GAAGGAGCAGGAAAGAAAATGTTAAAGAGTTTTAAGACATGTGGTCGCCTTCTCCAGCATTCAACTTCCCTAGGGAATTGCAAAATATGGAAAAAACCGGTTCAGAAGCC
TCACATTGCTAAAATGTTGAAACTGAAAATGCTGGCTCATAGGGCATATGGTTTAGTTATATGTAAGGTTCTTGGTTGGGACATTGAAAAGTTTCCTGCAGTCGTGTTAA
AAGGCGAAGCTCTTGGTCGTTCCTTAACAAAGTCAGACGTGTCGAAGGACGAATCTGTCGGCAATGCAGTTGATAATACAAAGGAAGCAGATGATCTTGTAAAAGAAAAC
TCTACAAAGATTAACAAAATGCAGGGCAAATCTGTTGGCAATGCAGTCATCGAAGAAGATGACAAGAACAAAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATTAAATGAAAATTAGTGAGAGGTTGGGACTGGAAGTTGGGAATGGACAATGGAGACGATGATACCAAACCTCCATTACACTGTCTCTTGATTCTGCCATTTTTCCACCA
ATGAATCCCTACTCCGAGAAAACACTCACCGAAGAGGTCCTGTATCTTCACTCTCTGTGGCGCCGAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCAC
CGTCGTCGCCGCTGCCGAGAATCGGAACCCCTCCAACAAGAGACCCAGAGATCCAAAGAACCGAAAGAACAAGAAGAAAAAACCACGCTCCGAGCCACCGCAAGACTCCA
GCCCTGAATGGCCCTGTCCGGAGCCGCTTCAAAATCAGCCCTCCACGTCATCTGGGTGGCCGTCAATTGAGCCCGTTGCCACTCCGGTGCCTCAGCCGGTGTCGTCTGAA
GAGCGAGCAAATCTTGCGGCGTTGCAATTGCAGTACAAGGGTTCCGAGGCTTGCCGGGGATTTTTCGCTAGAAATGCCGATTCGGGGAGCGACGAAGAGGGGGAGGAAGA
GGAGGAGGAAGCTGAGGGTAATGATGGGGAAATGATGGAAAGTGAAGAATATAAGTTCTTTTTGAAGCTGTTTGTGGAGAATGACGAACTTAGGGGTTATTACGAGAAGA
ATTCTGAAAGTGGGTTGTTTTGTTGCTTGGTTTGTGGTGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCAACATTCGATTTCGATA
TCGAGGACGAAGAAGAAGCGGGCTCATAGGGCTTTTGGGCAGGTTGTATGCAGGGTTTTTGGTTGGGATATTAATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCGCT
CGGTCGAGCATTAGCCGATTCTGGAGACTTGAAGGTTCTGCCAGAGGAAAATCATGTGGCTAAAGATCATGATTCTGGGGTTCAGAATGAAAATGTAGCTATTTCAAATG
ATGACATTAATAAGAAGAATGACGTGGTTTCTGTGGATGAGAAGGAACAGAAATTGGAGGAAGAAAAGACAGCTGAAGATCCTACTTGTAATGCTAAAGATTTGATTTCT
GGAGAGAATGATGATGCTTGCAATGATAACGATGTCAATCTGCAAGCAGAAAATACAGATGATTCAATTCCAGGCATTGGAGAAAGCAATGCGGAAATGGATAAATTGCC
TGTTCCGGAGTTGATTTTGAAAGCATGTAAAGAATTTTTTGCAGCCTTCTTAACATCTATGAGCGACGATGATGTTAGTGAAAACAACTTAATCAACGGGGATGGAGTTG
AGGAATGCGAAGAGTACAAATTCTTTTTAAAGTTGTTCACCGAGAACGAAAGCTTGAGAAGATATTACGAGAACAACTATGATGATGGAGAATTTTTCTGTTTAGCTTGT
GAAGGAGCAGGAAAGAAAATGTTAAAGAGTTTTAAGACATGTGGTCGCCTTCTCCAGCATTCAACTTCCCTAGGGAATTGCAAAATATGGAAAAAACCGGTTCAGAAGCC
TCACATTGCTAAAATGTTGAAACTGAAAATGCTGGCTCATAGGGCATATGGTTTAGTTATATGTAAGGTTCTTGGTTGGGACATTGAAAAGTTTCCTGCAGTCGTGTTAA
AAGGCGAAGCTCTTGGTCGTTCCTTAACAAAGTCAGACGTGTCGAAGGACGAATCTGTCGGCAATGCAGTTGATAATACAAAGGAAGCAGATGATCTTGTAAAAGAAAAC
TCTACAAAGATTAACAAAATGCAGGGCAAATCTGTTGGCAATGCAGTCATCGAAGAAGATGACAAGAACAAAGGTTAACGAATTGCAGGGTGAATCTGTGGGGAATGCAA
TTGGTAATATGAATGAAACTGATTCTATGAAGGTTGATGGCAATGGTGAAGTTATATTTTGAAGGATGATGGCTGTGGATGTGTAAAGTGAGGTGTAAATGACAGTTAAC
CAAGTATAGAATATAGATAGGACTTTATCAGGCAGCAGCCTTAGTTAATAGGAAAATCTGCCTTCAACAAAGTACTTTTCAATTATTCTTCTTTGTTATTTCATTTTTCT
TAGACTGTTAGATGAAGGTTTTGTGAGATTTTGAGATATATGAGATACAGGCTTACTCAAAA
Protein sequenceShow/hide protein sequence
MNPYSEKTLTEEVLYLHSLWRRGPPRNPKPTHNHSSTVVAAAENRNPSNKRPRDPKNRKNKKKKPRSEPPQDSSPEWPCPEPLQNQPSTSSGWPSIEPVATPVPQPVSSE
ERANLAALQLQYKGSEACRGFFARNADSGSDEEGEEEEEEAEGNDGEMMESEEYKFFLKLFVENDELRGYYEKNSESGLFCCLVCGGMGKKKSGKRFKNCIGLVQHSISI
SRTKKKRAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRALADSGDLKVLPEENHVAKDHDSGVQNENVAISNDDINKKNDVVSVDEKEQKLEEEKTAEDPTCNAKDLIS
GENDDACNDNDVNLQAENTDDSIPGIGESNAEMDKLPVPELILKACKEFFAAFLTSMSDDDVSENNLINGDGVEECEEYKFFLKLFTENESLRRYYENNYDDGEFFCLAC
EGAGKKMLKSFKTCGRLLQHSTSLGNCKIWKKPVQKPHIAKMLKLKMLAHRAYGLVICKVLGWDIEKFPAVVLKGEALGRSLTKSDVSKDESVGNAVDNTKEADDLVKEN
STKINKMQGKSVGNAVIEEDDKNKG