; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028142 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028142
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153056:3917622..3923337
RNA-Seq ExpressionSgr028142
SyntenySgr028142
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]1.3e-19867.26Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP
        MDP  E RLTEEVLHLH+LWRRGPPRN +PIH+HS+T  A  ANRNPSN   KRP D K R  K K PR  P+  +DSGPEWPCPEPV+NQ STSSGWPP
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP

Query:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK
        IEP  TPA  PVSSE+RA L+A++LQYK S ACRGFFARN DSGSDE  EEE  +GE+ E+EEYKFFLKLF EN++LR YYE N E G FCCLVCGGM K
Subjt:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK

Query:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN
        +  GKKFK+C+GLVQHSISISRTKK+RAHRAFG+V+CRVFGWDIDRLPTIVLKGEPL RSLADSG+LKVQ EENHVAKEHDSGVQ+EN AI  D+ ++KN
Subjt:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN

Query:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSEN
        E + +DG + KLE+E+ AEDP  N KDLISG+N++ACK ND K  A+N DNS+  M     EM+NLP NVLQVP+ IL ACKEF A F  S SD+DVSEN
Subjt:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSEN

Query:  NVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSL
        N+ DG+G+EER EFKFFLKLF ENE LRRYYENNY++GEFFCLAC G GKK  K FKTCG LLQH+TSL           K H ++M+KMK +AHRA S 
Subjt:  NVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSL

Query:  VVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGE
        V+CKVLGWDI+ +PA+VLKGEPLGRSL+K    K  + +VG  N +D   E++S K+N   E
Subjt:  VVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGE

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]9.8e-19963.03Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP
        MDP  E RLTEEVLHLH+LWRRGPPRN +PIH+HS+T  A  ANRNPSN   KRP D K R  K K PR  P+  +DSGPEWPCPEPV+NQ STSSGWPP
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP

Query:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK
        IEP  TPA  PVSSE+RA L+A++LQYK S ACRGFFARN DSGSDE  EEE  +GE+ E+EEYKFFLKLF EN++LR YYE N E G FCCLVCGGM K
Subjt:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK

Query:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN
        +  GKKFK+C+GLVQHSISISRTKK+RAHRAFG+V+CRVFGWDIDRLPTIVLKGEPL RSLADSG+LKVQ EENHVAKEHDSGVQ+EN AI  D+ ++KN
Subjt:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN

Query:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSEN
        E + +DG + KLE+E+ AEDP  N KDLISG+N++ACK ND K  A+N DNS+  M     EM+NLP NVLQVP+ IL ACKEF A F  S SD+DVSEN
Subjt:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSEN

Query:  NVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSL
        N+ DG+G+EER EFKFFLKLF ENE LRRYYENNY++GEFFCLAC G GKK  K FKTCG LLQH+TSL           K H ++M+KMK +AHRA S 
Subjt:  NVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSL

Query:  VVCKVLGWDIKNIPAIVLKGEPLGRSLSKPG---------------------------------------------------MPKGGESAVGNMNDLDEL
        V+CKVLGWDI+ +PA+VLKGEPLGRSL+K                                                       K   +A+GNMNDLD +
Subjt:  VVCKVLGWDIKNIPAIVLKGEPLGRSLSKPG---------------------------------------------------MPKGGESAVGNMNDLDEL

Query:  IENESMKVNSNGEA
         E +SMKV+SNGEA
Subjt:  IENESMKVNSNGEA

XP_038899320.1 uncharacterized protein LOC120086655 isoform X3 [Benincasa hispida]4.6e-19666.9Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP
        MDP  E RLTEEVLHLH+LWRRGPPRN +PIH+HS+T  A  ANRNPSN   KRP D K R  K K PR  P+  +DSGPEWPCPEPV+NQ STSSGWPP
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP

Query:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK
        IEP  TPA  PVSSE+RA L+A++LQYK S ACRGFFARN DSGSDE  EEE  +GE+ E+EEYKFFLKLF EN++LR YYE N E G FCCLVCGGM K
Subjt:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK

Query:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN
        +  GKKFK+C+GLVQHSISISRTKK+RAHRAFG+V+CRVFGWDIDRLPTIVLKGEPL RSLADSG+LK   EENHVAKEHDSGVQ+EN AI  D+ ++KN
Subjt:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN

Query:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSEN
        E + +DG + KLE+E+ AEDP  N KDLISG+N++ACK ND K  A+N DNS+  M     EM+NLP NVLQVP+ IL ACKEF A F  S SD+DVSEN
Subjt:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSEN

Query:  NVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSL
        N+ DG+G+EER EFKFFLKLF ENE LRRYYENNY++GEFFCLAC G GKK  K FKTCG LLQH+TSL           K H ++M+KMK +AHRA S 
Subjt:  NVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSL

Query:  VVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGE
        V+CKVLGWDI+ +PA+VLKGEPLGRSL+K    K  + +VG  N +D   E++S K+N   E
Subjt:  VVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGE

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]6.6e-19566.55Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP
        MDP  E RLTEEVLHLH+LWRRGPPRN +PIH+HS+T  A  ANRNPSN   KRP D K R  K K PR  P+  +DSGPEWPCPEPV+NQ STSSGWPP
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP

Query:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK
        IEP  TPA  PVSSE+RA L+A++LQYK S ACRGFFARN DSGSDE  EEE  +GE+ E+EEYKFFLKLF EN++LR YYE N E G FCCLVCGGM K
Subjt:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK

Query:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN
        +  GKKFK+C+GLVQHSISISRTKK+RAHRAFG+V+CRVFGWDIDRLPTIVLKGEPL RSLADSG+LKVQ EENHVAKEHDSGVQ+EN AI  D+ ++KN
Subjt:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN

Query:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSEN
        E + +DG + KLE+E+ AEDP  N KDLISG+N++ACK ND K  A+N DNS+  M     EM+NLP     VP+ IL ACKEF A F  S SD+DVSEN
Subjt:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSEN

Query:  NVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSL
        N+ DG+G+EER EFKFFLKLF ENE LRRYYENNY++GEFFCLAC G GKK  K FKTCG LLQH+TSL           K H ++M+KMK +AHRA S 
Subjt:  NVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSL

Query:  VVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGE
        V+CKVLGWDI+ +PA+VLKGEPLGRSL+K    K  + +VG  N +D   E++S K+N   E
Subjt:  VVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGE

XP_038899322.1 uncharacterized protein LOC120086655 isoform X5 [Benincasa hispida]2.2e-18263.57Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP
        MDP  E RLTEEVLHLH+LWRRGPPRN +PIH+HS+T  A  ANRNPSN   KRP D K R  K K PR  P+  +DSGPEWPCPEPV+NQ STSSGWPP
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSAT--ADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP

Query:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK
        IEP  TPA  PVSSE+RA L+A++LQYK S ACRGFFARN DSGSDE  EEE  +GE+ E+EEYKFFLKLF EN++LR YYE N E G FCCLVCGGM K
Subjt:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDE--EEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK

Query:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN
        +  GKKFK+C+GLVQHSISISRTKK+RAHRAFG+V+CRVFGWDIDRLPTIVLKGEPL RSLADSG+LKVQ EENHVAKEHDSGVQ+EN AI  D+ ++KN
Subjt:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN

Query:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHKAKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSENNV
        E + +DG + KLE+E+ AEDP  N KDLISG                                      +VP+ IL ACKEF A F  S SD+DVSENN+
Subjt:  EEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHKAKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSENNV

Query:  KDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSLVV
         DG+G+EER EFKFFLKLF ENE LRRYYENNY++GEFFCLAC G GKK  K FKTCG LLQH+TSL           K H ++M+KMK +AHRA S V+
Subjt:  KDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL---------AHKTHSSRMVKMKALAHRAYSLVV

Query:  CKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGE
        CKVLGWDI+ +PA+VLKGEPLGRSL+K    K  + +VG  N +D   E++S K+N   E
Subjt:  CKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGE

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X14.1e-15859.54Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSATADVANRNPSNNARKRPRDQKLRKGKN-KNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPPI
        MDP  + RLT+EVL+LHSLW RGPPRN +P H HS+TA VA+ NPSN   KRP D   RK KN K  +P   PP+DSGPEWPCPEPV+NQ STSSGWPPI
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSATADVANRNPSNNARKRPRDQKLRKGKN-KNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPPI

Query:  EPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSD---EEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK
        +P  TPA Q VSSE+R  L+A++LQYK S ACR FFARN DSGSD   EEEE +DGE+ E++EY FFLK+F EN +LR YYE N E G FCCLVC GMGK
Subjt:  EPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSD---EEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK

Query:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN
        K  GKKFK+C+ LVQHSISIS TKK+RAHRAFG V+ RVFGWDIDRLPTIVLKGEPL RSLA+SGDLKVQ EE HV                    D KN
Subjt:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN

Query:  EEISVDGN--EHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVS
        E +SV  N  E KLE+ K AEDP  N KDLISGEN++A K+ D K   +N DNSIS MG    EM+NL + +L+       ACKEF A F  S +DDDVS
Subjt:  EEISVDGN--EHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVS

Query:  ENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL--------AHKTHSSRMVKMKALAHRAYS
        E   +  DG EER EFKFFLKLF ENE LRRYYEN+Y +GEF CLAC+  G+K  K FKTC  LLQHST L          K   ++++KM  LAHRAY+
Subjt:  ENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL--------AHKTHSSRMVKMKALAHRAYS

Query:  LVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNM--NDLDELIENESMKVN
         VVCKVLG DIK +PAIVL GE LG SL+K  + K  + +   M  ++ D+++E++S +VN
Subjt:  LVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNM--NDLDELIENESMKVN

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X24.8e-15959.57Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSATADVANRNPSNNARKRPRDQKLRKGKN-KNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPPI
        MDP  + RLT+EVL+LHSLW RGPPRN +P H HS+TA VA+ NPSN   KRP D   RK KN K  +P   PP+DSGPEWPCPEPV+NQ STSSGWPPI
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSATADVANRNPSNNARKRPRDQKLRKGKN-KNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPPI

Query:  EPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSD---EEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK
        +P  TPA Q VSSE+R  L+A++LQYK S ACR FFARN DSGSD   EEEE +DGE+ E++EY FFLK+F EN +LR YYE N E G FCCLVC GMGK
Subjt:  EPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSD---EEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK

Query:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN
        K  GKKFK+C+ LVQHSISIS TKK+RAHRAFG V+ RVFGWDIDRLPTIVLKGEPL RSLA+SGDLKVQ EE HV                    D KN
Subjt:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN

Query:  EEISVDGN--EHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVS
        E +SV  N  E KLE+ K AEDP  N KDLISGEN++A K+ D K   +N DNSIS MG    EM+NL + +L+       ACKEF A F  S +DDDVS
Subjt:  EEISVDGN--EHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVS

Query:  ENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL--------AHKTHSSRMVKMKALAHRAYS
        E   +  DG EER EFKFFLKLF ENE LRRYYEN+Y +GEF CLAC+  G+K  K FKTC  LLQHST L          K   ++++KM  LAHRAY+
Subjt:  ENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL--------AHKTHSSRMVKMKALAHRAYS

Query:  LVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVN
         VVCKVLG DIK +PAIVL GE LG SL+K  + K         ++ D+++E++S +VN
Subjt:  LVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVN

A0A5D3DXE1 Uncharacterized protein2.2e-15661.31Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSATADVANRNPSNNARKRPRDQKLRKGKN-KNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPPI
        MDP  + RLT+EVL+LHSLW RGPPRN +P H HS+TA VA+ NPSN   KRP D   RK KN K  +P   PP+DSGPEWPCPEPV+NQ STSSGWPPI
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSATADVANRNPSNNARKRPRDQKLRKGKN-KNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPPI

Query:  EPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSD---EEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK
        +P  TPA Q VSSE+R  L+A++LQYK S ACR FFARN DSGSD   EEEE +DGE+ E++EY FFLK+F EN +LR YYE N E G FCCLVC GMGK
Subjt:  EPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSD---EEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGK

Query:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN
        K  GKKFK+C+ LVQHSISIS TKK+RAHRAFG V+ RVFGWDIDRLPTIVLKGEPL RSLA+SGDLKVQ EE HV                    D KN
Subjt:  KNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKN

Query:  EEISVDGN--EHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVS
        E +SV  N  E KLE+ K AEDP  N KDLISGEN++A K+ D K   +N DNSIS MG    EM+NL + +L+       ACKEF A F  S +DDDVS
Subjt:  EEISVDGN--EHKLEKEKAAEDPDFNDKDLISGENENACKENDHK--AKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVS

Query:  ENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL--------AHKTHSSRMVKMKALAHRAYS
        E   +  DG EER EFKFFLKLF ENE LRRYYEN+Y +GEF CLAC+  G+K  K FKTC  LLQHST L          K   ++++KM  LAHRAY+
Subjt:  ENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSL--------AHKTHSSRMVKMKALAHRAYS

Query:  LVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPK
         VVCKVLG DIK +PAIVL GE LG SL+K  + K
Subjt:  LVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPK

A0A6J1CJP3 uncharacterized protein LOC111012232 isoform X25.3e-18263.49Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHS--ATADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP
        MDP  ERRLTEEVLHLHSLWRRGPP+N + I +HS  A A+VANR PSN     P+  K +K K K PRPAP  P++SGPEWPCPEPV+NQ STSSGWP 
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHS--ATADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP

Query:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGS------DEEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCG
        I+P  TPA QPVSSE+RAKLSA++LQYKE +ACRGFFARN DSGS      +EEEE NDG +T+ EEYKFFLK+F EN++L +YYE N E GSFCCLVCG
Subjt:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGS------DEEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCG

Query:  GMGKKNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNN
        GMGKK SGK+FKSC+GLVQHSISISRTKK+RAHRAFG VICRV GWD+DRLP IVLKGEPL RSLADSG+ +VQ E+NHVAKE   GV+SEN+       
Subjt:  GMGKKNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNN

Query:  DQKNEEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKEND--HKAKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDD
        DQKNEE        KLE++KAAEDPD N K+  SGEN N CKEND   + +N DNSI  MG+ K EM+NLP     V Q I  ACKEFFA FS STSD+ 
Subjt:  DQKNEEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKEND--HKAKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDD

Query:  VSENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSLA---------HKTHSSRMVKMKALAHR
             + DGDGLEER EFKFFLKLF EN+ LR YYE+NYE+GEF CLAC+G GKKT K FKTCG LLQHSTSLA         H    ++M+KMK LAHR
Subjt:  VSENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSLA---------HKTHSSRMVKMKALAHR

Query:  AYSLVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMN--DLDELIENESMKVNS-NGEAVLKDGSLI
        AYS  VCKVLGWD++ +P++VLKGEPLGRSL+KPG+ K     +GN+N     + +EN S++ +    +AV K   L+
Subjt:  AYSLVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMN--DLDELIENESMKVNS-NGEAVLKDGSLI

A0A6J1CM54 uncharacterized protein LOC111012232 isoform X11.2e-18163.07Show/hide
Query:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHS--ATADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP
        MDP  ERRLTEEVLHLHSLWRRGPP+N + I +HS  A A+VANR PSN     P+  K +K K K PRPAP  P++SGPEWPCPEPV+NQ STSSGWP 
Subjt:  MDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHS--ATADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPP

Query:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGS------DEEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCG
        I+P  TPA QPVSSE+RAKLSA++LQYKE +ACRGFFARN DSGS      +EEEE NDG +T+ EEYKFFLK+F EN++L +YYE N E GSFCCLVCG
Subjt:  IEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGS------DEEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCG

Query:  GMGKKNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNN
        GMGKK SGK+FKSC+GLVQHSISISRTKK+RAHRAFG VICRV GWD+DRLP IVLKGEPL RSLADSG+ +VQ E+NHVAKE   GV+SEN+       
Subjt:  GMGKKNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNN

Query:  DQKNEEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKEND--HKAKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDD
        DQKNEE        KLE++KAAEDPD N K+  SGEN N CKEND   + +N DNSI  MG+ K EM+NLP     V Q I  ACKEFFA FS STSD+ 
Subjt:  DQKNEEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKEND--HKAKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDD

Query:  VSENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSLA---------HKTHSSRMVKMKALAHR
             + DGDGLEER EFKFFLKLF EN+ LR YYE+NYE+GEF CLAC+G GKKT K FKTCG LLQHSTSLA         H    ++M+KMK LAHR
Subjt:  VSENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLACKGTGKKTSKRFKTCGSLLQHSTSLA---------HKTHSSRMVKMKALAHR

Query:  AYSLVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGEAVLKDGSL
        AYS  VCKVLGWD++ +P++VLKGEPLGRSL+KPG+ K        + +++  I  + M+  S   + L+D ++
Subjt:  AYSLVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGEAVLKDGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein1.5e-4831.59Show/hide
Query:  ERRLTEEVLHLHSLWRRGPPRNREPIHS-----------------------------HSATADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDS
        +  L +EV++LHSLW +GPP  R+PI S                              + T  + +RNP+N        Q L    NK PRP      DS
Subjt:  ERRLTEEVLHLHSLWRRGPPRNREPIHS-----------------------------HSATADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDS

Query:  GPEWPCPEPVENQASTSSGWPPIEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGD------SGSDEEE--EGNDGEVTENE------EYKF
        G EWP  + V    ST SGWP   P      +P+S+E++ KL+A  LQ    R CR FF R         +G DE E  EG++ +  E E      E++F
Subjt:  GPEWPCPEPVENQASTSSGWPPIEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGD------SGSDEEE--EGNDGEVTENE------EYKF

Query:  FLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGKKNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGD
          ++F EN  L+ YYE N+  G F CLVCGG+G+K S +KFKSC+ L+QHS++I +T  +  HRA  +V+C V GWD++  P +                
Subjt:  FLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGKKNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGD

Query:  LKVQSEENHVAKEHDSGVQSENEAILDDNNDQKNEEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHKAKNVDNSISDMGAGKVEMENLPL
                                     + QK+ +  V+G          A +P  + K  I  E +      +H AK                     
Subjt:  LKVQSEENHVAKEHDSGVQSENEAILDDNNDQKNEEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHKAKNVDNSISDMGAGKVEMENLPL

Query:  NVLQVPQLILTACKEFFADFSLSTSDDDVSENNVKDG----DGLEER------REFKFFLKLFMENEGLRRYYENNYENGEFFCL-ACKGTGKKTSKRFK
         VLQ+ Q    A K+ F                VKDG    DG EE        E +   K+F EN  L+ YYE NYE G F CL  C  T KK  KRFK
Subjt:  NVLQVPQLILTACKEFFADFSLSTSDDDVSENNVKDG----DGLEER------REFKFFLKLFMENEGLRRYYENNYENGEFFCL-ACKGTGKKTSKRFK

Query:  TCGSLLQHSTSLAHKTHSSRMVKMKALAHRAYSLVVCKVLGWDIKNIPAIVLKG
         C  ++QH T         ++ KMK  AH+ ++  VC++LGWD + +P  V+KG
Subjt:  TCGSLLQHSTSLAHKTHSSRMVKMKALAHRAYSLVVCKVLGWDIKNIPAIVLKG

AT1G78810.2 unknown protein1.5e-4831.59Show/hide
Query:  ERRLTEEVLHLHSLWRRGPPRNREPIHS-----------------------------HSATADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDS
        +  L +EV++LHSLW +GPP  R+PI S                              + T  + +RNP+N        Q L    NK PRP      DS
Subjt:  ERRLTEEVLHLHSLWRRGPPRNREPIHS-----------------------------HSATADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDS

Query:  GPEWPCPEPVENQASTSSGWPPIEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGD------SGSDEEE--EGNDGEVTENE------EYKF
        G EWP  + V    ST SGWP   P      +P+S+E++ KL+A  LQ    R CR FF R         +G DE E  EG++ +  E E      E++F
Subjt:  GPEWPCPEPVENQASTSSGWPPIEPSTTPAVQPVSSEDRAKLSAMKLQYKESRACRGFFARNGD------SGSDEEE--EGNDGEVTENE------EYKF

Query:  FLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGKKNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGD
          ++F EN  L+ YYE N+  G F CLVCGG+G+K S +KFKSC+ L+QHS++I +T  +  HRA  +V+C V GWD++  P +                
Subjt:  FLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGKKNSGKKFKSCIGLVQHSISISRTKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGD

Query:  LKVQSEENHVAKEHDSGVQSENEAILDDNNDQKNEEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHKAKNVDNSISDMGAGKVEMENLPL
                                     + QK+ +  V+G          A +P  + K  I  E +      +H AK                     
Subjt:  LKVQSEENHVAKEHDSGVQSENEAILDDNNDQKNEEISVDGNEHKLEKEKAAEDPDFNDKDLISGENENACKENDHKAKNVDNSISDMGAGKVEMENLPL

Query:  NVLQVPQLILTACKEFFADFSLSTSDDDVSENNVKDG----DGLEER------REFKFFLKLFMENEGLRRYYENNYENGEFFCL-ACKGTGKKTSKRFK
         VLQ+ Q    A K+ F                VKDG    DG EE        E +   K+F EN  L+ YYE NYE G F CL  C  T KK  KRFK
Subjt:  NVLQVPQLILTACKEFFADFSLSTSDDDVSENNVKDG----DGLEER------REFKFFLKLFMENEGLRRYYENNYENGEFFCL-ACKGTGKKTSKRFK

Query:  TCGSLLQHSTSLAHKTHSSRMVKMKALAHRAYSLVVCKVLGWDIKNIPAIVLKG
         C  ++QH T         ++ KMK  AH+ ++  VC++LGWD + +P  V+KG
Subjt:  TCGSLLQHSTSLAHKTHSSRMVKMKALAHRAYSLVVCKVLGWDIKNIPAIVLKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGGATCCCAGCTTGGAGAGAAGACTCACTGAAGAGGTTCTTCATCTCCACTCACTGTGGCGGCGAGGCCCGCCAAGGAACCGCGAACCCATTCACAGTCATTC
AGCAACCGCCGATGTCGCGAATCGCAACCCCTCGAACAATGCAAGGAAGAGACCCAGAGACCAAAAGCTTCGAAAGGGCAAGAATAAGAATCCACGCCCTGCCCCCAAGC
CACCGGAAGACTCTGGCCCCGAGTGGCCTTGCCCGGAGCCGGTCGAAAATCAGGCTTCGACGTCATCTGGGTGGCCGCCGATCGAGCCCAGTACCACTCCGGCTGTTCAG
CCGGTGTCGTCTGAAGACCGCGCAAAGCTTTCCGCGATGAAATTGCAGTACAAGGAGTCCCGGGCTTGCCGGGGATTCTTCGCTAGAAATGGCGATTCGGGGAGTGACGA
AGAGGAGGAGGGTAACGACGGGGAGGTAACGGAAAATGAAGAGTATAAGTTCTTTCTGAAGCTGTTTGGGGAGAATAACGACCTTAGGAGTTATTACGAGACGAATTCTG
AAGGTGGGTCGTTTTGTTGCTTGGTTTGCGGTGGAATGGGGAAAAAGAATTCTGGGAAAAAGTTTAAGAGCTGCATTGGGCTCGTTCAACATTCGATTTCGATATCAAGG
ACGAAGAAGAGGCGGGCTCACAGGGCTTTCGGACGGGTCATATGCAGGGTTTTTGGTTGGGATATCGATCGACTTCCCACGATTGTGTTGAAGGGTGAGCCTCTTGGTCG
CTCATTAGCCGATTCTGGAGACTTGAAGGTTCAGTCCGAGGAAAATCATGTGGCTAAAGAGCATGATTCTGGGGTTCAGAGTGAAAATGAGGCCATTTTGGATGACAACA
ATGATCAGAAGAATGAAGAGATTTCTGTGGACGGAAATGAGCATAAGTTAGAGAAAGAAAAGGCAGCAGAAGATCCTGATTTTAATGATAAAGATTTGATTTCTGGTGAG
AATGAAAATGCTTGCAAGGAGAATGATCACAAAGCAAAAAATGTTGATAATTCAATTTCAGACATGGGAGCAGGAAAAGTTGAAATGGAAAACTTGCCTTTAAATGTGCT
GCAGGTACCGCAGCTGATTTTGACAGCCTGTAAAGAATTTTTTGCGGACTTCTCCCTGTCTACGAGTGACGATGATGTTAGTGAAAATAACGTAAAGGATGGAGATGGAC
TTGAGGAACGCAGAGAGTTCAAGTTCTTTTTGAAGTTGTTTATGGAGAATGAAGGCTTGAGAAGATATTATGAGAACAACTACGAAAACGGGGAATTTTTCTGTTTAGCT
TGTAAAGGAACAGGAAAGAAAACGTCGAAGAGATTTAAGACGTGTGGCAGCCTTCTCCAGCATTCAACTTCTCTGGCTCACAAGACTCACAGTAGTAGAATGGTGAAAAT
GAAGGCGCTGGCTCATAGGGCTTATAGTTTAGTTGTATGCAAGGTTCTTGGGTGGGACATCAAAAATATTCCAGCAATCGTGTTAAAAGGCGAACCTCTTGGCCGTTCCT
TATCAAAGCCAGGCATGCCGAAGGGTGGTGAATCTGCAGTTGGTAATATGAATGATTTAGATGAGCTCATAGAAAATGAATCTATGAAGGTTAATAGCAATGGTGAAGCA
GTTTTGAAGGATGGTTCTCTGATTAATGTCGTCAATTTTTCCTCCTCTTTCCCGTCGACCGGCAGCCCGAGCCGCCACCGCAAATGGGCTTTCGTACTTCTCATTGTGCG
AGTTGTACATTGCCAGCCACGCCAGTTGACGCGCCAAATTCTCCGGCCCCTCCGTCTCCTCCGCCTTCTCCAAAACCCCTACGATCTCCTCCGCGAACAAGTTGTCCAAC
AACCCATCTGTTCCGATCACCACAATGTCTCCGGCCATCAACGGAACTTCCCTCAGCCACGCCTTATCCGGCCCATCGGAACCCTGCTCATTTCCCAACTGAAACGGATG
GTTGAACCGCCGCTGCTGAATCGGCGACTTGAACGTGCACCTCTTATTCCTGAACACCATGAACCCGCTGTCTCCGACGAACACCGACCGGACGCACTTTCGAGGTCGAC
GACGGCGGAGCGTTCTCCGGAAACGGCTTCGGCGCAGTTCGCCATCAGCTCTCTGGCGTACTCGCCGGCGTCGATACCTTTCGAAGCCCAGCCGCCGACGCCGTCCGCCA
CGCCGACGGTCTGCTTCTCTGCGCAGACGAAGTGGGCGTCTTCTCCGAGCGGTTTGAAAGGGTTGTCTTTGGGAATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATGGATCCCAGCTTGGAGAGAAGACTCACTGAAGAGGTTCTTCATCTCCACTCACTGTGGCGGCGAGGCCCGCCAAGGAACCGCGAACCCATTCACAGTCATTC
AGCAACCGCCGATGTCGCGAATCGCAACCCCTCGAACAATGCAAGGAAGAGACCCAGAGACCAAAAGCTTCGAAAGGGCAAGAATAAGAATCCACGCCCTGCCCCCAAGC
CACCGGAAGACTCTGGCCCCGAGTGGCCTTGCCCGGAGCCGGTCGAAAATCAGGCTTCGACGTCATCTGGGTGGCCGCCGATCGAGCCCAGTACCACTCCGGCTGTTCAG
CCGGTGTCGTCTGAAGACCGCGCAAAGCTTTCCGCGATGAAATTGCAGTACAAGGAGTCCCGGGCTTGCCGGGGATTCTTCGCTAGAAATGGCGATTCGGGGAGTGACGA
AGAGGAGGAGGGTAACGACGGGGAGGTAACGGAAAATGAAGAGTATAAGTTCTTTCTGAAGCTGTTTGGGGAGAATAACGACCTTAGGAGTTATTACGAGACGAATTCTG
AAGGTGGGTCGTTTTGTTGCTTGGTTTGCGGTGGAATGGGGAAAAAGAATTCTGGGAAAAAGTTTAAGAGCTGCATTGGGCTCGTTCAACATTCGATTTCGATATCAAGG
ACGAAGAAGAGGCGGGCTCACAGGGCTTTCGGACGGGTCATATGCAGGGTTTTTGGTTGGGATATCGATCGACTTCCCACGATTGTGTTGAAGGGTGAGCCTCTTGGTCG
CTCATTAGCCGATTCTGGAGACTTGAAGGTTCAGTCCGAGGAAAATCATGTGGCTAAAGAGCATGATTCTGGGGTTCAGAGTGAAAATGAGGCCATTTTGGATGACAACA
ATGATCAGAAGAATGAAGAGATTTCTGTGGACGGAAATGAGCATAAGTTAGAGAAAGAAAAGGCAGCAGAAGATCCTGATTTTAATGATAAAGATTTGATTTCTGGTGAG
AATGAAAATGCTTGCAAGGAGAATGATCACAAAGCAAAAAATGTTGATAATTCAATTTCAGACATGGGAGCAGGAAAAGTTGAAATGGAAAACTTGCCTTTAAATGTGCT
GCAGGTACCGCAGCTGATTTTGACAGCCTGTAAAGAATTTTTTGCGGACTTCTCCCTGTCTACGAGTGACGATGATGTTAGTGAAAATAACGTAAAGGATGGAGATGGAC
TTGAGGAACGCAGAGAGTTCAAGTTCTTTTTGAAGTTGTTTATGGAGAATGAAGGCTTGAGAAGATATTATGAGAACAACTACGAAAACGGGGAATTTTTCTGTTTAGCT
TGTAAAGGAACAGGAAAGAAAACGTCGAAGAGATTTAAGACGTGTGGCAGCCTTCTCCAGCATTCAACTTCTCTGGCTCACAAGACTCACAGTAGTAGAATGGTGAAAAT
GAAGGCGCTGGCTCATAGGGCTTATAGTTTAGTTGTATGCAAGGTTCTTGGGTGGGACATCAAAAATATTCCAGCAATCGTGTTAAAAGGCGAACCTCTTGGCCGTTCCT
TATCAAAGCCAGGCATGCCGAAGGGTGGTGAATCTGCAGTTGGTAATATGAATGATTTAGATGAGCTCATAGAAAATGAATCTATGAAGGTTAATAGCAATGGTGAAGCA
GTTTTGAAGGATGGTTCTCTGATTAATGTCGTCAATTTTTCCTCCTCTTTCCCGTCGACCGGCAGCCCGAGCCGCCACCGCAAATGGGCTTTCGTACTTCTCATTGTGCG
AGTTGTACATTGCCAGCCACGCCAGTTGACGCGCCAAATTCTCCGGCCCCTCCGTCTCCTCCGCCTTCTCCAAAACCCCTACGATCTCCTCCGCGAACAAGTTGTCCAAC
AACCCATCTGTTCCGATCACCACAATGTCTCCGGCCATCAACGGAACTTCCCTCAGCCACGCCTTATCCGGCCCATCGGAACCCTGCTCATTTCCCAACTGAAACGGATG
GTTGAACCGCCGCTGCTGAATCGGCGACTTGAACGTGCACCTCTTATTCCTGAACACCATGAACCCGCTGTCTCCGACGAACACCGACCGGACGCACTTTCGAGGTCGAC
GACGGCGGAGCGTTCTCCGGAAACGGCTTCGGCGCAGTTCGCCATCAGCTCTCTGGCGTACTCGCCGGCGTCGATACCTTTCGAAGCCCAGCCGCCGACGCCGTCCGCCA
CGCCGACGGTCTGCTTCTCTGCGCAGACGAAGTGGGCGTCTTCTCCGAGCGGTTTGAAAGGGTTGTCTTTGGGAATGTAG
Protein sequenceShow/hide protein sequence
MAMDPSLERRLTEEVLHLHSLWRRGPPRNREPIHSHSATADVANRNPSNNARKRPRDQKLRKGKNKNPRPAPKPPEDSGPEWPCPEPVENQASTSSGWPPIEPSTTPAVQ
PVSSEDRAKLSAMKLQYKESRACRGFFARNGDSGSDEEEEGNDGEVTENEEYKFFLKLFGENNDLRSYYETNSEGGSFCCLVCGGMGKKNSGKKFKSCIGLVQHSISISR
TKKRRAHRAFGRVICRVFGWDIDRLPTIVLKGEPLGRSLADSGDLKVQSEENHVAKEHDSGVQSENEAILDDNNDQKNEEISVDGNEHKLEKEKAAEDPDFNDKDLISGE
NENACKENDHKAKNVDNSISDMGAGKVEMENLPLNVLQVPQLILTACKEFFADFSLSTSDDDVSENNVKDGDGLEERREFKFFLKLFMENEGLRRYYENNYENGEFFCLA
CKGTGKKTSKRFKTCGSLLQHSTSLAHKTHSSRMVKMKALAHRAYSLVVCKVLGWDIKNIPAIVLKGEPLGRSLSKPGMPKGGESAVGNMNDLDELIENESMKVNSNGEA
VLKDGSLINVVNFSSSFPSTGSPSRHRKWAFVLLIVRVVHCQPRQLTRQILRPLRLLRLLQNPYDLLREQVVQQPICSDHHNVSGHQRNFPQPRLIRPIGTLLISQLKRM
VEPPLLNRRLERAPLIPEHHEPAVSDEHRPDALSRSTTAERSPETASAQFAISSLAYSPASIPFEAQPPTPSATPTVCFSAQTKWASSPSGLKGLSLGM