; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021242 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021242
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold358:406574..408882
RNA-Seq ExpressionMS021242
SyntenyMS021242
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141998.1 uncharacterized protein LOC111012232 isoform X1 [Momordica charantia]6.2e-29099.41Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSEL AYYEKNYECGSFCCLVCGG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL
        MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGG GVKSENDDQKNEEKL
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL

Query:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE
        EEDKAAEDPDSNAKNSSSGEN NGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE
Subjt:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE

Query:  NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL
        NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL
Subjt:  NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL

Query:  GRSLTKPGVSK
        GRSLTKPGVSK
Subjt:  GRSLTKPGVSK

XP_022142005.1 uncharacterized protein LOC111012232 isoform X2 [Momordica charantia]6.2e-29099.41Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSEL AYYEKNYECGSFCCLVCGG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL
        MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGG GVKSENDDQKNEEKL
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL

Query:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE
        EEDKAAEDPDSNAKNSSSGEN NGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE
Subjt:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE

Query:  NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL
        NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL
Subjt:  NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL

Query:  GRSLTKPGVSK
        GRSLTKPGVSK
Subjt:  GRSLTKPGVSK

XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]2.3e-18367.41Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY E RLTEEVLHLH+LWRRGPP+N K I NHS+  VA  ANR PSNKRP    P     KKKKPR  P   Q+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        +P ATPAA PVSSEERA L+ALQLQYK   ACRGFFARNADSGS    +EE EEEE +G + + EEYKFFLK+FVEN ELR YYEKN E G FCCLVCGG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSEN-----DD--
        M K+K GK+FK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPLSRSLADSG  +VQPE+NHVAKE  SGV++EN     DD  
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSEN-----DD--

Query:  QKNE--------EKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLP-----VLQPISKACKEFFAGFSPSTSD---
        +KNE        +KLEE++ AEDP SN+K+  SG+N++ CK NDV +Q ENTDNS+ GM     EM NLP     V + I KACKEF A F  S SD   
Subjt:  QKNE--------EKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLP-----VLQPISKACKEFFAGFSPSTSD---

Query:  ---ELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRA
            L DG+G+EEREEFKFFLKLFTEN+ LR YYE+NY+DGEF CLAC GAGKK  K FKTCGRLLQH+TSL KN+I +        AKMLKMK +AHRA
Subjt:  ---ELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRA

Query:  YSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSK
         S  +CKVLGWD+E+LP+VVLKGEPLGRSLTK   +K
Subjt:  YSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSK

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]2.3e-18367.41Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY E RLTEEVLHLH+LWRRGPP+N K I NHS+  VA  ANR PSNKRP    P     KKKKPR  P   Q+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        +P ATPAA PVSSEERA L+ALQLQYK   ACRGFFARNADSGS    +EE EEEE +G + + EEYKFFLK+FVEN ELR YYEKN E G FCCLVCGG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSEN-----DD--
        M K+K GK+FK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPLSRSLADSG  +VQPE+NHVAKE  SGV++EN     DD  
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSEN-----DD--

Query:  QKNE--------EKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLP-----VLQPISKACKEFFAGFSPSTSD---
        +KNE        +KLEE++ AEDP SN+K+  SG+N++ CK NDV +Q ENTDNS+ GM     EM NLP     V + I KACKEF A F  S SD   
Subjt:  QKNE--------EKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLP-----VLQPISKACKEFFAGFSPSTSD---

Query:  ---ELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRA
            L DG+G+EEREEFKFFLKLFTEN+ LR YYE+NY+DGEF CLAC GAGKK  K FKTCGRLLQH+TSL KN+I +        AKMLKMK +AHRA
Subjt:  ---ELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRA

Query:  YSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSK
         S  +CKVLGWD+E+LP+VVLKGEPLGRSLTK   +K
Subjt:  YSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSK

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]3.2e-18568.05Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY E RLTEEVLHLH+LWRRGPP+N K I NHS+  VA  ANR PSNKRP    P     KKKKPR  P   Q+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        +P ATPAA PVSSEERA L+ALQLQYK   ACRGFFARNADSGS    +EE EEEE +G + + EEYKFFLK+FVEN ELR YYEKN E G FCCLVCGG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSEN-----DD--
        M K+K GK+FK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPLSRSLADSG  +VQPE+NHVAKE  SGV++EN     DD  
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSEN-----DD--

Query:  QKNE--------EKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD------EL
        +KNE        +KLEE++ AEDP SN+K+  SG+N++ CK NDV +Q ENTDNS+ GM     EM NLPV + I KACKEF A F  S SD       L
Subjt:  QKNE--------EKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD------EL

Query:  NDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAV
         DG+G+EEREEFKFFLKLFTEN+ LR YYE+NY+DGEF CLAC GAGKK  K FKTCGRLLQH+TSL KN+I +        AKMLKMK +AHRA S  +
Subjt:  NDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAV

Query:  CKVLGWDVEELPSVVLKGEPLGRSLTKPGVSK
        CKVLGWD+E+LP+VVLKGEPLGRSLTK   +K
Subjt:  CKVLGWDVEELPSVVLKGEPLGRSLTKPGVSK

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X12.2e-16365.05Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY + RLT+EVL+LHSLW RGPP+N K   +HS+ AVA+     PSNKRP  P   K K KKKK +P  D PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        QP ATPAAQ VSSEER  L+ALQLQYK   ACR FFARNADSGS   +EEEEEEEE+DG + + +EY FFLKMFVEN ELR YYEKN E G FCCLVC G
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL
        MGKKK GK+FK+C+ LVQHSISIS TKKKRAHRAFG V+ RV GWD+DRLP IVLKGEPLSRSLA+SG+ +VQPE+ HV  +      S N+D   E+KL
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL

Query:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFLK
        EE K AEDP SN+K+  SGEN++  K+ DV +Q EN DNSI GMG    EM NL V   I +ACKEF A F  S +D    E    DG EEREEFKFFLK
Subjt:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFLK

Query:  LFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLK
        LFTEN++LR YYE++Y DGEF CLACE AG+K  K FKTC RLLQHST L KN I E       + K+LKM  LAHRAY+S VCKVLG D++ LP++VL 
Subjt:  LFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLK

Query:  GEPLGRSLTKPGVSK
        GE LG SLTK  VSK
Subjt:  GEPLGRSLTKPGVSK

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X22.2e-16365.05Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY + RLT+EVL+LHSLW RGPP+N K   +HS+ AVA+     PSNKRP  P   K K KKKK +P  D PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        QP ATPAAQ VSSEER  L+ALQLQYK   ACR FFARNADSGS   +EEEEEEEE+DG + + +EY FFLKMFVEN ELR YYEKN E G FCCLVC G
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL
        MGKKK GK+FK+C+ LVQHSISIS TKKKRAHRAFG V+ RV GWD+DRLP IVLKGEPLSRSLA+SG+ +VQPE+ HV  +      S N+D   E+KL
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL

Query:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFLK
        EE K AEDP SN+K+  SGEN++  K+ DV +Q EN DNSI GMG    EM NL V   I +ACKEF A F  S +D    E    DG EEREEFKFFLK
Subjt:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFLK

Query:  LFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLK
        LFTEN++LR YYE++Y DGEF CLACE AG+K  K FKTC RLLQHST L KN I E       + K+LKM  LAHRAY+S VCKVLG D++ LP++VL 
Subjt:  LFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLK

Query:  GEPLGRSLTKPGVSK
        GE LG SLTK  VSK
Subjt:  GEPLGRSLTKPGVSK

A0A5D3DXE1 Uncharacterized protein2.2e-16365.05Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY + RLT+EVL+LHSLW RGPP+N K   +HS+ AVA+     PSNKRP  P   K K KKKK +P  D PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        QP ATPAAQ VSSEER  L+ALQLQYK   ACR FFARNADSGS   +EEEEEEEE+DG + + +EY FFLKMFVEN ELR YYEKN E G FCCLVC G
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL
        MGKKK GK+FK+C+ LVQHSISIS TKKKRAHRAFG V+ RV GWD+DRLP IVLKGEPLSRSLA+SG+ +VQPE+ HV  +      S N+D   E+KL
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL

Query:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFLK
        EE K AEDP SN+K+  SGEN++  K+ DV +Q EN DNSI GMG    EM NL V   I +ACKEF A F  S +D    E    DG EEREEFKFFLK
Subjt:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFLK

Query:  LFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLK
        LFTEN++LR YYE++Y DGEF CLACE AG+K  K FKTC RLLQHST L KN I E       + K+LKM  LAHRAY+S VCKVLG D++ LP++VL 
Subjt:  LFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLK

Query:  GEPLGRSLTKPGVSK
        GE LG SLTK  VSK
Subjt:  GEPLGRSLTKPGVSK

A0A6J1CJP3 uncharacterized protein LOC111012232 isoform X23.0e-29099.41Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSEL AYYEKNYECGSFCCLVCGG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL
        MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGG GVKSENDDQKNEEKL
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL

Query:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE
        EEDKAAEDPDSNAKNSSSGEN NGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE
Subjt:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE

Query:  NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL
        NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL
Subjt:  NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL

Query:  GRSLTKPGVSK
        GRSLTKPGVSK
Subjt:  GRSLTKPGVSK

A0A6J1CM54 uncharacterized protein LOC111012232 isoform X13.0e-29099.41Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG
        QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSEL AYYEKNYECGSFCCLVCGG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGG

Query:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL
        MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGG GVKSENDDQKNEEKL
Subjt:  MGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKL

Query:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE
        EEDKAAEDPDSNAKNSSSGEN NGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE
Subjt:  EEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTE

Query:  NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL
        NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL
Subjt:  NDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPL

Query:  GRSLTKPGVSK
        GRSLTKPGVSK
Subjt:  GRSLTKPGVSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein3.6e-5433.08Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGG------------PQ--------PPKAKKKKKKPRPAPDHPQESGPE
        M+ YD+  L +EV++LHSLW +GPP   K IP+ +   + +   R   N  P              PQ        P       K+PRP      +SG E
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGG------------PQ--------PPKAKKKKKKPRPAPDHPQESGPE

Query:  WPCPEPVQNQPSTSSGWPAIQPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNA---DSGSEVEEEEEEEEEENDGGITKIE-----EYKFFLK
        WP  + V   PST SGWP  +PC     +P+S+EE+ KL+A  LQ    + CR FF R +   DS     +E E +E + D  + K E     E++F  +
Subjt:  WPCPEPVQNQPSTSSGWPAIQPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNA---DSGSEVEEEEEEEEEENDGGITKIE-----EYKFFLK

Query:  MFVENSELRAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEV
        +F EN +L+ YYEKN   G F CLVCGG+G +KS ++FKSC+ L+QHS++I +T  K  HRA   V+C VLGWDV+  P+                    
Subjt:  MFVENSELRAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEV

Query:  QPEDNHVAKEGGSGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFS
                      V S+ D Q   E       A +P S++K           ++  V   EE+   ++            L + Q  S+A K+ F    
Subjt:  QPEDNHVAKEGGSGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFS

Query:  PSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGA-GKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLA
           +D   +       EE +   K+F+EN +L+ YYE NYE G F+CL C  A  KK  K FK C  ++QH T                  K+ KMK  A
Subjt:  PSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGA-GKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLA

Query:  HRAYSSAVCKVLGWDVEELPSVVLKG
        H+ ++  VC++LGWD E LP  V+KG
Subjt:  HRAYSSAVCKVLGWDVEELPSVVLKG

AT1G78810.2 unknown protein3.6e-5433.08Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGG------------PQ--------PPKAKKKKKKPRPAPDHPQESGPE
        M+ YD+  L +EV++LHSLW +GPP   K IP+ +   + +   R   N  P              PQ        P       K+PRP      +SG E
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGG------------PQ--------PPKAKKKKKKPRPAPDHPQESGPE

Query:  WPCPEPVQNQPSTSSGWPAIQPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNA---DSGSEVEEEEEEEEEENDGGITKIE-----EYKFFLK
        WP  + V   PST SGWP  +PC     +P+S+EE+ KL+A  LQ    + CR FF R +   DS     +E E +E + D  + K E     E++F  +
Subjt:  WPCPEPVQNQPSTSSGWPAIQPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNA---DSGSEVEEEEEEEEEENDGGITKIE-----EYKFFLK

Query:  MFVENSELRAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEV
        +F EN +L+ YYEKN   G F CLVCGG+G +KS ++FKSC+ L+QHS++I +T  K  HRA   V+C VLGWDV+  P+                    
Subjt:  MFVENSELRAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEV

Query:  QPEDNHVAKEGGSGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFS
                      V S+ D Q   E       A +P S++K           ++  V   EE+   ++            L + Q  S+A K+ F    
Subjt:  QPEDNHVAKEGGSGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENENGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFS

Query:  PSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGA-GKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLA
           +D   +       EE +   K+F+EN +L+ YYE NYE G F+CL C  A  KK  K FK C  ++QH T                  K+ KMK  A
Subjt:  PSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGA-GKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLA

Query:  HRAYSSAVCKVLGWDVEELPSVVLKG
        H+ ++  VC++LGWD E LP  V+KG
Subjt:  HRAYSSAVCKVLGWDVEELPSVVLKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTACGACGAGAGAAGACTCACCGAAGAGGTTCTTCATCTCCACTCTCTCTGGCGGCGAGGTCCGCCGAAGAACTGTAAATCCATTCCCAATCATTCAGCAAT
CGCCGTCGCCAACGTCGCGAATCGCATCCCTTCGAACAAGAGACCCGGAGGCCCACAACCCCCAAAGGCCAAGAAGAAGAAGAAGAAGCCACGCCCTGCCCCCGACCACC
CGCAAGAATCCGGACCCGAATGGCCGTGTCCGGAGCCGGTTCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGGCGATTCAGCCCTGTGCCACTCCGGCGGCTCAGCCC
GTGTCGTCGGAAGAGCGAGCGAAGCTCTCGGCGTTGCAATTGCAGTACAAGGAATTCAAGGCCTGCCGGGGATTCTTCGCGAGGAATGCCGATTCGGGGAGTGAGGTAGA
GGAGGAAGAGGAGGAGGAGGAGGAGGAAAATGATGGGGGGATTACGAAAATTGAGGAGTACAAGTTTTTTCTGAAGATGTTTGTGGAGAATAGTGAACTTAGGGCTTATT
ACGAGAAGAATTATGAATGTGGGTCGTTTTGTTGCTTGGTCTGCGGCGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTCAAGAGCTGCGTTGGGCTCGTTCAGCATTCC
ATTTCGATATCGAGGACGAAGAAGAAGCGGGCTCACAGGGCTTTTGGACTGGTCATATGCAGGGTTCTTGGTTGGGATGTTGATCGACTTCCGATTATTGTGTTGAAAGG
CGAGCCTCTTAGTCGCTCATTAGCTGATTCTGGAGAACCAGAGGTTCAGCCTGAGGATAATCATGTGGCTAAAGAGGGTGGTTCTGGGGTTAAGAGTGAGAACGATGATC
AGAAGAATGAAGAGAAATTGGAGGAAGACAAGGCAGCAGAAGATCCTGATTCTAATGCTAAAAATTCGAGTTCTGGTGAGAATGAAAATGGCTGCAAGGAGAATGATGTC
AATATGCAAGAAGAAAATACTGATAATTCAATTCCAGGCATGGGATCAGACAAAGAGGAAATGAAAAACTTGCCTGTACTGCAGCCGATCTCGAAAGCCTGTAAAGAATT
TTTTGCAGGCTTCTCTCCATCTACGAGCGATGAATTGAACGATGGAGATGGACTCGAGGAACGCGAAGAGTTCAAGTTCTTCTTGAAGTTGTTTACTGAGAATGATGACT
TGAGGGGATATTACGAGAGCAACTATGAAGACGGGGAATTTGTCTGCTTAGCTTGTGAAGGAGCAGGGAAGAAAACACCAAAGGGATTTAAGACGTGTGGTCGTCTTCTC
CAACATTCAACTTCTCTAGCGAAGAATAGAATAGGGGAAAATCTGCCTCACGATGCTGACCGTGCTAAAATGTTGAAGATGAAGACACTGGCTCATAGAGCATATAGTTC
GGCTGTGTGCAAGGTTCTTGGTTGGGACGTCGAAGAGCTTCCATCAGTCGTGTTGAAAGGCGAACCTCTGGGTCGTTCCTTAACAAAGCCAGGCGTGTCAAAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCCCTACGACGAGAGAAGACTCACCGAAGAGGTTCTTCATCTCCACTCTCTCTGGCGGCGAGGTCCGCCGAAGAACTGTAAATCCATTCCCAATCATTCAGCAAT
CGCCGTCGCCAACGTCGCGAATCGCATCCCTTCGAACAAGAGACCCGGAGGCCCACAACCCCCAAAGGCCAAGAAGAAGAAGAAGAAGCCACGCCCTGCCCCCGACCACC
CGCAAGAATCCGGACCCGAATGGCCGTGTCCGGAGCCGGTTCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGGCGATTCAGCCCTGTGCCACTCCGGCGGCTCAGCCC
GTGTCGTCGGAAGAGCGAGCGAAGCTCTCGGCGTTGCAATTGCAGTACAAGGAATTCAAGGCCTGCCGGGGATTCTTCGCGAGGAATGCCGATTCGGGGAGTGAGGTAGA
GGAGGAAGAGGAGGAGGAGGAGGAGGAAAATGATGGGGGGATTACGAAAATTGAGGAGTACAAGTTTTTTCTGAAGATGTTTGTGGAGAATAGTGAACTTAGGGCTTATT
ACGAGAAGAATTATGAATGTGGGTCGTTTTGTTGCTTGGTCTGCGGCGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTCAAGAGCTGCGTTGGGCTCGTTCAGCATTCC
ATTTCGATATCGAGGACGAAGAAGAAGCGGGCTCACAGGGCTTTTGGACTGGTCATATGCAGGGTTCTTGGTTGGGATGTTGATCGACTTCCGATTATTGTGTTGAAAGG
CGAGCCTCTTAGTCGCTCATTAGCTGATTCTGGAGAACCAGAGGTTCAGCCTGAGGATAATCATGTGGCTAAAGAGGGTGGTTCTGGGGTTAAGAGTGAGAACGATGATC
AGAAGAATGAAGAGAAATTGGAGGAAGACAAGGCAGCAGAAGATCCTGATTCTAATGCTAAAAATTCGAGTTCTGGTGAGAATGAAAATGGCTGCAAGGAGAATGATGTC
AATATGCAAGAAGAAAATACTGATAATTCAATTCCAGGCATGGGATCAGACAAAGAGGAAATGAAAAACTTGCCTGTACTGCAGCCGATCTCGAAAGCCTGTAAAGAATT
TTTTGCAGGCTTCTCTCCATCTACGAGCGATGAATTGAACGATGGAGATGGACTCGAGGAACGCGAAGAGTTCAAGTTCTTCTTGAAGTTGTTTACTGAGAATGATGACT
TGAGGGGATATTACGAGAGCAACTATGAAGACGGGGAATTTGTCTGCTTAGCTTGTGAAGGAGCAGGGAAGAAAACACCAAAGGGATTTAAGACGTGTGGTCGTCTTCTC
CAACATTCAACTTCTCTAGCGAAGAATAGAATAGGGGAAAATCTGCCTCACGATGCTGACCGTGCTAAAATGTTGAAGATGAAGACACTGGCTCATAGAGCATATAGTTC
GGCTGTGTGCAAGGTTCTTGGTTGGGACGTCGAAGAGCTTCCATCAGTCGTGTTGAAAGGCGAACCTCTGGGTCGTTCCTTAACAAAGCCAGGCGTGTCAAAG
Protein sequenceShow/hide protein sequence
MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAIQPCATPAAQP
VSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELRAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQHS
ISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGSGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENENGCKENDV
NMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAGKKTPKGFKTCGRLL
QHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSK