; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0823 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0823
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC11:6988344..6992285
RNA-Seq ExpressionMC11g0823
SyntenyMC11g0823
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141998.1 uncharacterized protein LOC111012232 isoform X1 [Momordica charantia]0.099.65Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEE NDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
        GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK

Query:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT
        LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT
Subjt:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT

Query:  ENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP
        ENDDLRGYYESNYEDGEFVCLACEGA KKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP
Subjt:  ENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP

Query:  LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG
        LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG
Subjt:  LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG

XP_022142005.1 uncharacterized protein LOC111012232 isoform X2 [Momordica charantia]0.098.59Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEE NDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
        GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK

Query:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT
        LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT
Subjt:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT

Query:  ENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP
        ENDDLRGYYESNYEDGEFVCLACEGA KKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP
Subjt:  ENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP

Query:  LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG
        LGRSLTKPGVSK      DEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG
Subjt:  LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG

XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]9.95e-23062.81Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY E RLTEEVLHLH+LWRRGPP+N K I NHS+  VA  ANR PSNKRP  P+      KKKKPR  P   Q+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        +P ATPAA PVSSEERA L+ALQLQYK   ACRGFFARNADSGS+     EE EEEE +G + + EEYKFFLK+FVEN EL  YYEKN E G FCCLVCG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSEN-----DD-
        GM K+K GK+FK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPLSRSLADSG  +VQPE+NHVAKE   GV++EN     DD 
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSEN-----DD-

Query:  -QKNE--------EKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLP-----VLQPISKACKEFFAGFSPSTSDE-
         +KNE        +KLEE++ AEDP SN+K+  SG+N + CK NDV +Q ENTDNS+ GM     EM NLP     V + I KACKEF A F  S SD  
Subjt:  -QKNE--------EKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLP-----VLQPISKACKEFFAGFSPSTSDE-

Query:  -----LNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHR
             L DG+G+EEREEFKFFLKLFTEN+ LR YYE+NY+DGEF CLAC GA KK  K FKTCGRLLQH+TSL KN+I +        AKMLKMK +AHR
Subjt:  -----LNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHR

Query:  AYSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDD----AVCKMNDLVGNYSDRDQQIAG
        A S  +CKVLGWD+E+LP+VVLKGEPLGRSLTK   +K+    QDE  +V  S+  +  E+ S + +K++++    AV  M+D+V + S +  Q+ G
Subjt:  AYSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDD----AVCKMNDLVGNYSDRDQQIAG

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]1.07e-22862.48Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY E RLTEEVLHLH+LWRRGPP+N K I NHS+  VA  ANR PSNKRP  P+      KKKKPR  P   Q+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        +P ATPAA PVSSEERA L+ALQLQYK   ACRGFFARNADSGS+     EE EEEE +G + + EEYKFFLK+FVEN EL  YYEKN E G FCCLVCG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSEN-----DD-
        GM K+K GK+FK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPLSRSLADSG  +VQPE+NHVAKE   GV++EN     DD 
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSEN-----DD-

Query:  -QKNE--------EKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLP-----VLQPISKACKEFFAGFSPSTSDE-
         +KNE        +KLEE++ AEDP SN+K+  SG+N + CK NDV +Q ENTDNS+ GM     EM NLP     V + I KACKEF A F  S SD  
Subjt:  -QKNE--------EKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLP-----VLQPISKACKEFFAGFSPSTSDE-

Query:  -----LNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHR
             L DG+G+EEREEFKFFLKLFTEN+ LR YYE+NY+DGEF CLAC GA KK  K FKTCGRLLQH+TSL KN+I +        AKMLKMK +AHR
Subjt:  -----LNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHR

Query:  AYSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDD----AVCKMNDLVGNYSDRDQQIAG
        A S  +CKVLGWD+E+LP+VVLKGEPLGRSLTK   +K      + +GN   S+  +  E+ S + +K++++    AV  M+D+V + S +  Q+ G
Subjt:  AYSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDD----AVCKMNDLVGNYSDRDQQIAG

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]3.13e-23263.34Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY E RLTEEVLHLH+LWRRGPP+N K I NHS+  VA  ANR PSNKRP  P+      KKKKPR  P   Q+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        +P ATPAA PVSSEERA L+ALQLQYK   ACRGFFARNADSGS+     EE EEEE +G + + EEYKFFLK+FVEN EL  YYEKN E G FCCLVCG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSEN-----DD-
        GM K+K GK+FK+CVGLVQHSISISRTKKKRAHRAFG V+CRV GWD+DRLP IVLKGEPLSRSLADSG  +VQPE+NHVAKE   GV++EN     DD 
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSEN-----DD-

Query:  -QKNE--------EKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDE------
         +KNE        +KLEE++ AEDP SN+K+  SG+N + CK NDV +Q ENTDNS+ GM     EM NLPV + I KACKEF A F  S SD       
Subjt:  -QKNE--------EKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDE------

Query:  LNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSA
        L DG+G+EEREEFKFFLKLFTEN+ LR YYE+NY+DGEF CLAC GA KK  K FKTCGRLLQH+TSL KN+I +        AKMLKMK +AHRA S  
Subjt:  LNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSA

Query:  VCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDD----AVCKMNDLVGNYSDRDQQIAG
        +CKVLGWD+E+LP+VVLKGEPLGRSLTK   +K+    QDE  +V  S+  +  E+ S + +K++++    AV  M+D+V + S +  Q+ G
Subjt:  VCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDD----AVCKMNDLVGNYSDRDQQIAG

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X14.20e-20362.16Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY + RLT+EVL+LHSLW RGPP+N K   +HS+ AVA+     PSNKRP  P   K K KKKK +P  D PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        QP ATPAAQ VSSEER  L+ALQLQYK   ACR FFARNADSGS+    EEEEEEEE+DG + + +EY FFLKMFVEN EL  YYEKN E G FCCLVC 
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
        GMGKKK GK+FK+C+ LVQHSISIS TKKKRAHRAFG V+ RV GWD+DRLP IVLKGEPLSRSLA+SG+ +VQPE+ HV  +      S N+D   E+K
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK

Query:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFL
        LEE K AEDP SN+K+  SGEN +  K+ DV +Q EN DNSI GMG    EM NL V   I +ACKEF A F  S +D    E    DG EEREEFKFFL
Subjt:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFL

Query:  KLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVL
        KLFTEN++LR YYE++Y DGEF CLACE A +K  K FKTC RLLQHST L KN I E       + K+LKM  LAHRAY+S VCKVLG D++ LP++VL
Subjt:  KLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVL

Query:  KGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKL
         GE LG SLTK  VSK+  K   ++ + N   + D +E+ S E ++L
Subjt:  KGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKL

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X21.12e-20262.25Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY + RLT+EVL+LHSLW RGPP+N K   +HS+ AVA+     PSNKRP  P   K K KKKK +P  D PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        QP ATPAAQ VSSEER  L+ALQLQYK   ACR FFARNADSGS+    EEEEEEEE+DG + + +EY FFLKMFVEN EL  YYEKN E G FCCLVC 
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
        GMGKKK GK+FK+C+ LVQHSISIS TKKKRAHRAFG V+ RV GWD+DRLP IVLKGEPLSRSLA+SG+ +VQPE+ HV  +      S N+D   E+K
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK

Query:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFL
        LEE K AEDP SN+K+  SGEN +  K+ DV +Q EN DNSI GMG    EM NL V   I +ACKEF A F  S +D    E    DG EEREEFKFFL
Subjt:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFL

Query:  KLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVL
        KLFTEN++LR YYE++Y DGEF CLACE A +K  K FKTC RLLQHST L KN I E       + K+LKM  LAHRAY+S VCKVLG D++ LP++VL
Subjt:  KLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVL

Query:  KGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIE
         GE LG SLTK  VSK     Q +  N +  +  D  E   +E
Subjt:  KGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIE

A0A5D3DXE1 Uncharacterized protein3.77e-20164.6Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPY + RLT+EVL+LHSLW RGPP+N K   +HS+ AVA+     PSNKRP  P   K K KKKK +P  D PQ+SGPEWPCPEPVQNQPSTSSGWP I
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        QP ATPAAQ VSSEER  L+ALQLQYK   ACR FFARNADSGS+    EEEEEEEE+DG + + +EY FFLKMFVEN EL  YYEKN E G FCCLVC 
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
        GMGKKK GK+FK+C+ LVQHSISIS TKKKRAHRAFG V+ RV GWD+DRLP IVLKGEPLSRSLA+SG+ +VQPE+ HV  +      S N+D   E+K
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK

Query:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFL
        LEE K AEDP SN+K+  SGEN +  K+ DV +Q EN DNSI GMG    EM NL V   I +ACKEF A F  S +D    E    DG EEREEFKFFL
Subjt:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSD----ELNDGDGLEEREEFKFFL

Query:  KLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVL
        KLFTEN++LR YYE++Y DGEF CLACE A +K  K FKTC RLLQHST L KN I E       + K+LKM  LAHRAY+S VCKVLG D++ LP++VL
Subjt:  KLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVL

Query:  KGEPLGRSLTKPGVSKV
         GE LG SLTK  VSKV
Subjt:  KGEPLGRSLTKPGVSKV

A0A6J1CJP3 uncharacterized protein LOC111012232 isoform X20.098.59Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEE NDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
        GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK

Query:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT
        LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT
Subjt:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT

Query:  ENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP
        ENDDLRGYYESNYEDGEFVCLACEGA KKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP
Subjt:  ENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP

Query:  LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG
        LGRSLTKPGVSK      DEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG
Subjt:  LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG

A0A6J1CM54 uncharacterized protein LOC111012232 isoform X10.099.65Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
        MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAI

Query:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
        QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEE NDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG
Subjt:  QPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCG

Query:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
        GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK
Subjt:  GMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEK

Query:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT
        LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT
Subjt:  LEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFT

Query:  ENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP
        ENDDLRGYYESNYEDGEFVCLACEGA KKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP
Subjt:  ENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEP

Query:  LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG
        LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG
Subjt:  LGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCKMNDLVGNYSDRDQQIAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein3.1e-5432.77Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGG------------PQ--------PPKAKKKKKKPRPAPDHPQESGPE
        M+ YD+  L +EV++LHSLW +GPP   K IP+ +   + +   R   N  P              PQ        P       K+PRP      +SG E
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGG------------PQ--------PPKAKKKKKKPRPAPDHPQESGPE

Query:  WPCPEPVQNQPSTSSGWPAIQPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNA---DSG------SEVEEEEEEEEEEENDGGITKIEEYKFF
        WP  + V   PST SGWP  +PC     +P+S+EE+ KL+A  LQ    + CR FF R +   DS       SE++E +E++  E+ +   +K  E++F 
Subjt:  WPCPEPVQNQPSTSSGWPAIQPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNA---DSG------SEVEEEEEEEEEEENDGGITKIEEYKFF

Query:  LKMFVENSELGAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEP
         ++F EN +L  YYEKN   G F CLVCGG+G +KS ++FKSC+ L+QHS++I +T  K  HRA   V+C VLGWDV+  P+                  
Subjt:  LKMFVENSELGAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEP

Query:  EVQPEDNHVAKEGGCGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAG
                        V S+ D Q   E       A +P S++K     + V   +E+                     +   L + Q  S+A K+ F  
Subjt:  EVQPEDNHVAKEGGCGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAG

Query:  FSPSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAR-KKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKT
             +D   +       EE +   K+F+EN +L+ YYE NYE G F+CL C  A  KK  K FK C  ++QH T                  K+ KMK 
Subjt:  FSPSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAR-KKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKT

Query:  LAHRAYSSAVCKVLGWDVEELPSVVLKG
         AH+ ++  VC++LGWD E LP  V+KG
Subjt:  LAHRAYSSAVCKVLGWDVEELPSVVLKG

AT1G78810.2 unknown protein3.1e-5432.77Show/hide
Query:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGG------------PQ--------PPKAKKKKKKPRPAPDHPQESGPE
        M+ YD+  L +EV++LHSLW +GPP   K IP+ +   + +   R   N  P              PQ        P       K+PRP      +SG E
Subjt:  MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGG------------PQ--------PPKAKKKKKKPRPAPDHPQESGPE

Query:  WPCPEPVQNQPSTSSGWPAIQPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNA---DSG------SEVEEEEEEEEEEENDGGITKIEEYKFF
        WP  + V   PST SGWP  +PC     +P+S+EE+ KL+A  LQ    + CR FF R +   DS       SE++E +E++  E+ +   +K  E++F 
Subjt:  WPCPEPVQNQPSTSSGWPAIQPCATPAAQPVSSEERAKLSALQLQYKEFKACRGFFARNA---DSG------SEVEEEEEEEEEEENDGGITKIEEYKFF

Query:  LKMFVENSELGAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEP
         ++F EN +L  YYEKN   G F CLVCGG+G +KS ++FKSC+ L+QHS++I +T  K  HRA   V+C VLGWDV+  P+                  
Subjt:  LKMFVENSELGAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQHSISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEP

Query:  EVQPEDNHVAKEGGCGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAG
                        V S+ D Q   E       A +P S++K     + V   +E+                     +   L + Q  S+A K+ F  
Subjt:  EVQPEDNHVAKEGGCGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENVNGCKENDVNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAG

Query:  FSPSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAR-KKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKT
             +D   +       EE +   K+F+EN +L+ YYE NYE G F+CL C  A  KK  K FK C  ++QH T                  K+ KMK 
Subjt:  FSPSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGAR-KKTPKGFKTCGRLLQHSTSLAKNRIGENLPHDADRAKMLKMKT

Query:  LAHRAYSSAVCKVLGWDVEELPSVVLKG
         AH+ ++  VC++LGWD E LP  V+KG
Subjt:  LAHRAYSSAVCKVLGWDVEELPSVVLKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCTACGACGAGAGAAGACTCACCGAAGAGGTTCTTCATCTCCACTCTCTCTGGCGGCGAGGTCCGCCGAAGAACTGTAAATCCATTCCCAATCATTCAGCAAT
CGCCGTCGCCAATGTCGCGAATCGCATCCCTTCGAACAAGAGACCCGGAGGCCCACAACCCCCAAAGGCCAAGAAGAAGAAGAAGAAGCCACGCCCTGCCCCCGACCACC
CGCAAGAATCCGGACCCGAATGGCCGTGTCCGGAGCCGGTTCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGGCGATTCAGCCCTGTGCCACTCCGGCGGCTCAGCCC
GTGTCGTCGGAAGAGCGAGCGAAGCTCTCGGCGTTGCAATTGCAGTACAAGGAATTCAAGGCCTGCCGGGGATTCTTCGCGAGGAATGCCGATTCTGGGAGTGAGGTAGA
GGAGGAAGAGGAGGAGGAGGAGGAGGAGGAAAATGATGGGGGGATTACGAAAATTGAGGAGTACAAGTTTTTTCTGAAGATGTTTGTGGAGAATAGTGAACTTGGGGCTT
ATTACGAGAAGAATTATGAATGTGGGTCGTTTTGTTGCTTGGTCTGCGGCGGAATGGGGAAAAAGAAATCTGGGAAAAGGTTCAAGAGCTGCGTTGGGCTCGTTCAGCAT
TCCATTTCGATATCGAGGACGAAGAAGAAGCGGGCTCACAGGGCTTTTGGACTGGTCATATGCAGGGTTCTTGGTTGGGATGTTGATCGACTTCCGATTATTGTGTTGAA
AGGCGAGCCTCTTAGTCGCTCATTAGCTGATTCTGGAGAACCAGAGGTTCAGCCTGAGGATAATCATGTGGCTAAAGAGGGTGGTTGTGGGGTTAAGAGTGAGAACGATG
ATCAGAAGAATGAAGAGAAATTGGAGGAAGACAAGGCAGCAGAAGATCCTGATTCTAATGCTAAAAATTCGAGTTCTGGTGAGAATGTAAATGGCTGCAAGGAGAATGAT
GTCAATATGCAAGAAGAAAATACTGATAATTCAATTCCAGGCATGGGATCAGACAAAGAGGAAATGAAAAACTTGCCTGTACTGCAGCCGATCTCGAAAGCCTGTAAAGA
ATTTTTTGCAGGCTTCTCTCCATCTACGAGCGATGAATTGAACGATGGAGATGGACTCGAGGAACGCGAAGAATTCAAGTTCTTCTTGAAGTTGTTTACTGAGAATGATG
ACTTGAGGGGATATTACGAGAGCAACTATGAAGACGGGGAATTTGTCTGCTTAGCTTGTGAAGGAGCACGGAAGAAAACACCAAAGGGATTTAAGACGTGTGGTCGTCTT
CTCCAACATTCAACTTCTCTAGCGAAGAATAGAATAGGGGAAAATCTGCCTCACGATGCTGACCGTGCTAAAATGTTGAAGATGAAGACACTGGCTCATAGAGCATATAG
TTCGGCTGTGTGCAAGGTTCTTGGTTGGGACGTCGAAGAGCTTCCATCAGTCGTGTTGAAAGGCGAACCTCTGGGTCGTTCCTTAACAAAGCCAGGCGTGTCAAAGGTTT
CGCCTAAATGGCAGGATGAAATTGGTAATGTGAATTATTCGATTTCGGGTGATCCTATGGAAAATGGCTCTATAGAGGCTAGCAAATTGCGGGACGATGCAGTTTGTAAG
ATGAATGATCTCGTCGGAAATTACTCAGACAGAGATCAACAAATTGCTGGGTGA
mRNA sequenceShow/hide mRNA sequence
CTCTTTCTTTCCTTCTTTATTTATTTATTTATTTATTTCTGAAAAATGTTACTTCGCGGGAGCAAAACGATGGAAGAAGAGTGATCGGTATAAAGAGAACCCAAGCTTCA
GTTTCTTGTTGATTTCGACGTTTTCGTAAGTTATTAGCACCAATGGATCCCTACGACGAGAGAAGACTCACCGAAGAGGTTCTTCATCTCCACTCTCTCTGGCGGCGAGG
TCCGCCGAAGAACTGTAAATCCATTCCCAATCATTCAGCAATCGCCGTCGCCAATGTCGCGAATCGCATCCCTTCGAACAAGAGACCCGGAGGCCCACAACCCCCAAAGG
CCAAGAAGAAGAAGAAGAAGCCACGCCCTGCCCCCGACCACCCGCAAGAATCCGGACCCGAATGGCCGTGTCCGGAGCCGGTTCAAAATCAGCCCTCGACGTCATCTGGG
TGGCCGGCGATTCAGCCCTGTGCCACTCCGGCGGCTCAGCCCGTGTCGTCGGAAGAGCGAGCGAAGCTCTCGGCGTTGCAATTGCAGTACAAGGAATTCAAGGCCTGCCG
GGGATTCTTCGCGAGGAATGCCGATTCTGGGAGTGAGGTAGAGGAGGAAGAGGAGGAGGAGGAGGAGGAGGAAAATGATGGGGGGATTACGAAAATTGAGGAGTACAAGT
TTTTTCTGAAGATGTTTGTGGAGAATAGTGAACTTGGGGCTTATTACGAGAAGAATTATGAATGTGGGTCGTTTTGTTGCTTGGTCTGCGGCGGAATGGGGAAAAAGAAA
TCTGGGAAAAGGTTCAAGAGCTGCGTTGGGCTCGTTCAGCATTCCATTTCGATATCGAGGACGAAGAAGAAGCGGGCTCACAGGGCTTTTGGACTGGTCATATGCAGGGT
TCTTGGTTGGGATGTTGATCGACTTCCGATTATTGTGTTGAAAGGCGAGCCTCTTAGTCGCTCATTAGCTGATTCTGGAGAACCAGAGGTTCAGCCTGAGGATAATCATG
TGGCTAAAGAGGGTGGTTGTGGGGTTAAGAGTGAGAACGATGATCAGAAGAATGAAGAGAAATTGGAGGAAGACAAGGCAGCAGAAGATCCTGATTCTAATGCTAAAAAT
TCGAGTTCTGGTGAGAATGTAAATGGCTGCAAGGAGAATGATGTCAATATGCAAGAAGAAAATACTGATAATTCAATTCCAGGCATGGGATCAGACAAAGAGGAAATGAA
AAACTTGCCTGTACTGCAGCCGATCTCGAAAGCCTGTAAAGAATTTTTTGCAGGCTTCTCTCCATCTACGAGCGATGAATTGAACGATGGAGATGGACTCGAGGAACGCG
AAGAATTCAAGTTCTTCTTGAAGTTGTTTACTGAGAATGATGACTTGAGGGGATATTACGAGAGCAACTATGAAGACGGGGAATTTGTCTGCTTAGCTTGTGAAGGAGCA
CGGAAGAAAACACCAAAGGGATTTAAGACGTGTGGTCGTCTTCTCCAACATTCAACTTCTCTAGCGAAGAATAGAATAGGGGAAAATCTGCCTCACGATGCTGACCGTGC
TAAAATGTTGAAGATGAAGACACTGGCTCATAGAGCATATAGTTCGGCTGTGTGCAAGGTTCTTGGTTGGGACGTCGAAGAGCTTCCATCAGTCGTGTTGAAAGGCGAAC
CTCTGGGTCGTTCCTTAACAAAGCCAGGCGTGTCAAAGGTTTCGCCTAAATGGCAGGATGAAATTGGTAATGTGAATTATTCGATTTCGGGTGATCCTATGGAAAATGGC
TCTATAGAGGCTAGCAAATTGCGGGACGATGCAGTTTGTAAGATGAATGATCTCGTCGGAAATTACTCAGACAGAGATCAACAAATTGCTGGGTGAATCTGTTGGACGAT
GTTATAGAAAATGAATGTATGAAGATTAATAGCTGTGGTAGAGCAGTTTTGTCATGATTCAGTGATATGCTATTAGTAGAATTCTACTTCAAATTTCTTGTCAAATATTT
ATGCTGTTTCTCTCTGGTCAGAGAGATGGCAAATTTACTTACATTCGAACTTTCATGATATATGTCCCACGCTCGAGACCAGCAAAAAGTAAATAAGATACAAATTTCTT
TAATCTTTCCATATTTTCCATGTTATCTATAACATTGCTGATAAAAGTTTTAGAAAGTAAACTAATCTTCTTTATATTTGCAATTGAAAAGTCCAAAATCTAAACTAGAA
ATCCAAAATCCAAAATGTTCTTTCTAATAATCTTTGTGAACTCAATGCGAACCCACGATTCGTGCAACAATTATAGTAATATCATCAATTTTTCCTCCGGTGTGCTGTCG
ACCGGCGGCCCGAGCCGCCACCGCAAATGGGCTATCATACTTGTCATCGTTCGAATTGTACAAGGCCAGCGCCGCCAGTTCCCAGGCCAAATCCCCCGGCTCAGGCGTCT
CCGAAAGCTCTCCGATCTCATCCGTGTATGCGTTGTCGAACAACCCATCAGTTCCGACCACCACAACATCTCCGGCCGCCACCGGAACCTCCTCCCGCGACGCATAACCC
AGGTCACTCGGACCCGGTTCTCTCTTCAATTGATACGGATGGTTGAACCCGCGCTGCTGAACCGGCGATCTGTGAATACACTTTTTATCCCTGAACACCGCGTACCCGCT
GTCTCCGACCAGCGCCGACCGGAGAACCGCGCCGTCGAGCGACACCATGCACGCCGTCGACGATCCGCGAGCGGCGGTCCGGCTGCAAGCCATGGCGAGAACCTTTCCGA
GGTCGACGCGGCCGCCGCGTTCGTTGGCGACGACGGCGGCGCAGTTGGCCATCAGCTCTCTGGCGTACACGCCGGCGTCGATTCCTTTCGCCGCCCAGCCGCCGACGCCG
TCAGCCACGCCGGCGGTCTGTTTGGCGGCGGAGATGAAGTGGGCGTCTTCTCCGAGCGGTTTCGAAGGGTTGTCTTTGGGAATGTAGAAAGAACCGAACTTCATTTTTAG
GATCGGATTCGGAGTTGCCATTTTTTTGGCAGAGGAAAACTGAAATTGAAATTCGTCGGAGATTTGATCGTCGAGCTTCCGCTTCATATCAGTCATTGTGGAAATTGTGA
AAAAGGGAAAAGTGATC
Protein sequenceShow/hide protein sequence
MDPYDERRLTEEVLHLHSLWRRGPPKNCKSIPNHSAIAVANVANRIPSNKRPGGPQPPKAKKKKKKPRPAPDHPQESGPEWPCPEPVQNQPSTSSGWPAIQPCATPAAQP
VSSEERAKLSALQLQYKEFKACRGFFARNADSGSEVEEEEEEEEEEENDGGITKIEEYKFFLKMFVENSELGAYYEKNYECGSFCCLVCGGMGKKKSGKRFKSCVGLVQH
SISISRTKKKRAHRAFGLVICRVLGWDVDRLPIIVLKGEPLSRSLADSGEPEVQPEDNHVAKEGGCGVKSENDDQKNEEKLEEDKAAEDPDSNAKNSSSGENVNGCKEND
VNMQEENTDNSIPGMGSDKEEMKNLPVLQPISKACKEFFAGFSPSTSDELNDGDGLEEREEFKFFLKLFTENDDLRGYYESNYEDGEFVCLACEGARKKTPKGFKTCGRL
LQHSTSLAKNRIGENLPHDADRAKMLKMKTLAHRAYSSAVCKVLGWDVEELPSVVLKGEPLGRSLTKPGVSKVSPKWQDEIGNVNYSISGDPMENGSIEASKLRDDAVCK
MNDLVGNYSDRDQQIAG