; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0661 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0661
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPB1 domain-containing protein
Genome locationMC04:6107894..6111567
RNA-Seq ExpressionMC04g0661
SyntenyMC04g0661
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000270 - PB1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571886.1 hypothetical protein SDJN03_28614, partial [Cucurbita argyrosperma subsp. sororia]1.22e-28984.46Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGG+TKIFAVDRS+KFASMLAKLSS CD DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPG+PARMRLFLFPANQSPSF S+G RSDR+RFVEVL SG +HG D PKQSVPNKVDFLFGLDKGG+A P
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL
        PPPVALKLHDP+PE VAPP+ESAARP PGDRIAV DP VHPAEIQRQLQELQRLHISEQEQAAAYRRK EE+NL+GGY GD+YAQKM+EK+PP +AQPTL
Subjt:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL

Query:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL
        Q PP GYW EKQVSSGGF AT+TATPG  DQ PVYMIHAPG VYH+ QHPMVR V APP NQGYYAVQRMASDVYR+QPVYNVVQ PPQP YP TSSPSL
Subjt:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL

Query:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
         QQPP KVAAYPGGG+TLAAD+GPYTQVAYDS+TGRQVYYTAGGAAMV  PP  PYQ VS    G++RTG VGQDGK +  AK+SQG V
Subjt:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

KAG7011570.1 hypothetical protein SDJN02_26476, partial [Cucurbita argyrosperma subsp. argyrosperma]7.39e-29184.87Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGG+TKIFAVDRS+KFASMLAKLSS CD DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPG+PARMRLFLFPANQSPSF S+G RSDR+RFVEVLSSG +HG D PKQSVPNKVDFLFGLDKGG+A P
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL
        PPPVALKLHDP+PE VAPP+ESAARP PGDRIAV DP VHPAEIQRQLQELQRLHISEQEQAAAYRRK EE+NL+GGY GD+YAQKM+EK+PP +AQPTL
Subjt:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL

Query:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL
        Q PP GYW EKQVSSGGF AT+TATPGA DQ PVYMIHAPG VYH+ QHPMVR V APP NQGYYAVQRMASDVYR+QPVYNVVQ PPQP YP TSSPSL
Subjt:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL

Query:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
         QQPP KVAAYPGGG+TLAAD+GPYTQVAYDS+TGRQVYYTAGGAAMV  PP  PYQ VS    G++RTG VGQDGK +  AK+SQG V
Subjt:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

XP_022952220.1 uncharacterized protein LOC111454962 [Cucurbita moschata]3.00e-29084.66Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGG+TKIFAVDRS+KFASMLAKLSS CD DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPG+PARMRLFLFPANQSPSF S+G RSDR+RFVEVLSSG +HG D PKQSVPNKVDFLFGLDKGG+A P
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL
        PPPVALKLHDP+PE VAPP+ESAARP PGDRIAV DP VHPAEIQRQLQELQRLHISEQEQAAAYRRK EE+NL+GGY GD+YAQKM+EK+PP +AQPTL
Subjt:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL

Query:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL
        Q PP GYW EKQVSSGGF AT+TATPG  DQ PVYMIHAPG VYH+ QHPMVR V APP NQGYYAVQRMASDVYR+QPVYNVVQ PPQP YP TSSPSL
Subjt:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL

Query:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
         QQPP KVAAYPGGG+TLAAD+GPYTQVAYDS+TGRQVYYTAGGAAMV  PP  PYQ VS    G++RTG VGQDGK +  AK+SQG V
Subjt:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

XP_022971931.1 uncharacterized protein LOC111470603 [Cucurbita maxima]4.97e-28984.25Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGG+TKIFAVDRS+KFASMLAKLSS CD DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPG+PARMRLFLFPANQSPSF S+G RSDR+RFVEVLSSG +HG D  KQSVPNKVDFLFGLDKGG+A P
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL
        PPPV LKLHDP+PE VAPP+ESAARP PGDRIAV DPVVHPAEIQRQLQELQRLHISEQEQAAAYRRK EE+NL+GGY GD+YAQKM+EK+PP +AQPTL
Subjt:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL

Query:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL
        Q PP GYW EKQVSSGGF AT+TATPG  DQ PVYMIHAPG VYH+ QHPMVR V APP NQGYYAVQRMASDVYR+QPVYNVVQ PPQP YP TSSPSL
Subjt:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL

Query:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
         QQPP KVAAYPGGG+TL+AD+GPYTQVAYDS+TGRQVYYTAGGAAMV  PP  PYQ VS    G++RTG VGQDGK +  AK+SQG V
Subjt:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

XP_023554167.1 uncharacterized protein LOC111811512 [Cucurbita pepo subsp. pepo]7.05e-28984.46Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGG+TKIFAVDRS+KFASMLAKLSS CD DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPG+PARMRLFLFPANQSPSF S+G RSDR+RFVEVLSSG +HG D PKQSVPNKVDFLFGLDKGG+A P
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL
         PPVALKLHDP+PE VAPP+ESAARP PGDRIAV DP VHPAEIQRQLQELQRLHISEQEQAAAYRRK EE+NL+GGY GD+YAQKM+EK+PP +AQPTL
Subjt:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL

Query:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPV-AAPPNQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL
        Q PP GYW EKQVSSGGF AT+TATPG  DQ PVYMIHAPG VYH+ QHPMVR V A PPNQGYYAVQRMASDVYR+QPVYNVVQ PPQP YP TSSPSL
Subjt:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPV-AAPPNQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL

Query:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
         QQPP KVAAYPGGG+TLAAD+GPYTQVAYDS+TGRQVYYTAGGAAMV  PP  PYQ VS    G++RTG VGQDGK +  AK+SQG V
Subjt:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

TrEMBL top hitse value%identityAlignment
A0A0A0K2B0 PB1 domain-containing protein4.23e-27781.54Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGGDTKIFAVDRSIKFASM+AKLSS  D DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSF SDG RSDR+RFVEVL+S  +H ADAPKQSVPNKVDFLFGLDK G+APP
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPP-VALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPT
        PPP VA+KLHDP+PE VA P+E  ARP PGDRIA+ DPVVHPAEI RQLQELQRLHISEQEQAAAY+RKSEENNL+GGY G++YAQK MEK+P  NA   
Subjt:  PPP-VALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPT

Query:  LQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAA--PPNQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSP
        +Q PP GYW EKQVSSGGF AT+TAT G PDQ PVYMIH PG VYH+ QHPMVRP+ A  PPNQGYYAVQRMASD+YR+QPVYNVVQ PPQ PYPATSSP
Subjt:  LQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAA--PPNQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSP

Query:  SLQQQPPPKVAAYPGGGMTLAADAGP--YTQVAYDSSTGRQVYYTAGGAAMVGPPPP--YQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
        +L QQPP KVAAYPGGG+TLAADAGP  YTQVAYDSSTGRQVYYTA G  ++  PPP  YQ VS    G++RTGAVGQDGK +  AK+SQG V
Subjt:  SLQQQPPPKVAAYPGGGMTLAADAGP--YTQVAYDSSTGRQVYYTAGGAAMVGPPPP--YQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

A0A1S3C038 LOW QUALITY PROTEIN: formin-like protein 206.44e-28382.48Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGGDTKIFAVDRSIKFASM+AKLSS CD DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMR FLFPANQSPSF SDG RSDR+RFVEVL+S  +H  DAPKQSVPNKVDFLFGLDK G+APP
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPP-VALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPT
        PPP VA+KLHDP+PE VA P+E  ARP PGDRIA+ DPVVHP EI RQLQELQRLHISEQEQAAAY+RKSEENNL+GGY G+YY QK MEK+P  NA P+
Subjt:  PPP-VALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPT

Query:  LQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPPNQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL
        LQ PP GYW EKQVSSGGF AT+TATPG PDQ PVYMIH PG VYH+ QHPMVRPV  PPNQGYYAVQRMASDVYR+QPVYNVVQ PPQ PYPATSSP L
Subjt:  LQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPPNQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL

Query:  QQQPPPKVAAYPGGGMTLAADAGP--YTQVAYDSSTGRQVYYTAGGAAMVG--PPPPYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
         QQPP KV AYPGGGMTLAAD+GP  YTQVAYDSSTGRQVYYT GGA ++   PPPPYQ VS    G++RTGAVGQDGK +  AK+SQG V
Subjt:  QQQPPPKVAAYPGGGMTLAADAGP--YTQVAYDSSTGRQVYYTAGGAAMVG--PPPPYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

A0A5A7SPJ4 Formin-like protein 201.12e-28382.69Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGGDTKIFAVDRSIKFASM+AKLSS CD DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSF SDG RSDR+RFVEVL+S  +H  DAPKQSVPNKVDFLFGLDK G+APP
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPP-VALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPT
        PPP VA+KLHDP+PE VA P+E  ARP PGDRIA+ DPVVHP EI RQLQELQRLHISEQEQAAAY+RKSEENNL+GGY G+YY QK MEK+P  NA P+
Subjt:  PPP-VALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPT

Query:  LQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPPNQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL
        LQ PP GYW EKQVSSGGF AT+TATPG PDQ PVYMIH PG VYH+ QHPMVRPV  PPNQGYYAVQRMASDVYR+QPVYNVVQ PPQ PYPATSSP L
Subjt:  LQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPPNQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL

Query:  QQQPPPKVAAYPGGGMTLAADAGP--YTQVAYDSSTGRQVYYTAGGAAMVG--PPPPYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
         QQPP KV AYPGGGMTLAAD+GP  YTQVAYDSSTGRQVYYT GGA ++   PPPPYQ VS    G++RTGAVGQDGK +  AK+SQG V
Subjt:  QQQPPPKVAAYPGGGMTLAADAGP--YTQVAYDSSTGRQVYYTAGGAAMVG--PPPPYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

A0A6J1GL59 uncharacterized protein LOC1114549621.45e-29084.66Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGG+TKIFAVDRS+KFASMLAKLSS CD DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPG+PARMRLFLFPANQSPSF S+G RSDR+RFVEVLSSG +HG D PKQSVPNKVDFLFGLDKGG+A P
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL
        PPPVALKLHDP+PE VAPP+ESAARP PGDRIAV DP VHPAEIQRQLQELQRLHISEQEQAAAYRRK EE+NL+GGY GD+YAQKM+EK+PP +AQPTL
Subjt:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL

Query:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL
        Q PP GYW EKQVSSGGF AT+TATPG  DQ PVYMIHAPG VYH+ QHPMVR V APP NQGYYAVQRMASDVYR+QPVYNVVQ PPQP YP TSSPSL
Subjt:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL

Query:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
         QQPP KVAAYPGGG+TLAAD+GPYTQVAYDS+TGRQVYYTAGGAAMV  PP  PYQ VS    G++RTG VGQDGK +  AK+SQG V
Subjt:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

A0A6J1I4J8 uncharacterized protein LOC1114706032.40e-28984.25Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ
        MDNYAYNSYAESGDSSPRSREIDFENPPPWDDA QLQ+ NYKVKFMCSYGGKIHPRPHDN+LSYVGG+TKIFAVDRS+KFASMLAKLSS CD DVTFKYQ
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQ

Query:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP
        LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPG+PARMRLFLFPANQSPSF S+G RSDR+RFVEVLSSG +HG D  KQSVPNKVDFLFGLDKGG+A P
Subjt:  LPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPP

Query:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL
        PPPV LKLHDP+PE VAPP+ESAARP PGDRIAV DPVVHPAEIQRQLQELQRLHISEQEQAAAYRRK EE+NL+GGY GD+YAQKM+EK+PP +AQPTL
Subjt:  PPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTL

Query:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL
        Q PP GYW EKQVSSGGF AT+TATPG  DQ PVYMIHAPG VYH+ QHPMVR V APP NQGYYAVQRMASDVYR+QPVYNVVQ PPQP YP TSSPSL
Subjt:  QAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMIHAPGTVYHAQQHPMVRPVAAPP-NQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSL

Query:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV
         QQPP KVAAYPGGG+TL+AD+GPYTQVAYDS+TGRQVYYTAGGAAMV  PP  PYQ VS    G++RTG VGQDGK +  AK+SQG V
Subjt:  QQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPP--PYQAVS----GEMRTGAVGQDGKQLISAKVSQGPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01190.1 Octicosapeptide/Phox/Bem1p family protein9.1e-3144.44Show/hide
Query:  NSYAESGDSSPRSREIDFENPPPWDDAAQLQSS---------NYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSS--FCDADV
        +SY ES DSSPRSR  D      WDD                + K++FMCSYGG I PRPHD  L Y+GGDT+I  VDR+    S++A+LS+        
Subjt:  NSYAESGDSSPRSREIDFENPPPWDDAAQLQSS---------NYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSS--FCDADV

Query:  TFKYQLPGEDLDALISVTNDDDLEHMMHEYDRLYRAPG--RPARMRLFLF----PANQSPSFASDGARSDRERFVEVLSS
        T KYQLP EDLD+LISVT D+DL++M+ EYDR   A    +P+R+RLFLF     A QS     + +    + F+  L+S
Subjt:  TFKYQLPGEDLDALISVTNDDDLEHMMHEYDRLYRAPG--RPARMRLFLF----PANQSPSFASDGARSDRERFVEVLSS

AT4G05150.1 Octicosapeptide/Phox/Bem1p family protein6.1e-2732.88Show/hide
Query:  PPPWDDAAQLQS-------SNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFC-DADVTFKYQLPGEDLDALISVTNDDDL
        PPP  D   L S       S  +V+FMC++GG+I PRP DN L YVGGD ++ AV R   FAS+L+KL+     ++++ KYQLP EDLDALISV+ D+D+
Subjt:  PPPWDDAAQLQS-------SNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFC-DADVTFKYQLPGEDLDALISVTNDDDL

Query:  EHMMHEYDRLYRAPG-RPARMRLFLFPAN-----------QSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKV----------------DFLF
        E+MM EYDR+ +    R +R+RLFLF  N            S S   D + +  + F++ L+ G S  A A       +V                D+LF
Subjt:  EHMMHEYDRLYRAPG-RPARMRLFLFPAN-----------QSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKV----------------DFLF

Query:  GLDKGGVAPPPPPVALKLHDPVPEA-VAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMME
        GLD      PP     +L D  P A +   V + + PG   R  VP P                + IS  E       K E    +     +   +++M+
Subjt:  GLDKGGVAPPPPPVALKLHDPVPEA-VAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMME

Query:  KSP-PANAQPTLQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMI--HAPGTVYHAQQHPMVRP
        +S  P N+Q    AP PG    +QV   G             Q PVY +    PG     Q + MV+P
Subjt:  KSP-PANAQPTLQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMI--HAPGTVYHAQQHPMVRP

AT5G09620.1 Octicosapeptide/Phox/Bem1p family protein2.1e-8040.44Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCD-----ADV
        MD ++YNSY +S +SSPRSR+++FENP PW+D    Q  NYKVK MCSYGGKI PRPHDN L+YV GDTKI +VDR I+F ++++KLS+ C       ++
Subjt:  MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCD-----ADV

Query:  TFKYQLPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQ-SPSFASDGA-RSDRERFVEVLSSGPSH-GADAPKQSVPNKVDFLFGL
        +FKYQLPGEDLDALISVTND+DLEHMMHEYDRL R   +PARMRLFLFP++  S  F S+G+ +SDR+    + S   S     AP    PN  DFLFG 
Subjt:  TFKYQLPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQ-SPSFASDGA-RSDRERFVEVLSSGPSH-GADAPKQSVPNKVDFLFGL

Query:  DKGGVAPPPPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQE--------------------------------
        +K     P PP  +K+  PVPE   P V    +     R+  P+  V+PAEIQRQ+QE Q + I +QE                                
Subjt:  DKGGVAPPPPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQE--------------------------------

Query:  ---------------------QAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTLQAPPPGYWPEKQVSSGGFQATV--TATPGAPD----QQP
                             Q A YRRK+E+       AG Y+     +   P     T Q PP GYW +   ++   Q  +  T +   P+    QQ 
Subjt:  ---------------------QAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTLQAPPPGYWPEKQVSSGGFQATV--TATPGAPD----QQP

Query:  VYMI----HAPGTVYHAQQHPMVRPVAAPPNQGYY--AVQRM-ASDVY---RDQPVYNVVQQPPQPPYPATSSPSLQQQPPPKVAAYPGGGMTLAADAGP
        VYMI     APGT+Y +   P V+      NQGYY   VQR+   D Y   ++QP YNVVQ  PQP +  +  P +     P+V    G  M L     P
Subjt:  VYMI----HAPGTVYHAQQHPMVRPVAAPPNQGYY--AVQRM-ASDVY---RDQPVYNVVQQPPQPPYPATSSPSLQQQPPPKVAAYPGGGMTLAADAGP

Query:  YTQVAYDSSTGRQVYYTAGGAAMVGPPPPYQAVSGEMRTGAVGQ
        Y+Q+      G+ VYYT  G  M+  PPP Q    + +   +GQ
Subjt:  YTQVAYDSSTGRQVYYTAGGAAMVGPPPPYQAVSGEMRTGAVGQ

AT5G49920.1 Octicosapeptide/Phox/Bem1p family protein3.0e-2650Show/hide
Query:  QSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQLPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPA
        + S  KVKFMCS+GG+I PRP D+VL YVGG+T++ AV   I F+ ++ KL++  + D+  KYQ+  EDLDAL+SV +D+D++HM+ EY+R +  P    
Subjt:  QSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQLPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPA

Query:  RMRLFLFPAN
        ++R FLFPAN
Subjt:  RMRLFLFPAN

AT5G64430.1 Octicosapeptide/Phox/Bem1p family protein4.9e-10147.82Show/hide
Query:  MDNYAYNSYAESGDSSPRSREIDFEN-PPPWDDAAQ-LQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFC------D
        M+ ++YNSY +S DSSPRSREI+F+N PPPWDD  Q  Q  +YKVKFMCSYGGKI PRPHDN L+YV G+TKI +VDR I+F  + +KLS+ C       
Subjt:  MDNYAYNSYAESGDSSPRSREIDFEN-PPPWDDAAQ-LQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFC------D

Query:  ADVTFKYQLPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGA-RSDRERFVEVLSSGPS-HGADAPKQSVPNKVDFLF
         +VTFKYQLPGEDLDALISVTNDDDLEHMMHEYDRL R   +PARMRLFLFPA  S  F S  + +SDR+RFVE L++ P    ++    + PN  DFLF
Subjt:  ADVTFKYQLPGEDLDALISVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGA-RSDRERFVEVLSSGPS-HGADAPKQSVPNKVDFLF

Query:  GLDKGGVAPPPPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQE----QAAAYRRKSEENNLL---GGYAGDYY
        G +K    PPPPP  +KL  PVP A+ PP+ +        R+  PD VV+P EIQRQ+QE QR+HI +QE    Q A YRRKS E+ L+   GGY    Y
Subjt:  GLDKGGVAPPPPPVALKLHDPVPEAVAPPVESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQE----QAAAYRRKSEENNLL---GGYAGDYY

Query:  AQKMMEKSPPAN-AQPTLQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMI--HAPGTVYHAQQHP----MVRPV-AAPPNQG--YYAVQRMASDV
         Q      P     Q   QAPP   + +   + G    T T   G P +QPVYMI   +P  VYHA   P    ++RP+     NQG  Y  VQR+ASD 
Subjt:  AQKMMEKSPPAN-AQPTLQAPPPGYWPEKQVSSGGFQATVTATPGAPDQQPVYMI--HAPGTVYHAQQHP----MVRPV-AAPPNQG--YYAVQRMASDV

Query:  YRD--QPVYNVV-------------QQPPQPPYPATSSPSLQQQPPPKVAAYPGGGMTLA-ADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPPPYQAVS
        YR+  Q  YNV              QQ P PP P TS P     PPP+  A P     +   D   YTQV Y    G+QVYYT        PPP Y  V 
Subjt:  YRD--QPVYNVV-------------QQPPQPPYPATSSPSLQQQPPPKVAAYPGGGMTLA-ADAGPYTQVAYDSSTGRQVYYTAGGAAMVGPPPPYQAVS

Query:  ------GEMRTGAVGQDGKQLISAKVS
               E+RTG  G+     ++ KVS
Subjt:  ------GEMRTGAVGQDGKQLISAKVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAATTACGCCTACAACTCCTACGCCGAGTCCGGCGACTCCTCCCCCCGCTCCCGCGAGATCGACTTCGAAAACCCCCCTCCCTGGGACGACGCCGCCCAGCTCCA
GAGCTCCAACTACAAGGTCAAGTTCATGTGCAGCTATGGCGGCAAGATCCACCCCCGCCCCCACGACAACGTCCTCTCCTACGTCGGCGGCGACACCAAGATCTTCGCCG
TCGACCGCTCCATCAAATTCGCCTCTATGTTGGCTAAGCTCTCCTCCTTCTGCGACGCCGATGTCACCTTCAAGTACCAGCTCCCCGGCGAGGATCTCGACGCCCTGATT
TCGGTCACCAACGACGACGATCTCGAGCACATGATGCACGAGTACGATCGCCTCTACAGGGCACCCGGGAGGCCGGCGCGGATGCGGCTTTTTCTGTTTCCGGCTAATCA
GAGCCCTAGCTTCGCCTCCGACGGGGCTCGGTCCGATCGGGAGCGATTTGTGGAGGTTTTGAGTTCCGGTCCCTCCCACGGGGCCGATGCGCCGAAGCAATCGGTTCCGA
ATAAGGTGGATTTTCTGTTTGGATTGGATAAGGGCGGGGTCGCACCTCCGCCGCCGCCTGTTGCGCTTAAATTGCACGATCCGGTGCCTGAGGCGGTCGCTCCGCCCGTT
GAATCTGCTGCTAGACCTGGTCCGGGAGATCGGATCGCGGTTCCGGATCCTGTGGTTCATCCGGCCGAGATTCAGAGGCAATTGCAGGAATTACAGAGGTTGCATATTAG
CGAGCAGGAGCAAGCGGCGGCGTACAGGAGGAAATCCGAAGAGAATAATCTTCTCGGAGGATACGCCGGCGATTACTACGCGCAGAAGATGATGGAGAAATCTCCACCGG
CGAACGCGCAGCCGACCTTGCAGGCTCCGCCGCCAGGATATTGGCCGGAGAAGCAGGTATCCAGCGGAGGTTTTCAGGCGACGGTGACGGCCACTCCCGGCGCACCGGAC
CAACAGCCGGTCTACATGATCCACGCTCCCGGAACGGTTTATCACGCGCAGCAACATCCGATGGTGAGACCCGTCGCCGCGCCGCCCAATCAAGGCTACTACGCCGTACA
GCGCATGGCCTCAGATGTCTACCGCGACCAACCGGTCTACAATGTCGTACAACAACCTCCACAGCCGCCGTACCCTGCGACGTCATCGCCGTCGCTGCAGCAACAACCAC
CTCCGAAGGTCGCCGCCTATCCCGGCGGCGGCATGACTCTGGCAGCAGACGCAGGGCCGTACACACAGGTGGCGTACGACAGCAGCACCGGGAGACAAGTGTACTACACA
GCCGGCGGAGCCGCCATGGTAGGGCCACCACCGCCTTACCAGGCGGTGAGCGGCGAAATGAGAACAGGAGCGGTGGGGCAGGACGGGAAGCAGCTGATTTCAGCAAAGGT
TTCACAAGGCCCAGTCTGA
mRNA sequenceShow/hide mRNA sequence
TGGTGATGGTTTCCAACATTGCTATATATATAAAATGAATTTCTTGAGATAATTATGTATGATATTTTTTAATTCTTTTTTTACATATATATAATATACATATGAGCCGC
TAAAGGGATCCAACAGCATCCCACCACCTGTCCGAAACACGGAGGAGATCACTCTGCTGTCACTCTCTCTCTCTCTCCATCTCTCTGTTTCCATTCACAATTCCTTCAAG
AAAATCCCCAAATCCAAAACCCCTTTCTCTCTGCAAATAAATTCCCATGAAATTTCTCCATTGAAATTTGCTCTCTCCTCCGCAATCCGCCATGGATAATTACGCCTACA
ACTCCTACGCCGAGTCCGGCGACTCCTCCCCCCGCTCCCGCGAGATCGACTTCGAAAACCCCCCTCCCTGGGACGACGCCGCCCAGCTCCAGAGCTCCAACTACAAGGTC
AAGTTCATGTGCAGCTATGGCGGCAAGATCCACCCCCGCCCCCACGACAACGTCCTCTCCTACGTCGGCGGCGACACCAAGATCTTCGCCGTCGACCGCTCCATCAAATT
CGCCTCTATGTTGGCTAAGCTCTCCTCCTTCTGCGACGCCGATGTCACCTTCAAGTACCAGCTCCCCGGCGAGGATCTCGACGCCCTGATTTCGGTCACCAACGACGACG
ATCTCGAGCACATGATGCACGAGTACGATCGCCTCTACAGGGCACCCGGGAGGCCGGCGCGGATGCGGCTTTTTCTGTTTCCGGCTAATCAGAGCCCTAGCTTCGCCTCC
GACGGGGCTCGGTCCGATCGGGAGCGATTTGTGGAGGTTTTGAGTTCCGGTCCCTCCCACGGGGCCGATGCGCCGAAGCAATCGGTTCCGAATAAGGTGGATTTTCTGTT
TGGATTGGATAAGGGCGGGGTCGCACCTCCGCCGCCGCCTGTTGCGCTTAAATTGCACGATCCGGTGCCTGAGGCGGTCGCTCCGCCCGTTGAATCTGCTGCTAGACCTG
GTCCGGGAGATCGGATCGCGGTTCCGGATCCTGTGGTTCATCCGGCCGAGATTCAGAGGCAATTGCAGGAATTACAGAGGTTGCATATTAGCGAGCAGGAGCAAGCGGCG
GCGTACAGGAGGAAATCCGAAGAGAATAATCTTCTCGGAGGATACGCCGGCGATTACTACGCGCAGAAGATGATGGAGAAATCTCCACCGGCGAACGCGCAGCCGACCTT
GCAGGCTCCGCCGCCAGGATATTGGCCGGAGAAGCAGGTATCCAGCGGAGGTTTTCAGGCGACGGTGACGGCCACTCCCGGCGCACCGGACCAACAGCCGGTCTACATGA
TCCACGCTCCCGGAACGGTTTATCACGCGCAGCAACATCCGATGGTGAGACCCGTCGCCGCGCCGCCCAATCAAGGCTACTACGCCGTACAGCGCATGGCCTCAGATGTC
TACCGCGACCAACCGGTCTACAATGTCGTACAACAACCTCCACAGCCGCCGTACCCTGCGACGTCATCGCCGTCGCTGCAGCAACAACCACCTCCGAAGGTCGCCGCCTA
TCCCGGCGGCGGCATGACTCTGGCAGCAGACGCAGGGCCGTACACACAGGTGGCGTACGACAGCAGCACCGGGAGACAAGTGTACTACACAGCCGGCGGAGCCGCCATGG
TAGGGCCACCACCGCCTTACCAGGCGGTGAGCGGCGAAATGAGAACAGGAGCGGTGGGGCAGGACGGGAAGCAGCTGATTTCAGCAAAGGTTTCACAAGGCCCAGTCTGA
ATAACGACAATCAGCATGTGGTATAAAAATTTTTGAGCTTTTTTTTCTCTTTCAATATGAATACTGAATTTTTAATTACATGCTATGATGCTGATGCTGCTGCTGCTGCT
CTTTTTGCTTCTCTCTGCCGTTTTCTTGTCTTCTTGGGTAAAAAAATAAGCATATTATGTTGTGTTATCATATGTCTTAATCATCCATCTATCTGGGTCGTCGTCTGTGT
AAAATATGAATATTTTAGAGATGAAAACCATTTTGGTCAGTTCCATCTTTTCTCCATTCCTGTTGGTGCTTAGAGTACCCAAAAAATGACCCAATTTTATGGGTAACCTT
TGAGATTTAAGTAGTCAAATTGGTTGGACATCCCTATATCTCCTCATTGATTGAGAGACTCTACATAGGTTTATAATAGATATTGACTACTCTTCTCATAGCAAATTGAT
TTTGAAGTAGAACCACATAAATATTAATATAATATCATAGCCCAAAAGCTCAAAACAAGCATCCTGTCCAAAAATGGAATCTGATCAAAATCAACGAGCCAAAAGAGCTA
CGATATTGAGGGGCATGACCAATTGATTTTGAGATGAAATCTCATGAATATTAATACGGGGTGAATAGTCACTAGTAGTTTCAGGATGACCAAGAAGACAAAATTTTACC
TTCTAACTTTTAGGTACGCTATATTTTTTTTCCCTTGTTTCGATCTTGCTTATAGAGTGTTGGGTGTGGGGTGTATGAGAATGCTAGTGTTACCACGAGTATTGATTTGA
GGTGCAGAATATATGGATTTGCATGGTTTTAGTTTTGGGTGATAATCGTGGAGATTAGTTGGAATTTGGAAAAGGGCATGCAAATGGAGAATCCTGTCCAATTACCAAGG
CTGTGTGTTCTGATTTGGAGTGTGTGTTGTTTTATTTTACTTTTACCAATTCGATTGTTAAAGAAGCAAAGCAACATCCATCGTCTCCCTCAGAATCCAATTCCAGAGGG
AACACTTCAAAATGAAATCTGAAGAAATCTAAATCCAATCATTACCTTTTTCTCAAGTTGACATGAGAGATTCTACTTCTTTTTTTTTTTTTCTTTTTAATATGCCCATT
ATAATAATTCTATTTTTATTCTTTTTTTCCCTTGTTCTCTTTGGATTAGTTGGGTGTTTGAATAGCTTTTGATTTGAAATAATGTCACATAATTCATTCGACCAAAAGGT
TGGGGTTCGAATTTTCACTTTGGATCTCTGAGGTTGTAGAGATGTATAGGTTATGGATAAAAATATTATGATTTATGATCAAGGAGAGTCTTAATATATTATGATGAACT
TTTGGATTAATAAATGAGTATTTGATTTTACAATTTAATCAACATGTGGATTGATGAAAGCTTAACTTAATTAAAATTGACTCCTTGAGTAAAATGAAATTTTAGTTTAT
AAAAACATAAAACTATTTTTTGTTGGACAAGGACATGTGCAGTGAGAATTCAAATATATAGGATCCCTTTGGTGGGGGAATATTTCTTAACGAGTTGAATTATGTTCGAG
TTGGCTATAAAACTATTCTTTTAACTTGAGTTTTGGTTAAAATTGTATCCTCACAAATTCATTTAATTTTGTTTTTTATTTTTATTGGTGAAAGAATGGGTGCATCCAAA
GTTTTATTGTAGCTGAGTAAGTTCATATGATCCACACTCAAATATCTTATAAGAGACGAAACAGACGTGGCATGTCTTTTATCTAATTTCATATTAACTTAGATTAAAAA
AAAGTGGGTATATCATGTTAACGTTTCTTTAATAATAGACAAAATGAAGGTATAATAACAAGTTCATATATATTATAAAAAAAGTAATTAAAAAACTTGTAGACTGGATA
TTAAAGTAGAGTAACAAAACAGTTTAAATATTATGTTGGTCCTG
Protein sequenceShow/hide protein sequence
MDNYAYNSYAESGDSSPRSREIDFENPPPWDDAAQLQSSNYKVKFMCSYGGKIHPRPHDNVLSYVGGDTKIFAVDRSIKFASMLAKLSSFCDADVTFKYQLPGEDLDALI
SVTNDDDLEHMMHEYDRLYRAPGRPARMRLFLFPANQSPSFASDGARSDRERFVEVLSSGPSHGADAPKQSVPNKVDFLFGLDKGGVAPPPPPVALKLHDPVPEAVAPPV
ESAARPGPGDRIAVPDPVVHPAEIQRQLQELQRLHISEQEQAAAYRRKSEENNLLGGYAGDYYAQKMMEKSPPANAQPTLQAPPPGYWPEKQVSSGGFQATVTATPGAPD
QQPVYMIHAPGTVYHAQQHPMVRPVAAPPNQGYYAVQRMASDVYRDQPVYNVVQQPPQPPYPATSSPSLQQQPPPKVAAYPGGGMTLAADAGPYTQVAYDSSTGRQVYYT
AGGAAMVGPPPPYQAVSGEMRTGAVGQDGKQLISAKVSQGPV