; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G044110 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G044110
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationGy14Chr3:40892596..40893602
RNA-Seq ExpressionCsGy3G044110
SyntenyCsGy3G044110
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652649.2 histone H3.v1 [Cucumis sativus]9.89e-168100Show/hide
Query:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
        MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
Subjt:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR

Query:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKR
        RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKR
Subjt:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKR

Query:  DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
Subjt:  DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_022930995.1 uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata]3.26e-7357.91Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK           +E++D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_022995232.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima]1.02e-7257.45Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKKR----DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK+     DEE          D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKKR----DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_022995233.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima]3.26e-7357.5Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKK--RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK  +D           E+D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKK--RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_038888901.1 uncharacterized protein LOC120078676 [Benincasa hispida]4.85e-10371.97Show/hide
Query:  MSNPIQE--------QPYDPFQS-FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIP
        MSN IQE        +P+DPF S FSTLCLN S   AVDPSLCSSC R H RS+ATPMKRP+PTPP          SKNL LD QQP+S  FSKI+LPIP
Subjt:  MSNPIQE--------QPYDPFQS-FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIP

Query:  FPPSVSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEH
        F PSV PLRRS+SDPT+ARNFSP    QSPAKRLCLNSPLPPLPLRRTVSDPNP+PEKTSDSPIKI KD+PESKRL+RIKDRLKEMN WWNEVMSEE+  
Subjt:  FPPSVSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEH

Query:  NDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
         DE E KK D  +EEEE       DEETVGVERVGDS+ L LKCSCGK F+ILLSGR+CFYKLL
Subjt:  NDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LI25 Uncharacterized protein2.11e-16295.49Show/hide
Query:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
        MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
Subjt:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR

Query:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKK-
        RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKK 
Subjt:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKK-

Query:  ----------RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
                  RDDEEEEEEE EEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
Subjt:  ----------RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X16.99e-7357.24Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKK-----RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK     ++DE           D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKK-----RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X21.58e-7357.91Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK           +E++D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X14.94e-7357.45Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKKR----DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK+     DEE          D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKKR----DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X21.58e-7357.5Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKK--RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK  +D           E+D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKK--RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein4.4e-0830.68Show/hide
Query:  SSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQ---PNSIPFSKINLP-IPFPPSV--SPL-RRSLSDPTDARNFSPPL-----------------QTQ
        ++ +P+KRPS   P S+Q       K  +  P++   PN + +SKI LP + F P+   SPL +RSLSD      F+ P+                 Q  
Subjt:  SSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQ---PNSIPFSKINLP-IPFPPSV--SPL-RRSLSDPTDARNFSPPL-----------------QTQ

Query:  SPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTS-----------DSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKRDDE----E
        SP      + P  P   RR+VSD +PAP   S           +  +   + S  +K L  IKD ++E++ W N+++   E  +      K+DD     +
Subjt:  SPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTS-----------DSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKRDDE----E

Query:  EEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        E  ++EE+ K+ +E V V R+G++  +++ C CG+ +  L SGR+C+YKLL
Subjt:  EEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAATCCTATTCAAGAACAGCCTTATGACCCTTTCCAATCCTTCTCCACTCTCTGTCTCAACTCCTCCTCCTCCTCCGCCGTCGACCCTTCACTCTGTTCTTCATG
CTTCCGTCCTCACTCTCGCTCCTCCGCCACTCCCATGAAACGCCCCTCCCCCACGCCCCCCTCTTCTCAACAACTCTCCACCGTCACCACTTCCAAGAACCTCCTTCTTG
ATCCTCAACAACCCAATTCCATCCCCTTCTCCAAGATCAATCTTCCCATTCCTTTTCCTCCCTCTGTTTCCCCTCTCCGCCGCTCTCTTTCCGACCCCACTGATGCCCGG
AATTTCTCCCCTCCCCTACAAACTCAATCCCCGGCAAAGCGATTATGCCTAAACTCACCACTCCCTCCCTTGCCTCTCCGCCGTACTGTCTCTGACCCAAATCCCGCCCC
TGAGAAAACTTCCGATTCCCCTATTAAAATTCAGAAAGACAGCCCTGAATCGAAGAGGCTGAAAAGGATAAAGGATCGACTGAAGGAGATGAATCATTGGTGGAACGAAG
TAATGAGTGAAGAAGAAGAACACAATGATGAAAAGGAGATAAAAAAGAGAGACGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGATGATGAAGAAACAGTG
GGGGTGGAAAGAGTTGGAGATTCAATGACACTAAAATTGAAGTGCTCATGTGGGAAGCGATTTGATATTCTTCTATCTGGAAGAAACTGCTTCTACAAATTGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTAATCCTATTCAAGAACAGCCTTATGACCCTTTCCAATCCTTCTCCACTCTCTGTCTCAACTCCTCCTCCTCCTCCGCCGTCGACCCTTCACTCTGTTCTTCATG
CTTCCGTCCTCACTCTCGCTCCTCCGCCACTCCCATGAAACGCCCCTCCCCCACGCCCCCCTCTTCTCAACAACTCTCCACCGTCACCACTTCCAAGAACCTCCTTCTTG
ATCCTCAACAACCCAATTCCATCCCCTTCTCCAAGATCAATCTTCCCATTCCTTTTCCTCCCTCTGTTTCCCCTCTCCGCCGCTCTCTTTCCGACCCCACTGATGCCCGG
AATTTCTCCCCTCCCCTACAAACTCAATCCCCGGCAAAGCGATTATGCCTAAACTCACCACTCCCTCCCTTGCCTCTCCGCCGTACTGTCTCTGACCCAAATCCCGCCCC
TGAGAAAACTTCCGATTCCCCTATTAAAATTCAGAAAGACAGCCCTGAATCGAAGAGGCTGAAAAGGATAAAGGATCGACTGAAGGAGATGAATCATTGGTGGAACGAAG
TAATGAGTGAAGAAGAAGAACACAATGATGAAAAGGAGATAAAAAAGAGAGACGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGATGATGAAGAAACAGTG
GGGGTGGAAAGAGTTGGAGATTCAATGACACTAAAATTGAAGTGCTCATGTGGGAAGCGATTTGATATTCTTCTATCTGGAAGAAACTGCTTCTACAAATTGTTGTAG
Protein sequenceShow/hide protein sequence
MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLRRSLSDPTDAR
NFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETV
GVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL