; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G13035 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G13035
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationctg1838:9377881..9378494
RNA-Seq ExpressionCucsat.G13035
SyntenyCucsat.G13035
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652649.2 histone H3.v1 [Cucumis sativus]9.50e-168100Show/hide
Query:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
        MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
Subjt:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR

Query:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKR
        RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKR
Subjt:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKR

Query:  DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
Subjt:  DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_022930995.1 uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata]3.14e-7357.91Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK           +E++D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_022995232.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima]9.84e-7357.45Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKKR----DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK+     DEE          D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKKR----DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_022995233.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima]3.14e-7357.5Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKK--RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK  +D           E+D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKK--RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_038888901.1 uncharacterized protein LOC120078676 [Benincasa hispida]4.67e-10371.97Show/hide
Query:  MSNPIQE--------QPYDPFQS-FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIP
        MSN IQE        +P+DPF S FSTLCLN S   AVDPSLCSSC R H RS+ATPMKRP+PTPP          SKNL LD QQP+S  FSKI+LPIP
Subjt:  MSNPIQE--------QPYDPFQS-FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIP

Query:  FPPSVSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEH
        F PSV PLRRS+SDPT+ARNFSP    QSPAKRLCLNSPLPPLPLRRTVSDPNP+PEKTSDSPIKI KD+PESKRL+RIKDRLKEMN WWNEVMSEE+  
Subjt:  FPPSVSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEH

Query:  NDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
         DE E KK D  +EEEE       DEETVGVERVGDS+ L LKCSCGK F+ILLSGR+CFYKLL
Subjt:  NDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LI25 Uncharacterized protein2.03e-16295.49Show/hide
Query:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
        MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
Subjt:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR

Query:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKK-
        RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKK 
Subjt:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKK-

Query:  ----------RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
                  RDDEEEEEEE EEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
Subjt:  ----------RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X16.75e-7357.24Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKK-----RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK     ++DE           D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKK-----RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X21.52e-7357.91Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK           +E++D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X14.76e-7357.45Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKKR----DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK+     DEE          D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKKR----DDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X21.52e-7357.5Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNEVMSE---EEEHNDEKEIKK--RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WNEVMSE   EEE  DE E KK  +D           E+D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNEVMSE---EEEHNDEKEIKK--RDDEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein7.5e-0830.68Show/hide
Query:  SSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQ---PNSIPFSKINLP-IPFPPSV--SPL-RRSLSDPTDARNFSPPL-----------------QTQ
        ++ +P+KRPS   P S+Q       K  +  P++   PN + +SKI LP + F P+   SPL +RSLSD      F+ P+                 Q  
Subjt:  SSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQ---PNSIPFSKINLP-IPFPPSV--SPL-RRSLSDPTDARNFSPPL-----------------QTQ

Query:  SPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTS-----------DSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKRDDE----E
        SP      + P  P   RR+VSD +PAP   S           +  +   + S  +K L  IKD ++E++ W N+++   E  +      K+DD     +
Subjt:  SPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTS-----------DSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKRDDE----E

Query:  EEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        E  ++EE+ K+ +E V V R+G++  +++ C CG+ +  L SGR+C+YKLL
Subjt:  EEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAATCCTATTCAAGAACAGCCTTATGACCCTTTCCAATCCTTCTCCACTCTCTGTCTCAACTCCTCCTCCTCCTCCGCCGTCGACCCTTCACTCTGTTCTTCATG
CTTCCGTCCTCACTCTCGCTCCTCCGCCACTCCCATGAAACGCCCCTCCCCCACGCCCCCCTCTTCTCAACAACTCTCCACCGTCACCACTTCCAAGAACCTCCTTCTTG
ATCCTCAACAACCCAATTCCATCCCCTTCTCCAAGATCAATCTTCCCATTCCTTTTCCTCCCTCTGTTTCCCCTCTCCGCCGCTCTCTTTCCGACCCCACTGATGCCCGG
AATTTCTCCCCTCCCCTACAAACTCAATCCCCGGCAAAGCGATTATGCCTAAACTCACCACTCCCTCCCTTGCCTCTCCGCCGTACTGTCTCTGACCCAAATCCCGCCCC
TGAGAAAACTTCCGATTCCCCTATTAAAATTCAGAAAGACAGCCCTGAATCGAAGAGGCTGAAAAGGATAAAGGATCGACTGAAGGAGATGAATCATTGGTGGAACGAAG
TAATGAGTGAAGAAGAAGAACACAATGATGAAAAGGAGATAAAAAAGAGAGACGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGATGATGAAGAAACAGTG
GGGGTGGAAAGAGTTGGAGATTCAATGACACTAAAATTGAAGTGCTCATGTGGGAAGCGATTTGATATTCTTCTATCTGGAAGAAACTGCTTCTACAAATTGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTAATCCTATTCAAGAACAGCCTTATGACCCTTTCCAATCCTTCTCCACTCTCTGTCTCAACTCCTCCTCCTCCTCCGCCGTCGACCCTTCACTCTGTTCTTCATG
CTTCCGTCCTCACTCTCGCTCCTCCGCCACTCCCATGAAACGCCCCTCCCCCACGCCCCCCTCTTCTCAACAACTCTCCACCGTCACCACTTCCAAGAACCTCCTTCTTG
ATCCTCAACAACCCAATTCCATCCCCTTCTCCAAGATCAATCTTCCCATTCCTTTTCCTCCCTCTGTTTCCCCTCTCCGCCGCTCTCTTTCCGACCCCACTGATGCCCGG
AATTTCTCCCCTCCCCTACAAACTCAATCCCCGGCAAAGCGATTATGCCTAAACTCACCACTCCCTCCCTTGCCTCTCCGCCGTACTGTCTCTGACCCAAATCCCGCCCC
TGAGAAAACTTCCGATTCCCCTATTAAAATTCAGAAAGACAGCCCTGAATCGAAGAGGCTGAAAAGGATAAAGGATCGACTGAAGGAGATGAATCATTGGTGGAACGAAG
TAATGAGTGAAGAAGAAGAACACAATGATGAAAAGGAGATAAAAAAGAGAGACGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGATGATGAAGAAACAGTG
GGGGTGGAAAGAGTTGGAGATTCAATGACACTAAAATTGAAGTGCTCATGTGGGAAGCGATTTGATATTCTTCTATCTGGAAGAAACTGCTTCTACAAATTGTTGTAG
Protein sequenceShow/hide protein sequence
MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLRRSLSDPTDAR
NFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNEVMSEEEEHNDEKEIKKRDDEEEEEEEEEEEKDDEETV
GVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL