; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G46670 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G46670
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationChr3:39847054..39848107
RNA-Seq ExpressionCSPI03G46670
SyntenyCSPI03G46670
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606253.1 hypothetical protein SDJN03_03570, partial [Cucurbita argyrosperma subsp. sororia]1.7e-5757.91Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEEE--EEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WN+VMS E+EH +E    KRD++ E +++ E  +EE+D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEEE--EEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_011652649.2 histone H3.v1 [Cucumis sativus]3.6e-12999.22Show/hide
Query:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
        MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
Subjt:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR

Query:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNKVMSEEEEHNDEKEIKKR
        RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWN+VMSEEEEHNDEKEIKKR
Subjt:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNKVMSEEEEHNDEKEIKKR

Query:  DDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        DD EEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
Subjt:  DDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_022930995.1 uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata]1.8e-5657.25Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WN+VMS E+EH +EK       +E E ++  +E++D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_022995232.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima]1.1e-5657.4Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEE-EEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WN+VMS E+EH +E    KRD+ E +++ E  ++E+D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEE-EEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

XP_038888901.1 uncharacterized protein LOC120078676 [Benincasa hispida]1.1e-8071.32Show/hide
Query:  MSNPIQ--------EQPYDPFQS-FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIP
        MSN IQ        E+P+DPF S FSTLCLN    SAVDPSLCSSC R H RS+ATPMKRP+PTPP          SKNL LD QQP+S  FSKI+LPIP
Subjt:  MSNPIQ--------EQPYDPFQS-FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIP

Query:  FPPSVSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNKVMSEEEEH
        F PSV PLRRS+SDPT+ARNFSP    QSPAKRLCLNSPLPPLPLRRTVSDPNP+PEKTSDSPIKI KD+PESKRL+RIKDRLKEMN WWN+VMSEE+  
Subjt:  FPPSVSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNKVMSEEEEH

Query:  NDEKEIKKRDDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
         DE E KK D  +EEEE        DEETVGVERVGDS+ L LKCSCGK F+ILLSGR+CFYKLL
Subjt:  NDEKEIKKRDDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LI25 Uncharacterized protein6.9e-12694.34Show/hide
Query:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
        MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR
Subjt:  MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLR

Query:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNKVMSEEEEHNDEKEIKKR
        RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWN+VMSEEEEHNDEKEIKK 
Subjt:  RSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNKVMSEEEEHNDEKEIKKR

Query:  ---------DDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
                   ++EEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
Subjt:  ---------DDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X11.2e-5657.76Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEE-EEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WN+VMS E+EH +E    KRD+ E ++  E  +E++D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEE-EEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X28.9e-5757.25Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ +KTS SP+        I++DSP+SKRL++IKDRLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WN+VMS E+EH +EK       +E E ++  +E++D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X15.2e-5757.4Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEE-EEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WN+VMS E+EH +E    KRD+ E +++ E  ++E+D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEE-EEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X28.9e-5756.88Show/hide
Query:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP
        MSN IQE  +P +P Q      FSTLCLN   +    P LCSSC R   R +AT  KR SPT    Q      T+K  LLDP+Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--QPYDPFQS-----FSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW
        PS       SPL RS+SDPT+ARNFSPP    SPAKRLC NS LPPLPLRRTVSDP P+ E+TS+SP+        I++DSP+SKRL++IK+RLKEMN W
Subjt:  PS------VSPLRRSLSDPTDARNFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPI-------KIQKDSPESKRLKRIKDRLKEMNHW

Query:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
        WN+VMS E+EH +EK       +E E ++  ++E+D+EETVGVERVGDS+ L+LKC CGK F+ILLSG +CFYKLL
Subjt:  WNKVMSEEEEHNDEKEIKKRDDEEEEEEEEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein4.4e-0830.8Show/hide
Query:  SSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQ---PNSIPFSKINLP-IPFPPSV--SPL-RRSLSDPTDARNFSPPL-----------------QTQ
        ++ +P+KRPS   P S+Q       K  +  P++   PN + +SKI LP + F P+   SPL +RSLSD      F+ P+                 Q  
Subjt:  SSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQ---PNSIPFSKINLP-IPFPPSV--SPL-RRSLSDPTDARNFSPPL-----------------QTQ

Query:  SPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTS-----------DSPIKIQKDSPESKRLKRIKDRLKEMNHWWNKVMSEEEEHNDEKEIKKRDDEEEEEE
        SP      + P  P   RR+VSD +PAP   S           +  +   + S  +K L  IKD ++E++ W NK++   E  +    +K+ D  +  +E
Subjt:  SPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTS-----------DSPIKIQKDSPESKRLKRIKDRLKEMNHWWNKVMSEEEEHNDEKEIKKRDDEEEEEE

Query:  --EEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL
          ++EE+ K+ +E V V R+G++  +++ C CG+ +  L SGR+C+YKLL
Subjt:  --EEEEEEKDDEETVGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAATCCTATTCAAGAACAGCCTTACGACCCTTTCCAATCCTTCTCCACTCTCTGTCTCAACTCCTCCTCCTCCTCCGCCGTCGACCCTTCACTCTGTTCTTCATG
CTTCCGTCCTCACTCTCGCTCCTCCGCCACTCCCATGAAACGCCCCTCCCCCACGCCCCCCTCTTCTCAACAACTCTCCACCGTCACCACTTCCAAGAACCTCCTTCTTG
ATCCTCAACAACCCAATTCCATCCCCTTCTCCAAGATCAATCTTCCCATTCCTTTTCCTCCCTCTGTTTCCCCTCTCCGCCGCTCTCTTTCCGACCCCACTGATGCCCGG
AATTTCTCCCCTCCCCTACAAACTCAATCCCCGGCAAAGCGATTATGCCTAAACTCACCACTCCCTCCCTTGCCTCTCCGCCGTACTGTCTCTGACCCAAATCCCGCCCC
TGAGAAAACTTCCGATTCCCCTATTAAAATTCAGAAAGACAGCCCTGAATCGAAGAGGCTGAAAAGGATAAAGGATCGACTGAAGGAGATGAATCATTGGTGGAACAAAG
TAATGAGTGAAGAAGAAGAACACAATGATGAAAAGGAGATAAAAAAGAGAGACGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGATGATGAAGAAACA
GTGGGGGTGGAAAGAGTTGGAGATTCAATGACACTAAAATTGAAGTGCTCATGTGGGAAGCGATTTGATATTCTTCTATCTGGAAGAAACTGCTTCTACAAATTGTTGTA
G
mRNA sequenceShow/hide mRNA sequence
AAAATAGGTGAGATTTTAGATCAATTTCCATTGGCGCCATGAGTAATCCTATTCAAGAACAGCCTTACGACCCTTTCCAATCCTTCTCCACTCTCTGTCTCAACTCCTCC
TCCTCCTCCGCCGTCGACCCTTCACTCTGTTCTTCATGCTTCCGTCCTCACTCTCGCTCCTCCGCCACTCCCATGAAACGCCCCTCCCCCACGCCCCCCTCTTCTCAACA
ACTCTCCACCGTCACCACTTCCAAGAACCTCCTTCTTGATCCTCAACAACCCAATTCCATCCCCTTCTCCAAGATCAATCTTCCCATTCCTTTTCCTCCCTCTGTTTCCC
CTCTCCGCCGCTCTCTTTCCGACCCCACTGATGCCCGGAATTTCTCCCCTCCCCTACAAACTCAATCCCCGGCAAAGCGATTATGCCTAAACTCACCACTCCCTCCCTTG
CCTCTCCGCCGTACTGTCTCTGACCCAAATCCCGCCCCTGAGAAAACTTCCGATTCCCCTATTAAAATTCAGAAAGACAGCCCTGAATCGAAGAGGCTGAAAAGGATAAA
GGATCGACTGAAGGAGATGAATCATTGGTGGAACAAAGTAATGAGTGAAGAAGAAGAACACAATGATGAAAAGGAGATAAAAAAGAGAGACGATGAAGAAGAAGAAGAAG
AAGAAGAAGAAGAAGAAGAAAAAGATGATGAAGAAACAGTGGGGGTGGAAAGAGTTGGAGATTCAATGACACTAAAATTGAAGTGCTCATGTGGGAAGCGATTTGATATT
CTTCTATCTGGAAGAAACTGCTTCTACAAATTGTTGTAG
Protein sequenceShow/hide protein sequence
MSNPIQEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTVTTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLRRSLSDPTDAR
NFSPPLQTQSPAKRLCLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQKDSPESKRLKRIKDRLKEMNHWWNKVMSEEEEHNDEKEIKKRDDEEEEEEEEEEEEKDDEET
VGVERVGDSMTLKLKCSCGKRFDILLSGRNCFYKLL