; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G011900 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G011900
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationCG_Chr09:12432595..12433507
RNA-Seq ExpressionClCG09G011900
SyntenyClCG09G011900
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652649.2 histone H3.v1 [Cucumis sativus]7.6e-8772.62Show/hide
Query:  LQNP--EQTFDPFHSRFSTLCLN---PSAGDPPLCSSCARRQPRSAATPMKRPSPTP--SQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFDPSVSPL
        + NP  EQ +DPF S FSTLCLN    SA DP LCSSC R   RS+ATPMKRPSPTP  SQ  ST TTSK L LD QQPNS PFSKI LPIPF PSVSPL
Subjt:  LQNP--EQTFDPFHSRFSTLCLN---PSAGDPPLCSSCARRQPRSAATPMKRPSPTP--SQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFDPSVSPL

Query:  RRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEE--H
        RRS+SDPT+ARNFSPP   QSPAKRLCLNSPLPPLPLRRTVSDPNP+PEKTSDSPIKI K       DSPESKRL+RIKDRLKEMN WWNEVMSEEE  +
Subjt:  RRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEE--H

Query:  DEIDTKKRDCWKDGEE-------DEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        DE + KKRD  ++ EE       DEETVGVERVGDS+ L+LKC CGK F+IL+SGR+CFYKLL
Subjt:  DEIDTKKRDCWKDGEE-------DEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

XP_022930995.1 uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata]4.6e-8469.74Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P  T T+KK  LD +Q N T FSKI LPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS +KTS SP+ IG+  D I EDSP+SKRLR+IKDRLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  C K+ E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

XP_022995232.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima]7.2e-8569.49Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P+ TT  KK  LD +Q N T FSKI LPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS E+TS+SP+ IG+  D I EDSP+SKRLR+IK+RLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKR-DCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK+ +C KD E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKR-DCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

XP_022995233.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima]2.1e-8469.74Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P+ TT  KK  LD +Q N T FSKI LPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS E+TS+SP+ IG+  D I EDSP+SKRLR+IK+RLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  C KD E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

XP_038888901.1 uncharacterized protein LOC120078676 [Benincasa hispida]4.2e-10984.5Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGDPPLCSSCARRQPRSAATPMKRPSPT-PSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFDPS
        MSNLIQESSE QNPE+ FDPFHSRFSTLCLNPSA DP LCSSCARR PRSAATPMKRP+PT P QHP     SK LFLDHQQP+ST FSKI LPIPFDPS
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGDPPLCSSCARRQPRSAATPMKRPSPT-PSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFDPS

Query:  VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEE
        V PLRRSVSDPTEARNFSP P+IQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGK       D+PESKRLRRIKDRLKEMNQWWNEVMSEE
Subjt:  VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEE

Query:  EHDEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        + DE +TKK DC K+ EEDEETVGVERVGDSLAL LKC CGKGFEIL+SGRSCFYKLL
Subjt:  EHDEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LI25 Uncharacterized protein1.0e-8469.96Show/hide
Query:  LQNP--EQTFDPFHSRFSTLCLN---PSAGDPPLCSSCARRQPRSAATPMKRPSPTP--SQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFDPSVSPL
        + NP  EQ +DPF S FSTLCLN    SA DP LCSSC R   RS+ATPMKRPSPTP  SQ  ST TTSK L LD QQPNS PFSKI LPIPF PSVSPL
Subjt:  LQNP--EQTFDPFHSRFSTLCLN---PSAGDPPLCSSCARRQPRSAATPMKRPSPTP--SQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFDPSVSPL

Query:  RRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEE--H
        RRS+SDPT+ARNFSPP   QSPAKRLCLNSPLPPLPLRRTVSDPNP+PEKTSDSPIKI K       DSPESKRL+RIKDRLKEMN WWNEVMSEEE  +
Subjt:  RRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEE--H

Query:  DEIDTKK-----------RDCWKDGEE------DEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        DE + KK           RD  ++ EE      DEETVGVERVGDS+ L+LKC CGK F+IL+SGR+CFYKLL
Subjt:  DEIDTKK-----------RDCWKDGEE------DEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X12.9e-8469.49Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P  T T+KK  LD +Q N T FSKI LPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS +KTS SP+ IG+  D I EDSP+SKRLR+IKDRLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKK-RDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  +C K+ E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKK-RDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X22.2e-8469.74Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P  T T+KK  LD +Q N T FSKI LPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS +KTS SP+ IG+  D I EDSP+SKRLR+IKDRLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  C K+ E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X13.5e-8569.49Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P+ TT  KK  LD +Q N T FSKI LPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS E+TS+SP+ IG+  D I EDSP+SKRLR+IK+RLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKR-DCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK+ +C KD E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKR-DCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X21.0e-8469.74Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P+ TT  KK  LD +Q N T FSKI LPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS E+TS+SP+ IG+  D I EDSP+SKRLR+IK+RLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  C KD E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein1.6e-1330.77Show/hide
Query:  SAATPMKRPSPTPSQHPSTTTTSKKLFL----DHQQPNSTPFSKIALP-IPFDPSV--SPL-RRSVSDP----------TEARNFSPPPLIQSPAKRLCL
        +  +P+KRPSP   Q        KKLF+    + + PN   +SKI LP + F+P+   SPL +RS+SD           +    ++   + Q  +     
Subjt:  SAATPMKRPSPTPSQHPSTTTTSKKLFL----DHQQPNSTPFSKIALP-IPFDPSV--SPL-RRSVSDP----------TEARNFSPPPLIQSPAKRLCL

Query:  NSPLPPLP--LRRTVSDPNPSPEKTSDSPIKIGKSTDLII--------EDSPESKRLRRIKDRLKEMNQWWNEVMSEEEHDEIDTKKRD-----------
           LPP P   RR+VSD +P+P   S     +G S    I        E S  +K L  IKD ++E++QW N+++   E     + K+D           
Subjt:  NSPLPPLP--LRRTVSDPNPSPEKTSDSPIKIGKSTDLII--------EDSPESKRLRRIKDRLKEMNQWWNEVMSEEEHDEIDTKKRD-----------

Query:  CWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
          +  +E +E V V R+G++  + + CPCG+ ++ L SGR C+YKLL
Subjt:  CWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAATCTGATTCAAGAATCTTCCGAACTCCAAAACCCAGAACAAACTTTCGATCCCTTCCATTCTCGTTTCTCCACCCTCTGTCTCAACCCCTCCGCCGGCGACCC
ACCACTCTGTTCTTCATGCGCTCGCCGTCAACCTCGCTCCGCCGCCACTCCCATGAAACGCCCCTCCCCCACGCCGTCGCAACACCCCTCCACCACCACCACCTCCAAGA
AGCTCTTTCTTGATCATCAACAACCCAATTCCACTCCCTTCTCCAAGATCGCTCTTCCCATTCCTTTTGATCCTTCCGTTTCCCCTCTCCGCCGCTCTGTTTCCGACCCC
ACCGAAGCCCGGAATTTCTCCCCTCCGCCGCTGATTCAGTCCCCTGCAAAACGGTTATGTCTCAACTCACCCCTGCCGCCTCTGCCTCTCCGGCGTACTGTCTCTGACCC
AAATCCCTCCCCTGAAAAAACTTCCGATTCCCCAATTAAAATTGGGAAATCCACCGATTTGATCATAGAAGACAGCCCCGAATCAAAGAGACTGAGAAGAATCAAGGATC
GATTGAAGGAGATGAATCAATGGTGGAACGAAGTGATGAGTGAAGAAGAACACGATGAAATTGACACAAAAAAGAGGGATTGCTGGAAGGATGGAGAAGAAGATGAGGAA
ACAGTGGGAGTGGAGAGAGTTGGAGATTCATTGGCGCTACGTTTAAAGTGTCCATGTGGGAAAGGATTTGAGATTCTTATTTCTGGAAGAAGCTGTTTCTACAAGCTGCT
GTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAATCTGATTCAAGAATCTTCCGAACTCCAAAACCCAGAACAAACTTTCGATCCCTTCCATTCTCGTTTCTCCACCCTCTGTCTCAACCCCTCCGCCGGCGACCC
ACCACTCTGTTCTTCATGCGCTCGCCGTCAACCTCGCTCCGCCGCCACTCCCATGAAACGCCCCTCCCCCACGCCGTCGCAACACCCCTCCACCACCACCACCTCCAAGA
AGCTCTTTCTTGATCATCAACAACCCAATTCCACTCCCTTCTCCAAGATCGCTCTTCCCATTCCTTTTGATCCTTCCGTTTCCCCTCTCCGCCGCTCTGTTTCCGACCCC
ACCGAAGCCCGGAATTTCTCCCCTCCGCCGCTGATTCAGTCCCCTGCAAAACGGTTATGTCTCAACTCACCCCTGCCGCCTCTGCCTCTCCGGCGTACTGTCTCTGACCC
AAATCCCTCCCCTGAAAAAACTTCCGATTCCCCAATTAAAATTGGGAAATCCACCGATTTGATCATAGAAGACAGCCCCGAATCAAAGAGACTGAGAAGAATCAAGGATC
GATTGAAGGAGATGAATCAATGGTGGAACGAAGTGATGAGTGAAGAAGAACACGATGAAATTGACACAAAAAAGAGGGATTGCTGGAAGGATGGAGAAGAAGATGAGGAA
ACAGTGGGAGTGGAGAGAGTTGGAGATTCATTGGCGCTACGTTTAAAGTGTCCATGTGGGAAAGGATTTGAGATTCTTATTTCTGGAAGAAGCTGTTTCTACAAGCTGCT
GTAG
Protein sequenceShow/hide protein sequence
MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGDPPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIALPIPFDPSVSPLRRSVSDP
TEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEEHDEIDTKKRDCWKDGEEDEE
TVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL