; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G12740 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G12740
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationClcChr09:11928074..11929309
RNA-Seq ExpressionClc09G12740
SyntenyClc09G12740
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011652649.2 histone H3.v1 [Cucumis sativus]4.5e-8772.62Show/hide
Query:  LQNP--EQTFDPFHSRFSTLCLN---PSAGDPPLCSSCARRQPRSAATPMKRPSPTP--SQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFDPSVSPL
        + NP  EQ +DPF S FSTLCLN    SA DP LCSSC R   RS+ATPMKRPSPTP  SQ  ST TTSK L LD QQPNS PFSKI+LPIPF PSVSPL
Subjt:  LQNP--EQTFDPFHSRFSTLCLN---PSAGDPPLCSSCARRQPRSAATPMKRPSPTP--SQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFDPSVSPL

Query:  RRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEE--H
        RRS+SDPT+ARNFSPP   QSPAKRLCLNSPLPPLPLRRTVSDPNP+PEKTSDSPIKI K       DSPESKRL+RIKDRLKEMN WWNEVMSEEE  +
Subjt:  RRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEE--H

Query:  DEIDTKKRDCWKDGEE-------DEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        DE + KKRD  ++ EE       DEETVGVERVGDS+ L+LKC CGK F+IL+SGR+CFYKLL
Subjt:  DEIDTKKRDCWKDGEE-------DEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

XP_022930995.1 uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata]5.5e-8570.11Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P  T T+KK  LD +Q N T FSKIDLPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS +KTS SP+ IG+  D I EDSP+SKRLR+IKDRLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  C K+ E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

XP_022995232.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima]1.1e-8569.85Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P+ TT  KK  LD +Q N T FSKIDLPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS E+TS+SP+ IG+  D I EDSP+SKRLR+IK+RLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKR-DCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK+ +C KD E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKR-DCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

XP_022995233.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima]2.5e-8570.11Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P+ TT  KK  LD +Q N T FSKIDLPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS E+TS+SP+ IG+  D I EDSP+SKRLR+IK+RLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  C KD E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

XP_038888901.1 uncharacterized protein LOC120078676 [Benincasa hispida]6.4e-11084.88Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGDPPLCSSCARRQPRSAATPMKRPSPT-PSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFDPS
        MSNLIQESSE QNPE+ FDPFHSRFSTLCLNPSA DP LCSSCARR PRSAATPMKRP+PT P QHP     SK LFLDHQQP+ST FSKIDLPIPFDPS
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGDPPLCSSCARRQPRSAATPMKRPSPT-PSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFDPS

Query:  VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEE
        V PLRRSVSDPTEARNFSP P+IQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGK       D+PESKRLRRIKDRLKEMNQWWNEVMSEE
Subjt:  VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEE

Query:  EHDEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        + DE +TKK DC K+ EEDEETVGVERVGDSLAL LKC CGKGFEIL+SGRSCFYKLL
Subjt:  EHDEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LI25 Uncharacterized protein5.9e-8569.96Show/hide
Query:  LQNP--EQTFDPFHSRFSTLCLN---PSAGDPPLCSSCARRQPRSAATPMKRPSPTP--SQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFDPSVSPL
        + NP  EQ +DPF S FSTLCLN    SA DP LCSSC R   RS+ATPMKRPSPTP  SQ  ST TTSK L LD QQPNS PFSKI+LPIPF PSVSPL
Subjt:  LQNP--EQTFDPFHSRFSTLCLN---PSAGDPPLCSSCARRQPRSAATPMKRPSPTP--SQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFDPSVSPL

Query:  RRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEE--H
        RRS+SDPT+ARNFSPP   QSPAKRLCLNSPLPPLPLRRTVSDPNP+PEKTSDSPIKI K       DSPESKRL+RIKDRLKEMN WWNEVMSEEE  +
Subjt:  RRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEE--H

Query:  DEIDTKK-----------RDCWKDGEE------DEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        DE + KK           RD  ++ EE      DEETVGVERVGDS+ L+LKC CGK F+IL+SGR+CFYKLL
Subjt:  DEIDTKK-----------RDCWKDGEE------DEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X14.5e-8569.85Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P  T T+KK  LD +Q N T FSKIDLPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS +KTS SP+ IG+  D I EDSP+SKRLR+IKDRLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKK-RDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  +C K+ E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKK-RDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X22.7e-8570.11Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P  T T+KK  LD +Q N T FSKIDLPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS +KTS SP+ IG+  D I EDSP+SKRLR+IKDRLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  C K+ E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X15.3e-8669.85Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P+ TT  KK  LD +Q N T FSKIDLPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS E+TS+SP+ IG+  D I EDSP+SKRLR+IK+RLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKR-DCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK+ +C KD E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKR-DCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X21.2e-8570.11Show/hide
Query:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD
        MSNLIQES+E QNPEQ    F SRFSTLCLNP       PPLCSSC RR PR AAT  KR SPT  Q P+ TT  KK  LD +Q N T FSKIDLPIPF 
Subjt:  MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGD---PPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFD

Query:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW
        PS       SPL RSVSDPTEARNFSPP    SPAKRLC NS LPPLPLRRTVSDP PS E+TS+SP+ IG+  D I EDSP+SKRLR+IK+RLKEMN+W
Subjt:  PS------VSPLRRSVSDPTEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQW

Query:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
        WNEVMSE+EH     DE +TKK  C KD E++EETVGVERVGDSL LRLKCPCGKGFEIL+SG SCFYKLL
Subjt:  WNEVMSEEEH-----DEIDTKKRDCWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein1.6e-1330.77Show/hide
Query:  SAATPMKRPSPTPSQHPSTTTTSKKLFL----DHQQPNSTPFSKIDLP-IPFDPSV--SPL-RRSVSDP----------TEARNFSPPPLIQSPAKRLCL
        +  +P+KRPSP   Q        KKLF+    + + PN   +SKI LP + F+P+   SPL +RS+SD           +    ++   + Q  +     
Subjt:  SAATPMKRPSPTPSQHPSTTTTSKKLFL----DHQQPNSTPFSKIDLP-IPFDPSV--SPL-RRSVSDP----------TEARNFSPPPLIQSPAKRLCL

Query:  NSPLPPLP--LRRTVSDPNPSPEKTSDSPIKIGKSTDLII--------EDSPESKRLRRIKDRLKEMNQWWNEVMSEEEHDEIDTKKRD-----------
           LPP P   RR+VSD +P+P   S     +G S    I        E S  +K L  IKD ++E++QW N+++   E     + K+D           
Subjt:  NSPLPPLP--LRRTVSDPNPSPEKTSDSPIKIGKSTDLII--------EDSPESKRLRRIKDRLKEMNQWWNEVMSEEEHDEIDTKKRD-----------

Query:  CWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL
          +  +E +E V V R+G++  + + CPCG+ ++ L SGR C+YKLL
Subjt:  CWKDGEEDEETVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAATCTGATTCAAGAATCTTCCGAACTCCAAAACCCAGAACAAACTTTCGATCCCTTCCATTCTCGTTTCTCCACCCTCTGTCTCAACCCCTCCGCCGGCGACCC
ACCACTCTGTTCTTCATGCGCTCGCCGTCAACCTCGCTCCGCCGCCACTCCCATGAAACGCCCCTCCCCCACGCCGTCGCAACACCCCTCCACCACCACCACCTCCAAGA
AGCTCTTTCTTGATCATCAACAACCCAATTCCACTCCCTTCTCCAAGATCGATCTTCCCATTCCTTTTGATCCTTCCGTTTCCCCTCTCCGCCGCTCTGTTTCCGACCCC
ACCGAAGCCCGGAATTTCTCCCCTCCGCCGCTGATTCAGTCCCCTGCAAAACGGTTATGTCTCAACTCACCCCTGCCGCCTCTGCCTCTCCGGCGTACTGTCTCTGACCC
AAATCCCTCCCCTGAAAAAACTTCCGATTCCCCAATTAAAATTGGGAAATCCACCGATTTGATCATAGAAGACAGCCCCGAATCAAAGAGACTGAGAAGAATCAAGGATC
GATTGAAGGAGATGAATCAATGGTGGAACGAAGTGATGAGTGAAGAAGAACACGATGAAATTGACACAAAAAAGAGAGATTGCTGGAAGGATGGAGAAGAAGATGAGGAA
ACAGTGGGAGTGGAGAGAGTTGGAGATTCATTGGCGCTACGTTTAAAGTGTCCATGTGGGAAAGGATTTGAGATTCTTATTTCTGGAAGAAGCTGTTTCTACAAGCTGCT
GTAG
mRNA sequenceShow/hide mRNA sequence
TTTTACTTCGGATTCCAACAACTTCATCATCAACATTCAAATCTTTAGCCGCCATGAGCAATCTGATTCAAGAATCTTCCGAACTCCAAAACCCAGAACAAACTTTCGAT
CCCTTCCATTCTCGTTTCTCCACCCTCTGTCTCAACCCCTCCGCCGGCGACCCACCACTCTGTTCTTCATGCGCTCGCCGTCAACCTCGCTCCGCCGCCACTCCCATGAA
ACGCCCCTCCCCCACGCCGTCGCAACACCCCTCCACCACCACCACCTCCAAGAAGCTCTTTCTTGATCATCAACAACCCAATTCCACTCCCTTCTCCAAGATCGATCTTC
CCATTCCTTTTGATCCTTCCGTTTCCCCTCTCCGCCGCTCTGTTTCCGACCCCACCGAAGCCCGGAATTTCTCCCCTCCGCCGCTGATTCAGTCCCCTGCAAAACGGTTA
TGTCTCAACTCACCCCTGCCGCCTCTGCCTCTCCGGCGTACTGTCTCTGACCCAAATCCCTCCCCTGAAAAAACTTCCGATTCCCCAATTAAAATTGGGAAATCCACCGA
TTTGATCATAGAAGACAGCCCCGAATCAAAGAGACTGAGAAGAATCAAGGATCGATTGAAGGAGATGAATCAATGGTGGAACGAAGTGATGAGTGAAGAAGAACACGATG
AAATTGACACAAAAAAGAGAGATTGCTGGAAGGATGGAGAAGAAGATGAGGAAACAGTGGGAGTGGAGAGAGTTGGAGATTCATTGGCGCTACGTTTAAAGTGTCCATGT
GGGAAAGGATTTGAGATTCTTATTTCTGGAAGAAGCTGTTTCTACAAGCTGCTGTAGATCAATTTTTTCGATTTTGTGTTTGATTTTCTCAAGTTCTAACGAATACCCAT
TTCAGATTTATCAATTTCTTCTTTCCTACATTCATTTGTTTTCTCGTTTTCCTTTTTGTGTAAACACAATAGTCCATACAAATCAATGTGTTTTCAAGTTTTGTCAAACT
CATGGAGAATATCTCTTATGACTTCTAAAGTTTATCATTTAATTAGTTACTATCAAGTTAGATATTAAGCTTGATACTTGTTGCATTTTTTAAAGGTTACATTTAGC
Protein sequenceShow/hide protein sequence
MSNLIQESSELQNPEQTFDPFHSRFSTLCLNPSAGDPPLCSSCARRQPRSAATPMKRPSPTPSQHPSTTTTSKKLFLDHQQPNSTPFSKIDLPIPFDPSVSPLRRSVSDP
TEARNFSPPPLIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGKSTDLIIEDSPESKRLRRIKDRLKEMNQWWNEVMSEEEHDEIDTKKRDCWKDGEEDEE
TVGVERVGDSLALRLKCPCGKGFEILISGRSCFYKLL