; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021030 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021030
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationchr04:831521..832517
RNA-Seq ExpressionPay0021030
SyntenyPay0021030
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606253.1 hypothetical protein SDJN03_03570, partial [Cucurbita argyrosperma subsp. sororia]2.8e-5257.65Show/hide
Query:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP
        MSN IQE   P +P Q      FSTLCLN   +     P LCSSC R   R  AT  KR SPT    Q  TA T K+ LLD +Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM
        PS       SPL RS+SDPT+A NFSPP     SPAKRLC NS LPPLPLRRTVSD  P P+ +K+S SP+         ++DSPDSKRLR+IKDRLKEM
Subjt:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM

Query:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
        N+WWNEVMSE+E H++E    KRD + E +++ E  ++E+D+EETVGVERVGDS+ L+LKC CGK FEILLSG +CFYKLL
Subjt:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

XP_011652649.2 histone H3.v1 [Cucumis sativus]2.9e-11089.31Show/hide
Query:  MSNPIQEHPYDPFQSFSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPP-SQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFPPSVSPL
        MSNPIQE PYDPFQSFSTLCL NSSSSSAVDPSLCSSCFRPHSRS+ATPMKRPSPTPP SQQ ST  TSKNLLLD QQPNSIPFSKINLPIPFPPSVSPL
Subjt:  MSNPIQEHPYDPFQSFSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPP-SQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFPPSVSPL

Query:  RRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPIKFQKDSPDSKRLRRIKDRLKEMNKWWNEVMSEEEKHDDEME
        RRSLSDPTDA NFS PP  TQSPAKRLCLNSPLPPLPLRRTVSD  PNPAPEK+SDSPIK QKDSP+SKRL+RIKDRLKEMN WWNEVMSEEE+H+DE E
Subjt:  RRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPIKFQKDSPDSKRLRRIKDRLKEMNKWWNEVMSEEEKHDDEME

Query:  TKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
         KKRD+EEEEEEEEEE   EKDDEETVGVERVGDSMTLKLKCSCGKRF+ILLSGRNCFYKLL
Subjt:  TKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

XP_022995232.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima]1.1e-5157.3Show/hide
Query:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP
        MSN IQE   P +P Q      FSTLCLN   +     P LCSSC R   R  AT  KR SPT   Q    A T+K  LLD +Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM
        PS       SPL RS+SDPT+A NFSPP     SPAKRLC NS LPPLPLRRTVSD  P P+ E++S+SP+         ++DSPDSKRLR+IK+RLKEM
Subjt:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM

Query:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
        N+WWNEVMSE+E H++E    KRD E E +++ E  + E+D+EETVGVERVGDS+ L+LKC CGK FEILLSG +CFYKLL
Subjt:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

XP_022995233.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima]8.1e-5256.94Show/hide
Query:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP
        MSN IQE   P +P Q      FSTLCLN   +     P LCSSC R   R  AT  KR SPT   Q    A T+K  LLD +Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM
        PS       SPL RS+SDPT+A NFSPP     SPAKRLC NS LPPLPLRRTVSD  P P+ E++S+SP+         ++DSPDSKRLR+IK+RLKEM
Subjt:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM

Query:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
        N+WWNEVMSE+E H++E    KRD  E ++  ++EE    D+EETVGVERVGDS+ L+LKC CGK FEILLSG +CFYKLL
Subjt:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

XP_038888901.1 uncharacterized protein LOC120078676 [Benincasa hispida]7.0e-8071.48Show/hide
Query:  MSNPIQ--------EHPYDPFQS-FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIP
        MSN IQ        E P+DPF S FSTLCLN     SAVDPSLCSSC R H RS ATPMKRP+PTPP Q P     SKNL LDHQQP+S  FSKI+LPIP
Subjt:  MSNPIQ--------EHPYDPFQS-FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIP

Query:  FPPSVSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPIKFQKDSPDSKRLRRIKDRLKEMNKWWNEVMSEE
        F PSV PLRRS+SDPT+A NFSP P   QSPAKRLCLNSPLPPLPLRRTVSD  PNP+PEK+SDSPIK  KD+P+SKRLRRIKDRLKEMN+WWNEVMSEE
Subjt:  FPPSVSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPIKFQKDSPDSKRLRRIKDRLKEMNKWWNEVMSEE

Query:  EKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
        +   DE ETKK D  +EEEE          DEETVGVERVGDS+ L LKCSCGK FEILLSGR+CFYKLL
Subjt:  EKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LI25 Uncharacterized protein1.5e-10785.5Show/hide
Query:  MSNPIQEHPYDPFQSFSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPP-SQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFPPSVSPL
        MSNPIQE PYDPFQSFSTLCL NSSSSSAVDPSLCSSCFRPHSRS+ATPMKRPSPTPP SQQ ST  TSKNLLLD QQPNSIPFSKINLPIPFPPSVSPL
Subjt:  MSNPIQEHPYDPFQSFSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPP-SQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFPPSVSPL

Query:  RRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPIKFQKDSPDSKRLRRIKDRLKEMNKWWNEVMSEEEKHDDEME
        RRSLSDPTDA NFS PP  TQSPAKRLCLNSPLPPLPLRRTVSD  PNPAPEK+SDSPIK QKDSP+SKRL+RIKDRLKEMN WWNEVMSEEE+H+DE E
Subjt:  RRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPIKFQKDSPDSKRLRRIKDRLKEMNKWWNEVMSEEEKHDDEME

Query:  TKKR-------DNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
         KK        + + ++EEEEEEEE+EKDDEETVGVERVGDSMTLKLKCSCGKRF+ILLSGRNCFYKLL
Subjt:  TKKR-------DNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X14.3e-5157.65Show/hide
Query:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP
        MSN IQE   P +P Q      FSTLCLN   +     P LCSSC R   R  AT  KR SPT    Q  TA T K+ LLD +Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM
        PS       SPL RS+SDPT+A NFSPP     SPAKRLC NS LPPLPLRRTVSD  P P+ +K+S SP+         ++DSPDSKRLR+IKDRLKEM
Subjt:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM

Query:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
        N+WWNEVMSE+E H++E    KRD E E ++  E  ++++D+EETVGVERVGDS+ L+LKC CGK FEILLSG +CFYKLL
Subjt:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X21.1e-5157.65Show/hide
Query:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP
        MSN IQE   P +P Q      FSTLCLN   +     P LCSSC R   R  AT  KR SPT    Q  TA T K+ LLD +Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM
        PS       SPL RS+SDPT+A NFSPP     SPAKRLC NS LPPLPLRRTVSD  P P+ +K+S SP+         ++DSPDSKRLR+IKDRLKEM
Subjt:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM

Query:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
        N+WWNEVMSE+E H++E    KRD  E ++  +E+E    D+EETVGVERVGDS+ L+LKC CGK FEILLSG +CFYKLL
Subjt:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X15.1e-5257.3Show/hide
Query:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP
        MSN IQE   P +P Q      FSTLCLN   +     P LCSSC R   R  AT  KR SPT   Q    A T+K  LLD +Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM
        PS       SPL RS+SDPT+A NFSPP     SPAKRLC NS LPPLPLRRTVSD  P P+ E++S+SP+         ++DSPDSKRLR+IK+RLKEM
Subjt:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM

Query:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
        N+WWNEVMSE+E H++E    KRD E E +++ E  + E+D+EETVGVERVGDS+ L+LKC CGK FEILLSG +CFYKLL
Subjt:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X23.9e-5256.94Show/hide
Query:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP
        MSN IQE   P +P Q      FSTLCLN   +     P LCSSC R   R  AT  KR SPT   Q    A T+K  LLD +Q N   FSKI+LPIPF 
Subjt:  MSNPIQE--HPYDPFQS-----FSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFP

Query:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM
        PS       SPL RS+SDPT+A NFSPP     SPAKRLC NS LPPLPLRRTVSD  P P+ E++S+SP+         ++DSPDSKRLR+IK+RLKEM
Subjt:  PS------VSPLRRSLSDPTDACNFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPI-------KFQKDSPDSKRLRRIKDRLKEM

Query:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
        N+WWNEVMSE+E H++E    KRD  E ++  ++EE    D+EETVGVERVGDS+ L+LKC CGK FEILLSG +CFYKLL
Subjt:  NKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein5.5e-0631.29Show/hide
Query:  EHPYDPFQ---SFSTLCLNNSSSSS------AVDPSLCSSCFRPHSRSTAT--PMKRPSPTPPSQQPSTAPTSKNLLL----DHQQPNSIPFSKINLP-I
        ++ YDP +     S L LN+  +SS      +  P   S      S +TAT  P+KRPS   P  +    P  K L +    + + PN + +SKI LP +
Subjt:  EHPYDPFQ---SFSTLCLNNSSSSS------AVDPSLCSSCFRPHSRSTAT--PMKRPSPTPPSQQPSTAPTSKNLLL----DHQQPNSIPFSKINLP-I

Query:  PFPPSV--SPL-RRSLSD----PTDACNFSPPPPHTQSPAKRLCL----NSP-LPPLP--LRRTVSDPNPNPAPE------KSSDSP---IKFQKDSPDS
         F P+   SPL +RSLSD    P  +   S    +T++   +       N P LPP P   RR+VSD +P P+ +      +S+  P   +   + S  +
Subjt:  PFPPSV--SPL-RRSLSD----PTDACNFSPPPPHTQSPAKRLCL----NSP-LPPLP--LRRTVSDPNPNPAPE------KSSDSP---IKFQKDSPDS

Query:  KRLRRIKDRLKEMNKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL
        K L  IKD ++E+++W N+++   E        K+ D+ +  +E  ++EE+ K+ +E V V R+G++  +++ C CG+ ++ L SGR+C+YKLL
Subjt:  KRLRRIKDRLKEMNKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEKDDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAATCCTATTCAAGAACACCCTTACGACCCTTTCCAATCCTTCTCCACTCTCTGTCTCAACAACTCCTCCTCCTCCTCCGCTGTCGACCCTTCACTCTGTTCTTC
ATGCTTCCGTCCTCACTCTCGCTCCACCGCCACTCCCATGAAACGCCCCTCCCCCACGCCCCCGTCTCAACAACCCTCCACCGCCCCCACTTCCAAGAACCTCCTTCTTG
ATCATCAACAACCCAATTCCATCCCTTTCTCCAAGATTAATCTCCCCATTCCTTTTCCTCCCTCTGTTTCCCCTCTCCGCCGCTCTCTTTCCGACCCCACCGATGCCTGC
AATTTCTCCCCTCCTCCGCCGCATACTCAATCCCCGGCAAAACGATTATGCCTAAACTCACCACTCCCTCCCCTGCCTCTCCGCCGTACTGTCTCTGACCCAAACCCAAA
CCCCGCCCCTGAAAAATCTTCCGATTCCCCAATTAAATTTCAGAAAGACAGCCCTGACTCGAAGAGGCTGAGAAGAATTAAGGATCGACTGAAGGAGATGAATAAGTGGT
GGAACGAAGTAATGAGTGAAGAAGAAAAACACGATGATGAAATGGAGACGAAAAAGAGAGACAATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGAAAAA
GATGATGAAGAAACAGTGGGAGTGGAAAGAGTTGGAGATTCAATGACACTAAAATTGAAGTGCTCATGTGGGAAGCGATTTGAGATTCTTCTATCTGGAAGAAACTGCTT
CTACAAATTGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTAATCCTATTCAAGAACACCCTTACGACCCTTTCCAATCCTTCTCCACTCTCTGTCTCAACAACTCCTCCTCCTCCTCCGCTGTCGACCCTTCACTCTGTTCTTC
ATGCTTCCGTCCTCACTCTCGCTCCACCGCCACTCCCATGAAACGCCCCTCCCCCACGCCCCCGTCTCAACAACCCTCCACCGCCCCCACTTCCAAGAACCTCCTTCTTG
ATCATCAACAACCCAATTCCATCCCTTTCTCCAAGATTAATCTCCCCATTCCTTTTCCTCCCTCTGTTTCCCCTCTCCGCCGCTCTCTTTCCGACCCCACCGATGCCTGC
AATTTCTCCCCTCCTCCGCCGCATACTCAATCCCCGGCAAAACGATTATGCCTAAACTCACCACTCCCTCCCCTGCCTCTCCGCCGTACTGTCTCTGACCCAAACCCAAA
CCCCGCCCCTGAAAAATCTTCCGATTCCCCAATTAAATTTCAGAAAGACAGCCCTGACTCGAAGAGGCTGAGAAGAATTAAGGATCGACTGAAGGAGATGAATAAGTGGT
GGAACGAAGTAATGAGTGAAGAAGAAAAACACGATGATGAAATGGAGACGAAAAAGAGAGACAATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGAAAAA
GATGATGAAGAAACAGTGGGAGTGGAAAGAGTTGGAGATTCAATGACACTAAAATTGAAGTGCTCATGTGGGAAGCGATTTGAGATTCTTCTATCTGGAAGAAACTGCTT
CTACAAATTGTTGTAG
Protein sequenceShow/hide protein sequence
MSNPIQEHPYDPFQSFSTLCLNNSSSSSAVDPSLCSSCFRPHSRSTATPMKRPSPTPPSQQPSTAPTSKNLLLDHQQPNSIPFSKINLPIPFPPSVSPLRRSLSDPTDAC
NFSPPPPHTQSPAKRLCLNSPLPPLPLRRTVSDPNPNPAPEKSSDSPIKFQKDSPDSKRLRRIKDRLKEMNKWWNEVMSEEEKHDDEMETKKRDNEEEEEEEEEEEEKEK
DDEETVGVERVGDSMTLKLKCSCGKRFEILLSGRNCFYKLL