; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002407 (gene) of Snake gourd v1 genome

Gene IDTan0002407
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationLG07:71541625..71542598
RNA-Seq ExpressionTan0002407
SyntenyTan0002407
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606253.1 hypothetical protein SDJN03_03570, partial [Cucurbita argyrosperma subsp. sororia]4.5e-8771.48Show/hide
Query:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF
        MSNLIQES E Q    + FDSRFSTLCLNP    GG HH  PPLCSSC RRPPR      KRRSPT  +DPTATTKK  LD  +QHN TSFSKIDLPIPF
Subjt:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF

Query:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE
         PSSA   P   SPL RSVSDPTEAR FSP    PSPAKRLC NS   PLPLRRTVSDP PS D T + SP+ IGR NDSIKE+SPDSKRLR++KDRLKE
Subjt:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE

Query:  MNQWWNKLMSEEEQEDRNRDE-NDTKKR---CK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
        MN+WWN++MSE+E E+  RDE N+TKK+   CK EED+EETVGVERVGDSLELRLKCPCG+GFEILLSG +CFYKLL
Subjt:  MNQWWNKLMSEEEQEDRNRDE-NDTKKR---CK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

XP_022930994.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita moschata]7.0e-8871.74Show/hide
Query:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF
        MSNLIQES E Q    + FDSRFSTLCLNP    GG HH  PPLCSSC RRPPR      KRRSPT  +DPTATTKK  LD  +QHN TSFSKIDLPIPF
Subjt:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF

Query:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE
         PSSA   P   SPL RSVSDPTEAR FSP    PSPAKRLC NS   PLPLRRTVSDP PS D T + SP+ IGR NDSIKE+SPDSKRLR++KDRLKE
Subjt:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE

Query:  MNQWWNKLMSEEEQEDRNRDENDTKK---RCKE-EDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
        MN+WWN++MSE+E E+  RDEN+TKK    CKE ED+EETVGVERVGDSLELRLKCPCG+GFEILLSG +CFYKLL
Subjt:  MNQWWNKLMSEEEQEDRNRDENDTKK---RCKE-EDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

XP_022930995.1 uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata]1.7e-8972.53Show/hide
Query:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF
        MSNLIQES E Q    + FDSRFSTLCLNP    GG HH  PPLCSSC RRPPR      KRRSPT  +DPTATTKK  LD  +QHN TSFSKIDLPIPF
Subjt:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF

Query:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE
         PSSA   P   SPL RSVSDPTEAR FSP    PSPAKRLC NS   PLPLRRTVSDP PS D T + SP+ IGR NDSIKE+SPDSKRLR++KDRLKE
Subjt:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE

Query:  MNQWWNKLMSEEEQEDRNRDENDTKKRCKE-EDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
        MN+WWN++MSE+E E+  RDEN+TKK CKE ED+EETVGVERVGDSLELRLKCPCG+GFEILLSG +CFYKLL
Subjt:  MNQWWNKLMSEEEQEDRNRDENDTKKRCKE-EDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

XP_022995232.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima]1.2e-8771.01Show/hide
Query:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF
        MSNLIQES E Q    + FDSRFSTLCLNP    GG HH  PPLCSSC RRPPR      KRRSPT  +DP ATTKK  LD  +QHN TSFSKIDLPIPF
Subjt:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF

Query:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE
         PSSA   P   SPL RSVSDPTEAR FSP    PSPAKRLC NS   PLPLRRTVSDP PS + T +ESP+ IGR NDSIKE+SPDSKRLR++K+RLKE
Subjt:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE

Query:  MNQWWNKLMSEEEQEDRNRDENDTKKR---CK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
        MN+WWN++MSE+E E+  RDEN+TKK+   CK EED+EETVGVERVGDSLELRLKCPCG+GFEILLSG +CFYKLL
Subjt:  MNQWWNKLMSEEEQEDRNRDENDTKKR---CK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

XP_022995233.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima]1.1e-8871.79Show/hide
Query:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF
        MSNLIQES E Q    + FDSRFSTLCLNP    GG HH  PPLCSSC RRPPR      KRRSPT  +DP ATTKK  LD  +QHN TSFSKIDLPIPF
Subjt:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF

Query:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE
         PSSA   P   SPL RSVSDPTEAR FSP    PSPAKRLC NS   PLPLRRTVSDP PS + T +ESP+ IGR NDSIKE+SPDSKRLR++K+RLKE
Subjt:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE

Query:  MNQWWNKLMSEEEQEDRNRDENDTKKRCK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
        MN+WWN++MSE+E E+  RDEN+TKK CK EED+EETVGVERVGDSLELRLKCPCG+GFEILLSG +CFYKLL
Subjt:  MNQWWNKLMSEEEQEDRNRDENDTKKRCK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LI25 Uncharacterized protein6.0e-6155.75Show/hide
Query:  MSNLIQESPEEQFEPFDSRFSTLCLNPSAVDGGAHHPPLCSSCARRPPRS----MKRRSPTPFED---PTATTKKLFLDHQQQHNPTSFSKIDLPIPFVP
        MSN IQE P   ++PF S FSTLCLN S+    A  P LCSSC R   RS    MKR SPTP       T TT K  L   QQ N   FSKI+LPIPF P
Subjt:  MSNLIQESPEEQFEPFDSRFSTLCLNPSAVDGGAHHPPLCSSCARRPPRS----MKRRSPTPFED---PTATTKKLFLDHQQQHNPTSFSKIDLPIPFVP

Query:  SSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKEMN
                S+SPLRRS+SDPT+AR FSPP  T SPAKRLC NS   PLPLRRTVSDPNP+P+ T ++SPI+       I+++SP+SKRL+R+KDRLKEMN
Subjt:  SSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKEMN

Query:  QWWNKLMSEEEQ-----------------EDRNRDENDTKKRCKEEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
         WWN++MSEEE+                 E +  DE + ++  +E+DDEETVGVERVGDS+ L+LKC CG+ F+ILLSGRNCFYKLL
Subjt:  QWWNKLMSEEEQ-----------------EDRNRDENDTKKRCKEEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X13.4e-8871.74Show/hide
Query:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF
        MSNLIQES E Q    + FDSRFSTLCLNP    GG HH  PPLCSSC RRPPR      KRRSPT  +DPTATTKK  LD  +QHN TSFSKIDLPIPF
Subjt:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF

Query:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE
         PSSA   P   SPL RSVSDPTEAR FSP    PSPAKRLC NS   PLPLRRTVSDP PS D T + SP+ IGR NDSIKE+SPDSKRLR++KDRLKE
Subjt:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE

Query:  MNQWWNKLMSEEEQEDRNRDENDTKK---RCKE-EDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
        MN+WWN++MSE+E E+  RDEN+TKK    CKE ED+EETVGVERVGDSLELRLKCPCG+GFEILLSG +CFYKLL
Subjt:  MNQWWNKLMSEEEQEDRNRDENDTKK---RCKE-EDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X28.1e-9072.53Show/hide
Query:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF
        MSNLIQES E Q    + FDSRFSTLCLNP    GG HH  PPLCSSC RRPPR      KRRSPT  +DPTATTKK  LD  +QHN TSFSKIDLPIPF
Subjt:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF

Query:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE
         PSSA   P   SPL RSVSDPTEAR FSP    PSPAKRLC NS   PLPLRRTVSDP PS D T + SP+ IGR NDSIKE+SPDSKRLR++KDRLKE
Subjt:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE

Query:  MNQWWNKLMSEEEQEDRNRDENDTKKRCKE-EDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
        MN+WWN++MSE+E E+  RDEN+TKK CKE ED+EETVGVERVGDSLELRLKCPCG+GFEILLSG +CFYKLL
Subjt:  MNQWWNKLMSEEEQEDRNRDENDTKKRCKE-EDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X15.8e-8871.01Show/hide
Query:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF
        MSNLIQES E Q    + FDSRFSTLCLNP    GG HH  PPLCSSC RRPPR      KRRSPT  +DP ATTKK  LD  +QHN TSFSKIDLPIPF
Subjt:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF

Query:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE
         PSSA   P   SPL RSVSDPTEAR FSP    PSPAKRLC NS   PLPLRRTVSDP PS + T +ESP+ IGR NDSIKE+SPDSKRLR++K+RLKE
Subjt:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE

Query:  MNQWWNKLMSEEEQEDRNRDENDTKKR---CK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
        MN+WWN++MSE+E E+  RDEN+TKK+   CK EED+EETVGVERVGDSLELRLKCPCG+GFEILLSG +CFYKLL
Subjt:  MNQWWNKLMSEEEQEDRNRDENDTKKR---CK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X25.2e-8971.79Show/hide
Query:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF
        MSNLIQES E Q    + FDSRFSTLCLNP    GG HH  PPLCSSC RRPPR      KRRSPT  +DP ATTKK  LD  +QHN TSFSKIDLPIPF
Subjt:  MSNLIQESPEEQ---FEPFDSRFSTLCLNPSAVDGGAHH--PPLCSSCARRPPR----SMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPF

Query:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE
         PSSA   P   SPL RSVSDPTEAR FSP    PSPAKRLC NS   PLPLRRTVSDP PS + T +ESP+ IGR NDSIKE+SPDSKRLR++K+RLKE
Subjt:  VPSSAQSQPISLSPLRRSVSDPTEARKFSPPPLTPSPAKRLCSNS---PLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKE

Query:  MNQWWNKLMSEEEQEDRNRDENDTKKRCK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
        MN+WWN++MSE+E E+  RDEN+TKK CK EED+EETVGVERVGDSLELRLKCPCG+GFEILLSG +CFYKLL
Subjt:  MNQWWNKLMSEEEQEDRNRDENDTKKRCK-EEDDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein1.9e-1129.6Show/hide
Query:  SSCARRPPRSMKRRSPTPFEDPTATTKKLFL---DHQQQHNPTSFSKIDLP-IPFVPSSAQSQPISLSPLRRSVSDPTEA---------RKFSPPPLTPS
        SS        +KR SP   +      KKLF+   + ++  N   +SKI LP + F P+  +S P+    L  + + P  +          + S    T  
Subjt:  SSCARRPPRSMKRRSPTPFEDPTATTKKLFL---DHQQQHNPTSFSKIDLP-IPFVPSSAQSQPISLSPLRRSVSDPTEA---------RKFSPPPLTPS

Query:  PAKRLCSNSPLP--LRRTVSDPNPSPDITFAESPIQIGRAN-----DSIKEESPD-SKRLRRMKDRLKEMNQWWNKLMSEEEQEDRNRDENDTKKRCKEE
        P+  + S  P P   RR+VSD +P+P    ++S +   R+N     D    ES D +K L  +KD ++E++QW NKL+   E       + D   +  +E
Subjt:  PAKRLCSNSPLP--LRRTVSDPNPSPDITFAESPIQIGRAN-----DSIKEESPD-SKRLRRMKDRLKEMNQWWNKLMSEEEQEDRNRDENDTKKRCKEE

Query:  ---------DDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL
                 + +E V V R+G++  + + CPCGR ++ L SGR+C+YKLL
Subjt:  ---------DDEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAATCTGATTCAAGAATCACCAGAAGAGCAATTCGAACCCTTCGATTCCCGCTTCTCCACGCTCTGCCTCAACCCCTCCGCCGTCGACGGCGGCGCCCACCACCC
TCCACTCTGTTCTTCATGCGCCCGCCGTCCACCTCGCTCCATGAAACGACGCTCCCCGACGCCGTTTGAAGACCCCACCGCCACCACCAAGAAGCTCTTTCTTGATCATC
AACAACAACACAATCCCACTTCCTTCTCCAAGATCGATCTCCCAATTCCTTTTGTGCCTTCTTCAGCCCAATCCCAGCCCATTTCCCTCTCCCCTCTCCGCCGCTCTGTT
TCCGACCCGACCGAAGCCCGTAAGTTCTCCCCTCCACCGCTCACTCCGTCGCCGGCCAAACGCTTATGTTCCAACTCCCCGCTCCCCCTCCGGCGCACGGTCTCGGACCC
AAATCCCTCCCCTGACATAACCTTCGCCGAATCCCCAATTCAAATTGGCAGAGCCAATGATTCGATCAAAGAAGAAAGCCCCGATTCAAAGAGGCTGAGAAGAATGAAGG
ATCGATTGAAGGAGATGAATCAATGGTGGAACAAATTGATGAGTGAAGAAGAACAGGAAGATAGAAACAGAGATGAAAATGACACCAAAAAGCGTTGCAAGGAAGAAGAT
GATGAAGAAACAGTGGGAGTGGAGAGGGTGGGAGATTCATTGGAGCTTCGTTTGAAGTGTCCCTGTGGGAGAGGCTTTGAAATTCTTCTTTCTGGAAGAAACTGTTTCTA
CAAGCTGCTCTAG
mRNA sequenceShow/hide mRNA sequence
GTCATCAATCGATCTTTGAATCTCTTGGCGCCATGAGCAATCTGATTCAAGAATCACCAGAAGAGCAATTCGAACCCTTCGATTCCCGCTTCTCCACGCTCTGCCTCAAC
CCCTCCGCCGTCGACGGCGGCGCCCACCACCCTCCACTCTGTTCTTCATGCGCCCGCCGTCCACCTCGCTCCATGAAACGACGCTCCCCGACGCCGTTTGAAGACCCCAC
CGCCACCACCAAGAAGCTCTTTCTTGATCATCAACAACAACACAATCCCACTTCCTTCTCCAAGATCGATCTCCCAATTCCTTTTGTGCCTTCTTCAGCCCAATCCCAGC
CCATTTCCCTCTCCCCTCTCCGCCGCTCTGTTTCCGACCCGACCGAAGCCCGTAAGTTCTCCCCTCCACCGCTCACTCCGTCGCCGGCCAAACGCTTATGTTCCAACTCC
CCGCTCCCCCTCCGGCGCACGGTCTCGGACCCAAATCCCTCCCCTGACATAACCTTCGCCGAATCCCCAATTCAAATTGGCAGAGCCAATGATTCGATCAAAGAAGAAAG
CCCCGATTCAAAGAGGCTGAGAAGAATGAAGGATCGATTGAAGGAGATGAATCAATGGTGGAACAAATTGATGAGTGAAGAAGAACAGGAAGATAGAAACAGAGATGAAA
ATGACACCAAAAAGCGTTGCAAGGAAGAAGATGATGAAGAAACAGTGGGAGTGGAGAGGGTGGGAGATTCATTGGAGCTTCGTTTGAAGTGTCCCTGTGGGAGAGGCTTT
GAAATTCTTCTTTCTGGAAGAAACTGTTTCTACAAGCTGCTCTAG
Protein sequenceShow/hide protein sequence
MSNLIQESPEEQFEPFDSRFSTLCLNPSAVDGGAHHPPLCSSCARRPPRSMKRRSPTPFEDPTATTKKLFLDHQQQHNPTSFSKIDLPIPFVPSSAQSQPISLSPLRRSV
SDPTEARKFSPPPLTPSPAKRLCSNSPLPLRRTVSDPNPSPDITFAESPIQIGRANDSIKEESPDSKRLRRMKDRLKEMNQWWNKLMSEEEQEDRNRDENDTKKRCKEED
DEETVGVERVGDSLELRLKCPCGRGFEILLSGRNCFYKLL