; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033055 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033055
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationchr11:40411641..40412599
RNA-Seq ExpressionLag0033055
SyntenyLag0033055
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606253.1 hypothetical protein SDJN03_03570, partial [Cucurbita argyrosperma subsp. sororia]1.4e-9171.58Show/hide
Query:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI
        MS+ IQE ++ QNPE+    FDSRFS LCLN      G   H  PPLCSSCGRR P  AAT  KRRSPT  QDPT            +   +FSKIDLPI
Subjt:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI

Query:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM
        PFGPSSA PTPFSPL RSVSDPTEARNFSP    PSPAKR CPNS+LPPLPLRRTVSDP PS DKTS SP+  GRVNDSIKE+SPDSKRLR+IKDRLKEM
Subjt:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM

Query:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
        N+WWNEVMSE+EHEEE RD  +++ETKK   C  +EEDEEETVGVERVGDSLELRLKCPCGKGFEILLSG +CFYKLL
Subjt:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

XP_022930994.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita moschata]2.3e-9171.58Show/hide
Query:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI
        MS+ IQE ++ QNPE+    FDSRFS LCLN      G   H  PPLCSSCGRR P  AAT  KRRSPT  QDPT            +   +FSKIDLPI
Subjt:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI

Query:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM
        PFGPSSA PTPFSPL RSVSDPTEARNFSP    PSPAKR CPNS+LPPLPLRRTVSDP PS DKTS SP+  GRVNDSIKE+SPDSKRLR+IKDRLKEM
Subjt:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM

Query:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
        N+WWNEVMSE+EHEEE R   DE+ETKK   C  ++EDEEETVGVERVGDSLELRLKCPCGKGFEILLSG +CFYKLL
Subjt:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

XP_022930995.1 uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata]2.3e-9172.56Show/hide
Query:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI
        MS+ IQE ++ QNPE+    FDSRFS LCLN      G   H  PPLCSSCGRR P  AAT  KRRSPT  QDPT            +   +FSKIDLPI
Subjt:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI

Query:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM
        PFGPSSA PTPFSPL RSVSDPTEARNFSP    PSPAKR CPNS+LPPLPLRRTVSDP PS DKTS SP+  GRVNDSIKE+SPDSKRLR+IKDRLKEM
Subjt:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM

Query:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERCKE-EDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
        N+WWNEVMSE+EHEEE R   DE+ETKK   CKE EDEEETVGVERVGDSLELRLKCPCGKGFEILLSG +CFYKLL
Subjt:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERCKE-EDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

XP_022995232.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima]4.0e-9171.94Show/hide
Query:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDP--TNKK----------LAFSKIDLPI
        MS+ IQE ++ QNPE+    FDSRFS LCLN      G   H  PPLCSSCGRR P  AAT  KRRSPT  QDP  T KK           +FSKIDLPI
Subjt:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDP--TNKK----------LAFSKIDLPI

Query:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM
        PFGPSSA PTPFSPL RSVSDPTEARNFSP    PSPAKR CPNS+LPPLPLRRTVSDP PS ++TSESP+  GRVNDSIKE+SPDSKRLR+IK+RLKEM
Subjt:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM

Query:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
        N+WWNEVMSE+EHEEE R   DE+ETKK   C   EEDEEETVGVERVGDSLELRLKCPCGKGFEILLSG +CFYKLL
Subjt:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

XP_022995233.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima]8.9e-9172.56Show/hide
Query:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDP--TNKK----------LAFSKIDLPI
        MS+ IQE ++ QNPE+    FDSRFS LCLN      G   H  PPLCSSCGRR P  AAT  KRRSPT  QDP  T KK           +FSKIDLPI
Subjt:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDP--TNKK----------LAFSKIDLPI

Query:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM
        PFGPSSA PTPFSPL RSVSDPTEARNFSP    PSPAKR CPNS+LPPLPLRRTVSDP PS ++TSESP+  GRVNDSIKE+SPDSKRLR+IK+RLKEM
Subjt:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM

Query:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERCK-EEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
        N+WWNEVMSE+EHEEE R   DE+ETKK   CK EEDEEETVGVERVGDSLELRLKCPCGKGFEILLSG +CFYKLL
Subjt:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERCK-EEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LI25 Uncharacterized protein3.1e-6557.61Show/hide
Query:  EEHFDPFDSRFSMLCLNNPSAVDGAAGHPPLCSSCGRRQPLSAATPMKRRSPTP---------------FQDPTN-KKLAFSKIDLPIPFGPSSAQPTPF
        E+ +DPF S FS LCLN+ S+   +A  P LCSSC R    S+ATPMKR SPTP                 DP     + FSKI+LPIPF PS       
Subjt:  EEHFDPFDSRFSMLCLNNPSAVDGAAGHPPLCSSCGRRQPLSAATPMKRRSPTP---------------FQDPTN-KKLAFSKIDLPIPFGPSSAQPTPF

Query:  SPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEMNQWWNEVMSEEE
        SPLRRS+SDPT+ARNFSPP  T SPAKR C NS LPPLPLRRTVSDPNP+P+KTS+SPIK       I+++SP+SKRL+RIKDRLKEMN WWNEVMSEEE
Subjt:  SPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEMNQWWNEVMSEEE

Query:  HEEENRDTIDE------------DETKKSERCKEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
           + ++   E            DE ++ E  +E+D+EETVGVERVGDS+ L+LKC CGK F+ILLSGRNCFYKLL
Subjt:  HEEENRDTIDE------------DETKKSERCKEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X11.1e-9171.58Show/hide
Query:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI
        MS+ IQE ++ QNPE+    FDSRFS LCLN      G   H  PPLCSSCGRR P  AAT  KRRSPT  QDPT            +   +FSKIDLPI
Subjt:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI

Query:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM
        PFGPSSA PTPFSPL RSVSDPTEARNFSP    PSPAKR CPNS+LPPLPLRRTVSDP PS DKTS SP+  GRVNDSIKE+SPDSKRLR+IKDRLKEM
Subjt:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM

Query:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
        N+WWNEVMSE+EHEEE R   DE+ETKK   C  ++EDEEETVGVERVGDSLELRLKCPCGKGFEILLSG +CFYKLL
Subjt:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X21.1e-9172.56Show/hide
Query:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI
        MS+ IQE ++ QNPE+    FDSRFS LCLN      G   H  PPLCSSCGRR P  AAT  KRRSPT  QDPT            +   +FSKIDLPI
Subjt:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDPT------------NKKLAFSKIDLPI

Query:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM
        PFGPSSA PTPFSPL RSVSDPTEARNFSP    PSPAKR CPNS+LPPLPLRRTVSDP PS DKTS SP+  GRVNDSIKE+SPDSKRLR+IKDRLKEM
Subjt:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM

Query:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERCKE-EDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
        N+WWNEVMSE+EHEEE R   DE+ETKK   CKE EDEEETVGVERVGDSLELRLKCPCGKGFEILLSG +CFYKLL
Subjt:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERCKE-EDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X11.9e-9171.94Show/hide
Query:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDP--TNKK----------LAFSKIDLPI
        MS+ IQE ++ QNPE+    FDSRFS LCLN      G   H  PPLCSSCGRR P  AAT  KRRSPT  QDP  T KK           +FSKIDLPI
Subjt:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDP--TNKK----------LAFSKIDLPI

Query:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM
        PFGPSSA PTPFSPL RSVSDPTEARNFSP    PSPAKR CPNS+LPPLPLRRTVSDP PS ++TSESP+  GRVNDSIKE+SPDSKRLR+IK+RLKEM
Subjt:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM

Query:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
        N+WWNEVMSE+EHEEE R   DE+ETKK   C   EEDEEETVGVERVGDSLELRLKCPCGKGFEILLSG +CFYKLL
Subjt:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERC--KEEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X24.3e-9172.56Show/hide
Query:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDP--TNKK----------LAFSKIDLPI
        MS+ IQE ++ QNPE+    FDSRFS LCLN      G   H  PPLCSSCGRR P  AAT  KRRSPT  QDP  T KK           +FSKIDLPI
Subjt:  MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGH--PPLCSSCGRRQPLSAATPMKRRSPTPFQDP--TNKK----------LAFSKIDLPI

Query:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM
        PFGPSSA PTPFSPL RSVSDPTEARNFSP    PSPAKR CPNS+LPPLPLRRTVSDP PS ++TSESP+  GRVNDSIKE+SPDSKRLR+IK+RLKEM
Subjt:  PFGPSSAQPTPFSPLRRSVSDPTEARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEM

Query:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERCK-EEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL
        N+WWNEVMSE+EHEEE R   DE+ETKK   CK EEDEEETVGVERVGDSLELRLKCPCGKGFEILLSG +CFYKLL
Subjt:  NQWWNEVMSEEEHEEENRDTIDEDETKKSERCK-EEDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein8.8e-1231.49Show/hide
Query:  QEPSQSQNPEEHFDPFDSRFSMLCLN---NPSAVDGAAGHPP---LCSSCGRRQPLSAATPMKRRSPTPFQ--DPTNKKL--------------AFSKID
        QE   S +PEE     D   S+L LN   N S    A   PP      S G     +  +P+KR SP   Q  +P  KKL               +SKI 
Subjt:  QEPSQSQNPEEHFDPFDSRFSMLCLN---NPSAVDGAAGHPP---LCSSCGRRQPLSAATPMKRRSPTPFQ--DPTNKKL--------------AFSKID

Query:  LP-IPFGPSSAQPTPFSPL-RRSVSDPTEARNFSPPP-----------LTPSPAKRFCPNS----SLPPLP--LRRTVSDPNPSPDKTSESPIKTGRVND
        LP + F P+  +    SPL +RS+SD      F+ P               S A+   P S    SLPP P   RR+VSD +P+P  +S+S + + R N 
Subjt:  LP-IPFGPSSAQPTPFSPL-RRSVSDPTEARNFSPPP-----------LTPSPAKRFCPNS----SLPPLP--LRRTVSDPNPSPDKTSESPIKTGRVND

Query:  ------SIKENSPDSKRLRRIKDRLKEMNQWWNEVMSEEEHEEENRDTIDEDETKKSERCKEEDE----EETVGVERVGDSLELRLKCPCGKGFEILLSG
              +  E+S  +K L  IKD ++E++QW N+++   E         D+      E  ++E++    +E V V R+G++  + + CPCG+ ++ L SG
Subjt:  ------SIKENSPDSKRLRRIKDRLKEMNQWWNEVMSEEEHEEENRDTIDEDETKKSERCKEEDE----EETVGVERVGDSLELRLKCPCGKGFEILLSG

Query:  RNCFYKLL
        R+C+YKLL
Subjt:  RNCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCAGATTCAAGAACCATCCCAATCCCAAAACCCAGAAGAGCATTTCGATCCTTTCGATTCCCGCTTCTCCATGCTCTGCCTCAACAACCCCTCCGCCGTCGA
CGGCGCCGCCGGCCACCCTCCACTCTGTTCTTCATGCGGCCGCCGTCAACCTCTCTCCGCCGCCACTCCCATGAAACGACGCTCCCCAACGCCATTTCAAGACCCCACCA
ACAAGAAGCTCGCCTTCTCCAAGATCGATCTCCCAATTCCTTTTGGGCCTTCTTCGGCCCAGCCCACTCCCTTCTCCCCTCTCCGCCGCTCTGTTTCCGACCCCACCGAA
GCCCGGAATTTCTCCCCTCCGCCGCTCACTCCGTCGCCGGCCAAGCGCTTCTGTCCCAACTCCTCGCTCCCGCCGCTGCCCCTCCGCCGCACCGTCTCCGACCCAAATCC
CTCCCCTGACAAAACTTCCGAATCCCCAATCAAAACCGGGAGAGTCAATGATTCGATCAAAGAAAACAGCCCCGATTCAAAGAGGCTGAGAAGAATCAAGGATCGACTGA
AGGAGATGAATCAATGGTGGAACGAAGTCATGAGCGAAGAAGAACACGAAGAAGAAAACAGAGATACCATCGATGAAGACGAAACCAAAAAGAGTGAGCGTTGCAAGGAA
GAAGACGAGGAAGAAACTGTGGGAGTGGAGAGAGTGGGGGATTCGTTGGAGCTTCGTTTGAAGTGCCCCTGTGGAAAAGGATTTGAAATCCTTCTTTCTGGGAGAAATTG
TTTCTACAAGCTGCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCAGATTCAAGAACCATCCCAATCCCAAAACCCAGAAGAGCATTTCGATCCTTTCGATTCCCGCTTCTCCATGCTCTGCCTCAACAACCCCTCCGCCGTCGA
CGGCGCCGCCGGCCACCCTCCACTCTGTTCTTCATGCGGCCGCCGTCAACCTCTCTCCGCCGCCACTCCCATGAAACGACGCTCCCCAACGCCATTTCAAGACCCCACCA
ACAAGAAGCTCGCCTTCTCCAAGATCGATCTCCCAATTCCTTTTGGGCCTTCTTCGGCCCAGCCCACTCCCTTCTCCCCTCTCCGCCGCTCTGTTTCCGACCCCACCGAA
GCCCGGAATTTCTCCCCTCCGCCGCTCACTCCGTCGCCGGCCAAGCGCTTCTGTCCCAACTCCTCGCTCCCGCCGCTGCCCCTCCGCCGCACCGTCTCCGACCCAAATCC
CTCCCCTGACAAAACTTCCGAATCCCCAATCAAAACCGGGAGAGTCAATGATTCGATCAAAGAAAACAGCCCCGATTCAAAGAGGCTGAGAAGAATCAAGGATCGACTGA
AGGAGATGAATCAATGGTGGAACGAAGTCATGAGCGAAGAAGAACACGAAGAAGAAAACAGAGATACCATCGATGAAGACGAAACCAAAAAGAGTGAGCGTTGCAAGGAA
GAAGACGAGGAAGAAACTGTGGGAGTGGAGAGAGTGGGGGATTCGTTGGAGCTTCGTTTGAAGTGCCCCTGTGGAAAAGGATTTGAAATCCTTCTTTCTGGGAGAAATTG
TTTCTACAAGCTGCTGTAG
Protein sequenceShow/hide protein sequence
MSDQIQEPSQSQNPEEHFDPFDSRFSMLCLNNPSAVDGAAGHPPLCSSCGRRQPLSAATPMKRRSPTPFQDPTNKKLAFSKIDLPIPFGPSSAQPTPFSPLRRSVSDPTE
ARNFSPPPLTPSPAKRFCPNSSLPPLPLRRTVSDPNPSPDKTSESPIKTGRVNDSIKENSPDSKRLRRIKDRLKEMNQWWNEVMSEEEHEEENRDTIDEDETKKSERCKE
EDEEETVGVERVGDSLELRLKCPCGKGFEILLSGRNCFYKLL