; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000900 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000900
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLEA_2 domain-containing protein
Genome locationchr4:19237254..19240144
RNA-Seq ExpressionLag0000900
SyntenyLag0000900
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588521.1 hypothetical protein SDJN03_17086, partial [Cucurbita argyrosperma subsp. sororia]1.7e-9886.73Show/hide
Query:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM
        S  +   +LS+S HR HP Y+PRSSS SSASFKGCCCCLFLL SFLALLVLA+VL+VVLALKPKKPQFDLQQV VQYMGITTPN P +P  ASLSLNIRM
Subjt:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM

Query:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
        +FTAVNPNKVGIKY ES FTVMYRG+PLGRASVPGFVQ+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
Subjt:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS

Query:  VDCAIVISPRKQSLTYKQCGFDGLNV
        VDCAIVISPRKQSLTYKQCGFDGL+V
Subjt:  VDCAIVISPRKQSLTYKQCGFDGLNV

KAG7022372.1 hypothetical protein SDJN02_16103, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-9886.73Show/hide
Query:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM
        S  +   +LS+S HR HP Y+PRSSS SSASFKGCCCCLFLL SFLALLVLA+VL+VVLALKPKKPQFDLQQV VQYMGITTPN P  P  ASLSLNIRM
Subjt:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM

Query:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
        +FTAVNPNKVGIKY ES FTVMYRG+PLGRASVPGFVQ+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
Subjt:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS

Query:  VDCAIVISPRKQSLTYKQCGFDGLNV
        VDCAIVISPRKQSLTYKQCGFDGL+V
Subjt:  VDCAIVISPRKQSLTYKQCGFDGLNV

XP_022931679.1 uncharacterized protein LOC111437836 [Cucurbita moschata]2.2e-9886.73Show/hide
Query:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM
        S  +   +LS+S HR HP Y+PRSSS SSASFKGCCCCLFLL SFLALLVLA+VL+VVLALKPKKPQFDLQQV VQYMGITTPN P  P  ASLSLNIRM
Subjt:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM

Query:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
        +FTAVNPNKVGIKY ES FTVMYRG+PLGRASVPGFVQ+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
Subjt:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS

Query:  VDCAIVISPRKQSLTYKQCGFDGLNV
        VDCAIVISPRKQSLTYKQCGFDGL+V
Subjt:  VDCAIVISPRKQSLTYKQCGFDGLNV

XP_022969932.1 uncharacterized protein LOC111468981 [Cucurbita maxima]7.7e-9987.17Show/hide
Query:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM
        S  +   +LSSS HR HP Y+PRSSS SSASFKGCCCCLFLL SFLALLVLA+VL+VVLALKPKKPQFDLQQV VQYMGITTPN P +P  ASLSLNIRM
Subjt:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM

Query:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
        +FTAVNPNKVGIKY ES FTVMYRG+PLGRASVPGFVQ+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
Subjt:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS

Query:  VDCAIVISPRKQSLTYKQCGFDGLNV
        VDCAIVISPRKQSLTYKQCGFDGL+V
Subjt:  VDCAIVISPRKQSLTYKQCGFDGLNV

XP_023529478.1 uncharacterized protein LOC111792323 [Cucurbita pepo subsp. pepo]1.3e-9889.04Show/hide
Query:  SLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRMIFTAVNP
        +LS+S HR HP Y+PRSSS SSASFKGCCCCLFLL SFLALLVLA+VL+VVLALKPKKPQFDLQQV VQYMGITTPN P  P  ASLSLNIRM+FTAVNP
Subjt:  SLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRMIFTAVNP

Query:  NKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVI
        NKVGIKY ES FTVMYRG+PLGRASVPGFVQ+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVI
Subjt:  NKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVI

Query:  SPRKQSLTYKQCGFDGLNV
        SPRKQSLTYKQCGFDGL+V
Subjt:  SPRKQSLTYKQCGFDGLNV

TrEMBL top hitse value%identityAlignment
A0A0A0K5H7 LEA_2 domain-containing protein3.2e-9892.86Show/hide
Query:  HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRMIFTAVNPNKVGIKYGE
        HP YNPR SSSSSA+FKGCCCCLFLLFSFLALLVLAIVL+VVLALKPKKPQFDLQQV VQY+GIT P    NPTTASLSLNIRMIFTAVNPNKVGIKY E
Subjt:  HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRMIFTAVNPNKVGIKYGE

Query:  SHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTY
        S FTVMYRG+PLGRASVPGF QDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTY
Subjt:  SHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTY

Query:  KQCGFDGLNV
        KQCGFDGLNV
Subjt:  KQCGFDGLNV

A0A1S3BGU8 uncharacterized protein LOC1034897005.4e-9888.5Show/hide
Query:  NGNSLSSSGHR-----HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM
        N +++SSS        HP YNPR SSSSSA+FKGCCCCLFLLFSFLALLVLAIVL+VVLALKPKKPQFDLQQV VQYMGIT P    NPTTASLSLNIRM
Subjt:  NGNSLSSSGHR-----HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM

Query:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
        IFTAVNPNKVGIKY ES FTVMYRG+PLGRASVPGF QDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
Subjt:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS

Query:  VDCAIVISPRKQSLTYKQCGFDGLNV
        VDCAIVISPRKQSLTYKQCGFDGLNV
Subjt:  VDCAIVISPRKQSLTYKQCGFDGLNV

A0A6J1EUX4 uncharacterized protein LOC1114378361.1e-9886.73Show/hide
Query:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM
        S  +   +LS+S HR HP Y+PRSSS SSASFKGCCCCLFLL SFLALLVLA+VL+VVLALKPKKPQFDLQQV VQYMGITTPN P  P  ASLSLNIRM
Subjt:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM

Query:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
        +FTAVNPNKVGIKY ES FTVMYRG+PLGRASVPGFVQ+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
Subjt:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS

Query:  VDCAIVISPRKQSLTYKQCGFDGLNV
        VDCAIVISPRKQSLTYKQCGFDGL+V
Subjt:  VDCAIVISPRKQSLTYKQCGFDGLNV

A0A6J1HXP7 uncharacterized protein LOC1114689813.7e-9987.17Show/hide
Query:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM
        S  +   +LSSS HR HP Y+PRSSS SSASFKGCCCCLFLL SFLALLVLA+VL+VVLALKPKKPQFDLQQV VQYMGITTPN P +P  ASLSLNIRM
Subjt:  SSTVNGNSLSSSGHR-HPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRM

Query:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
        +FTAVNPNKVGIKY ES FTVMYRG+PLGRASVPGFVQ+PHSQRQVD T+AVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS
Subjt:  IFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVS

Query:  VDCAIVISPRKQSLTYKQCGFDGLNV
        VDCAIVISPRKQSLTYKQCGFDGL+V
Subjt:  VDCAIVISPRKQSLTYKQCGFDGLNV

A0A6J1I8L5 uncharacterized protein LOC1114706486.6e-9689.15Show/hide
Query:  HRHPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRMIFTAVNPNKVGIKY
        H  P Y PR  SSSSASFKGCCCCLFLLFSFLALL+LA+VL+VVLALKPKKPQFDLQQV +QYM ITTP    NPT ASLSLNIRMIFTAVNPNKVGIKY
Subjt:  HRHPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRMIFTAVNPNKVGIKY

Query:  GESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSL
        GES FTVMYRG+PLGRAS+PGF QDPHSQRQVDATIAVDRV+LLQADAADLIRDASLNDRVELRILGDVAAKIRLLSF+SPGVQVSVDCAIVISPRKQSL
Subjt:  GESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSL

Query:  TYKQCGFDGLNV
        TYKQCGFDGLNV
Subjt:  TYKQCGFDGLNV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17620.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.4e-0522.57Show/hide
Query:  TVNGNSLSSSGHRHPTYNPRSSSSSSASFKGCC--CCLFLLF--SFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIR
        T   N         P Y P +    ++  +GCC  CC + +F    L L+V A   +V L  +P++P F + ++ +  +  T        +   L+  I 
Subjt:  TVNGNSLSSSGHRHPTYNPRSSSSSSASFKGCC--CCLFLLF--SFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIR

Query:  MIFTAVNPNK-VGIKYGESHFTVMYRG-------VPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS
        +   A NPNK VG  Y  +  T +Y+        V +G+ ++  F     +   + +TI      L +  A  L  D      V ++I+ +   K+++ +
Subjt:  MIFTAVNPNK-VGIKYGESHFTVMYRG-------VPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS

Query:  FNSP--GVQVSVDCAIVISPRKQSLT
          +P  G++V+ +   V++P  +  T
Subjt:  FNSP--GVQVSVDCAIVISPRKQSLT

AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.4e-8470.74Show/hide
Query:  PSST---VNGNSLSSSGHRHPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLN
        PSS+   +NG+ +++   +   Y    SSSSSAS KGCCCCLFLLF+FLALLVLA+VL+V+LA+KPKKPQFDLQQV+V YMGI+ P+   +PTTASLSL 
Subjt:  PSST---VNGNSLSSSGHRHPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLN

Query:  IRMIFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGV
        IRM+FTAVNPNKVGI+YGES FTVMY+G+PLGRA+VPGF QD HS + V+ATI+VDRVNL+QA AADL+RDASLNDRVEL + GDV AKIR+++F+SPGV
Subjt:  IRMIFTAVNPNKVGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGV

Query:  QVSVDCAIVISPRKQSLTYKQCGFDGLNV
        QVSV+C I ISPRKQ+L YKQCGFDGL+V
Subjt:  QVSVDCAIVISPRKQSLTYKQCGFDGLNV

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.6e-0623.56Show/hide
Query:  KGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTAS--LSLNIRMIFTAVNPNKVGIKYGESHFTVMYRGVPLGR
        + C  C+      + L+ + IV+L     KPK+P   +  V+V  +  +      NP      L+L + +  +  NPN++G  Y  S   + YRG  +G 
Subjt:  KGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTAS--LSLNIRMIFTAVNPNKVGIKYGESHFTVMYRGVPLGR

Query:  ASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGF
        A +P           ++ T+ +    LL      L+ D  +   + L     V  K+ +L      VQ S  C + IS   +++T + C +
Subjt:  ASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQCGF

AT5G53730.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.0e-0524.42Show/hide
Query:  LFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRMIFTAVNPN-KVGIKYGESHFTVMYRGVPL-GRASVPGF
        LF  FS     +L I+ LV L L P++P+F L +  +  + +TT       +T  L+ ++++   + NPN KVGI Y +      YRG  +   AS+P F
Subjt:  LFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRMIFTAVNPN-KVGIKYGESHFTVMYRGVPL-GRASVPGF

Query:  VQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVIS
         Q       + A +    + + Q+    + R+ S   ++ + +  D   + ++ ++ S   + +V+C  +++
Subjt:  VQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATCTTCAACAGTAAATGGCAACAGCCTGTCGAGCTCCGGCCACCGCCACCCTACGTACAACCCCAGATCATCGTCGTCGTCGTCGGCCTCCTTCAAGGGTTGCTG
CTGCTGCCTCTTCCTCCTCTTCTCCTTCCTAGCCCTCCTCGTCTTGGCCATCGTCCTCCTCGTCGTCCTCGCCCTCAAGCCCAAGAAGCCCCAGTTCGATCTCCAACAGG
TCAGCGTCCAATACATGGGAATCACCACTCCCAACGTTCCCAACAATCCCACCACCGCCTCCCTCTCCCTCAACATTCGAATGATATTCACCGCCGTGAATCCAAACAAG
GTCGGCATCAAATACGGCGAGTCCCACTTCACCGTCATGTACCGAGGAGTTCCTCTCGGCAGAGCCTCCGTTCCCGGCTTCGTCCAAGACCCCCACAGCCAGCGCCAGGT
CGACGCTACCATCGCCGTGGATCGCGTCAATCTCCTCCAGGCCGACGCCGCCGACTTGATCCGCGACGCCTCCTTGAACGACCGTGTCGAGCTCAGAATCCTCGGCGATG
TCGCGGCTAAGATCCGCCTCTTGTCCTTCAATTCCCCCGGCGTTCAGGTTTCAGTAGATTGTGCAATTGTGATCAGTCCAAGAAAGCAGTCTCTCACATACAAGCAATGT
GGTTTTGATGGCTTAAATGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCATCTTCAACAGTAAATGGCAACAGCCTGTCGAGCTCCGGCCACCGCCACCCTACGTACAACCCCAGATCATCGTCGTCGTCGTCGGCCTCCTTCAAGGGTTGCTG
CTGCTGCCTCTTCCTCCTCTTCTCCTTCCTAGCCCTCCTCGTCTTGGCCATCGTCCTCCTCGTCGTCCTCGCCCTCAAGCCCAAGAAGCCCCAGTTCGATCTCCAACAGG
TCAGCGTCCAATACATGGGAATCACCACTCCCAACGTTCCCAACAATCCCACCACCGCCTCCCTCTCCCTCAACATTCGAATGATATTCACCGCCGTGAATCCAAACAAG
GTCGGCATCAAATACGGCGAGTCCCACTTCACCGTCATGTACCGAGGAGTTCCTCTCGGCAGAGCCTCCGTTCCCGGCTTCGTCCAAGACCCCCACAGCCAGCGCCAGGT
CGACGCTACCATCGCCGTGGATCGCGTCAATCTCCTCCAGGCCGACGCCGCCGACTTGATCCGCGACGCCTCCTTGAACGACCGTGTCGAGCTCAGAATCCTCGGCGATG
TCGCGGCTAAGATCCGCCTCTTGTCCTTCAATTCCCCCGGCGTTCAGGTTTCAGTAGATTGTGCAATTGTGATCAGTCCAAGAAAGCAGTCTCTCACATACAAGCAATGT
GGTTTTGATGGCTTAAATGTATGA
Protein sequenceShow/hide protein sequence
MPSSTVNGNSLSSSGHRHPTYNPRSSSSSSASFKGCCCCLFLLFSFLALLVLAIVLLVVLALKPKKPQFDLQQVSVQYMGITTPNVPNNPTTASLSLNIRMIFTAVNPNK
VGIKYGESHFTVMYRGVPLGRASVPGFVQDPHSQRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQVSVDCAIVISPRKQSLTYKQC
GFDGLNV