; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010175 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010175
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCASP-like protein
Genome locationChr06:19294928..19296705
RNA-Seq ExpressionHG10010175
SyntenyHG10010175
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006702 - Casparian strip membrane protein domain
IPR045009 - CASP-like protein 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065678.1 CASP-like protein 5B3 isoform X1 [Cucumis melo var. makuwa]1.1e-7296.18Show/hide
Query:  TQRGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSL
        T RGNML FPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWS MLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSL
Subjt:  TQRGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSL

Query:  AAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV
        AAASASGGVAVLFFSDLGHCSF +ECRKFQISVVLAFLSWIT+AISALIMLWILAAV
Subjt:  AAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV

KAG6593844.1 1-aminocyclopropane-1-carboxylate oxidase-like 1, partial [Cucurbita argyrosperma subsp. sororia]6.9e-7292.45Show/hide
Query:  INGTQRGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTAT
        +  ++ GNMLDFPGTPGTLT FVLRILQCVFAAGS+ASMAT+VGFYNFTAFCYVIASMGLQVTWSFMLALLDAYAL+RKKMLHNPILVSLFVVGDWVTAT
Subjt:  INGTQRGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTAT

Query:  LSLAAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA
        LSLAAASASGGVAVLFFS+LGHCSF DECRKFQISVVLAFLSWITIAISALIMLWILAA
Subjt:  LSLAAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA

TYK08224.1 CASP-like protein 5B3 isoform X1 [Cucumis melo var. makuwa]2.6e-8786.32Show/hide
Query:  MVGQSTRAAEEGHLGIFISHFYSLSTFVSSHFDSSKINGTQ----------------RGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNF
        MVGQSTRAAEE HLGIF SH Y  STFVSSHFDS KIN TQ                 GNML FPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNF
Subjt:  MVGQSTRAAEEGHLGIFISHFYSLSTFVSSHFDSSKINGTQ----------------RGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNF

Query:  TAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAI
        TAFCYVIASMGLQVTWS MLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVAVLFFSDLGHCSF +ECRKFQISVVLAFLSWIT+AI
Subjt:  TAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAI

Query:  SALIMLWILAAV
        SALIMLWILAAV
Subjt:  SALIMLWILAAV

XP_004142183.1 CASP-like protein 5B3 isoform X1 [Cucumis sativus]2.9e-7096.05Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATS+GFYNFTAFCYVIASMGLQVTWS MLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV
        SGGVAVLFFSDLGHCSF  ECRKFQISVVLAFLSWIT+ ISALIMLWILAAV
Subjt:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV

XP_038875242.1 CASP-like protein 5B3 isoform X1 [Benincasa hispida]1.5e-7197.37Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV
        SGGVAVLFFSDLGHC+F DECRKFQISVVLAFLSWITIA+SALIMLWIL AV
Subjt:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV

TrEMBL top hitse value%identityAlignment
A0A0A0LIJ4 CASP-like protein1.4e-7096.05Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATS+GFYNFTAFCYVIASMGLQVTWS MLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV
        SGGVAVLFFSDLGHCSF  ECRKFQISVVLAFLSWIT+ ISALIMLWILAAV
Subjt:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV

A0A1S3CDC1 CASP-like protein1.8e-7096.71Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        ML FPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWS MLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV
        SGGVAVLFFSDLGHCSF +ECRKFQISVVLAFLSWIT+AISALIMLWILAAV
Subjt:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV

A0A5A7VDG4 CASP-like protein5.2e-7396.18Show/hide
Query:  TQRGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSL
        T RGNML FPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWS MLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSL
Subjt:  TQRGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSL

Query:  AAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV
        AAASASGGVAVLFFSDLGHCSF +ECRKFQISVVLAFLSWIT+AISALIMLWILAAV
Subjt:  AAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV

A0A5D3CAJ7 CASP-like protein1.3e-8786.32Show/hide
Query:  MVGQSTRAAEEGHLGIFISHFYSLSTFVSSHFDSSKINGTQ----------------RGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNF
        MVGQSTRAAEE HLGIF SH Y  STFVSSHFDS KIN TQ                 GNML FPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNF
Subjt:  MVGQSTRAAEEGHLGIFISHFYSLSTFVSSHFDSSKINGTQ----------------RGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNF

Query:  TAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAI
        TAFCYVIASMGLQVTWS MLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVAVLFFSDLGHCSF +ECRKFQISVVLAFLSWIT+AI
Subjt:  TAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAI

Query:  SALIMLWILAAV
        SALIMLWILAAV
Subjt:  SALIMLWILAAV

A0A6J1KMZ0 CASP-like protein2.4e-7096.03Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        MLDFPGTPGTLT FVLRILQCVFAAGS+ASMAT+VGFYNFTAFCYVIASMGLQVTWSFMLALLDAYAL+RKKMLHNPILVSLFVVGDWVTATLSLAAASA
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA
        SGGVAVLFFS+LGHCSF DECRKFQISVVLAFLSWITIAISALIMLWILAA
Subjt:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA

SwissProt top hitse value%identityAlignment
B6T990 CASP-like protein 5B38.0e-3957.14Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        M D  G+PGT +G  LR+ QCV A  S+ +MAT+ GF N+TAFCY+IASMGLQ+ WSF LA LD Y+L  K+ LHNP+LVSLFVVGDWVTA LS AAASA
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDE--CRKFQISVVLAFLSWITIAISALIMLWILAAV
        S GV +LF  D+  C    +  C ++ +SVVLAF++W  IA SA+ M W+L ++
Subjt:  SGGVAVLFFSDLGHCSFMDE--CRKFQISVVLAFLSWITIAISALIMLWILAAV

B8BD96 CASP-like protein 5B34.3e-4057.14Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        M D  G+PGT +G  LR+ QCVFA  S+ +MA++ GF N+TAFCY+IASMGLQ+ WSF LA LD Y+L  K+ LHNP+LVSLFVVGDWVTA LS AAASA
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDE--CRKFQISVVLAFLSWITIAISALIMLWILAAV
        S GV +LF  D+  C    +  C ++++SV+LAF++W  IA SA+ M W+LA++
Subjt:  SGGVAVLFFSDLGHCSFMDE--CRKFQISVVLAFLSWITIAISALIMLWILAAV

D5ACW4 CASP-like protein 5B1 (Fragment)6.8e-3857.62Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        M D PGTPGT+ G  LR+ Q +FAA S+  M TS  F NFTAFCY+ A+M LQ  WSF+LA +D YAL+ K+ L N IL+SLFVVGDWVTATLSLAAA +
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA
        + G+ VLF  DL +C  M  CR++Q+S  +AF SW+ IAIS+LI L +L +
Subjt:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA

Q6K478 CASP-like protein 5B34.3e-4057.14Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        M D  G+PGT +G  LR+ QCVFA  S+ +MA++ GF N+TAFCY+IASMGLQ+ WSF LA LD Y+L  K+ LHNP+LVSLFVVGDWVTA LS AAASA
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDE--CRKFQISVVLAFLSWITIAISALIMLWILAAV
        S GV +LF  D+  C    +  C ++++SV+LAF++W  IA SA+ M W+LA++
Subjt:  SGGVAVLFFSDLGHCSFMDE--CRKFQISVVLAFLSWITIAISALIMLWILAAV

Q8L7R5 CASP-like protein 5B33.4e-5371.52Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        M+D PGTPGTLTG VLRI QCVFAAGSI+ M TS GF++FTAFCY+IA+MGLQV WSF LA+LD +ALVRKK L +P+LVSLFVVGDWVT+TLSLA AS+
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA
        S G+ VL+F DLG CSF  EC K+Q+SV LAFL WITIA+S+L  LW+LA+
Subjt:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA

Arabidopsis top hitse value%identityAlignment
AT2G28370.1 Uncharacterised protein family (UPF0497)2.7e-2944.44Show/hide
Query:  SSKINGTQRGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWV
        ++++    R  M D  G PGTL G  LR  Q +FAA ++  MA++  F + TAFCY++A+ GLQ  WS  LA++D YA++ K+ L N  LVSLF +GD V
Subjt:  SSKINGTQRGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWV

Query:  TATLSLAAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA
        T+TL+ AAA AS G+ VL  +DL  C+  + C +F+ S  LAF+SW     S L   W LA+
Subjt:  TATLSLAAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA

AT2G37200.1 Uncharacterised protein family (UPF0497)2.6e-2442Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        M D  G PGT  G +LR+ Q V A  S++ M T+  F + TAFC ++ ++ LQ  WS  L ++DAYAL+ ++ L N  +V  F +GD VT+TL+ AAASA
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILA
        S G+ VL  +DLG C+ ++ C +F+ +  +AF+SW  ++ S ++  W LA
Subjt:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILA

AT3G23200.1 Uncharacterised protein family (UPF0497)2.4e-5471.52Show/hide
Query:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA
        M+D PGTPGTLTG VLRI QCVFAAGSI+ M TS GF++FTAFCY+IA+MGLQV WSF LA+LD +ALVRKK L +P+LVSLFVVGDWVT+TLSLA AS+
Subjt:  MLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASA

Query:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA
        S G+ VL+F DLG CSF  EC K+Q+SV LAFL WITIA+S+L  LW+LA+
Subjt:  SGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAA

AT3G53850.1 Uncharacterised protein family (UPF0497)5.0e-3656.38Show/hide
Query:  GTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVA
        G PGT+ G +LRI QC  AA SI  M ++  F   TAFCY+IASMGLQ+ WSF LA LD YAL  KK L NPILVSLFVVGDWVTA LSLAAA +S GV 
Subjt:  GTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVA

Query:  VLFFSDLGHCSFMDE--CRKFQISVVLAFLSWITIAISALIMLWILAAV
        VL+  D+ +C+   +  C +++++V L+F++WI IA+S+ +  WILA+V
Subjt:  VLFFSDLGHCSFMDE--CRKFQISVVLAFLSWITIAISALIMLWILAAV

AT5G02060.1 Uncharacterised protein family (UPF0497)9.4e-3555.48Show/hide
Query:  GTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVA
        G+PGT++G +LR+ QC  AA SI  M +S  F N+TAFC+++ASMGLQ+ WSF LA LD YA+ RK  L +PIL+SLF VGDWVTA L+LAAA +S GV 
Subjt:  GTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAYALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVA

Query:  VLFFSDLGHCSFMD--ECRKFQISVVLAFLSWITIAISALIMLWIL
        VLF  D   C       C +FQISV L+F +W   AIS+  M WIL
Subjt:  VLFFSDLGHCSFMD--ECRKFQISVVLAFLSWITIAISALIMLWIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGCCAATCAACGCGTGCTGCCGAAGAAGGCCATTTAGGGATTTTCATTTCTCATTTTTACTCTCTTTCTACTTTTGTATCTTCCCATTTCGACTCTTCAAAGAT
CAATGGCACCCAGAGGGGAAATATGTTGGATTTTCCTGGGACTCCAGGGACATTGACTGGGTTTGTGTTGAGGATTTTGCAATGTGTTTTTGCTGCTGGGTCTATTGCTT
CCATGGCTACTTCTGTTGGCTTCTATAACTTCACAGCTTTTTGCTATGTGATCGCGTCGATGGGTCTGCAAGTGACTTGGAGTTTTATGCTGGCATTGTTAGATGCATAT
GCATTGGTAAGAAAGAAGATGCTTCACAATCCTATTCTGGTCAGCCTCTTTGTTGTTGGAGATTGGGTGACAGCAACATTGTCTCTCGCTGCAGCTTCTGCCTCTGGTGG
GGTAGCGGTATTGTTCTTCAGCGACTTAGGACACTGCAGTTTCATGGACGAATGCCGAAAGTTCCAAATCTCTGTTGTTTTAGCCTTTCTAAGCTGGATAACCATAGCCA
TTTCAGCTCTGATTATGCTCTGGATATTGGCTGCAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGCCAATCAACGCGTGCTGCCGAAGAAGGCCATTTAGGGATTTTCATTTCTCATTTTTACTCTCTTTCTACTTTTGTATCTTCCCATTTCGACTCTTCAAAGAT
CAATGGCACCCAGAGGGGAAATATGTTGGATTTTCCTGGGACTCCAGGGACATTGACTGGGTTTGTGTTGAGGATTTTGCAATGTGTTTTTGCTGCTGGGTCTATTGCTT
CCATGGCTACTTCTGTTGGCTTCTATAACTTCACAGCTTTTTGCTATGTGATCGCGTCGATGGGTCTGCAAGTGACTTGGAGTTTTATGCTGGCATTGTTAGATGCATAT
GCATTGGTAAGAAAGAAGATGCTTCACAATCCTATTCTGGTCAGCCTCTTTGTTGTTGGAGATTGGGTGACAGCAACATTGTCTCTCGCTGCAGCTTCTGCCTCTGGTGG
GGTAGCGGTATTGTTCTTCAGCGACTTAGGACACTGCAGTTTCATGGACGAATGCCGAAAGTTCCAAATCTCTGTTGTTTTAGCCTTTCTAAGCTGGATAACCATAGCCA
TTTCAGCTCTGATTATGCTCTGGATATTGGCTGCAGTTTGA
Protein sequenceShow/hide protein sequence
MVGQSTRAAEEGHLGIFISHFYSLSTFVSSHFDSSKINGTQRGNMLDFPGTPGTLTGFVLRILQCVFAAGSIASMATSVGFYNFTAFCYVIASMGLQVTWSFMLALLDAY
ALVRKKMLHNPILVSLFVVGDWVTATLSLAAASASGGVAVLFFSDLGHCSFMDECRKFQISVVLAFLSWITIAISALIMLWILAAV